The world is being quietly rearranged by people who write very long documents.


The title they went with ReVEL: Multi-Turn Reflective LLM-Guided Heuristic Evolution via Structured Performance Feedback Noisy translates that to

AI learns to fix its own bad ideas by getting specific feedback on what went wrong


Researchers built a system where an AI helps design optimization algorithms, then gets feedback showing exactly which parts failed and why, then fixes them. Instead of asking an AI to write code once and hoping it works, this system lets it iterate — the AI sees its mistakes grouped by type and rewrites the weak pieces.
For years, people have tried using LLMs as one-shot code generators for hard problems. They're brittle and often fail in predictable ways. This paper shows that if you give an AI structured feedback about *where* it's failing (not just 'your solution is bad'), it can repair its own logic across multiple rounds. The practical implication: optimization problems that currently require human experts to hand-tune might become cheaper and faster to solve. But this is still a lab result on standard benchmarks. The gap between 'works better in experiments' and 'actually changes how people design algorithms in practice' remains large.
Whether teams actually use this approach to design heuristics for real optimization problems outside academia, and whether the quality of AI-generated heuristics approaches hand-crafted ones by domain experts.

If you insist
Read the original →