The world is being quietly rearranged by people who write very long documents.


The title they went with Evolutionary Search for Automated Design of Uncertainty Quantification Methods Noisy translates that to

AI can now design its own methods to detect when it's lying — and it works better than humans designed them


Researchers used one AI system to automatically write new code for detecting when other large language models are hallucinating or making false claims. The AI-generated methods beat hand-coded versions by 6.7% on verification tasks and stayed reliable even when tested on unfamiliar data. This means detecting AI failures no longer requires humans to manually engineer solutions.
For years, engineers have built hallucination detectors by hand, using intuition and domain knowledge — a slow, expensive process that doesn't scale. This shows an AI can write better detectors automatically, which matters because language models are now deployed in high-stakes domains like medicine and law where false confidence is dangerous. The second surprise: different AI models evolved completely different strategies to solve the same problem, some building complex statistical models and others using simpler tricks. That means the space of possible detection methods is larger than anyone knew.
Whether the evolved detector methods stay stable and generalizable when deployed on different types of claims and different model families, or whether they overfit to the specific tasks they were trained on.

If you insist
Read the original →