The world is being quietly rearranged by people who write very long documents.


The title they went with Agentic Tool Use in Large Language Models Noisy translates that to

AI researchers map out what's actually blocking AI agents from working in the real world


A new survey organizes fragmented research on how AI language models learn to use external tools — databases, calculators, APIs, real-world actions — into a single framework. It shows that three fundamentally different approaches exist (simple prompting, supervised learning, and reward-based training), each with different strengths and failure modes, and identifies what's blocking progress.
Right now, AI tool-use research is scattered across hundreds of papers using different tasks, different tools, and different evaluation methods — making it impossible to know which approach actually works better, or why systems fail in production. This survey is the first coherent map of the problem space, which means researchers can stop reinventing the wheel and start identifying which failures are fundamental versus solvable. It's the kind of work that either accelerates real progress by pointing to actual bottlenecks, or exposes that we've been chasing incremental improvements on the wrong problems entirely.
Look for whether subsequent AI research cites this framework to justify new work, or whether the three paradigms it identifies actually collapse into something simpler — either outcome tells you whether the survey identified real structural differences or just organized noise.

If you insist
Read the original →