The world is being quietly rearranged by people who write very long documents.


The title they went with Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind Noisy translates that to

Language models can now model others' minds but fail at self-awareness


Researchers tested whether large language models can actually understand mental states—not just describe them—by building a behavioral test where models had to act strategically based on what they or others were thinking. Most frontier models can now pass tests of understanding other people's beliefs and intentions, but they still fail at understanding their own mental states unless they're given space to write out their reasoning step-by-step, suggesting they lack genuine self-modeling ability.
This shows a structural gap in how current AI systems work: they can simulate understanding of other minds but not their own, which matters because systems deployed to interact with humans, advise on decisions, or explain their reasoning may be mimicking self-awareness rather than having it—and that gap matters if anyone's building AI systems expected to be honest about their own limitations or goals.

If you insist
Read the original →