What happened
Researchers found that you can predict whether an AI's step-by-step reasoning will be correct by watching how its confidence changes during the thinking process — not just checking final confidence scores. In practice, this means you could reject or re-run AI answers that show warning signs of uncertainty before they're used in real decisions, without needing to know how the AI works internally.
Why it matters
Right now, AI systems can sound confident while being wrong, and there's no cheap way to catch those failures before they cause problems. If this pattern actually works across different models and tasks, it gives you an early warning signal that costs almost nothing to check — which means you can safely deploy AI reasoning in higher-stakes settings by automatically catching the cases where it's likely to be unreliable.