Large language models can now tell when they need to reason, saving money

What happened

Large language models that use step-by-step reasoning can now figure out on their own when to use that reasoning, instead of always doing it. This makes them cheaper to run and more accurate, because they only use complex thought processes when a task actually requires them.

Why it matters

Large language models often waste computing power by 'thinking aloud' through every problem, even simple ones. This paper shows how models can decide for themselves when to engage in complex reasoning. This could make advanced AI cheaper and more reliable for many applications.

The signal

Watch for this method, or similar ways of using internal uncertainty, to appear in major open-source or commercial large language models.