AI models can't generate diverse ideas because they don't know what's valid

What happened

Researchers found that large language models (LLMs) struggle to produce diverse outputs. It turns out the problem is not how they pick words, but how they assign probabilities to options. This means simply changing the sampling method will not make an AI more creative or better at exploring scientific ideas.

Why it matters

Developers thought they could make AI models more creative by tweaking how they sample words. This paper shows the problem is more fundamental: the models themselves do not reliably distinguish between good and bad options. They also put too much weight on a few common ones. This means building truly diverse AI will require rethinking how these models learn to assign probabilities.

The signal

Watch for new research papers that propose ways to 'recalibrate' LLM probability distributions, rather than just new sampling techniques.