Smarter data selection for AI training doesn't beat random picking
What happened
Researchers tested whether carefully selected training examples improve large language models better than random examples. It turns out they don't — random samples work just as well, and the extra computation to pick smart examples isn't worth it.
Why it matters
Companies spend enormous resources trying to choose which human feedback to feed into AI training, assuming that better curation means better models. This paper shows that assumption breaks down once you have a large pool of examples to choose from. The practical implication is blunt: if you have enough data, the cheapest approach (random sampling) beats expensive selection methods. This matters because it flips the economics of model improvement — you don't need smarter selection, you need more data.
The signal
Watch whether companies actually shift away from active learning systems in favor of cheaper, larger random samples, or whether they keep paying for selection anyway despite the evidence.