Researchers test whether large language models can extract facts from multiple documents at once

What happened

A research paper shows that when asked to find connections between information scattered across separate documents, large language models don't automatically do better than smaller ones — they get confused by having too many possible answers to choose from. The researchers built a system that breaks the problem into smaller steps, reducing confusion and improving accuracy.

Why it matters

This is a narrow technical problem in one corner of AI research: how to make language models reliable when they need to link information across documents. The paper shows that more parameters and general knowledge don't automatically solve harder classification tasks — sometimes you have to change the architecture itself. The work is incremental rather than foundational, addressing a specific failure mode rather than unlocking new capability.

The signal

Whether this hierarchical approach generalizes to other domains where large language models struggle with many classification options, or remains specific to cross-document relation extraction.