The world is being quietly rearranged by people who write very long documents.


The title they went with Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning Noisy translates that to

AI soccer commentary now identifies players by name instead of generic positions


Researchers built a system that watches soccer video and generates live commentary that names specific players and teams, rather than just describing what happened anonymously. This matters because it's the difference between 'the midfielder scores' and 'Harry Kane scores in the 67th minute' — the kind of detail that makes sports broadcasts actually watchable, and it suggests AI can now handle the harder task of connecting what it sees on screen to real-world identities.
Sports broadcasting has always required a human knowing who is on the field and what happened in previous games — context machines couldn't provide. This work shows that AI can now bridge that gap by combining live video analysis with historical statistics to generate commentary that sounds like it was written by someone actually watching the game. The structural shift is that the bottleneck moves from 'can AI describe action' to 'can AI describe action with the context a real audience expects' — and this paper suggests the answer is yes. This is the first time an AI system has outperformed a state-of-the-art large language model at identifying players correctly from video, which matters because it means commentary generation is becoming viable as an actual product rather than a research curiosity.
Watch whether any broadcast rights holder or sports streaming platform actually deploys a system like this for live games within 18 months, and whether the commentary it generates requires human correction or editing before airing.

If you insist
Read the original →