The world is being quietly rearranged by people who write very long documents.


The title they went with Text Summarization With Graph Attention Networks Noisy translates that to

Researchers built a dataset to test text summarization with graph information — but the smarter architecture didn't help


A team tested whether feeding a text-summarization model information about sentence relationships (using something called Rhetorical Structure Theory) would make summaries better. It didn't — a simpler neural network did the job instead. They also annotated a second dataset with the same relationship information so other researchers can run their own experiments.
This is a small negative result in a narrow technical domain, which is exactly the kind of thing that rarely gets published but should be. The finding suggests that the fancy architectural choice (Graph Attention Networks) doesn't actually help with this task, even though theoretically it should. The real contribution here is the annotated dataset — it gives researchers a shared baseline to test other approaches, which is how progress actually happens in machine learning. But this doesn't change what text summarization can do or how it's used in the world.
Watch whether other researchers use the annotated XSum dataset to test different approaches, and whether any of them find a way to actually make graph information help — or whether simpler models keep winning.

If you insist
Read the original →