What happened
Researchers released an open-source data processing pipeline that makes it much easier to prepare training data for AI systems that can listen and speak simultaneously in natural conversation — the kind that handles overlapping speech and natural interruptions. Right now, speech AI mostly trains on single-speaker recordings, so building systems that feel like actual back-and-forth conversation requires solving hard problems (like figuring out who is speaking when) that existing tools struggle with.