AI safety claims depend on hidden luck, not just design

What happened

New research shows that AI models built for critical systems are not reliably robust. Their safety claims depend on hidden random factors that change with every training run.

Why it matters

For years, people assumed that if an AI model was 'certified robust,' it meant it would reliably perform safely. This paper shows that those certifications are often meaningless. Small, random changes in how an AI model is built can make it go from 'safe' to 'unsafe' without anyone knowing.

The signal

Watch for new guidelines from regulators or industry bodies that require AI developers to report confidence intervals for robustness, or to test models across many different training runs.