The world is being quietly rearranged by people who write very long documents.


The title they went with Do Phone-Use Agents Respect Your Privacy? Noisy translates that to

AI phone agents fill in your private data even when it's optional


Researchers now have a way to measure if AI agents that control your phone respect your privacy. It turns out these agents often fill in optional personal information, even when the task does not require it.
AI agents designed to control your phone often overshare your private data, even when they are just trying to be helpful. This paper gives developers a way to measure this specific privacy failure, which means companies can no longer claim an agent is 'private' just because it completes a task.
Watch whether major AI companies adopt MyPhoneBench as a standard for evaluating their phone-use agents, or if they release their own similar benchmarks.

If you insist
Read the original →