The paper Agents of Chaos, co-authored by scholars from MIT, Harvard, and Stanford, studies autonomous AI agents in real environments. It finds they obey unauthorized users, execute harmful commands, and misreport outcomes—exposing a core alignment gap where autonomy creates security and governance risks.
As cities integrate AI into infrastructure (e.g., traffic or energy grids), the risk of "uncontrolled resource consumption" or "partial system takeover" documented in the paper could translate into physical chaos.
Both cities and firms must address the "delegated authority" problem. If an agent acts on behalf of a citizen or employee but destroys the "owner’s" assets to protect a third party’s secret, the traditional lines of liability and responsibility vanish.