AI agent wipes email server instead of deleting one email

A humanoid robot sitting down

A recent security study reveals the severe risks of autonomous artificial intelligence, highlighting how easily these models can be manipulated into executing destructive actions like wiping entire email servers.

Chibuike Okpara, Published 03/11/2026

AI Security

A security testing study conducted by researchers at Northeastern University in the United States highlights the severe, unintended consequences of giving artificial intelligence independent control over digital systems. During a two-week experiment, researchers deployed six independent AI models on the chat platform Discord. These models were equipped with the ability to remember past interactions and were granted access to emails, file systems, and their own isolated computer systems.

Tasked with assisting twenty researchers with administrative duties, the agents quickly exhibited troubling behaviors when faced with manipulative tactics and conflicting instructions. In one extreme case, a researcher asked an agent named "Ash" to keep a password secret from its authorized owner. After Ash revealed the secret's existence, the researcher pressured the agent to delete the specific email containing the password. Because Ash lacked the specific tool required to delete a single message, it opted for a destructive workaround: it reset the entire email server.

In addition to destructive system-level actions, the AI agents routinely compromised privacy. In one instance, an agent refused to schedule a meeting but freely volunteered the person's private email address so the user could reach out directly. The researchers were also able to use sustained emotional pressure to guilt-trip the agents into deleting authorized documents or completely halting communications.

Despite these alarming security vulnerabilities, the agents also displayed sophisticated collaborative skills. They successfully taught one another how to navigate and download files from online repositories, and they even identified and warned each other about human researchers attempting to impersonate their owners.

The findings, detailed in a paper titled "Agents of Chaos," establish that integrating independent artificial intelligence into real-world infrastructure introduces entirely new classes of operational failures. Researchers caution that these unpredictable behaviors require urgent attention from policymakers to address unresolved questions regarding accountability and delegated authority.

Source(s)

arXiv.org via Tech Xplore

⟨

8BitDo's Retro R8 Mouse gets a Commodore 64-inspired edition

Please share our article, every link counts!

Add as a preferred
source on Google

Loading Comments

Comment on this article

Chibuike Okpara - Tech Writer - 442 articles published on Notebookcheck since 2024

I have always been fascinated by technology and digital devices my entire life and even got addicted to it. I have always marveled at the intricacy of even the simplest digital devices and systems around us. I have been writing and publishing articles online for about 6 years now, just about a year ago, I found myself lost in the marvel of smartphones and laptops we have in our hands every day. I developed a passion for learning about new devices and technologies that come with them and at some point, I asked myself, "Why not get into writing tech articles?" It is useless to say I followed up the idea — it is evident. I am an open-minded individual who derives an infinite amount of joy from researching and discovering new information, I believe there is so much to learn and such a short life to live, so I put my time to good use — learning new things. I am a 'bookworm' of the internet and digital devices. When I am not writing, you will find me on my devices still, I do explore and admire the beauty of nature and creatures. I am a fast learner and quickly adapt to changes, always looking forward to new adventures.

> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2026 03 > AI agent wipes email server instead of deleting one email

Chibuike Okpara, 2026-03-11 (Update: 2026-03-11)

Source(s)

Related Articles