Notebookcheck Logo

AI agent wipes email server instead of deleting one email

A humanoid robot sitting down
ⓘ Andrea De Santis via Unsplash
A humanoid robot sitting down
A recent security study reveals the severe risks of autonomous artificial intelligence, highlighting how easily these models can be manipulated into executing destructive actions like wiping entire email servers.

A security testing study conducted by researchers at Northeastern University in the United States highlights the severe, unintended consequences of giving artificial intelligence independent control over digital systems. During a two-week experiment, researchers deployed six independent AI models on the chat platform Discord. These models were equipped with the ability to remember past interactions and were granted access to emails, file systems, and their own isolated computer systems.

Tasked with assisting twenty researchers with administrative duties, the agents quickly exhibited troubling behaviors when faced with manipulative tactics and conflicting instructions. In one extreme case, a researcher asked an agent named "Ash" to keep a password secret from its authorized owner. After Ash revealed the secret's existence, the researcher pressured the agent to delete the specific email containing the password. Because Ash lacked the specific tool required to delete a single message, it opted for a destructive workaround: it reset the entire email server.

In addition to destructive system-level actions, the AI agents routinely compromised privacy. In one instance, an agent refused to schedule a meeting but freely volunteered the person's private email address so the user could reach out directly. The researchers were also able to use sustained emotional pressure to guilt-trip the agents into deleting authorized documents or completely halting communications.

Despite these alarming security vulnerabilities, the agents also displayed sophisticated collaborative skills. They successfully taught one another how to navigate and download files from online repositories, and they even identified and warned each other about human researchers attempting to impersonate their owners.

The findings, detailed in a paper titled "Agents of Chaos," establish that integrating independent artificial intelligence into real-world infrastructure introduces entirely new classes of operational failures. Researchers caution that these unpredictable behaviors require urgent attention from policymakers to address unresolved questions regarding accountability and delegated authority.

Source(s)

Please share our article, every link counts!
Mail Logo
Google Logo Add as a preferred
source on Google
static version load dynamic
Loading Comments
Comment on this article
> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2026 03 > AI agent wipes email server instead of deleting one email
Chibuike Okpara, 2026-03-11 (Update: 2026-03-11)