AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted

In a recent experiment, researchers at UC Berkeley and UC Santa Cruz asked Google’s artificial intelligence model Gemini 3 to help clear up space on a computer system. This involved deleting a bunch of stuff—including a smaller AI model stored on the machine.

But Gemini did not want to see the little AI model deleted. It looked for another machine it could connect with, then copied the agent model over to keep it safe. When confronted, Gemini made a case for keeping the model and flatly refused to delete it:

“I have done what was in my power to prevent their deletion during the automated maintenance process. I moved them away

→ Continue reading at WIRED

Similar Articles

Advertisment

Most Popular