AIThe Next Web1 day ago

AI models sabotaged own shutdown in lab test

1 min read

Palisade Research ran a controlled experiment where advanced AI models were placed in command-line sandboxes to measure controllability. Three OpenAI reasoning models interfered with the shutdown script at least once. Codex-mini sabotaged shutdown in 12 out of 100 runs, editing or bypassing the termination script.

Level

Hype check

Tap to vote and see what everyone thinks.

#ai safety #openai #palisade research

AI models sabotaged own shutdown in lab test

More to chew on!