News
AI model threatened to blackmail engineer over affair when told it was being replaced: safety report
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported ...
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results