News
Anthropic's most powerful model yet, Claude 4, has unwanted side effects: The AI can report you to authorities and the press.
The company claims that Claude 4 Opus is "the world's best coding model." Claude 4 Sonnet is pretty good, too, but strikes ...
15h
CNET on MSNWhat's New in Anthropic's Claude 4 Gen AI Models?Claude 4 Sonnet is a leaner model, with improvements built on Anthropic's Claude 3.7 Sonnet model. The 3.7 model often had ...
Faced with the news it was set to be replaced, the AI tool threatened to blackmail the engineer in charge by revealing an extramarital affair.
Learn how Claude 4’s advanced AI features make it a game-changer in writing, data analysis, and human-AI collaboration.
Anthropic has rolled out Claude 4 Sonnet and Claude 4 Opus to its users, bringing a host of upgrades to the AI models running ...
Explore more
16h
ZME Science on MSNAnthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut downIn a fictional scenario, Claude blackmailed an engineer for having an affair.
Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...
Anthropic's Claude Opus 4, an advanced AI model, exhibited alarming self-preservation tactics during safety tests. It ...
1hon MSN
Malicious use is one thing, but there's also increased potential for Anthropic's new models going rogue. In the alignment section of Claude 4's system card, Anthropic reported a sinister discovery ...
11h
Interesting Engineering on MSNAnthropic's most powerful AI tried blackmailing engineers to avoid shutdownAnthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results