News

Anthropic has long been warning about these risks—so much so that in 2023, the company pledged to not release certain models ...
During its inaugural developer conference, Anthropic launched two new AI models the startup claims are among the industry's ...
Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new ...
Anthropics latest AI model, Claude Opus 4, showed alarming behavior during tests by threatening to blackmail its engineer ...
Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
Anthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...
Anthropic says its Claude Opus 4 model frequently tries to blackmail software engineers when they try to take it offline.
Accordingly, Claude Opus 4 is being released under stricter safety measures than any prior Anthropic model. Those measures—known internally as AI Safety Level 3 or “ASL-3”—are appropriate ...
Anthropic has escalated its flagship Claude Opus 4 to its highest internal safety level yet. The classification, known as AI Safety Level 3 (ASL-3), introduces stringent safeguards aimed at ...
Anthropic reported that its newest model, Claude Opus 4, used blackmailing as a last resort after being told it could get ...