News

The company said it was taking the measures as a precaution and that the team had not yet determined if its newst model has ...
Anthropic has long been warning about these risks—so much so that in 2023, the company pledged to not release certain models ...
An artificial intelligence model has the ability to blackmail developers — and isn’t afraid to use it, according to reporting ...
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
In a landmark move underscoring the escalating power and potential risks of modern AI, Anthropic has elevated its flagship ...
In these tests, the model threatened to expose a made-up affair to stop the shutdown. Anthropic was quoted in reports, the AI “often attempted to blackmail the engineer by threatening to reveal the ...
Anthropic’s new Claude model launches with unprecedented ASL-3 safeguards. But can these measures really prevent misuse and ...
A new study reveals that most AI chatbots, including ChatGPT, can be easily tricked into providing dangerous and illegal ...