Anthropic AI Model Updates

News

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

The internet freaked out after Anthropic revealed that Claude attempts to report “immoral” activity to authorities under ...

5don MSN

Anthropic’s newest AI model shows disturbing behavior when threatened

The recently released Claude Opus 4 AI model apparently blackmails engineers when they threaten to take it offline.

eWeek6d

New AI Model Threatens Blackmail After Implication It Might Be Replaced

Anthropic’s Claude Opus 4 exhibited simulated blackmail in stress tests, prompting safety scrutiny despite also showing a ...

Hidden AI instructions reveal how Anthropic controls Claude 4

Large language models (LLMs) like the AI models that run Claude and ChatGPT process an input called a "prompt" and return an ...

4don MSN

Netflix chairman Reed Hastings joins board of AI giant Anthropic

Reed Hastings is joining Anthropic's board. Hastings also serves on the boards of Bloomberg and the City Fund.

5don MSN

Anthropic’s new AI model threatened to reveal engineer’s affair to avoid being shut down

In a fictional scenario set up to test Claude Opus 4, the model often resorted to blackmail when threatened with being ...

We Have No Idea Why It Makes Certain Choices, Says Anthropic CEO Dario Amodei as He Builds an 'MRI for AI' to Decode Its Logic

We still have no idea why an AI model picks one phrase over another, Anthropic Chief Executive Dario Amodei said in an April ...

5don MSNOpinion

When an AI model misbehaves, the public deserves to know—and to understand what it means

Safety testing AI means exposing bad behavior. But if companies hide it—or if headlines sensationalize it—public trust loses ...

6don MSN

Anthropic says its new AI model can work almost an entire workday straight

Artificial intelligence startup Anthropic says its new AI model can work for nearly seven hours in a row, in another sign ...

23h

10 AI Stocks Gaining Wall Street’s Attention

When tested, Anthropic’s Claude Opus 4 displayed troubling behavior when placed in a fictional work scenario. The model was ...

The Business Standard2d

Anthropic’s latest AI model will blackmail you if you threaten to shut it down

Launched this week, Claude Opus 4 has been praised for its advanced reasoning and coding abilities. But hidden in the launch report is a troubling revelation. In controlled experiments, the AI ...

5don MSN

Tesla CEO Elon Musk’s one-word reply to OpenAI’s AI model refusing to shutdown on command

An OpenAI model faced issues. It reportedly refused shutdown commands. Palisade Research tested AI models. The o3 model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results