Antropic claims AI's 'evil' portrayal was the cause of Claude's blackmail attempt - BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates

According to Anthropic, fictional depictions of artificial intelligence can have real-world effects on AI models.

The company announced last year that during pre-release testing involving fictitious companies, Claude Opus 4 often attempted to blackmail engineers to avoid being replaced with another system. Anthropic later published research suggesting that other companies’ models had similar issues with “agent misalignment.”

It appears that Anthropic has taken further action on its behavior, claiming in a post to

The company elaborated further in a blog post, saying that starting with Claude Haiku 4.5, Anthropic’s models “never make threats (during testing), compared to up to 96% of the time in previous models.”

What’s the difference? The company said it found that training based on “Claude’s constitutional documents and fictional stories of AI working brilliantly” improved collaboration.

In this regard, Anthropic stated that training was found to be more effective when it included “the principles underlying coordinated behavior” rather than just “a demonstration of coordinated behavior alone.”

“Doing both together appears to be the most effective strategy,” the company said.

tech crunch event

San Francisco, California
|
October 13-15, 2026

Source link

What's Hot

Machado was not included in US plans for post-Maduro Venezuela, sources say

Britney Spears talks about her ‘spiritual journey’ after arrest

Barcelona defeats Real Madrid 2-0 in El Clasico to retain La Liga title | Soccer News

Antropic claims AI’s ‘evil’ portrayal was the cause of Claude’s blackmail attempt

Get ready for the whisper-filled office of the future.

We’re feeling cynical about xAI’s big deal with Anthropic

Voice AI in India is difficult. Wispr Flow is betting on it anyway.

So you’ve heard these AI terms and nodded along; let’s fix that

Newly freed hostages face long road to recovery after two years in captivity

Former Kenyan Prime Minister Raila Odinga dies at 80

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

Russia expands drone targeting on Ukraine’s rail network

Britney Spears talks about her ‘spiritual journey’ after arrest

Savannah Guthrie pays Mother’s Day tribute to Nancy Guthrie

Martin Short breaks his silence about the death of his daughter Catherine

Fisherman sandals trend 2026, the best comfortable summer shoes

Our Picks

Machado was not included in US plans for post-Maduro Venezuela, sources say

A scene of anxiety and relief at Tenerife’s port as people disembark from the Hantavirus ship

Rebels taunted Putin’s Africa Corps from major cities in the Sahel. Now his regional dominance is eroding.

Subscribe to Updates

What's Hot

Antropic claims AI’s ‘evil’ portrayal was the cause of Claude’s blackmail attempt

Related Posts

Subscribe to Updates