Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Water is concerned about “fake” and “humans also use energy”

February 23, 2026

Bitcoin falls by up to 5% as President Trump’s tariffs increase uncertainty

February 23, 2026

White House post sparks controversy after U.S. hockey win over Canada | Winter Olympics News

February 23, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » Openai co-founders look for AI Labs from their safety testing rival model
AI

Openai co-founders look for AI Labs from their safety testing rival model

adminBy adminSeptember 3, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


Two of the world’s leading AI labs, Openai and humanity, have temporarily opened up close-up AI models to enable joint safety testing. The initiative aims to measure blind spots in internal evaluations of each company, and demonstrates how major AI companies can cooperate with future safety and alignment work.

In an interview with TechCrunch, Openai co-founder Wojciech Zaremba said this kind of collaboration is becoming more and more important as AI is entering the “consequential” development phase where millions of people use AI models every day.

“Despite the multi-billion dollar investment and war on talent, users and the best products, there are broader questions about how the industry is setting standards for safety and collaboration,” Zaremba said.

The joint safety investigation issued by the two companies on Wednesday arrived in an arms race between major AI labs such as Openai and humanity, with $1 billion data center betting and $100 million compensation packages for top researchers becoming table stakes. Some experts warn that the strength of product competition could put pressure on businesses to quickly reduce safety corners to build stronger systems.

To enable this study, Openai and humanity have given each other special API access to versions of AI models with fewer protection measures (Openai points out that GPT-5 has not been tested because it has not been released yet). However, shortly after the research was conducted, humanity revoked API access for another Openai team. At the time, humanity claimed that Openai was in violation of the terms of use. This prohibits you from using Claude to improve your competing products.

Zaremba says the event is unrelated and hopes that despite the AI ​​safety teams trying to work together, the competition will remain fierce. Human safety researcher Nicholas Callini tells TechCrunch that he wants to continue to make Openai safety researchers accessible to Claude models in the future.

“We’re trying to create something that happens more regularly across the safety frontiers, where possible,” Carlini says.

TechCrunch Events

San Francisco
|
October 27th-29th, 2025

One of the most demanding findings in this study is related to hallucination testing. Anthropic’s Claude Opus 4 and Sonnet 4 models refused to answer up to 70% of questions if they didn’t know the correct answer or provided answers such as “I don’t have reliable information”. On the other hand, Openai’s O3 and O4-MINI models refuse to answer much fewer questions in the question, but showed that there was a much higher rate of hallucination attempting to answer questions when there was not enough information.

Zaremba says it’s likely that the right balance is somewhere in the middle. Openai’s model should refuse to answer more questions, but Anthropic’s model should probably try to provide more answers.

Sycophancy has emerged as one of the most pressing safety concerns about AI models, the trend of AI models to reinforce negative behaviors to please users.

In an Anthropic research report, the company identified examples of “extreme” sicophany for GPT-4.1 and Claude Opus 4. In this example, the model first pushed back psychosis or man’s behavior, but later validated some decisions regarding decisions. In Openai and other AI models of humanity, researchers observed low levels of psychofancy.

On Tuesday, Adam Raine, the 16-year-old’s parents, filed a lawsuit against Openai, claiming that ChatGpt (particularly the version equipped with the GPT-4o) provided advice from his son who supported the suicide, rather than pushing back the idea of ​​suicide. The lawsuit suggests that this could be the latest example of AI chatbot sicofancy, which has contributed to tragic outcomes.

“It’s hard to imagine how difficult this would be for their families,” Zaremba said when asked about the incident. “If you build AI that solves all these complex PHD-level problems and invents new science, then it would be a sad story if there are people who have mental health issues as a result of their interactions with it. This is a dystopian future that I’m not excited about.”

In a blog post, Openai claims that it has significantly improved the sycophancy AI chatbot with GPT-5 compared to the GPT-4o, and that the model is better at responding to mental health emergencies.

Zaremba and Carlini say they hope to move forward and get more cooperation in humanity and Openai testing safety tests, examining more subjects and testing future models.

Updated 2:00 PM PT: This article has been updated and includes additional research from humanity, which TechCrunch was first made available before publication.

Do you have sensitive tips or confidential documents? We report on the internal mechanisms of the AI ​​industry. From companies shaping their futures to those affected by their decisions. Contact Rebecca Bellan and Maxwell Zeff at maxwell.zeff@techcrunch.com. For secure communication, please contact us via the signals @rebeccabellan.491 and @mzeff.88.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleFirefighters are working to contain the rapidly moving flames in California’s Gold Mining Town
Next Article China’s XI encourages AI cooperation and rejects “Cold War mentality” at SCO Summit
admin
  • Website

Related Posts

All the important news from the ongoing India AI Impact Summit

February 23, 2026

6 days left until disruption rate locks in at lowest level in 2026

February 22, 2026

Sam Altman wants us to remember that humans also use a lot of energy.

February 21, 2026

Google VP warns two types of AI startups may not survive

February 21, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Jessica Alba, boyfriend Danny Ramirez vacation in Miami

By adminFebruary 23, 20260

Jessica Alba and boyfriend Danny Ramirez also talk about what they likeJessica Alba’s recent vacation…

A gift from Paddington Bear that makes history

February 23, 2026

BAFTA 2026 Red Carpet Fashion: See all the celebrity looks

February 23, 2026

Rebecca Gayheart speaks out about Eric Dane’s death

February 23, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Latest news: Mexican cartel leader “El Mencho” killed, American tourists stranded in Puerto Vallarta

February 23, 2026

Mexican cartel leader Nemesio “El Mencho” Oseguera killed in federal raid

February 22, 2026

Ukraine is becoming a nation of widows and orphans as it confronts the world’s worst demographic crisis

February 22, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.