Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Ethiopian Prime Minister wins Nobel Peace Prize, incites civil war and faces re-election

May 31, 2026

How travelers can get their money back

May 31, 2026

Why financial anxiety does not decrease even if income increases

May 31, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » Openai’s research on AI models is intentionally lying
AI

Openai’s research on AI models is intentionally lying

adminBy adminSeptember 19, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


Sometimes researchers at the biggest tech companies drop bombs. There was a time when Google said the latest quantum chips showed that there were multiple universes. Or when humanity lets its AI agent Claudius run a snack vending machine, it calls people security and claims it is human.

This week it was the opening turn to raise our collective brows.

Openai was released on Monday and released several researches explaining how AI models can stop “Scheming.” This is the practice of “AI behaving in one direction on the surface while hiding its true goals.”

In a paper conducted in Apollo’s research, the researchers went a little further and compared the AI ​​plan to human stock brokers to break the law and make as much money as possible. However, researchers argued that most AI “scheming” is not so harmful. “The most common failures include simple forms of deception, for example pretending to complete a task without actually doing so,” they write.

This paper was primarily published to demonstrate that “deliberative alignment” (the anti-skemming technique they were testing) worked well.

However, we also explained that AI developers don’t understand how to train models rather than schemes. This is because such training can actually teach the model how to scheme a better scheme so that it is not detected.

“The main failure mode of attempting to try training is simply teaching the model to plan more carefully and secretly,” the researchers write.

TechCrunch Events

San Francisco
|
October 27th-29th, 2025

Perhaps the most surprising part is that if you understand that a model is being tested, you can not only pass the test, but even if it is still planning, you can pretend that it is not planning. “Models are more aware of how often they are evaluated. This situational awareness itself can reduce the scheme regardless of true integrity,” the researchers write.

It’s not news that AI models are lying. Now most of us are experiencing AI hallucinations or models are confidently giving answers to prompts that are simply not true. But as documented by Openai Research, published earlier this month, hallucinations are essentially confident in their speculation.

The plan is a different thing. That’s intentional.

Even this revelation that models intentionally mislead humans is not new. Apollo Research first published a paper in December documenting how five models were planned when instructions were given to achieve their goals “at all costs.”

The news here is actually good news. Researchers saw a significant reduction in the scheme by using “deliberation alignment ⁠.” The techniques include teaching the model an “anti-shaming specification” and reviewing the model before acting. It’s like making the rules repeat before little kids can play them.

Openai researchers argue that lies are not so serious, either in their own model, or even ChatGpt. Wojciech Zaremba, co-founder of Openai, told Maxwell Zeff of TechCrunch: Great job. “And that’s just a lie. There are some small forms of deception that we still need to tackle. ”

The fact that AI models from multiple players deliberately deceive humans is probably understandable. They were built by humans, mimicked humans (the synthetic data is aside), and most of them were trained with human-generated data.

That’s also weird.

We all experienced frustration with low-performance technology (home printers last year thinking about you), but when did your non-ai software knowingly lie? Have your inbox manufactured emails on its own? Has your CMS recorded new prospects that were not present to fill that number? Did the FinTech app organize its own banking transaction?

This is worth pondering as the world of business immerses itself in the barrel towards the future of AI, where companies believe they can treat agents like independent employees. The researchers in this paper have the same warning.

“AIS hopes that as they are assigned more complex tasks with real outcomes and begin pursuing more ambiguous, long-term goals, the likelihood of harmful planning increases.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleNvidia has spent more than $900 million on the CEO of AI startup technology Enfabrica
Next Article Stephen Colbert protects Jimmy Kimmel and calls for Trump Autocrito
admin
  • Website

Related Posts

SoftBank announces investment of up to 75 billion euros in data center construction in France

May 30, 2026

“What a joke”: Github Copilot’s new token-based billing causes confusion among developers

May 30, 2026

I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually quite useful.

May 30, 2026

Meta is reportedly developing an AI pendant.

May 30, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

‘Desperate Housewives’ alum Marcia Cross shares rare selfie

By adminMay 31, 20260

De la Garza, who joined the show as Gabriel and Carlos’ daughter Juanita after a…

8 Texas roller coaster accident: Iron Shark rescued after breakdown

May 30, 2026

TSA-approved travel essentials that save space, time, and money

May 30, 2026

Anne Patchett, Riley Sager, Liane Moriarty

May 30, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Ethiopian Prime Minister wins Nobel Peace Prize, incites civil war and faces re-election

May 31, 2026

They were trained for high-stakes underwater rescues. Instead, the villagers walked out of the cave to freedom.

May 30, 2026

Some of the world’s last Maoist rebels are in India. Their decades-long rebellion is in its death throes.

May 30, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.