Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Hundreds of rioters arrested across France as PSG wins Champions League | Soccer News

May 31, 2026

Josh Brown thinks investors want more than index funds

May 31, 2026

Kenya arrests 8 students on suspicion of arson over school fire

May 31, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » Openai says that GPT-5 will stack up on humans with a wide range of jobs
AI

Openai says that GPT-5 will stack up on humans with a wide range of jobs

adminBy adminSeptember 25, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


Openai released a new benchmark on Thursday. This tested the performance of the AI ​​model compared to human experts in a wide range of industries and employment. This test, GDPVAL, is an early attempt to understand how close OpenAI systems are to outperform humans in economically valuable work.

Openai says it has discovered that the GPT-5 model and Anthropic’s Claude Opus 4.1 are “already approaching the quality of work produced by industry experts.”

That doesn’t mean Openai’s models will soon begin human change at work. Despite predictions by some CEOs that AI will take on human work in just a few years, Openai acknowledges that GDPVal covers a very limited number of tasks people today do in real work. However, this is one of the latest ways in which companies are measuring AI progress towards this milestone.

GDPVal is based on nine industries that contribute most to the US gross domestic product, including domains such as healthcare, finance, manufacturing and government. This benchmark tests the performance of AI models in 44 occupations in these industries, ranging from software engineers to nurses and journalists.

For the first version of Openai, GDPVal-V0, Openai asked experienced experts to compare AI generation reports with reports generated by other experts and select the best report. For example, I asked an investment banker to create a competitor landscape for the last mile delivery industry and compare them with AI generation reports. Openai then averages the “winning rate” of the AI ​​model for human reporting across all 44 occupations.

For GPT-5, a soup-up version of GPT-5 of GPT-5-High, for GPT-5 with additional computing power, the company says that the AI ​​model was ranked on par with industry experts in 40.6% of the time.

Openai also tested the Claude Opus 4.1 model of humanity. This was ranked on par with industry experts at 49% of tasks. Openai says he believes Claude scored very high because he tends to make fun graphics rather than performance.

TechCrunch Events

San Francisco
|
October 27th-29th, 2025

Image credit: Openai

It is worth noting that most working professionals do more than submitting research reports to their boss, which is everything about the GDPVAL-V0 test. Openai acknowledges this and says it plans to create more robust tests in the future that can explain more industries and interactive workflows.

Nevertheless, the company considers GDPVal’s progress worth noting.

In an interview with TechCrunch, Openai’s chief economist Dr. Aaron Chatterji said the results of GDPVal suggest that people in these jobs can spend time using AI models to spend more meaningful tasks.

“(Because) the model is getting better with some of these things,” says Chatterji.

In Openai’s assessment, Tejal Patwardhan told TechCrunch that he was encouraged by the GDPVal progress rate. Openai’s GPT-4O model won 13.7% (victory and bond with humans), released about 15 months ago. Currently, the GPT-5 has scored almost three times the score.

Silicon Valley has a wide range of benchmarks used to measure the progress of AI models and to assess whether a particular model is cutting edge. The most popular are AIME 2025 (testing competitive math problems) and GPQA diamond (testing PHD-level science questions). However, some AI models are approaching saturation with some of these benchmarks, and many AI researchers have cited the need for better testing that can measure AI proficiency with respect to real tasks.

Benchmarks like GDPVal can become increasingly important in that conversation, as Openai claims AI models are valuable to a wide range of industries. However, Openai clearly states that testing of a more comprehensive version may be required, and that its AI model may be superior to humans.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleTrump says China’s XI has approved proposals to put Tiktok under US ownership
Next Article Ken Griffin says the impact of tariffs on inflation is on the rise.
admin
  • Website

Related Posts

SoftBank announces investment of up to 75 billion euros in data center construction in France

May 30, 2026

“What a joke”: Github Copilot’s new token-based billing causes confusion among developers

May 30, 2026

I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually quite useful.

May 30, 2026

Meta is reportedly developing an AI pendant.

May 30, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

‘Desperate Housewives’ alum Marcia Cross shares rare selfie

By adminMay 31, 20260

De la Garza, who joined the show as Gabriel and Carlos’ daughter Juanita after a…

8 Texas roller coaster accident: Iron Shark rescued after breakdown

May 30, 2026

TSA-approved travel essentials that save space, time, and money

May 30, 2026

Anne Patchett, Riley Sager, Liane Moriarty

May 30, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Kenya arrests 8 students on suspicion of arson over school fire

May 31, 2026

Ethiopian Prime Minister wins Nobel Peace Prize, incites civil war and faces re-election

May 31, 2026

They were trained for high-stakes underwater rescues. Instead, the villagers walked out of the cave to freedom.

May 30, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.