Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Cook’s AI legacy at risk at final developer conference

June 5, 2026

Cockroach Janta Party: Boston University graduate flies to India to lead Gen Z protests

June 5, 2026

Kyle Busch’s wife Samantha Busch breaks silence on his death

June 5, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » Model routing on AI is a problem for OpenAI and Anthropic
Tech

Model routing on AI is a problem for OpenAI and Anthropic

adminBy adminJune 5, 2026No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


Fixing overspending on AI is a problem for OpenAI and Anthropic

A new spending discipline is taking hold within corporate America, as chief financial officers and boards of directors begin to crack down on inefficient spending on artificial intelligence. This change has the potential to reshape AI trade.

For the past two years, playbooks have defaulted to choosing the most powerful AI model and sending all queries through that model, regardless of complexity. Now that the AI ​​bill is so far over budget, companies are starting to question whether they actually need a top-of-the-line or frontier model for every task. Two leaders at the center of building AI told CNBC this week that a solution is emerging: model routing.

What is model routing?

Routing is a tool that adapts jobs to models, sending difficult problems to expensive frontier models and easy problems to cheaper, faster alternative models.

Scott Wu, CEO of Cognition, which develops the coding agent Devin, said the benefits of routine work are huge. For many routine tasks, he says, companies can be five to 10 times more cost-effective by using models that are good enough for the task.

Today, most companies don’t do any routing at all. Glean CEO Arvind Jain estimates that approximately 95% of enterprise AI usage is still performed on the most expensive frontier models, even for tasks that could easily be handled by cheaper alternatives. Wu gave the example of asking a model to name the third US president. No matter how expensive they are, you’ll know they were all Thomas Jefferson.

Arvind Jain, CEO of Glean, takes to the SaaS Monster stage during day 1 of Web Summit 2022 at Altice Arena in Lisbon, Portugal on November 2, 2022.

Harry Murphy | Sports File | Getty Images

The pressure behind this change is a cost curve that has taken even the biggest technology companies by surprise. Jeetu Patel, Chief Product Officer Ciscolaid out the calculations. At approximately $200 in token usage per employee per week, that’s approximately $10,000 per employee per year. A company with 90,000 employees expects to make $900 million a year. A token is a block of data that a model uses to generate information. Usage is charged according to the number of tokens processed.

Patel said the adjustment was necessary because Cisco is way over its budget and currently has 30,000 engineers developing products primarily written in AI. Cisco reallocated resources and prioritized tokens over other spending.

Vendors under pressure

AI companies are aware of the concerns.

Cognition has announced what it calls the AI ​​Productivity Guarantee. If the engineering value provided by Devin is less than the price paid by the customer, Cognition will fund up to $10 million in usage until it reaches par. Mr. Wu framed this as a way to cut through the noise of metrics that plague the industry: return on investment.

Wu said that rather than measuring activities such as tokens or lines of code consumed, Cognition estimates the number of human engineering hours agents actually save and backs up that estimate with a refund. He said you can spend billions of tokens and not do anything with it. Companies should strive to produce results, not activities.

If companies start steering easy, high-volume work to cheaper open source models in places like China, OpenAI and Anthropic won’t be able to get paid for every task. They only accept more complex tasks. Both companies have built their businesses and IPO expectations around them on the premise of huge demand at premium prices.

Patel doesn’t think that will sink Frontier Labs, and says its cutting-edge technology will continue to be valuable. But he sees the pricing model changing. Rather than simply charging more, labs will need to be more efficient in how they use their models, which Patel predicts will lead to a concerted industry effort.

The question was whether companies would continue to spend as AI-related costs skyrocket. Nowadays, many people seem to be finding ways to spend their money wisely. Pricing power is shifting from companies selling premium AI to those buying it.

Frontier Labs will still charge a premium for the most difficult research. But how much of the market do the others account for? The answer could go a long way in determining the valuation of the leading AI companies.

Never miss the most trusted news moments in business news when you choose CNBC as your preferred source on Google.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleTed Lasso, Buffy star Anthony Head dies at age 72 from pneumonia complications
Next Article Startup Battlefield 200 applications will officially close in 3 days
admin
  • Website

Related Posts

Cook’s AI legacy at risk at final developer conference

June 5, 2026

Trump administration and OpenAI discuss possibility of government funding

June 5, 2026

Where investors may find the next “big wave” in AI trading

June 5, 2026

Alphabet aims for $85 billion after four-week losing streak

June 5, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Kyle Busch’s wife Samantha Busch breaks silence on his death

By adminJune 5, 20260

“From family and friends to fans and complete strangers, thank you for coming out for…

Ted Lasso, Buffy star Anthony Head dies at age 72 from pneumonia complications

June 5, 2026

How tall is Victor Wenbanyama? Age, height, deportation rumors and more

June 5, 2026

What happened to James Handy? Michael Gledhill arrested in connection with stabbing

June 5, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Cockroach Janta Party: Boston University graduate flies to India to lead Gen Z protests

June 5, 2026

Exclusive: Lebanese president accuses Iran of using Iran as bargaining chip in peace talks with US

June 5, 2026

Exclusive: Adviser to Iran’s supreme leader warns of potential war as talks over $24 billion stall

June 5, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.