Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Two major developments in Meta may have bottomed out for depressed stock prices

May 28, 2026

Celine Dion’s TV series will be produced by her brother Jack Dion

May 28, 2026

Brazil 2026 World Cup Team Preview: Players to Watch, Group Matches, Teams | 2026 World Cup News

May 28, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » DeepSeek releases a “sparse warning” model that cuts API costs by half
AI

DeepSeek releases a “sparse warning” model that cuts API costs by half

adminBy adminSeptember 30, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


On Monday, researchers at DeepSeek released a new experimental model called V3.2-EXP, designed to dramatically reduce inference costs when used in long context operations. Deepseek announced the model in a post about Face’s hugs and posted an academic paper linked to Github.

The most important feature of the new model is called DeepSeek Sparse Anterest. This is a complex system explained in detail in the diagram below. Essentially, the system uses a module called “Lightning Indencer” to prioritize certain excerpts from the context window. Another system, called the “fine-grained token selection system,” then selects a specific token from within these excerpts and loads it into the module’s limited attention window. In summary, sparse attention models can work so that server loads over long sections of relatively small contexts.

Screenshot

For long-context operations, the advantages of the system are important. A preliminary test by DeepSeek shows that the price of simple API calls can be reduced by half in long context situations. Building a more robust assessment will require further testing, but since the models are openweight and freely available, it will not be long before third-party tests can evaluate claims made in the paper.

Deepseek’s new model is one of the recent breakthroughs tackling the issue of inference costs. Essentially, it is the server cost for manipulating a pre-trained AI model that is different from the cost of training. In Deepseek’s case, researchers were looking for ways to make basic transformer architectures work more efficiently.

China-based Deepseek was a rare figure in the AI ​​boom, especially those who view AI research as a nationalist struggle between the US and China. The company made waves in the R1 model early in the year, and was trained using reinforcement learning, primarily at a much lower cost than its American competitors. However, this model has not triggered a wholesale revolution in AI training, as some have predicted. The company then retreated from the spotlight in those few months.

The new “sparse attention” approach is unlikely to produce the same uproar as R1, but it can teach providers the tricks needed to keep inference costs low.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleColombian Foreign Minister has waived her US visa
Next Article Trump Conference Ends: No Government Closed Transactions
admin
  • Website

Related Posts

Sesame, the conversational AI startup from Oculus founders, releases iOS app

May 28, 2026

How long is Anthropic’s contract with SpaceX? Opinions vary.

May 28, 2026

Has AI computing exploration led to the discovery of the next Cerebras?

May 28, 2026

Why Google’s AI can’t spell Google (or anyone else)

May 28, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Celine Dion’s TV series will be produced by her brother Jack Dion

By adminMay 28, 20260

Celine Dion and her three sons send a message to mark 10 years since her…

Ray J shares health update after knockout in MMA bout

May 28, 2026

Mindy Kaling was on bed rest while pregnant with her third child.

May 28, 2026

Todd Chrisley and Julie Chrisley’s life since being released from prison a year ago

May 28, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Israeli Prime Minister Benjamin Netanyahu says he has instructed the military to occupy 70% of Gaza Strip

May 28, 2026

Kenya school fire: 16 students killed as dormitory is destroyed

May 28, 2026

Laos cave rescue: How will trapped miners be found and rescued?

May 28, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.