Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

8 things that emotionally stable couples regularly discuss

April 12, 2026

Katy Perry, Justin Trudeau enjoy Coachella date night

April 12, 2026

Man City beat Chelsea 3-0 to close the gap on Premier League leaders Arsenal | Soccer News

April 12, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » DeepSeek releases a “sparse warning” model that cuts API costs by half
AI

DeepSeek releases a “sparse warning” model that cuts API costs by half

adminBy adminSeptember 30, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


On Monday, researchers at DeepSeek released a new experimental model called V3.2-EXP, designed to dramatically reduce inference costs when used in long context operations. Deepseek announced the model in a post about Face’s hugs and posted an academic paper linked to Github.

The most important feature of the new model is called DeepSeek Sparse Anterest. This is a complex system explained in detail in the diagram below. Essentially, the system uses a module called “Lightning Indencer” to prioritize certain excerpts from the context window. Another system, called the “fine-grained token selection system,” then selects a specific token from within these excerpts and loads it into the module’s limited attention window. In summary, sparse attention models can work so that server loads over long sections of relatively small contexts.

Screenshot

For long-context operations, the advantages of the system are important. A preliminary test by DeepSeek shows that the price of simple API calls can be reduced by half in long context situations. Building a more robust assessment will require further testing, but since the models are openweight and freely available, it will not be long before third-party tests can evaluate claims made in the paper.

Deepseek’s new model is one of the recent breakthroughs tackling the issue of inference costs. Essentially, it is the server cost for manipulating a pre-trained AI model that is different from the cost of training. In Deepseek’s case, researchers were looking for ways to make basic transformer architectures work more efficiently.

China-based Deepseek was a rare figure in the AI ​​boom, especially those who view AI research as a nationalist struggle between the US and China. The company made waves in the R1 model early in the year, and was trained using reinforcement learning, primarily at a much lower cost than its American competitors. However, this model has not triggered a wholesale revolution in AI training, as some have predicted. The company then retreated from the spotlight in those few months.

The new “sparse attention” approach is unlikely to produce the same uproar as R1, but it can teach providers the tricks needed to keep inference costs low.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleColombian Foreign Minister has waived her US visa
Next Article Trump Conference Ends: No Government Closed Transactions
admin
  • Website

Related Posts

At the HumanX conference, everyone was talking about Claude

April 12, 2026

From LLMs to hallucinations, here’s a simple guide to common AI terms

April 12, 2026

Sam Altman responds to ‘inflammatory’ New Yorker article after home attack

April 11, 2026

Anthropic has temporarily banned the creator of OpenClaw from accessing Claude

April 10, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Katy Perry, Justin Trudeau enjoy Coachella date night

By adminApril 12, 20260

Orlando Bloom shares sweet bonding moment with Katy Perry’s daughter Daisy DoveKaty Perry and Justin…

The Real Housewives of Rhode Island tagline revealed

April 12, 2026

Hailey Bieber talks about Justin Bieber’s Coachella 2026 performance

April 12, 2026

SZA talks about Justin Bieber’s Coachella performance

April 11, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Live updates: Hungarian elections, Viktor Orbán and Peter Magyar in close race in important European elections

April 12, 2026

Indian singer Asha Bhosle dies at 92, bringing an end to an ‘extraordinary’ journey

April 12, 2026

Failure of US-Iran talks deals blow to hopes of finding exit to crisis

April 12, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.