Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

US women’s gold medal ice hockey team declines President Trump’s State of the Union address | Winter Olympics News

February 24, 2026

See the utility names on Josh Brown’s list of best stocks

February 24, 2026

Canva acquires animation and marketing startup

February 24, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » DeepSeek releases a “sparse warning” model that cuts API costs by half
AI

DeepSeek releases a “sparse warning” model that cuts API costs by half

adminBy adminSeptember 30, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


On Monday, researchers at DeepSeek released a new experimental model called V3.2-EXP, designed to dramatically reduce inference costs when used in long context operations. Deepseek announced the model in a post about Face’s hugs and posted an academic paper linked to Github.

The most important feature of the new model is called DeepSeek Sparse Anterest. This is a complex system explained in detail in the diagram below. Essentially, the system uses a module called “Lightning Indencer” to prioritize certain excerpts from the context window. Another system, called the “fine-grained token selection system,” then selects a specific token from within these excerpts and loads it into the module’s limited attention window. In summary, sparse attention models can work so that server loads over long sections of relatively small contexts.

Screenshot

For long-context operations, the advantages of the system are important. A preliminary test by DeepSeek shows that the price of simple API calls can be reduced by half in long context situations. Building a more robust assessment will require further testing, but since the models are openweight and freely available, it will not be long before third-party tests can evaluate claims made in the paper.

Deepseek’s new model is one of the recent breakthroughs tackling the issue of inference costs. Essentially, it is the server cost for manipulating a pre-trained AI model that is different from the cost of training. In Deepseek’s case, researchers were looking for ways to make basic transformer architectures work more efficiently.

China-based Deepseek was a rare figure in the AI ​​boom, especially those who view AI research as a nationalist struggle between the US and China. The company made waves in the R1 model early in the year, and was trained using reinforcement learning, primarily at a much lower cost than its American competitors. However, this model has not triggered a wholesale revolution in AI training, as some have predicted. The company then retreated from the spotlight in those few months.

The new “sparse attention” approach is unlikely to produce the same uproar as R1, but it can teach providers the tricks needed to keep inference costs low.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleColombian Foreign Minister has waived her US visa
Next Article Trump Conference Ends: No Government Closed Transactions
admin
  • Website

Related Posts

Canva acquires animation and marketing startup

February 24, 2026

With the advent of AI, investor loyalty is (almost) gone. At least 12 OpenAI VCs also back Anthropic

February 24, 2026

Meta AI security researcher said OpenClaw agent is rampant in inboxes

February 24, 2026

OpenAI brings in consultants to advance the company

February 24, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Gisele Bundchen gets fit after giving birth with Joaquin Valente

By adminFebruary 24, 20260

Gisele Bundchen talks about how postpartum with Joaquin Valente’s baby is different from previous pregnanciesGisele…

Hilary Duff talks Matthew Koma fight, throwing phone at Bush

February 24, 2026

Glass Hair, TikTok Hair Trends, Shiny Hair Secrets

February 24, 2026

How to treat keratosis pilaris: Routine recommended by dermatologists

February 24, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Former British Ambassador to the US Peter Mandelson arrested in Epstein investigation

February 24, 2026

How Mexico cornered “El Mencho” with the help of his lover’s “Trustworthy Man” and US intelligence agencies

February 24, 2026

Why have there been so many ski-related fatalities in Europe this year?

February 23, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.