Free Quiz
Write for Us
Learn Artificial Intelligence and Machine Learning
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
Learn Artificial Intelligence and Machine Learning
No Result
View All Result

Home » DeepSeek launch ‘sparse attention’ model that cuts API costs in half

DeepSeek launch ‘sparse attention’ model that cuts API costs in half

Tarun Khanna by Tarun Khanna
September 30, 2025
in Artificial Intelligence
Reading Time: 2 mins read
0
DeepSeek launch ‘sparse attention’ model that cuts API costs in half

Photo Credit: https://techcrunch.com/

Share on FacebookShare on TwitterShare on LinkedInShare on WhatsApp

Researchers at DeepSeek on Monday launched a new experimental model known as V3.2-exp, designed to have dramatically decrease inference prices when used in long-context operations. DeepSeek introduced the model with a post on Hugging Face, also posting a linked academic paper on GitHub.

The most improtant feature of the brand new model is referred to as DeepSeek Sparse Attention, an complicated system defined in detail in the diagram below. In essence, the system makes use of a module referred to as a “lightning indexer” to prioritize unique excerpts from the context window. After that, a separate system referred to as a “fine-grained token choice system” chooses unique tokens from the ones excerpts to load into the module’s limited attention window. Taken collectively, they permit the Sparse Attention models to function over long quantities portions of context with relatively small server loads.

Photo Credit: https://techcrunch.com/

For long-context operations, the advantages of the system are significant. Preliminary testing by using DeepSeek determined that the price of a simple API call can be decreased by as much as a lot as half in long-context conditions. Further testing out may be needed to build a more robust assessment, however due to the fact the model is open-weight and freely to be had on Hugging Face, it won’t be long before third-party tests can assess the claims made within the paper.

Also Read:

Yann LeCun Leaves Meta to Release Latest AI Startup Targeted on Advanced Machine Intelligence

White House organize Executive Order to Block State AI Laws

Google releases Gemini 3 with new coding app and record benchmark scores

Google rolls out its AI ‘Flight Deals’ tool globally, adds latest travel features in Search

DeepSeek’s new model is one among a string of latest breakthroughs tackling the trouble of inference costs— importantly, the server costs of operating a pre-trained AI model, as distinct from the cost of training it. In DeepSeek’s case, the researchers have been seeking out ways to make the essential transformer architecture operate more efficiently — and finding that there are great improvements to be made.

Based in China, DeepSeek has been an unusual figure in the AI boom, especially for those who view AI research as a nationalist battle between the U.S. And China. The company made waves at the beginning of the year with its R1 model, trained the usage of mainly reinforcement learning at a far lower value than its American competition. But the model has no longer sparked a wholesale revolution in AI training, as some anticipated, and the company has receded from the highlight in the months since.

The new “sparse attention” method is not likely to provide the same uproar as R1 — but it can still teach U.S. Vendors some much wished tricks to assist keep inference costs low.

ShareTweetShareSend
Previous Post

Engineers generate Soft Robots That Can Literally Walk on Water

Next Post

Fiscal Fears Fuel Flight to Bitcoin, Gold as Major Currencies Falter

Tarun Khanna

Tarun Khanna

Founder DeepTech Bytes - Data Scientist | Author | IT Consultant
Tarun Khanna is a versatile and accomplished Data Scientist, with expertise in IT Consultancy as well as Specialization in Software Development and Digital Marketing Solutions.

Related Posts

First Documented Large-Scale AI-Orchestrated Cyberattack Elevates New Security Concerns
Artificial Intelligence

First Documented Large-Scale AI-Orchestrated Cyberattack Elevates New Security Concerns

November 17, 2025
AI Isn’t a Bubble however a Long-Term Opportunity, JPMorgan’s Erdoes stated
Artificial Intelligence

AI Isn’t a Bubble however a Long-Term Opportunity, JPMorgan’s Erdoes stated

November 16, 2025
Databricks co-founder claims US must go open source to beat China in AI
Artificial Intelligence

Databricks co-founder claims US must go open source to beat China in AI

November 16, 2025
10% of Nvidia’s cost: Why Tesla-Intel chip partnership require attention
Artificial Intelligence

10% of Nvidia’s cost: Why Tesla-Intel chip partnership require attention

November 10, 2025
Next Post
Fiscal Fears Fuel Flight to Bitcoin, Gold as Major Currencies Falter

Fiscal Fears Fuel Flight to Bitcoin, Gold as Major Currencies Falter

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

96 − = 89

TRENDING

How can Artificial Intelligence Maximize Your Business Growth in 2021?

artificial intelligence
by Tarun Khanna
March 18, 2021
0
ShareTweetShareSend

Character.AI Ends Teen Chatbot Experience, Shifts Focus to AI Creativity

Character.AI Ends Teen Chatbot Experience, Shifts Focus to AI Creativity

Photo Credit: https://opendatascience.com/

by Tarun Khanna
October 30, 2025
0
ShareTweetShareSend

Proof of What? 5 Key Crypto Mechanisms Explained

by Tarun Khanna
January 1, 2022
0
ShareTweetShareSend

From Trash to Tech: Scientists Turn Pomelo Peels into Electricity-Generating Devices

From Trash to Tech: Scientists Turn Pomelo Peels into Electricity-Generating Devices

Photo Credit: https://scitechdaily.com/ Researchers at the University of Illinois Urbana-Champaign developed a method to upcycle the spongy, porous peel of pomelos, typically discarded as waste, into devices that generate electricity and act as biomechanical motion sensors. These eco-friendly devices harness contact electrification to power small electronics like LEDs, calculators, and sports watches without needing external electricity.

by Tarun Khanna
May 5, 2025
0
ShareTweetShareSend

Newly Nvidia Blackwell chip for China may also outpace H20 model

Newly Nvidia Blackwell chip for China may also outpace H20 model

Photo Credit: https://www.artificialintelligence-news.com/

by Tarun Khanna
August 20, 2025
0
ShareTweetShareSend

What is Automated Machine Learning (Auto ML) ?

What is Automated Machine Learning

What is Automated Machine Learning (Auto ML) ?

by Tarun Khanna
February 12, 2021
0
ShareTweetShareSend

DeepTech Bytes

Deep Tech Bytes is a global standard digital zine that brings multiple facets of deep technology including Artificial Intelligence (AI), Machine Learning (ML), Data Science, Blockchain, Robotics,Python, Big Data, Deep Learning and more.
Deep Tech Bytes on Google News

Quick Links

  • Home
  • Affiliate Programs
  • About Us
  • Write For Us
  • Submit Startup Story
  • Advertise With Us
  • Terms of Service
  • Disclaimer
  • Cookies Policy
  • Privacy Policy
  • DMCA
  • Contact Us

Topics

  • Artificial Intelligence
  • Data Science
  • Python
  • Machine Learning
  • Deep Learning
  • Big Data
  • Blockchain
  • Tableau
  • Cryptocurrency
  • NFT
  • Technology
  • News
  • Startups
  • Books
  • Interview Questions

Connect

For PR Agencies & Content Writers:

connect@deeptechbytes.com

Facebook Twitter Linkedin Instagram
Listen on Apple Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
DMCA.com Protection Status

© 2024 Designed by AK Network Solutions

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books

© 2023. Designed by AK Network Solutions