Tools

News

Notícias

Classificados

Cursos

Broker

IPv4:

IPv6:

 

UpOrDown
Ping
MTR
Smokeping
MTU Detect
Portscan
DNS
HTTP/SSL
My IP
IP Calc
IP Extractor

DeepSeek Tests Sparse Attention to Cut Costs

Image © Arstechnica
DeepSeek releases v3.2-Exp, introducing DeepSeek Sparse Attention with a claimed 50% API price cut and potential for major cost reductions in long-context AI.

Long-context AI prompts slow down as attention costs grow quadratically. DeepSeek’s new v3.2-Exp release introduces DeepSeek Sparse Attention (DSA), touted to dramatically reduce compute and API costs, with the company claiming a 50% price cut.

Spars e attention is not new; it was used in models like OpenAI’s GPT-3 (2019) and Google’s Reformer (2020). Western labs’ current usage is not fully disclosed, but the technique is widely cited as a path to efficiency.

DeepSeek has been notable for other reasons: its R1 model reportedly matched OpenAI’s o1 performance at a training cost of about $6 million, and its earlier chat app briefly topped the iPhone App Store.

In v3.2-Exp, DeepSeek implements what it calls a “fine-grained sparse attention” mechanism and a “lightning indexer” that scores pairwise word relevance and keeps the top 2,048 connections for each word, skipping others without hurting overall comprehension.

The release includes open-source components under the MIT License with open weights, enabling peer review and further experimentation. Tech press has noted that early benchmarks suggest cost savings in long-context scenarios, though independent verification remains pending.

Even if results hold, the approach could reduce AI inference costs for long conversations or large-scale deployments, potentially changing how companies balance hardware and software efficiency in the coming years.

 

Arstechnica

Notícias relacionadas

Divergência MME e Aneel sobre cessão de postes
Brisanet dobra base móvel em 2025
Vivo anuncia Rogério Takayanagi como VP de engenharia e serviços
GT fará minuta da Política Nacional de Infraestruturas Críticas
Oi: Justiça prorroga blindagem de pagamentos até abril
Rogerio Takahyanagi assume Vivo como VP Engenharia

O ISP.Tools sobrevive graças aos anúncios.

Considere desativar seu bloqueador de anúncios.
Prometemos não ser intrusivos.

Consentimento para cookies

Utilizamos cookies para melhorar a sua experiência no nosso site.

Ao utilizar o nosso site, você concorda com o uso de cookies. Saiba mais