Tools

News

Notícias

Classificados

Cursos

Broker

IPv4:
IPv6:
UpOrDown
Ping
MTR
MTU Detect
Portscan
DNS
HTTP/SSL
My IP
IP Calc & Sum

DeepSeek Tests Sparse Attention to Cut Costs

Image © Arstechnica
DeepSeek releases v3.2-Exp, introducing DeepSeek Sparse Attention with a claimed 50% API price cut and potential for major cost reductions in long-context AI.

Long-context AI prompts slow down as attention costs grow quadratically. DeepSeek’s new v3.2-Exp release introduces DeepSeek Sparse Attention (DSA), touted to dramatically reduce compute and API costs, with the company claiming a 50% price cut.

Spars e attention is not new; it was used in models like OpenAI’s GPT-3 (2019) and Google’s Reformer (2020). Western labs’ current usage is not fully disclosed, but the technique is widely cited as a path to efficiency.

DeepSeek has been notable for other reasons: its R1 model reportedly matched OpenAI’s o1 performance at a training cost of about $6 million, and its earlier chat app briefly topped the iPhone App Store.

In v3.2-Exp, DeepSeek implements what it calls a “fine-grained sparse attention” mechanism and a “lightning indexer” that scores pairwise word relevance and keeps the top 2,048 connections for each word, skipping others without hurting overall comprehension.

The release includes open-source components under the MIT License with open weights, enabling peer review and further experimentation. Tech press has noted that early benchmarks suggest cost savings in long-context scenarios, though independent verification remains pending.

Even if results hold, the approach could reduce AI inference costs for long conversations or large-scale deployments, potentially changing how companies balance hardware and software efficiency in the coming years.

 

Arstechnica

Notícias relacionadas

APIs Sob Ataque: Proteção da Confiança Digital
Serpro desenvolve IA nacional para frear LLMs estrangeiros
TIP Brasil e Unifique firmam parceria 5G regional
Anatel mapeará condições de Internet no ensino superior
Anatel pode executar garantias para migrar Oi
Desoneração de M2M/IoT não resolve tudo

O ISP.Tools sobrevive graças aos anúncios.

Considere desativar seu bloqueador de anúncios.
Prometemos não ser intrusivos.

Consentimento de cookies

Usamos cookies para melhorar sua experiência em nosso site.

Ao usar nosso site, você concorda com os cookies. Saiba mais sobre o site