Tools

News

Notícias

Classificados

Cursos

Broker

IPv4:

IPv6:

 

UpOrDown
Ping
MTR
Smokeping
MTU Detect
Portscan
DNS
HTTP/SSL
My IP
IP Calc
IP Extractor

DeepSeek Tests Sparse Attention to Cut Costs

Image © Arstechnica
DeepSeek releases v3.2-Exp, introducing DeepSeek Sparse Attention with a claimed 50% API price cut and potential for major cost reductions in long-context AI.

Long-context AI prompts slow down as attention costs grow quadratically. DeepSeek’s new v3.2-Exp release introduces DeepSeek Sparse Attention (DSA), touted to dramatically reduce compute and API costs, with the company claiming a 50% price cut.

Spars e attention is not new; it was used in models like OpenAI’s GPT-3 (2019) and Google’s Reformer (2020). Western labs’ current usage is not fully disclosed, but the technique is widely cited as a path to efficiency.

DeepSeek has been notable for other reasons: its R1 model reportedly matched OpenAI’s o1 performance at a training cost of about $6 million, and its earlier chat app briefly topped the iPhone App Store.

In v3.2-Exp, DeepSeek implements what it calls a “fine-grained sparse attention” mechanism and a “lightning indexer” that scores pairwise word relevance and keeps the top 2,048 connections for each word, skipping others without hurting overall comprehension.

The release includes open-source components under the MIT License with open weights, enabling peer review and further experimentation. Tech press has noted that early benchmarks suggest cost savings in long-context scenarios, though independent verification remains pending.

Even if results hold, the approach could reduce AI inference costs for long conversations or large-scale deployments, potentially changing how companies balance hardware and software efficiency in the coming years.

 

Arstechnica

Notícias relacionadas

Leilão de 700 MHz adiado para 2026
Claude Opus 4.5 impulsiona IA 2025
Novo Marco da Cibersegurança no Brasil
Brasil sobe para 16º lugar no ranking de IA 2025
Segurança da Informação em TI: Vazamentos em Ascensão
Ceará mira data centers no interior

O ISP.Tools sobrevive graças aos anúncios.

Considere a possibilidade de desativar seu bloqueador de anúncios.
Prometemos não ser intrusivos.

Consentimento de cookies

Usamos cookies para melhorar sua experiência em nosso site.

Ao usar nosso site, você concorda com os cookies. Saiba mais sobre o site