Tools

News

Notícias

Classificados

Cursos

Broker

IPv4:

IPv6:

 

UpOrDown
Ping
MTR
Smokeping
MTU Detect
Portscan
DNS
HTTP/SSL
My IP
IP Calc
IP Extractor

Devstral 2 Narrows Gap With Proprietary Models

Image © Arstechnica
Mistral AI's Devstral 2, a 123B open-weights coding model, posts 72.2% on SWE-bench Verified, signaling narrowing distance to proprietary rivals. The release also includes the Mistral Vibe CLI for autonomous software engineering.

The French AI startup Mistral AI announced Devstral 2, a 123-billion-parameter open-weights coding model designed to function as part of an autonomous software engineering agent. The model posted a 72.2 percent SWE-bench Verified score, ranking among the top open-weights coding models.

Alongside the model, Mistral rolled out Mistral Vibe, a CLI that lets developers interact with the Devstral family directly from the terminal. It can scan directory structures, inspect Git status to preserve context, modify multiple files, and run shell commands autonomously. The company released the CLI under the Apache 2.0 license.

SWE-bench Verified tests 500 real software-engineering tasks drawn from Python GitHub issues; the AI must read issue descriptions, navigate code, and patch it to pass tests. Industry insiders say the benchmark is watched closely by major AI players, even if it tends to overrepresent simpler bug fixes in many tasks.

In parallel with Devstral 2, Mistral released Devstral Small 2, a 24B parameter model scoring 68% on SWE-bench. It is designed to run locally on consumer hardware, including laptops without internet access. Both versions support a 256,000-token context window, enabling medium-sized codebases, with licensing for Small 2 under Apache 2.0 and Devstral 2 under a modified MIT license.

The company frames Devstral 2 as a step toward more capable autonomous software engineering, though observers caution that benchmarks are not fully predictive of real-world performance. Still, the approach signals how open-weight models are narrowing the gap with proprietary rivals while highlighting ongoing debates around reliability and safety in AI-assisted coding.”

 

Arstechnica

Related News

FCC Shuts 2,048 Dormant Proceedings
OpenAI Debuts GPT-5.2 After Code Red
Ripple Fiber Expands Merrimac Footprint
Disney Invests $1B in OpenAI for Sora
ThinkBig Rebrands as IQ Fiber in Maryland
Oracle Boosts Data Center Spend by $15B

ISP.Tools survives thanks to ads.

Consider disabling your ad blocker.
We promise not to be intrusive.

Cookie Consent

We use cookies to improve your experience on our site.

By using our site you consent to cookies. Learn more