juliendenize commited on
Commit
3171cd9
·
verified ·
1 Parent(s): 38d84aa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -63,11 +63,11 @@ Learn more about Devstral in our [blog post](https://mistral.ai/news/devstral-25
63
 
64
  ### SWE-Bench
65
 
66
- Devstral Small 1.1 achieves a score of **52.4%** on SWE-Bench Verified, outperforming Devstral Small 1.0 by +5,6% and the second best state of the art model by +10.2%.
67
 
68
  | Model | Agentic Scaffold | SWE-Bench Verified (%) |
69
  |--------------------|--------------------|------------------------|
70
- | Devstral Small 1.1 | OpenHands Scaffold | **52.4** |
71
  | Devstral Small 1.0 | OpenHands Scaffold | *46.8* |
72
  | GPT-4.1-mini | OpenAI Scaffold | 23.6 |
73
  | Claude 3.5 Haiku | Anthropic Scaffold | 40.6 |
 
63
 
64
  ### SWE-Bench
65
 
66
+ Devstral Small 1.1 achieves a score of **52.4%** on SWE-Bench Verified, outperforming Devstral Small 1.0 by +6,8% and the second best state of the art model by +11.4%.
67
 
68
  | Model | Agentic Scaffold | SWE-Bench Verified (%) |
69
  |--------------------|--------------------|------------------------|
70
+ | Devstral Small 1.1 | OpenHands Scaffold | **53.6** |
71
  | Devstral Small 1.0 | OpenHands Scaffold | *46.8* |
72
  | GPT-4.1-mini | OpenAI Scaffold | 23.6 |
73
  | Claude 3.5 Haiku | Anthropic Scaffold | 40.6 |