Text Generation
Transformers
Safetensors
step3p5
conversational
custom_code
Eval Results

Add MathArena evaluation result for aime/aime_2026

#26
.eval_results/MathArena--aime_2026.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: MathArena/aime_2026
3
+ task_id: MathArena/aime_2026
4
+ value: 96.67
5
+ date: '2026-02-16'
6
+ source:
7
+ url: https://matharena.ai/?comp=aime--aime_2026
8
+ name: Official MathArena Evaluation