WeiboAI
/

VibeThinker-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions

YinZhiBin commited on Nov 7

Commit

fbe15ee

·

verified ·

1 Parent(s): 1472722

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -19,9 +19,9 @@ VibeThinker-1.5B is a 1.5-billion parameter dense language model. With a total t
 ![image](https://cdn-uploads.huggingface.co/production/uploads/64d1faaa1ed6649d70d1fa2f/xpCgABkLikjM0-TwGnrmk.png)
 ## Key Performance Data
-💡 Mathematical Reasoning: On the three major math benchmarks AIME24, AIME25, and HMMT25, its scores (80.9, 73.7, and 50.1, respectively) all surpass those of the initial DeepSeek R1 model, which has over 400 times the parameters (scores of 79.8, 70.0, and 41.7, respectively).
-🌱 Code Generation: It achieved scores of 55.3 on LiveCodeBench v5 and 51 on v6. Its v6 score slightly leads Magistral Medium (50.3), underscoring its strong reasoning performance.
 🔁 On the AIME 25 benchmark, VibeThinker-1.5B significantly extends the Pareto frontier of reasoning accuracy versus model scale, demonstrating that exceptional performance can be achieved with extreme parameter efficiency.

 ![image](https://cdn-uploads.huggingface.co/production/uploads/64d1faaa1ed6649d70d1fa2f/xpCgABkLikjM0-TwGnrmk.png)
 ## Key Performance Data
+💡 Mathematical Reasoning: On the three major math benchmarks AIME24, AIME25, and HMMT25, its scores (80.3, 74.5, and 50.4, respectively) all surpass those of the initial DeepSeek R1 model, which has over 400 times the parameters (scores of 79.8, 70.0, and 41.7, respectively).
+🌱 Code Generation: It achieved scores of 55.9 on LiveCodeBench v5 and 51.1 on v6. Its v6 score slightly leads Magistral Medium (50.3), underscoring its strong reasoning performance.
 🔁 On the AIME 25 benchmark, VibeThinker-1.5B significantly extends the Pareto frontier of reasoning accuracy versus model scale, demonstrating that exceptional performance can be achieved with extreme parameter efficiency.