Update README.md
Browse files
README.md
CHANGED
|
@@ -86,6 +86,8 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
|
|
| 86 |
|
| 87 |

|
| 88 |
|
|
|
|
|
|
|
| 89 |

|
| 90 |
|
| 91 |
| Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
|
|
@@ -113,8 +115,6 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
|
|
| 113 |
|
| 114 |
### Humanity's Last Exam Leaderboard Performance
|
| 115 |
|
| 116 |
-
.png)
|
| 117 |
-
|
| 118 |
| Rank | Model | Accuracy (%) | Performance vs Alpie |
|
| 119 |
|------|-------|-------------|---------------------|
|
| 120 |
| 1 | GPT 4.5 Preview | 5.8 | Above Alpie |
|
|
|
|
| 86 |
|
| 87 |

|
| 88 |
|
| 89 |
+
_-_Accuracy_Comparison.png)
|
| 90 |
+
|
| 91 |

|
| 92 |
|
| 93 |
| Benchmark | Alpie-Core (32B-4bit) | DeepSeek-V2 (236B) | Qwen2.5 72B | Llama 3.1 405B | Llama 3.1 70B | Gemma-3 27B-PT | Mistral-Small-24B-Base-2501 |
|
|
|
|
| 115 |
|
| 116 |
### Humanity's Last Exam Leaderboard Performance
|
| 117 |
|
|
|
|
|
|
|
| 118 |
| Rank | Model | Accuracy (%) | Performance vs Alpie |
|
| 119 |
|------|-------|-------------|---------------------|
|
| 120 |
| 1 | GPT 4.5 Preview | 5.8 | Above Alpie |
|