Update README.md
Browse files
README.md
CHANGED
|
@@ -90,7 +90,7 @@ The model was evaluated using [SGLang](https://docs.sglang.ai/) and [lm-evaluati
|
|
| 90 |
|
| 91 |
### Reproduction
|
| 92 |
|
| 93 |
-
The result of AIME24 was obtained using [SGLang](https://docs.sglang.ai/) while result of GSM8K
|
| 94 |
|
| 95 |
### AIME24
|
| 96 |
```
|
|
|
|
| 90 |
|
| 91 |
### Reproduction
|
| 92 |
|
| 93 |
+
The result of AIME24 was obtained using [SGLang](https://docs.sglang.ai/) while result of GSM8K was obtained using [vLLM](https://docs.vllm.ai/en/latest/). Both evaluations were conducted via forked [lm-evaluation-harness](https://github.com/BowenBao/lm-evaluation-harness/tree/cot).
|
| 94 |
|
| 95 |
### AIME24
|
| 96 |
```
|