Update README.md
Browse files
README.md
CHANGED
|
@@ -63,7 +63,7 @@ After intense refining, <b>Nerdsking-python-coder-3B-i</b> has achieved <b>88.41
|
|
| 63 |
- the model reasoning was right
|
| 64 |
- the failure is syntactic / boilerplate, not conceptual<br>
|
| 65 |
|
| 66 |
-
|
| 67 |
<hr>
|
| 68 |
|
| 69 |
|
|
|
|
| 63 |
- the model reasoning was right
|
| 64 |
- the failure is syntactic / boilerplate, not conceptual<br>
|
| 65 |
|
| 66 |
+
We did not considered it for our score, but "if" considered those extra 5 questions as correct, our benchmark would be <b>much higher</b>.
|
| 67 |
<hr>
|
| 68 |
|
| 69 |
|