Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,7 @@ tags:
|
|
| 10 |
|
| 11 |
This is a distillation experiment with SmolLM2-1.7B as teacher and SmolLM2-360M as student model.
|
| 12 |
|
| 13 |
-
It slightly improves upon the performance of the basemodel on the following tasks (wip)
|
| 14 |
|
| 15 |
| Tasks |**HuggingFaceTB/SmolLM2-360M** Value|**aloobun/d-SmolLM2-360M** Value|
|
| 16 |
|----------------------------------------------------------|-------------:|-------------:|
|
|
@@ -26,6 +26,8 @@ It slightly improves upon the performance of the basemodel on the following task
|
|
| 26 |
| - leaderboard_musr_murder_mysteries | 0.5040 | 0.5160 |
|
| 27 |
|
| 28 |
|
|
|
|
|
|
|
| 29 |
|
| 30 |
# Eval Results aloobun/d-SmolLM2-360M (WIP)
|
| 31 |
|
|
|
|
| 10 |
|
| 11 |
This is a distillation experiment with SmolLM2-1.7B as teacher and SmolLM2-360M as student model.
|
| 12 |
|
| 13 |
+
It slightly improves upon the performance of the basemodel on the following tasks (wip):
|
| 14 |
|
| 15 |
| Tasks |**HuggingFaceTB/SmolLM2-360M** Value|**aloobun/d-SmolLM2-360M** Value|
|
| 16 |
|----------------------------------------------------------|-------------:|-------------:|
|
|
|
|
| 26 |
| - leaderboard_musr_murder_mysteries | 0.5040 | 0.5160 |
|
| 27 |
|
| 28 |
|
| 29 |
+
Well, it didn’t work as well as I hoped, will try again.
|
| 30 |
+
|
| 31 |
|
| 32 |
# Eval Results aloobun/d-SmolLM2-360M (WIP)
|
| 33 |
|