aloobun
/

d-SmolLM2-360M

Text Generation

text-generation-inference

Model card Files Files and versions

aloobun commited on Nov 21, 2024

Commit

bd857ad

·

verified ·

1 Parent(s): 7802a05

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 This is a distillation experiment with SmolLM2-1.7B as teacher and SmolLM2-360M as student model.
-It slightly improves upon the performance of the basemodel on the following tasks (wip). I guess i can do much better than this - will try again.
 |                          Tasks                           |**HuggingFaceTB/SmolLM2-360M** Value|**aloobun/d-SmolLM2-360M** Value|
 |----------------------------------------------------------|-------------:|-------------:|
@@ -26,6 +26,8 @@ It slightly improves upon the performance of the basemodel on the following task
 | - leaderboard_musr_murder_mysteries                   |       0.5040 |       0.5160 |
 # Eval Results aloobun/d-SmolLM2-360M (WIP)

 This is a distillation experiment with SmolLM2-1.7B as teacher and SmolLM2-360M as student model.
+It slightly improves upon the performance of the basemodel on the following tasks (wip):
 |                          Tasks                           |**HuggingFaceTB/SmolLM2-360M** Value|**aloobun/d-SmolLM2-360M** Value|
 |----------------------------------------------------------|-------------:|-------------:|
 | - leaderboard_musr_murder_mysteries                   |       0.5040 |       0.5160 |
+Well, it didn’t work as well as I hoped, will try again.
 # Eval Results aloobun/d-SmolLM2-360M (WIP)