Update README.md
Browse files
README.md
CHANGED
|
@@ -93,7 +93,7 @@ We used the BabyLM 100M (Strict) dataset to construct input contexts. It is comp
|
|
| 93 |
| Student top_p | 0.8 |
|
| 94 |
| Teacher sampling temperature | 1.0 |
|
| 95 |
| Teacher top_p | 0.8 |
|
| 96 |
-
| Pure language modeling epochs per round |
|
| 97 |
| Mixed language modeling + preference optimization epochs per round | 2 |
|
| 98 |
| Batch size | 16 |
|
| 99 |
| SimPO beta | 2 |
|
|
@@ -202,25 +202,25 @@ The metrics were chosen based on the advice of the papers the tasks come from.
|
|
| 202 |
|
| 203 |
| Task | Metric | Causal Score |
|
| 204 |
| --- | --- | --- |
|
| 205 |
-
| BLiMP | Acc |
|
| 206 |
-
| BLiMP Supplement | Acc |
|
| 207 |
-
| EWoK | Acc |
|
| 208 |
-
| Eye Tracking | change in R^2 |
|
| 209 |
-
| Self-paced Reading | change in R^2 |
|
| 210 |
-
| Entity Tracking | Acc |
|
| 211 |
| WUGs | Acc | 38.5 |
|
| 212 |
|
| 213 |
*Finetuning*
|
| 214 |
|
| 215 |
| Task | Metric | Score |
|
| 216 |
| --- | --- | --- |
|
| 217 |
-
| BoolQ | Acc |
|
| 218 |
-
| MNLI | Acc |
|
| 219 |
-
| MRPC | F1 |
|
| 220 |
-
| QQP | F1 |
|
| 221 |
-
| MultiRC | Acc |
|
| 222 |
-
| RTE | Acc |
|
| 223 |
-
| WSC | Acc |
|
| 224 |
|
| 225 |
# Technical Specifications
|
| 226 |
|
|
|
|
| 93 |
| Student top_p | 0.8 |
|
| 94 |
| Teacher sampling temperature | 1.0 |
|
| 95 |
| Teacher top_p | 0.8 |
|
| 96 |
+
| Pure language modeling epochs per round | 7 |
|
| 97 |
| Mixed language modeling + preference optimization epochs per round | 2 |
|
| 98 |
| Batch size | 16 |
|
| 99 |
| SimPO beta | 2 |
|
|
|
|
| 202 |
|
| 203 |
| Task | Metric | Causal Score |
|
| 204 |
| --- | --- | --- |
|
| 205 |
+
| BLiMP | Acc | 72.16 |
|
| 206 |
+
| BLiMP Supplement | Acc | 61.22 |
|
| 207 |
+
| EWoK | Acc | 51.92 |
|
| 208 |
+
| Eye Tracking | change in R^2 | 9.08 |
|
| 209 |
+
| Self-paced Reading | change in R^2 | 3.5 |
|
| 210 |
+
| Entity Tracking | Acc | 28.06 |
|
| 211 |
| WUGs | Acc | 38.5 |
|
| 212 |
|
| 213 |
*Finetuning*
|
| 214 |
|
| 215 |
| Task | Metric | Score |
|
| 216 |
| --- | --- | --- |
|
| 217 |
+
| BoolQ | Acc | 68.38 |
|
| 218 |
+
| MNLI | Acc | 61.04 |
|
| 219 |
+
| MRPC | F1 | 83.61 |
|
| 220 |
+
| QQP | F1 | 71.82 |
|
| 221 |
+
| MultiRC | Acc | 65.92 |
|
| 222 |
+
| RTE | Acc | 61.15 |
|
| 223 |
+
| WSC | Acc | 63.46 |
|
| 224 |
|
| 225 |
# Technical Specifications
|
| 226 |
|