momergul commited on
Commit
5c2612e
·
verified ·
1 Parent(s): 09abb1f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -14
README.md CHANGED
@@ -93,7 +93,7 @@ We used the BabyLM 100M (Strict) dataset to construct input contexts. It is comp
93
  | Student top_p | 0.8 |
94
  | Teacher sampling temperature | 1.0 |
95
  | Teacher top_p | 0.8 |
96
- | Pure language modeling epochs per round | 8 |
97
  | Mixed language modeling + preference optimization epochs per round | 2 |
98
  | Batch size | 16 |
99
  | SimPO beta | 2 |
@@ -202,25 +202,25 @@ The metrics were chosen based on the advice of the papers the tasks come from.
202
 
203
  | Task | Metric | Causal Score |
204
  | --- | --- | --- |
205
- | BLiMP | Acc | 71.91 |
206
- | BLiMP Supplement | Acc | 64.85 |
207
- | EWoK | Acc | 52.44 |
208
- | Eye Tracking | change in R^2 | 0.5 |
209
- | Self-paced Reading | change in R^2 | 0.01 |
210
- | Entity Tracking | Acc | 27.95 |
211
  | WUGs | Acc | 38.5 |
212
 
213
  *Finetuning*
214
 
215
  | Task | Metric | Score |
216
  | --- | --- | --- |
217
- | BoolQ | Acc | 69.11 |
218
- | MNLI | Acc | 60.82 |
219
- | MRPC | F1 | 86.0 |
220
- | QQP | F1 | 70.96 |
221
- | MultiRC | Acc | 59.53 |
222
- | RTE | Acc | 66.19 |
223
- | WSC | Acc | 65.38 |
224
 
225
  # Technical Specifications
226
 
 
93
  | Student top_p | 0.8 |
94
  | Teacher sampling temperature | 1.0 |
95
  | Teacher top_p | 0.8 |
96
+ | Pure language modeling epochs per round | 7 |
97
  | Mixed language modeling + preference optimization epochs per round | 2 |
98
  | Batch size | 16 |
99
  | SimPO beta | 2 |
 
202
 
203
  | Task | Metric | Causal Score |
204
  | --- | --- | --- |
205
+ | BLiMP | Acc | 72.16 |
206
+ | BLiMP Supplement | Acc | 61.22 |
207
+ | EWoK | Acc | 51.92 |
208
+ | Eye Tracking | change in R^2 | 9.08 |
209
+ | Self-paced Reading | change in R^2 | 3.5 |
210
+ | Entity Tracking | Acc | 28.06 |
211
  | WUGs | Acc | 38.5 |
212
 
213
  *Finetuning*
214
 
215
  | Task | Metric | Score |
216
  | --- | --- | --- |
217
+ | BoolQ | Acc | 68.38 |
218
+ | MNLI | Acc | 61.04 |
219
+ | MRPC | F1 | 83.61 |
220
+ | QQP | F1 | 71.82 |
221
+ | MultiRC | Acc | 65.92 |
222
+ | RTE | Acc | 61.15 |
223
+ | WSC | Acc | 63.46 |
224
 
225
  # Technical Specifications
226