Reasoning Analysis When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37 When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated Sep 29 • 2.97M • 243 • 3 When-Does-Reasoning-Matter/math-reasoning-ift-pairs Viewer • Updated Oct 27 • 458k • 1.47k • 7
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
MLM vs CLM Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 79 MLMvsCLM/610m-mlm40-42k-10000 Feature Extraction • Updated Jul 4 • 4 MLMvsCLM/610m-clm-40k-mlm20-42k Feature Extraction • Updated Jul 4 • 4 MLMvsCLM/1b-mlm40-42k Feature Extraction • Updated Jul 4 • 6
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 79
Reasoning Analysis When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37 When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated Sep 29 • 2.97M • 243 • 3 When-Does-Reasoning-Matter/math-reasoning-ift-pairs Viewer • Updated Oct 27 • 458k • 1.47k • 7
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
MLM vs CLM Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 79 MLMvsCLM/610m-mlm40-42k-10000 Feature Extraction • Updated Jul 4 • 4 MLMvsCLM/610m-clm-40k-mlm20-42k Feature Extraction • Updated Jul 4 • 4 MLMvsCLM/1b-mlm40-42k Feature Extraction • Updated Jul 4 • 6
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 79
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-Choose-GPT-4 Text Generation • 13B • Updated Jul 24, 2024 • 8