akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet 2B • Updated Nov 20, 2025 • 6
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_14BDrafter 2B • Updated Jun 16, 2025 • 5
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_32BDrafter Updated Jun 13, 2025
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner_Mini Text Generation • 2B • Updated Jun 11, 2025 • 4
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SplitReasoner Text Generation • 2B • Updated Apr 22, 2025 • 9
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner Text Generation • 2B • Updated Apr 19, 2025 • 15 • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner Text Generation • 2B • Updated Apr 17, 2025 • 401
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SelfCompress_SFT_GRPO_INDUCETEST 2B • Updated Apr 16, 2025 • 4
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SelfCompress_SFT Text Generation • 2B • Updated Apr 15, 2025 • 4
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_GRPO_14k_v3 Text Generation • 2B • Updated Apr 15, 2025 • 6
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_14k Text Generation • 2B • Updated Apr 14, 2025 • 5