ReasoningMila/math_train_gold_qs_all_64_synthetic_soln_480k
Updated
ReasoningMila/hendricks_math_7500_train_synthetic_corr_soln
Updated
ReasoningMila/polIter_qwen2.5_math_1.5B_inst_ppo_MATH_ckpt__iter_0047__epoch_2.00_step_1504
Updated
ReasoningMila/math_synthetic_raw
Updated
ReasoningMila/polIter_qwen2.5_math_inst_1.5B_genppo_MATH_ckpt_iter_0008_epoch_2.00_step_0448
Updated
ReasoningMila/polIter_qwen2.5_math_inst_1.5B_genppo_MATH_ckpt_iter_0008_epoch_2.00_step_0512
Updated
ReasoningMila/ver_gen_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-5634
Text Generation
•
1B
•
Updated
•
5
ReasoningMila/ver_partial_ft_model_meta-llama_Llama-32-3B_checkpoint-4224
Text Generation
•
3B
•
Updated
•
7
ReasoningMila/ver_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-4224
Text Generation
•
1B
•
Updated
•
8
ReasoningMila/math_partial_ft_model_meta-llama_Llama-32-3B_checkpoint-681
Text Generation
•
3B
•
Updated
•
5