HerrHruby
/

vanilla_grpo_acemath_rl_4b_inst_16k_step_180

Model card Files Files and versions

vanilla_grpo_acemath_rl_4b_inst_16k_step_180

8.06 GB

1 contributor

History: 2 commits

HerrHruby's picture

Upload trained model

c0481ba verified about 1 month ago