TheFloatingString
/

qwen2-0.5b-instruct-math-grpo

Generated from Trainer

Model card Files Files and versions

qwen2-0.5b-instruct-math-grpo

18.1 MB

1 contributor

History: 11 commits

TheFloatingString's picture

TheFloatingString

Training in progress, step 300

df21be1 verified about 2 months ago