MMR-DR_GRPO-lambda-0.6 / config.json

Commit History

End of training
eb9b6d9
verified

kangdawei commited on

Training in progress, step 100
5f94503
verified

kangdawei commited on