kangdawei
/

MMR-DR_GRPO-lambda-0.5

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

MMR-DR_GRPO-lambda-0.5 / trainer_state.json

kangdawei's picture

Model save

b61d851 verified 2 months ago

history contribute delete

536 kB

File too large to display, you can check the raw version instead.