FutureMa
/

Qwen2.5-7B-Instruct-GRPO-Math

Text Generation

Model card Files Files and versions

Qwen2.5-7B-Instruct-GRPO-Math / additional_config.json

FutureMa's picture

Upload GRPO fine-tuned Qwen2.5-7B-Instruct model

bc4cc58 verified about 1 month ago

history blame contribute delete

67 Bytes

{"lora_dtype": null, "lorap_lr_ratio": null, "lorap_emb_lr": 1e-06}