math-reasoning-value-function / training_config.json
Potat0-0's picture
Upload 13 files
0cf8636 verified
{
"model_name": "Qwen/Qwen2.5-0.5B-Instruct",
"lora_r": 16,
"lora_alpha": 32,
"max_seq_length": 2048,
"num_train_epochs": 3,
"learning_rate": 2e-05
}