XiaoY1
/
Qwen2-7B-Instruct-DPO-math-beta0.5

Model card Files Files and versions
xet
Metrics Training metrics Community