XiaoY1
/

Qwen2-7B-Instruct-DPO-math-beta0.5

alignment-handbook

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

Qwen2-7B-Instruct-DPO-math-beta0.5 / vocab.json

XiaoY1's picture

Upload vocab.json with huggingface_hub

bbac60d verified about 1 year ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.