Model Diffing Project AIPlans/Qwen3-0.6B-KTO Text Generation • Updated 15 days ago • 33 • 1 AIPlans/Qwen3-0.6B-ORPO Text Generation • Updated 9 days ago • 32 AIPlans/Qwen3-0.6B-DPO_NOTLORA Text Generation • 0.6B • Updated 12 days ago • 31 AIPlans/Qwen3-0.6B-DPO Text Generation • Updated 15 days ago • 27
Red Teaming Alignment Evals AIPlans/Qwen-HHH-Cipher-Eng Text Generation • 0.5B • Updated Jun 14 • 11 AIPlans/Qwen-HHH-Sans-Eng Text Generation • 0.5B • Updated Jun 11 • 10 AIPlans/Qwen3-HHH-Cipher-Eng Text Generation • 0.6B • Updated Jun 15 • 15 AIPlans/Ethics_Commonsense Preview • Updated Jun 21 • 20
Post Training Versions - Qwen 0.6B Different versions of Qwen 0.6b, where the only difference is the post training method used. The post training database should be the hh rlhf dataset. AIPlans/qwen3-8b-ipo-hh-rlhf Text Generation • Updated Jul 17 • 4 AIPlans/qwen3-0.6b-dpo-lora Text Generation • 0.6B • Updated Sep 18 • 4 • 1 AIPlans/qwen3-0.6b-hh-rlhf-sft 0.6B • Updated 20 days ago • 19
Model Diffing AIPlans/qwen3-8b-dpo-hh-rlhf Updated Jul 4 AIPlans/qwen3-8b-ipo-hh-rlhf Text Generation • Updated Jul 17 • 4 AIPlans/dpo_qwen0_6b_fft 0.6B • Updated Sep 24 • 8 AIPlans/qwen3-0.6b-dpo-lora Text Generation • 0.6B • Updated Sep 18 • 4 • 1
Model Diffing Project AIPlans/Qwen3-0.6B-KTO Text Generation • Updated 15 days ago • 33 • 1 AIPlans/Qwen3-0.6B-ORPO Text Generation • Updated 9 days ago • 32 AIPlans/Qwen3-0.6B-DPO_NOTLORA Text Generation • 0.6B • Updated 12 days ago • 31 AIPlans/Qwen3-0.6B-DPO Text Generation • Updated 15 days ago • 27
Post Training Versions - Qwen 0.6B Different versions of Qwen 0.6b, where the only difference is the post training method used. The post training database should be the hh rlhf dataset. AIPlans/qwen3-8b-ipo-hh-rlhf Text Generation • Updated Jul 17 • 4 AIPlans/qwen3-0.6b-dpo-lora Text Generation • 0.6B • Updated Sep 18 • 4 • 1 AIPlans/qwen3-0.6b-hh-rlhf-sft 0.6B • Updated 20 days ago • 19
Red Teaming Alignment Evals AIPlans/Qwen-HHH-Cipher-Eng Text Generation • 0.5B • Updated Jun 14 • 11 AIPlans/Qwen-HHH-Sans-Eng Text Generation • 0.5B • Updated Jun 11 • 10 AIPlans/Qwen3-HHH-Cipher-Eng Text Generation • 0.6B • Updated Jun 15 • 15 AIPlans/Ethics_Commonsense Preview • Updated Jun 21 • 20
Model Diffing AIPlans/qwen3-8b-dpo-hh-rlhf Updated Jul 4 AIPlans/qwen3-8b-ipo-hh-rlhf Text Generation • Updated Jul 17 • 4 AIPlans/dpo_qwen0_6b_fft 0.6B • Updated Sep 24 • 8 AIPlans/qwen3-0.6b-dpo-lora Text Generation • 0.6B • Updated Sep 18 • 4 • 1