-
-
-
-
-
-
Inference Providers
Active filters: X-R1
zhengComing/zhengComing_Qwen2.5_0dot5B_R1_zero
Text Generation
• 0.5B • Updated
smartrichard/X-R1-lora-7500
Text Generation
• Updated
• 2
watermelonhjg/Qwen2.5-3B-EN-Zero
Text Generation
• 3B • Updated
watermelonhjg/Qwen2.5-7B-EN-Zero
Text Generation
• 8B • Updated
• 3
watermelonhjg/Qwen2.5-3B-Instruct-CN-Math-Zero
Text Generation
• 3B • Updated
• 2
watermelonhjg/Qwen2.5-7B-Instruct-CN-Math-Zero
Text Generation
• 8B • Updated
• 1
watermelonhjg/Qwen2.5-7B-Instruct-EN-Zero
Text Generation
• 8B • Updated
• 1
watermelonhjg/Qwen2.5-3B-Instruct-EN-Zero
Text Generation
• 3B • Updated
• 3
watermelonhjg/Qwen2.5-7B-med
Text Generation
• 8B • Updated
• 2
watermelonhjg/Qwen2.5-7B-0.01KL
Text Generation
• 8B • Updated
• 1
watermelonhjg/Qwen2.5-7B-class5
Text Generation
• 8B • Updated
• 1
watermelonhjg/Qwen2.5-7B-cn-class2
Text Generation
• 8B • Updated
• 1
watermelonhjg/Qwen2.5-Math-7B-en-zero
Text Generation
• 8B • Updated
• 1
watermelonhjg/Qwen2.5-Math-7B-cn-zero-class2
Text Generation
• 8B • Updated
• 1
IDoNotHaveAName/origin_grpo_train_1_epoch
Text Generation
• 2B • Updated
• 4
IDoNotHaveAName/GRPO-qwen2.5-1.5B-reward-process
Text Generation
• 2B • Updated
• 1
IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-with-hint
Text Generation
• 2B • Updated
• 1
IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-without-hint
Text Generation
• 2B • Updated
• 2
IDoNotHaveAName/X-R1-3epoch
Text Generation
• 2B • Updated
• 1
IDoNotHaveAName/2epoch-experiment
Text Generation
• 2B • Updated
• 1
IDoNotHaveAName/model-trainby-mistake
Text Generation
• 2B • Updated
• 1
mradermacher/Hint-Informed-GRPO-1.5B-GGUF
2B • Updated
• 18
GavinChan1105/X-R1-3B-cn-math
Text Generation
• 3B • Updated
• 2
mradermacher/X-R1-3B-cn-math-GGUF