LRM-Conta-Detection-Arena/sft-conta-deepseek-distill-qwen2.5-7b Text Generation • 8B • Updated Oct 9 • 3
hdong0/deepseek-Qwen-7B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4 Text Generation • 8B • Updated Oct 25 • 4
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k_5_epochs Text Generation • 333k • Updated 29 days ago • 35
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k Text Generation • 333k • Updated 29 days ago • 28
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_w_sys_4k Text Generation • 333k • Updated 28 days ago • 25
wetsoledrysoul/restem_DeepSeek-R1-Distill-Qwen-7B_ep0 Text Generation • 8B • Updated 22 days ago • 99