M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
models
51
JunxiongWang/M1-3B
Text Generation
•
3B
•
Updated
•
4
•
2
JunxiongWang/M1-3B-SFT
Text Generation
•
3B
•
Updated
•
26
•
1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B
•
Updated
•
3
JunxiongWang/MambaInLlama3B_SFT_MATH
3B
•
Updated
•
2
JunxiongWang/MambaInLlama3B_DPO2
3B
•
Updated
•
5
JunxiongWang/MambaInLlama3B_DPO1
3B
•
Updated
•
4
JunxiongWang/MambaInLlama3B_Distill_MATH
3B
•
Updated
•
3
JunxiongWang/MambaInLlama3B_v3
3B
•
Updated
•
3
JunxiongWang/MambaInLlama1B_Distill_MATH
1B
•
Updated
•
4
JunxiongWang/mamba_0_5_distill
Updated
datasets
20
JunxiongWang/QwenFineMATH
Viewer
•
Updated
•
6.71M
•
12
JunxiongWang/R1_GR_SFT
Viewer
•
Updated
•
44k
•
11
JunxiongWang/R1_SFT
Updated
•
30
JunxiongWang/R1_Sythetic_SFT
Viewer
•
Updated
•
1M
•
200
JunxiongWang/MATH_SFT
Viewer
•
Updated
•
19.1M
•
173
JunxiongWang/R1_OpenThoughts_SFT
Viewer
•
Updated
•
862k
•
55
JunxiongWang/R1_am_SFT
Viewer
•
Updated
•
1.4M
•
29
JunxiongWang/qwen1b_it_math
Viewer
•
Updated
•
19.1M
•
25
JunxiongWang/test_math
Viewer
•
Updated
•
89.1k
•
41
JunxiongWang/FineMathV4
Viewer
•
Updated
•
6.7M
•
37