hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LR-Retrained Text Generation • 3B • Updated 5 days ago • 51
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LR-Retrained Text Generation • 3B • Updated 5 days ago • 51
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-R1-QwQ-Seed-42-MLR Text Generation • 3B • Updated 7 days ago • 48
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-R1-QwQ-Seed-42-MLR Text Generation • 3B • Updated 7 days ago • 48
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-ROC-Seed-42 Text Generation • 3B • Updated 11 days ago • 45
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMe Text Generation • 3B • Updated 11 days ago • 54
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-R1-MC-Seed-42 Text Generation • 3B • Updated 11 days ago • 42
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-R1-MC-Seed-42 Text Generation • 3B • Updated 11 days ago • 42
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-ROC-Seed-42 Text Generation • 3B • Updated 11 days ago • 45
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI Paper • 2511.07885 • Published 26 days ago • 6
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMi Text Generation • 3B • Updated 27 days ago • 39
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMa Text Generation • 3B • Updated 27 days ago • 29
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMe Text Generation • 3B • Updated 11 days ago • 54
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMi Text Generation • 3B • Updated 27 days ago • 39
Cartridges: Lightweight and general-purpose long context representations via self-study Paper • 2506.06266 • Published Jun 6 • 6
Archon: An Architecture Search Framework for Inference-Time Techniques Paper • 2409.15254 • Published Sep 23, 2024 • 1