nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated 16 days ago • 30.3k • 92
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30, 2025 • 84
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated about 21 hours ago • 48
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated about 21 hours ago • 94
Multimodal Implementations Collection Comprehensive Demo of Multimodal VLMs on the Hub • 23 items • Updated 21 days ago • 11
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 3 days ago • 285k • 581
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published Nov 26, 2025 • 119
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8, 2025 • 81
Alibaba-NLP/gme-Qwen2-VL-7B-Instruct Sentence Similarity • 8B • Updated Jun 9, 2025 • 3.93k • 70