agentic
updated
NousResearch/Hermes-4-70B
Text Generation
• Updated
• 2.44k
• • 169
unsloth/Kimi-K2-Instruct-0905-GGUF
1T • Updated
• 1.36k
• 52
Text-to-Image
• Updated
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning
in Large Language Models
Paper
• 2509.09675
• Published
• 28
NousResearch/Hermes-4-14B-FP8
Text Generation
• 15B • Updated
• 3.27k
• 16
NousResearch/Hermes-4-70B-FP8
Text Generation
• 71B • Updated
• 173
• 25
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated
• 7
• 14
NousResearch/DeepHermes-Financial-Fundamentals-Prediction-Specialist-Atropos
Text Generation
• 8B • Updated
• 30
• 15
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated
• 8
• 3
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated
• 7
• 6
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated
• 11
• 7
NousResearch/Hermes-4-405B-FP8
Text Generation
• Updated
• 248
• 21
deepseek-ai/DeepSeek-V3.1-Terminus
Text Generation
• Updated
• 8.35k
• • 362
nvidia/NVIDIA-Nemotron-Nano-9B-v2-FP8
Text Generation
• 9B • Updated
• 39.4k
• 7
nvidia/nemocurator-fineweb-nemotron-4-edu-classifier
0.1B • Updated
• 2.65k
• 11
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text
• 236B • Updated
• 2.73M
• • 378
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
Paper
• 2509.15566
• Published
• 14
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and
Open Resources
Paper
• 2509.21268
• Published
• 104
Text Generation
• Updated
• 6.63k
• • 98
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
• Updated
• 41.3k
• • 963
NousResearch/Hermes-4-14B
Text Generation
• 425k • Updated
• 4.83k
• 118