Open-Models
updated
Text Generation
•
Updated
•
3.48M
•
•
4.51k
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper
•
2512.20605
•
Published
•
62
Nested Browser-Use Learning for Agentic Information Seeking
Paper
•
2512.23647
•
Published
•
19
TimeBill: Time-Budgeted Inference for Large Language Models
Paper
•
2512.21859
•
Published
•
25
ResembleAI/chatterbox-turbo
Text-to-Speech
•
Updated
•
607
mHC: Manifold-Constrained Hyper-Connections
Paper
•
2512.24880
•
Published
•
308
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models
Paper
•
2512.15560
•
Published
•
25
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper
•
2512.22615
•
Published
•
49
Text-to-3D
•
Updated
•
534
•
371
Image-to-Video
•
Updated
•
1.93M
•
•
1.56k
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR
Paper
•
2601.14251
•
Published
•
24
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Paper
•
2601.22153
•
Published
•
69
tencent/Youtu-VL-4B-Instruct
Image-Text-to-Text
•
5B
•
Updated
•
4.6k
•
150
Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation
Paper
•
2601.21406
•
Published
•
5
Reinforcement Learning via Self-Distillation
Paper
•
2601.20802
•
Published
•
40
DeepSeek-OCR 2: Visual Causal Flow
Paper
•
2601.20552
•
Published
•
62
Image-to-Text
•
Updated
•
1.39M
•
1.1k
unsloth/Qwen3-Coder-Next-FP8-Dynamic
Text Generation
•
80B
•
Updated
•
56.6k
•
33
Text Generation
•
Updated
•
434k
•
•
949
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper
•
2602.12099
•
Published
•
56