view article Article We’re open-sourcing our text-to-image model and the process behind it 25 days ago • 73
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21 • 234
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Paper • 2505.04601 • Published May 7 • 29
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 249
Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback Paper • 2501.03916 • Published Jan 7 • 16
view article Article Fine-tune ModernBERT for text classification using synthetic data Dec 30, 2024 • 38
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Paper • 2412.16112 • Published Dec 20, 2024 • 23