From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 14 days ago • 240
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 10 days ago • 65
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 28
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 21 days ago • 134
Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning Paper • 2511.14617 • Published 19 days ago • 1
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 26 days ago • 31
Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes Paper • 0812.4360 • Published Dec 23, 2008 • 2
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 24 days ago • 92
Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI Paper • 2511.01689 • Published Nov 3 • 4
LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics Paper • 2511.08544 • Published 26 days ago • 6
From Memorization to Reasoning in the Spectrum of Loss Curvature Paper • 2510.24256 • Published Oct 28 • 2