From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 13 days ago • 238
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 9 days ago • 63
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 28
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 20 days ago • 134
Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning Paper • 2511.14617 • Published 18 days ago • 1
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 25 days ago • 31
Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes Paper • 0812.4360 • Published Dec 23, 2008 • 2
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 23 days ago • 92