Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions Paper • 2512.00097 • Published 11 days ago • 1 • 2
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR Paper • 2509.23808 • Published Sep 28 • 47 • 2
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19 • 118 • 6
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19 • 118 • 6