CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis Paper • 2503.23145 • Published Mar 29, 2025 • 35
Running 1.49k Big Code Models Leaderboard 📈 1.49k Explore and compare code generation models on a leaderboard