arxiv:2601.19532
Marthe Ballon
martheballon
AI & ML interests
None yet
Recent Activity
authored
a paper
29 days ago
Benchmarks Saturate When The Model Gets Smarter Than The Judge updated
a dataset 29 days ago
martheballon/Omni-MATH-2 submitted
a paper
29 days ago
Benchmarks Saturate When The Model Gets Smarter Than The Judge Organizations
None yet