TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
•
12.1k
•
76.4k
•
407
Natural Language Processing, Image Generation
VisCoder2: Building Multi-Language Visualization Coding Agents
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions