Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Open Agent Evaluation Laboratory
university
https://boxiyu.github.io/
BoshCavendish
BoxiYu
boxi-yu-194b63279
Activity Feed
Follow
2
AI & ML interests
Code Agent, Benchmark Augmentation
Recent Activity
CWCY
updated
a dataset
2 days ago
OpenAgentLab/SWE-ABS
Bertsekas
authored
a paper
8 months ago
How Should I Build A Benchmark? Revisiting Code-Related Benchmarks For LLMs
Bertsekas
authored
a paper
8 months ago
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
View all activity
Team members
2
OpenAgentLab
's models
None public yet