Add Terminal-Bench evaluation result (30.0%)

#55
by burtenshaw HF Staff - opened
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment