HUME: Measuring the Human-Model Performance Gap in Text Embedding Task
Paper
•
2510.10062
•
Published
•
10
The HUME benchmark is designed to evaluate the performance of text embedding models and humans on a comparable set of tasks. This captures areas wh...