VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.
List of Our Papers
Main VLM2Vec / MMEB Series
Other Related Papers from Our Team
- GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
- B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)
datasets
42
Viewer
•
Updated
•
302k
•
2
VLM2Vec/MMLongBench-page-fixed
Viewer
•
Updated
•
8.91k
•
3.32k
VLM2Vec/ViDoSeek-page-fixed
Viewer
•
Updated
•
8.78k
•
3.3k
Updated
•
157
Viewer
•
Updated
•
1.03M
•
170
•
1
Viewer
•
Updated
•
1.03M
•
75
Viewer
•
Updated
•
4k
•
899
Viewer
•
Updated
•
1.8k
•
107
•
1
Viewer
•
Updated
•
1k
•
914
Viewer
•
Updated
•
4.48k
•
2.9k
•
1