Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FlagEval

non-profit
https://flageval.baai.ac.cn/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

philokey  updated a dataset about 1 month ago
FlagEval/coco_val2014_sampled
philokey  authored a paper about 1 month ago
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
philokey  updated a dataset about 1 month ago
FlagEval/MeasureBench
View all activity

Richeng Xuan's profile picture Xuannan Liu 's profile picture llvvvv's profile picture Sherlock's profile picture Gray 's profile picture makarov's profile picture Zheqi He's profile picture jingshu's profile picture daiteng01's profile picture lixuejing's profile picture HelloGitHub's profile picture Moyu's profile picture

spaces 2

Running
6

FlagEval-Arena

🐢

Arena

Mar 18
Running
12

FlagEval-Debate

🐠

Display a debate interface

Mar 17

models 1

FlagEval/flageval_judgemodel

Text Generation • 33B • Updated Dec 30, 2024 • 16 • 1

datasets 13

FlagEval/ERQAPlus

Viewer • Updated 10 days ago • 800 • 14 • 1

FlagEval/coco_val2014_sampled

Viewer • Updated about 1 month ago • 1k • 90

FlagEval/MeasureBench

Viewer • Updated Nov 3 • 2.44k • 213 • 1

FlagEval/EmbodiedVerse-Bench

Viewer • Updated Jun 25 • 2.04k • 107

FlagEval/Where2Place

Viewer • Updated May 29 • 100 • 223

FlagEval/SAT

Viewer • Updated May 6 • 150 • 17

FlagEval/HMMT_2025

Viewer • Updated May 6 • 30 • 238 • 1

FlagEval/ERQA

Viewer • Updated Apr 22 • 400 • 544 • 2

FlagEval/sub_spatial

Viewer • Updated Apr 21 • 690 • 13

FlagEval/EmbSpatial-Bench

Viewer • Updated Apr 21 • 3.64k • 223 • 2
View 13 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs