关于Benchmark评测结果

by Rebelliousgang - opened 29 days ago

29 days ago

请问Model Card这里的评测结果是使用OpenAI 评测的么，是应该与Table1中的最下面一行做对比么？
UniPic2-Metaquery-9B w/o GRPO achieves competitive results across a variety of vision-language tasks:
Task Score
🧠 GenEval 0.86
🖼️ DPG-Bench 83.63
✂️ GEditBench-EN 6.90
🧪 ImgEdit-Bench 4.10

OrlandoHugBot

Skywork org 26 days ago

是的

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment