关于Benchmark评测结果

#3
by Rebelliousgang - opened

请问Model Card这里的评测结果是使用OpenAI 评测的么,是应该与Table1中的最下面一行做对比么?
UniPic2-Metaquery-9B w/o GRPO achieves competitive results across a variety of vision-language tasks:
Task Score
🧠 GenEval 0.86
🖼️ DPG-Bench 83.63
✂️ GEditBench-EN 6.90
🧪 ImgEdit-Bench 4.10

是的

Sign up or log in to comment