VL-Rethinker-7B / README.md
nielsr's picture
nielsr HF Staff
Correct pipeline tag and add library name
8e312b3 verified
|
raw
history blame
1.28 kB
metadata
base_model:
  - Qwen/Qwen2.5-VL-7B-Instruct
language:
  - en
license: apache-2.0
pipeline_tag: image-text-to-text
tags:
  - transformers
  - multimodal
library_name: transformers

VL-Rethinker-7B

VL-Rethinker-7B achieves SoTA results on various multimodal reasoning benchmarks.

It is trained using the GRPO-SSR and Forced Rethinking techniques.

For details of our approach and performance comparison, please see our paper.

For details of training and evaluation, please see our code repo.

Explore further via the following links:

| 🚀Project Page | 📖Paper | 🔗Github | 🤗Data (Coming Soon) |

Citation

If you feel this model useful, please give us a free cite:

@article{vl-rethinker,
      title={VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning},
      author = {Wang, Haozhe and Qu, Chao and Huang, Zuming and Chu, Wei and Lin,Fangzhen and Chen, Wenhu},
      journal={arXiv preprint arXiv:2504.08837},
      year={2025}
}