Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought!
YM Qin
Wakals
AI & ML interests
Computer Vision, Vision-language Model, Generative Model
Recent Activity
updated
a model
3 days ago
Wakals/CoVT-LLaVA-13B-depth
updated
a model
3 days ago
Wakals/CoVT-7B-seg
updated
a model
3 days ago
Wakals/CoVT-7B-depth
Organizations
None yet