Zhaokai Wang
wzk1015
AI & ML interests
Computer Vision
Music Generation
Multimodal Large Language Models
Recent Activity
upvoted
a
paper
about 13 hours ago
EditThinker: Unlocking Iterative Reasoning for Any Image Editor
liked
a model
about 2 months ago
Zhenxin-Lei/MetaCaptioner
upvoted
a
paper
about 2 months ago
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding
LLM