Wang Han
zjuwh
ยท
AI & ML interests
LLM Post-Training
Recent Activity
upvoted a paper 9 days ago
V-Zero: Self-Improving Multimodal Reasoning with Zero Annotation liked
a dataset 2 months ago
zjuwh/self_train_set updated
a dataset 2 months ago
zjuwh/self_train_set