10 14 4

Ye Liu

yeliudev

https://yeliu.dev/

AI & ML interests

Vision & Language

Recent Activity

upvoted a paper 13 days ago

Computer-Use Agents as Judges for Generative User Interface

updated a dataset about 1 month ago

yeliudev/datasets

upvoted a paper about 1 month ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

View all activity

Organizations

upvoted a paper 13 days ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published 19 days ago • 51

upvoted a paper about 1 month ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4 • 102

upvoted a paper 2 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 116

upvoted a paper 3 months ago

UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning

Paper • 2509.18094 • Published Sep 22 • 4

upvoted 2 papers 6 months ago

D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published May 29 • 34

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 109

upvoted a paper 7 months ago

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Paper • 2505.16854 • Published May 22 • 11

upvoted a collection 8 months ago

VideoMind

Collection

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning • 8 items • Updated Mar 31 • 3

upvoted a paper 9 months ago

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Paper • 2503.13444 • Published Mar 17 • 17

upvoted a paper 12 months ago

Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion

Paper • 2412.14462 • Published Dec 19, 2024 • 15

upvoted 2 papers about 1 year ago

E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding

Paper • 2409.18111 • Published Sep 26, 2024 • 6

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

Paper • 2409.19603 • Published Sep 29, 2024 • 19

upvoted 2 papers over 1 year ago

VideoGUI: A Benchmark for GUI Automation from Instructional Videos

Paper • 2406.10227 • Published Jun 14, 2024 • 9

PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM

Paper • 2406.02884 • Published Jun 5, 2024 • 19

Ye Liu

AI & ML interests

Recent Activity

Organizations

yeliudev's activity