9 99 17

Xiaoye Qu

Xiaoye08

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Qwen3-VL Technical Report

upvoted a paper 3 days ago

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

View all activity

Organizations

upvoted 2 papers 3 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 10 days ago • 106

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

Paper • 2512.03746 • Published 4 days ago • 15

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 167

upvoted 2 papers 5 days ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Paper • 2511.20549 • Published 11 days ago • 23

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 13 days ago • 239

upvoted a paper 6 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 9 days ago • 145

upvoted a paper 9 days ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 11 days ago • 111

upvoted a paper 13 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 17 days ago • 104

liked a model 16 days ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated 30 days ago • 20 • 2

upvoted a paper 17 days ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published 17 days ago • 42

upvoted a paper 25 days ago

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published 28 days ago • 24

commented a paper 25 days ago

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published 28 days ago • 24 •

upvoted a paper 30 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published about 1 month ago • 208

upvoted a paper about 2 months ago

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21 • 66

upvoted a collection about 2 months ago

VPPO Model

Collection

SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 4 items • Updated 30 days ago • 4

liked 2 models about 2 months ago

chamber111/VPPO-7B

Image-Text-to-Text • 8B • Updated 30 days ago • 36 • 5

chamber111/VPPO-32B

33B • Updated Oct 16 • 3 • 2

commented 2 papers about 2 months ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10 • 36 •

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10 • 36 •

upvoted a paper about 2 months ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10 • 36

Xiaoye Qu

AI & ML interests

Recent Activity

Organizations

Xiaoye08's activity