31 26 2

Yulei Qin

yolay

https://yuleichin.github.io/

AI & ML interests

Medical Imaging, Computer Vision, Language Models

Recent Activity

updated a model 1 day ago

yolay/SPEAR-SearchQA-Qwen2.5-14B

updated a model 2 days ago

yolay/SPEAR-SearchQA-Qwen2.5-7B

updated a collection 5 days ago

SPEAR

View all activity

Organizations

upvoted a paper 12 days ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published 14 days ago • 26

upvoted a paper 15 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 19 days ago • 105

upvoted 3 papers about 1 month ago

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents

Paper • 2507.23698 • Published Jul 31 • 10

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Paper • 2511.02347 • Published Nov 4 • 8

upvoted an article about 1 month ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

upvoted a paper 2 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

upvoted a collection 2 months ago

Reinforcement learning

Collection

78 items • Updated 4 days ago • 7

upvoted a paper 2 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26 • 29

upvoted 5 papers 4 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4 • 36

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 135

upvoted 2 papers 5 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 89

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 123

upvoted a collection 6 months ago

RAIF

Collection

Datasets and models in the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models" [github.com/yuleiqin/RAIF]. • 12 items • Updated Jul 17 • 2

upvoted 2 papers 6 months ago

WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks

Paper • 2506.01952 • Published Jun 2 • 10

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2 • 16

upvoted an article 10 months ago