Yangyi Chen

YangyiYY

https://yangyi-chen.github.io/

AI & ML interests

Multimodal, Large Language Models

Recent Activity

liked a model 14 days ago

nvidia/Nemotron-Cascade-8B-Intermediate-ckpts

authored a paper 15 days ago

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

authored a paper 15 days ago

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published 18 days ago • 27

upvoted a collection 18 days ago

Nemotron-Cascade

Collection

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 1 day ago • 40

upvoted a paper about 1 month ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 111

upvoted a paper 6 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8, 2025 • 47

upvoted a paper 7 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

upvoted a paper 8 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5, 2025 • 79

upvoted a paper 9 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93

upvoted an article 10 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

upvoted 2 papers 12 months ago

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published Jan 20, 2025 • 28

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Paper • 2501.04561 • Published Jan 8, 2025 • 17

Yangyi Chen

AI & ML interests

Recent Activity

Organizations

YangyiYY's activity

Putting RL back in RLHF