27 86 261

Yinxu Pan

cppowboy

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

new activity 2 days ago

zai-org/GLM-4.7-Flash:unsupport glm4-moe-lite

liked a model 3 days ago

openbmb/AgentCPM-Report

liked a model 3 days ago

openbmb/AgentCPM-Explore

View all activity

Organizations

upvoted 2 papers 8 days ago

Ministral 3

Paper • 2601.08584 • Published 10 days ago • 45

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 8 days ago • 121

upvoted a paper 13 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 14 days ago • 203

upvoted a paper 17 days ago

Recursive Language Models

Paper • 2512.24601 • Published 23 days ago • 73

upvoted 3 papers 18 days ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published 21 days ago • 54

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 23 days ago • 115

Deep Delta Learning

Paper • 2601.00417 • Published 21 days ago • 32

upvoted 2 papers 23 days ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published 25 days ago • 26

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published 24 days ago • 20

upvoted a paper 24 days ago

Nested Browser-Use Learning for Agentic Information Seeking

Paper • 2512.23647 • Published 24 days ago • 18

upvoted 2 papers 25 days ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

Paper • 2512.21919 • Published 28 days ago • 10

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Paper • 2512.22047 • Published 28 days ago • 27

upvoted 3 papers 29 days ago

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Paper • 2512.18470 • Published Dec 20, 2025 • 11

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published about 1 month ago • 35

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published about 1 month ago • 35

upvoted a paper about 1 month ago

SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

Paper • 2512.17419 • Published Dec 19, 2025 • 10

upvoted an article about 1 month ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

•

106

upvoted 3 papers about 2 months ago

Yinxu Pan

AI & ML interests

Recent Activity

Organizations

cppowboy's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models