Zhilong Zheng's picture

Zhilong Zheng

zzzzl-h

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

View all activity

Organizations

None yet

authored a paper 3 days ago

STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

Paper • 2602.15620 • Published 4 days ago • 3