-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 196 -
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas
Paper • 2601.21558 • Published • 58 -
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
Paper • 2601.22975 • Published • 100
yjg yjg
yjgYJG
·
AI & ML interests
None yet
Recent Activity
updated
a collection
19 days ago
agent
updated
a collection
19 days ago
agent
upvoted
a
paper
19 days ago
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas
Organizations
None yet