ultramit19/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-pesty_roaring_panther Text Generation • 0.5B • Updated 9 days ago • 253
ultramit19/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-pesty_roaring_panther Text Generation • 0.5B • Updated 9 days ago • 253
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10, 2025 • 661