pawl_jack's picture

3 1

pawl_jack

pangdingjack

·

AI & ML interests

None yet

Organizations

None yet

upvoted 3 articles 6 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

271

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

219

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

Jun 3, 2025

•

310