PrimeIntellect
/

INTELLECT-2

Model card Files Files and versions

justus27 commited on May 11

Commit

407f1d7

·

verified ·

1 Parent(s): a0e0098

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -6,9 +6,11 @@ datasets:
 # INTELLECT-2
-INTELLECT-2 is a 32B parameter reasoning model trained through a reinforcement learning run leveraging globally distributed community-contributed GPU resources.
-To learn more about how INTELLECT-2 was trained, you can check out its [technical report](link)
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)

 # INTELLECT-2
+INTELLECT-2 is a 32 billion parameter language model trained through globally distributed reinforcement learning (RL) run on permissionless, community-contributed GPU resources.
+The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability.
+For detailed information, see our [technical report](link).
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)