Safetensors
qwen2
justus27 commited on
Commit
407f1d7
·
verified ·
1 Parent(s): a0e0098

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -6,9 +6,11 @@ datasets:
6
 
7
  # INTELLECT-2
8
 
9
- INTELLECT-2 is a 32B parameter reasoning model trained through a reinforcement learning run leveraging globally distributed community-contributed GPU resources.
10
 
11
- To learn more about how INTELLECT-2 was trained, you can check out its [technical report](link)
 
 
12
 
13
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)
14
 
 
6
 
7
  # INTELLECT-2
8
 
9
+ INTELLECT-2 is a 32 billion parameter language model trained through globally distributed reinforcement learning (RL) run on permissionless, community-contributed GPU resources.
10
 
11
+ The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability.
12
+
13
+ For detailed information, see our [technical report](link).
14
 
15
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)
16