Update README.md
Browse files
README.md
CHANGED
|
@@ -6,9 +6,11 @@ datasets:
|
|
| 6 |
|
| 7 |
# INTELLECT-2
|
| 8 |
|
| 9 |
-
INTELLECT-2 is a
|
| 10 |
|
| 11 |
-
|
|
|
|
|
|
|
| 12 |
|
| 13 |

|
| 14 |
|
|
|
|
| 6 |
|
| 7 |
# INTELLECT-2
|
| 8 |
|
| 9 |
+
INTELLECT-2 is a 32 billion parameter language model trained through globally distributed reinforcement learning (RL) run on permissionless, community-contributed GPU resources.
|
| 10 |
|
| 11 |
+
The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability.
|
| 12 |
+
|
| 13 |
+
For detailed information, see our [technical report](link).
|
| 14 |
|
| 15 |

|
| 16 |
|