Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ datasets:
|
|
| 12 |
<img alt="OLMo Logo" src="https://cdn-uploads.huggingface.co/production/uploads/65316953791d5a2611426c20/nC44-uxMD6J6H3OHxRtVU.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
|
| 13 |
|
| 14 |
|
| 15 |
-
# Model Card for Olmo 3 RL-Zero Code
|
| 16 |
|
| 17 |
We introduce Olmo 3, a new family of 7B and 32B models both Instruct and Think variants. Long chain-of-thought thinking improves reasoning tasks like math and coding.
|
| 18 |
|
|
@@ -26,10 +26,10 @@ For the other Olmo 3 RL-Zero models see:
|
|
| 26 |
| **Domain** | **Model** | **RLVR Dataset**
|
| 27 |
|--------------------------|---------------|---------------|
|
| 28 |
| **Base Model** | [Olmo-3-7B](https://huggingface.co/allenai/Olmo-3-1025-7B) |
|
| 29 |
-
| **Math** | [Olmo-3-RLZero-Math
|
| 30 |
-
| **Code** | [Olmo-3-RLZero-Code
|
| 31 |
-
| **IF** | [Olmo-3-RLZero-IF
|
| 32 |
-
| **Mix** | [Olmo-3-RLZero-Mix
|
| 33 |
|
| 34 |
For the core Olmo 3 models see:
|
| 35 |
|
|
|
|
| 12 |
<img alt="OLMo Logo" src="https://cdn-uploads.huggingface.co/production/uploads/65316953791d5a2611426c20/nC44-uxMD6J6H3OHxRtVU.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
|
| 13 |
|
| 14 |
|
| 15 |
+
# Model Card for Olmo 3 7B RL-Zero Code
|
| 16 |
|
| 17 |
We introduce Olmo 3, a new family of 7B and 32B models both Instruct and Think variants. Long chain-of-thought thinking improves reasoning tasks like math and coding.
|
| 18 |
|
|
|
|
| 26 |
| **Domain** | **Model** | **RLVR Dataset**
|
| 27 |
|--------------------------|---------------|---------------|
|
| 28 |
| **Base Model** | [Olmo-3-7B](https://huggingface.co/allenai/Olmo-3-1025-7B) |
|
| 29 |
+
| **Math** | [Olmo-3-7B-RLZero-Math](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Math/) | [Dolci-RLZero-Math-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Math-7B)
|
| 30 |
+
| **Code** | [Olmo-3-7B-RLZero-Code](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Code/) | [Dolci-RLZero-Code-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Code-7B)
|
| 31 |
+
| **IF** | [Olmo-3-7B-RLZero-IF](https://huggingface.co/allenai/Olmo-3-7B-RLZero-IF/) | [Dolci-RLZero-IF-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-IF-7B)
|
| 32 |
+
| **Mix** | [Olmo-3-7B-RLZero-Mix](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Mix/) | [Dolci-RLZero-Mix-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Mix-7B)
|
| 33 |
|
| 34 |
For the core Olmo 3 models see:
|
| 35 |
|