allenai
/

Olmo-3-7B-RL-Zero-Code

Text Generation

Model card Files Files and versions

mnoukhov commited on 17 days ago

Commit

4e0f769

·

verified ·

1 Parent(s): e181e6d

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ datasets:
 <img alt="OLMo Logo" src="https://cdn-uploads.huggingface.co/production/uploads/65316953791d5a2611426c20/nC44-uxMD6J6H3OHxRtVU.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
-# Model Card for Olmo 3 RL-Zero Code
 We introduce Olmo 3, a new family of 7B and 32B models both Instruct and Think variants. Long chain-of-thought thinking improves reasoning tasks like math and coding.
@@ -26,10 +26,10 @@ For the other Olmo 3 RL-Zero models see:
 | **Domain**               | **Model**  | **RLVR Dataset**
 |--------------------------|---------------|---------------|
 | **Base Model**           | [Olmo-3-7B](https://huggingface.co/allenai/Olmo-3-1025-7B) |
-| **Math**                 | [Olmo-3-RLZero-Math-7B](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Math/) | [Dolci-RLZero-Math-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Math-7B)
-| **Code**                 | [Olmo-3-RLZero-Code-7B](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Code/) | [Dolci-RLZero-Code-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Code-7B)
-| **IF**                   | [Olmo-3-RLZero-IF-7B](https://huggingface.co/allenai/Olmo-3-7B-RLZero-IF/) | [Dolci-RLZero-IF-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-IF-7B)
-| **Mix**                  | [Olmo-3-RLZero-Mix-7B](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Mix/) | [Dolci-RLZero-Mix-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Mix-7B)
 For the core Olmo 3 models see:

 <img alt="OLMo Logo" src="https://cdn-uploads.huggingface.co/production/uploads/65316953791d5a2611426c20/nC44-uxMD6J6H3OHxRtVU.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
+# Model Card for Olmo 3 7B RL-Zero Code
 We introduce Olmo 3, a new family of 7B and 32B models both Instruct and Think variants. Long chain-of-thought thinking improves reasoning tasks like math and coding.
 | **Domain**               | **Model**  | **RLVR Dataset**
 |--------------------------|---------------|---------------|
 | **Base Model**           | [Olmo-3-7B](https://huggingface.co/allenai/Olmo-3-1025-7B) |
+| **Math**                 | [Olmo-3-7B-RLZero-Math](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Math/) | [Dolci-RLZero-Math-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Math-7B)
+| **Code**                 | [Olmo-3-7B-RLZero-Code](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Code/) | [Dolci-RLZero-Code-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Code-7B)
+| **IF**                   | [Olmo-3-7B-RLZero-IF](https://huggingface.co/allenai/Olmo-3-7B-RLZero-IF/) | [Dolci-RLZero-IF-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-IF-7B)
+| **Mix**                  | [Olmo-3-7B-RLZero-Mix](https://huggingface.co/allenai/Olmo-3-7B-RLZero-Mix/) | [Dolci-RLZero-Mix-7B](https://huggingface.co/datasets/allenai/Dolci-RLZero-Mix-7B)
 For the core Olmo 3 models see: