PrimeIntellect
/

Qwen3-0.6B-Reverse-Text-SFT

Model card Files Files and versions

mikasenghaas commited on Sep 24, 2025

Commit

30c7b98

·

verified ·

1 Parent(s): 641cf83

Update README.md

Files changed (1) hide show

README.md +3 -19

README.md CHANGED Viewed

@@ -1,29 +1,13 @@
 ---
-library_name: transformers
 license: apache-2.0
 datasets:
-- PrimeIntellect/Reverse-Text-SFT
 base_model:
 - PrimeIntellect/Qwen3-0.6B
 ---
-# Qwen3-0.6B-Reverse-Text-SFT
 <!-- Provide a quick summary of what the model is/does. -->
-A debug model fine-tuned on `willcb/R1-reverse-wikipedia-paragraphs-v1-1000`. To be used as warmed up model to RL in `vf-reverse-text`.
-Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `8262560`)
-```bash
-uv run torchrun --nproc-per-node 8 src/prime_rl/trainer/sft/train.py \
-  --model.name PrimeIntellect/Qwen3-0.6B \
-  --data.name willcb/R1-reverse-wikipedia-paragraphs-v1-1000 \
-  --max-steps 100 \
-  --data.batch-size 16 \
-  --data.micro-batch-size 1 \
-  --data.seq-len 4096 \
-  --optim.lr 2e-5
-```
-Check the run out on [W&B](https://wandb.ai/primeintellect/mika/runs/odsfiekx?nw=nwusermikasenghaas_).

 ---
 license: apache-2.0
 datasets:
+- willcb/V3-wordle
 base_model:
 - PrimeIntellect/Qwen3-0.6B
 ---
+# Qwen-0.6B-Reverse-Text-SFT
 <!-- Provide a quick summary of what the model is/does. -->
+A SFT fine-tune of `PrimeIntellect/Qwen-0.6B`. Details [here](https://github.com/PrimeIntellect-ai/prime-rl/tree/main/examples/reverse_text).