metadata
license: apache-2.0
datasets:
- TIGER-Lab/rStar-Critique-Data
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen3-4B
tags:
- code
Model
We release the 4B model trained with Critique-Coder.
Data
Data Construction Pipeline is shown:
Paper
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning
Project Page
https://tiger-ai-lab.github.io/Critique-Coder
Code
https://github.com/TIGER-AI-Lab/Critique-Coder
Sample Usage
You can download this dataset using the Hugging Face CLI:
hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset
Citation
@article{ruan2025critiquecoder,
title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
journal={ArXiv},
year={2025},
volume={2509.22824}
}
