---
license: apache-2.0
datasets:
- TIGER-Lab/rStar-Critique-Data
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen3-4B
tags:
- code
---

## Model
We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder).

## Data
Data Construction Pipeline is shown:

![pipeline](https://github.com/TIGER-AI-Lab/Critique-Coder/blob/main/assets/images/dataset.png?raw=true)

## Paper
[Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824)

## Project Page
https://tiger-ai-lab.github.io/Critique-Coder

## Code
https://github.com/TIGER-AI-Lab/Critique-Coder

## Sample Usage

You can download this dataset using the Hugging Face CLI:

```bash
hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset
```

## Citation
```
@article{ruan2025critiquecoder,
    title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
    author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
    journal={ArXiv},
    year={2025},
    volume={2509.22824}
}
```