Safetensors
English
qwen3
code
Critique-Coder-4B / README.md
wenhu's picture
Update README.md
5f45ee8 verified
metadata
license: apache-2.0
datasets:
  - TIGER-Lab/rStar-Critique-Data
language:
  - en
metrics:
  - accuracy
base_model:
  - Qwen/Qwen3-4B
tags:
  - code

Model

We release the 4B model trained with Critique-Coder.

Data

Data Construction Pipeline is shown:

pipeline

Paper

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Project Page

https://tiger-ai-lab.github.io/Critique-Coder

Code

https://github.com/TIGER-AI-Lab/Critique-Coder

Sample Usage

You can download this dataset using the Hugging Face CLI:

hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset

Citation

@article{ruan2025critiquecoder,
    title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
    author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
    journal={ArXiv},
    year={2025},
    volume={2509.22824}
}