--- license: apache-2.0 datasets: - TIGER-Lab/rStar-Critique-Data language: - en metrics: - accuracy base_model: - Qwen/Qwen3-4B tags: - code --- ## Model We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder). ## Data Data Construction Pipeline is shown: ![pipeline](https://github.com/TIGER-AI-Lab/Critique-Coder/blob/main/assets/images/dataset.png?raw=true) ## Paper [Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824) ## Project Page https://tiger-ai-lab.github.io/Critique-Coder ## Code https://github.com/TIGER-AI-Lab/Critique-Coder ## Sample Usage You can download this dataset using the Hugging Face CLI: ```bash hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset ``` ## Citation ``` @article{ruan2025critiquecoder, title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning}, author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu}, journal={ArXiv}, year={2025}, volume={2509.22824} } ```