Safetensors
English
qwen3
code
wenhu commited on
Commit
5f45ee8
·
verified ·
1 Parent(s): 8d1b7ed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -1
README.md CHANGED
@@ -12,4 +12,38 @@ tags:
12
  - code
13
  ---
14
 
15
- We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  - code
13
  ---
14
 
15
+ ## Model
16
+ We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder).
17
+
18
+ ## Data
19
+ Data Construction Pipeline is shown:
20
+
21
+ ![pipeline](https://github.com/TIGER-AI-Lab/Critique-Coder/blob/main/assets/images/dataset.png?raw=true)
22
+
23
+ ## Paper
24
+ [Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824)
25
+
26
+ ## Project Page
27
+ https://tiger-ai-lab.github.io/Critique-Coder
28
+
29
+ ## Code
30
+ https://github.com/TIGER-AI-Lab/Critique-Coder
31
+
32
+ ## Sample Usage
33
+
34
+ You can download this dataset using the Hugging Face CLI:
35
+
36
+ ```bash
37
+ hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset
38
+ ```
39
+
40
+ ## Citation
41
+ ```
42
+ @article{ruan2025critiquecoder,
43
+ title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
44
+ author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
45
+ journal={ArXiv},
46
+ year={2025},
47
+ volume={2509.22824}
48
+ }
49
+ ```