Update README.md
Browse files
README.md
CHANGED
|
@@ -17,8 +17,8 @@ We propose **GenPRM**, a strong generative process reward model with the followi
|
|
| 17 |
|
| 18 |
GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
|
| 19 |
|
| 20 |
-
- As a verifier
|
| 21 |
-
- As a critic
|
| 22 |
|
| 23 |

|
| 24 |
|
|
|
|
| 17 |
|
| 18 |
GenPRM achieves state-of-the-art performance across multiple benchmarks in two key roles:
|
| 19 |
|
| 20 |
+
- **As a verifier**: GenPRM-7B outperforms all classification-based PRMs of comparable size and even surpasses **Qwen2.5-Math-PRM-72B** via test-time scaling.
|
| 21 |
+
- **As a critic**: GenPRM-7B demonstrates superior critique capabilities, achieving **3.4×** greater performance gains than DeepSeekR1-Distill-Qwen-7B after 3 refinement iterations.
|
| 22 |
|
| 23 |

|
| 24 |
|