ypwang61 commited on
Commit
46abcb1
·
verified ·
1 Parent(s): 80c902d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ base_model:
6
+ - Qwen/Qwen2.5-Math-1.5B
7
+ datasets:
8
+ - ypwang61/One-Shot-RLVR-Datasets
9
+ ---
10
+
11
+ This repository contains the model presented in [Reinforcement Learning for Reasoning in Large Language Models with One Training Example](https://huggingface.co/papers/2504.20571).
12
+
13
+ Code: https://github.com/ypwang61/One-Shot-RLVR