beyoru commited on
Commit
0924888
·
verified ·
1 Parent(s): 469ec4d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -9,4 +9,7 @@ language:
9
  ---
10
 
11
  ## Model details
12
- Make a model learn from doing an examiner...
 
 
 
 
9
  ---
10
 
11
  ## Model details
12
+ This model is fine-tuned from Qwen3-4B-Instruct using a custom reinforcement learning (RL) framework that rewards the model for producing solutions passing automated test cases — similar to the process of programming task evaluation on LeetCode.
13
+
14
+ Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.
15
+