beyoru
/

MinCoder-4B-Exp

Text Generation

text-generation-inference

Model card Files Files and versions

beyoru commited on Nov 1, 2025

Commit

0924888

·

verified ·

1 Parent(s): 469ec4d

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -9,4 +9,7 @@ language:
 ---
 ## Model details
-Make a model learn from doing an examiner...

 ---
 ## Model details
+This model is fine-tuned from Qwen3-4B-Instruct using a custom reinforcement learning (RL) framework that rewards the model for producing solutions passing automated test cases — similar to the process of programming task evaluation on LeetCode.
+Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.