| base_model: | |
| - Qwen/Qwen3-4B-Base | |
| datasets: | |
| - RuleReasoner/rule-reasoning | |
| language: | |
| - en | |
| library_name: transformers | |
| license: mit | |
| metrics: | |
| - accuracy | |
| pipeline_tag: text-generation | |
| tags: | |
| - rule-based reasoning | |
| new_version: RuleReasoner/RuleReasoner-4B | |
| If you use the model in your research, please cite the original papers as below. | |
| ```latex | |
| @article{liu2025rulereasoner, | |
| title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling}, | |
| author={Yang Liu and Jiaqi Li and Zilong Zheng}, | |
| year={2025}, | |
| eprint={2506.08672}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CL}, | |
| url={https://arxiv.org/abs/2506.08672}, | |
| } | |
| ``` | |
| Code: https://github.com/bigai-nlco/RuleReasoner |