⚠️This model isn't just a quantized model of unsloth/Qwen3-4B-Base

冬休みの自由研究としてUnslothのGRPOを使ってトレーニングしたQwen3-4B-Baseモデル。数学推論に特化させた…つもりなだけで実際はあんまりうまく動作しない。

Downloads last month
56
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for PixN/MY_FIRST_RL

Base model

Qwen/Qwen3-4B-Base
Quantized
(12)
this model