Gemma Judge
Collection
This is a collection of compact yet highly capable LLM-as-a-judge models fine-tuned from Gemma3 4B.
•
5 items
•
Updated
This model is a fine-tuned version of the unsloth/gemma-3-4b-it
trained on the Feedback-Collection dataset from the Prometheus Eval.
Fine-tuning Framework: Finetuned using Unsloth optimized LoRA adapters.
| Model | Benchmark | Pearson r | Spearman ρ |
|---|---|---|---|
| 🟩 This model | Feedback Bench | 0.9198 | 0.9210 |
| 🟨 Prometheus 2 (8×7B) (Kim et al., 2024) | Feedback Bench / Preference Bench | ≈ 0.898 / – | ≈ 0.90 / – |
This model is released under the Apache 2.0 License.
However, because it is derived from Google’s Gemma 3, your use of this model must also comply with the Gemma Terms of Use.
By using this model, you agree to:
For full details, see: https://ai.google.dev/gemma/terms