altaidevorg
/

gemma-judge-preferences-v0.1

Model card Files Files and versions

Hyperakan commited on Oct 24

Commit

fe1add5

·

verified ·

1 Parent(s): 9f9f700

Create README.md

Files changed (1) hide show

README.md +42 -0

README.md ADDED Viewed

	@@ -0,0 +1,42 @@

+---
+license: apache-2.0
+datasets:
+- prometheus-eval/Preference-Collection
+language:
+- en
+base_model:
+- unsloth/gemma-3-4b-it
+---
+## 📘 Model Summary
+`merged_prometheus_gemma3` is a fine-tuned **preference evaluation model** based on **`unsloth/gemma-3-4b-it`**, trained with the **Unsloth framework** on the **[`prometheus-eval/Preference-Collection`](https://huggingface.co/datasets/prometheus-eval/Preference-Collection)** dataset.
+It is designed to perform **pairwise preference comparison** and **alignment evaluation** tasks, inspired by the **Prometheus** framework (*Kim et al., 2023*).
+---
+## 🧮 Performance Benchmark
+| Model | Benchmark | Accuracy (%) (Pairwise) |
+|:------|:-----------|:-----------------------:|
+| 🟦 **merged_prometheus_gemma3** | Preference Bench | **95.6** |
+| 🟨 **Prometheus 2 (8×7B)** *(Kim et al., 2024)* | Preference Bench | 90.65 |
+**Highlights:**
+- Outperforms **Prometheus 2 (8×7B)** by **+4.95%**, while being **smaller** in size.
+- Optimized for **efficiency, alignment scoring, and feedback consistency**.
+---
+## 🧾 License
+This model is released under the **Apache 2.0 License**.
+However, because it is derived from **Google’s Gemma 3**, your use of this model must also comply with the **[Gemma Terms of Use](https://ai.google.dev/gemma/terms)**.
+By using this model, you agree to:
+- Follow Google’s **Gemma Model Terms of Use**, including restrictions on misuse and redistribution.
+- Attribute Google as the original provider of the Gemma 3 base model.
+For full details, see: [https://ai.google.dev/gemma/terms](https://ai.google.dev/gemma/terms)
+---