Safetensors
English
gemma3
Hyperakan commited on
Commit
fe1add5
·
verified ·
1 Parent(s): 9f9f700

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - prometheus-eval/Preference-Collection
5
+ language:
6
+ - en
7
+ base_model:
8
+ - unsloth/gemma-3-4b-it
9
+ ---
10
+
11
+ ## 📘 Model Summary
12
+
13
+ `merged_prometheus_gemma3` is a fine-tuned **preference evaluation model** based on **`unsloth/gemma-3-4b-it`**, trained with the **Unsloth framework** on the **[`prometheus-eval/Preference-Collection`](https://huggingface.co/datasets/prometheus-eval/Preference-Collection)** dataset.
14
+ It is designed to perform **pairwise preference comparison** and **alignment evaluation** tasks, inspired by the **Prometheus** framework (*Kim et al., 2023*).
15
+
16
+ ---
17
+
18
+ ## 🧮 Performance Benchmark
19
+
20
+ | Model | Benchmark | Accuracy (%) (Pairwise) |
21
+ |:------|:-----------|:-----------------------:|
22
+ | 🟦 **merged_prometheus_gemma3** | Preference Bench | **95.6** |
23
+ | 🟨 **Prometheus 2 (8×7B)** *(Kim et al., 2024)* | Preference Bench | 90.65 |
24
+
25
+ **Highlights:**
26
+ - Outperforms **Prometheus 2 (8×7B)** by **+4.95%**, while being **smaller** in size.
27
+ - Optimized for **efficiency, alignment scoring, and feedback consistency**.
28
+
29
+ ---
30
+
31
+ ## 🧾 License
32
+
33
+ This model is released under the **Apache 2.0 License**.
34
+ However, because it is derived from **Google’s Gemma 3**, your use of this model must also comply with the **[Gemma Terms of Use](https://ai.google.dev/gemma/terms)**.
35
+
36
+ By using this model, you agree to:
37
+ - Follow Google’s **Gemma Model Terms of Use**, including restrictions on misuse and redistribution.
38
+ - Attribute Google as the original provider of the Gemma 3 base model.
39
+
40
+ For full details, see: [https://ai.google.dev/gemma/terms](https://ai.google.dev/gemma/terms)
41
+
42
+ ---