Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,7 @@ base_model:
|
|
| 10 |
|
| 11 |
## π Model Summary
|
| 12 |
|
| 13 |
-
|
| 14 |
It is designed to perform **pairwise preference comparison** and **alignment evaluation** tasks, inspired by the **Prometheus** framework (*Kim et al., 2023*).
|
| 15 |
|
| 16 |
---
|
|
@@ -19,7 +19,7 @@ It is designed to perform **pairwise preference comparison** and **alignment eva
|
|
| 19 |
|
| 20 |
| Model | Benchmark | Accuracy (%) (Pairwise) |
|
| 21 |
|:------|:-----------|:-----------------------:|
|
| 22 |
-
| π¦ **
|
| 23 |
| π¨ **Prometheus 2 (8Γ7B)** *(Kim et al., 2024)* | Preference Bench | 90.65 |
|
| 24 |
|
| 25 |
**Highlights:**
|
|
|
|
| 10 |
|
| 11 |
## π Model Summary
|
| 12 |
|
| 13 |
+
This model is a fine-tuned **preference evaluation model** based on **`unsloth/gemma-3-4b-it`**, trained on the **[`prometheus-eval/Preference-Collection`](https://huggingface.co/datasets/prometheus-eval/Preference-Collection)** dataset.
|
| 14 |
It is designed to perform **pairwise preference comparison** and **alignment evaluation** tasks, inspired by the **Prometheus** framework (*Kim et al., 2023*).
|
| 15 |
|
| 16 |
---
|
|
|
|
| 19 |
|
| 20 |
| Model | Benchmark | Accuracy (%) (Pairwise) |
|
| 21 |
|:------|:-----------|:-----------------------:|
|
| 22 |
+
| π¦ **This model** | Preference Bench | **95.6** |
|
| 23 |
| π¨ **Prometheus 2 (8Γ7B)** *(Kim et al., 2024)* | Preference Bench | 90.65 |
|
| 24 |
|
| 25 |
**Highlights:**
|