Update README.md
Browse files
README.md
CHANGED
|
@@ -6,6 +6,8 @@ language:
|
|
| 6 |
- en
|
| 7 |
pipeline_tag: text-generation
|
| 8 |
---
|
|
|
|
|
|
|
| 9 |
Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
|
| 10 |
|
| 11 |
# Llama-3-Instruct-8B-SPPO-Iter3
|
|
|
|
| 6 |
- en
|
| 7 |
pipeline_tag: text-generation
|
| 8 |
---
|
| 9 |
+
Quantized to exl2 using [Exllamav2 0.1.6](https://github.com/turboderp/exllamav2)
|
| 10 |
+
|
| 11 |
Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
|
| 12 |
|
| 13 |
# Llama-3-Instruct-8B-SPPO-Iter3
|