Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# MergeMonster
|
| 2 |
+
An unsupervised merging algorithm for Transformers-based language models, using a list of phrases (both good and bad) and a fully automated strategy that strives to decrease (or increase) the probability of these phrases occuring in the final merge.
|
| 3 |
+
|
| 4 |
+
[Code available here](https://github.com/Gryphe/MergeMonster/tree/main)
|
| 5 |
+
|
| 6 |
+
A place to store various promising quantizations of things produced by the Merge Monster.
|
| 7 |
+
|
| 8 |
+
## Current variations
|
| 9 |
+
|
| 10 |
+
- **MergeMonster-7b** - First model: Mistral 0.1, with the objective to reduce overused GPT style roleplaying phrases.
|
| 11 |
+
- **MergeMonster-13b** - First model: MythoMax, with the objective to reduce overused GPT style roleplaying phrases.
|
| 12 |
+
- **MergeMonster-Decensored-7b** - First model: Mistral 0.1, with the objective to reduce refusals on a global level.
|