havok2
/

Kartoffelbox-v0.1_0.65h2

Model card Files Files and versions

havok2 commited on Jul 2

Commit

e3af40a

·

verified ·

1 Parent(s): db954d1

Update README.md

Files changed (1) hide show

README.md +56 -7

README.md CHANGED Viewed

@@ -1,8 +1,57 @@
 ---
-license: unknown
-language:
-- en
-- de
-base_model:
-- SebastianBodza/Kartoffelbox-v0.1
----

 ---
+license: apache-2.0
+language: de
+library_name: transformers
+tags:
+- text-to-speech
+- tts
+- german
+- chatterbox
+- voice-cloning
+- zero-shot
+- merged-model
+---
+# Kartoffelbox-v0.1_0.65h2: A Merged German Chatterbox-TTS Model
+## Model Description
+This repository contains an experimental German Text-to-Speech model created by merging two fine-tuned [Chatterbox-TTS](https://github.com/anotherjesse/Chatterbox-TTS) models.
+This model is a **hybrid** that combines the characteristics of the well-known German `Kartoffelbox` model with a custom model trained on a specific [male/female] voice. The goal was to fuse the natural pronunciation of "Kartoffelbox" with the unique vocal identity and robustness of an extensively trained custom model.
+The final model is the result of a weighted-sum merge, using **65% of the weights** from the custom-trained model and 35% from "Kartoffelbox".
+**Key Features:**
+- **Language:** German
+- **Type:** Hybrid, Merged Model
+- **Capabilities:** High-quality speech synthesis and Zero-Shot Voice Cloning.
+- **Vocal Characteristics:** [Describe what you hear here. E.g., A clear, male voice with a very natural German intonation, sounding less robotic than many standard models.]
+## Intended Use and Limitations
+#### Intended Use
+This model is intended for the following use cases:
+- Generating natural-sounding German speech.
+- Experiments in the field of model merging for TTS.
+- As a foundation for further fine-tuning on other German voices.
+#### Ethical Considerations and Limitations
+(This section can remain as is)
+Speech synthesis and voice cloning technologies carry risks of misuse. This model should **not** be used to:
+- Clone the voices of individuals without their explicit consent.
+- Create misinformation or "deepfakes".
+- Impersonate, deceive, or harass individuals.
+The user of this model is responsible for its ethical use and for complying with all applicable laws.
+## How to Use the Model
+Usage is identical to other Chatterbox models. You will need the `chatterbox-tts` library.
+**1. Installation**
+```bash
+# Clone the Chatterbox repository and install its dependencies
+git clone https://github.com/anotherjesse/Chatterbox-TTS.git
+cd Chatterbox-TTS
+pip install -e .