Akjava commited on
Commit
65c3e7f
·
verified ·
1 Parent(s): 932d1cb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +51 -45
README.md CHANGED
@@ -1,45 +1,51 @@
1
- ---
2
- license: mit
3
- ---
4
- This model is a 100-speaker multispeaker model in the Matcha-TTS format/architecture.(trained with Japanese)
5
-
6
- <div class="audio-container">
7
- <h4>家具商人のフィシェルは、荷車と仔馬を貸してくれた。</h4>
8
- <h5>spk10:A lower-pitched female voice with a strong core</h5>
9
- <audio controls src="https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group006/resolve/main/examples/qwen100_checkpoint_epoch=5874_ch10_kagu.wav"></audio>
10
- <h4>私はあなたのことが心配です</h4>
11
- <h5>spk99:A slightly quirky female voice that leaves a strong impression</h5>
12
- <audio controls src="https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group006/resolve/main/examples/qwen100_checkpoint_epoch=5874_ch99_watashi.wav"></audio>
13
- <h4>僕はいつか面白いゲームを作りたい</h4>
14
- <h5>spk26:AI-Game-Bu:SEAN</h5>
15
- <audio controls src="https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group006/resolve/main/examples/qwen100_checkpoint_epoch=5874_ch26_boku.wav"></audio>
16
- </div>
17
-
18
- This model is replaced 10 qwen-character to chatterbox(common voice) character.
19
-
20
- trained mel_mean/mel_std is difference than group005qw
21
- ## Qwen3-TTS and Chatterbox Multingual Mixed
22
- [https://huggingface.co/ResembleAI/chatterbox]
23
- I faild to confirm watermark because of technical probrom,but maybe chatterbox watermark is exist.
24
- If you don't like the watermark, use qwen3-tts only version
25
-
26
- - there are similar [qwen3-tts only](https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group005qw-100) version
27
-
28
- ## license
29
- This model license is under MIT
30
-
31
- My training data is created by Apache Licensed/mit model output.
32
- https://huggingface.co/Qwen/Qwen3-TTS-12Hz-1.7B-Base
33
-
34
- Matcha-TTS is MIT
35
- https://github.com/shivammehta25/Matcha-TTS
36
-
37
- ## Training
38
- need checkpoint from there
39
- https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group003f-CL-V2
40
-
41
- Use this.
42
- https://github.com/akjava/Matcha-TTS-Japanese
43
-
44
- ## Demo
45
- https://ai-game-bu.itch.io/ai-gaming-voice
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - Akjava/QWEN3-TTS-Voice-Clone-100-Japanese-Female-ITA-Corpus-Emotion
5
+ base_model:
6
+ - Akjava/matcha-tts_ja_100speakers_group005qw-100
7
+ ---
8
+ This model is a 100-speaker multispeaker model in the Matcha-TTS format/architecture.(trained with Japanese)
9
+
10
+ <div class="audio-container">
11
+ <h4>家具商人のフィシェルは、荷車と仔馬を貸してくれた。</h4>
12
+ <h5>spk10:A lower-pitched female voice with a strong core</h5>
13
+ <audio controls src="https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group006/resolve/main/examples/qwen100_checkpoint_epoch=5874_ch10_kagu.wav"></audio>
14
+ <h4>私はあなたのことが心配です</h4>
15
+ <h5>spk99:A slightly quirky female voice that leaves a strong impression</h5>
16
+ <audio controls src="https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group006/resolve/main/examples/qwen100_checkpoint_epoch=5874_ch99_watashi.wav"></audio>
17
+ <h4>僕はいつか面白いゲームを作りたい</h4>
18
+ <h5>spk26:AI-Game-Bu:SEAN</h5>
19
+ <audio controls src="https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group006/resolve/main/examples/qwen100_checkpoint_epoch=5874_ch26_boku.wav"></audio>
20
+ </div>
21
+
22
+ This model is replaced 10 qwen-character to chatterbox(common voice) character.
23
+
24
+ trained mel_mean/mel_std is difference than group005qw
25
+ ## Qwen3-TTS and Chatterbox Multingual Mixed
26
+ [Chatterbox](https://huggingface.co/ResembleAI/chatterbox)
27
+ I faild to confirm watermark because of technical probrom,but maybe chatterbox watermark is exist.
28
+ If you don't like the watermark, use qwen3-tts only version
29
+
30
+ - there are similar [qwen3-tts only](https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group005qw-100) version
31
+
32
+ ## license
33
+ This model license is under MIT
34
+
35
+ My training data is created by Apache Licensed/mit model output.
36
+ https://huggingface.co/Qwen/Qwen3-TTS-12Hz-1.7B-Base
37
+
38
+ Matcha-TTS is MIT
39
+ https://github.com/shivammehta25/Matcha-TTS
40
+
41
+ ## Training
42
+ need checkpoint and audio from there
43
+ https://huggingface.co/Akjava/matcha-tts_ja_100speakers_group005qw-100
44
+
45
+ Use this.
46
+ https://github.com/akjava/Matcha-TTS-Japanese
47
+
48
+
49
+
50
+ ## Demo
51
+ https://ai-game-bu.itch.io/ai-gaming-voice