TTS Arena V2
Vote on the latest TTS models!
Projects I've worked on (or contributed to)
Vote on the latest TTS models!
Note Contributed to the Emolia dataset.
Note Apache 2.0 retrain of F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Note Unofficial demo for E2/F5-TTS, which supports zero-shot voice cloning. Not affiliated with the authors of F5-TTS
Note Universal voice acting model!
A demo of OpenDalle V1.1 on a ZERO GPU.
Note Note: I did not create the model, just the demo.
Fast & efficient ASR outperforming Whisper!
Note Unofficial demo for the Moonshine ASR model, an efficient & fast ASR model by Useful Sensors Moonshine ASR: https://github.com/usefulsensors/moonshine
Did StyleTTS 2 generate that audio?!?
Note A Whipser-based audio classification model to detect StyleTTS 2
Fast, efficient, & multilingual text-to-speech
Note Demo for MeloTTS: Multilingual, multispeaker text-to-speech licensed under the MIT license
Generate MIDI music using RWKV v4!
Note My newest project, a demo of RWKV 4 Music (the MIDI model).
Efficient, fast, and natural text to speech with StyleTTS 2!
Note My most successful project: an online demo for StyleTTS 2. Reached HF Spaces of the Week and was the most popular Space of the Week. Note: I did not create StyleTTS 2, just the demo.
Note A frankenmerge of NeuralHermes 2.5 and OpenOrca
Note A multilingual dataset of text-phoneme pairs supporting 15 languages.
Generate a video from audio with customizable waveform visualization
TTS for any emotion, now with non-verbal sounds!