Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
johnsett 's Collections
robotics
image-text-to-text
CUA
audio
interesting
tts
video
image

tts

updated Nov 24, 2025
Upvote
-

  • myshell-ai/MeloTTS-English

    Text-to-Speech • Updated Dec 24, 2024 • 1.04M • 301

    Note really nice tts


  • coqui/XTTS-v2

    Text-to-Speech • Updated Dec 11, 2023 • 4.79M • 3.3k

    Note change voice and language tts


  • facebook/musicgen-small

    Text-to-Audio • 0.6B • Updated Nov 17, 2023 • 82.4k • 468

  • Running
    3

    Spleeter And ASR

    🚀
    3

    Separate audio into vocals and accompaniment, transcribe vocals

    Note take a auto file and split out the voice from music and then extract text from the voice


  • Running
    31

    Speaker Diarization

    🔥
    31

    Speaker diarization, speake segmentation,


  • pyannote/segmentation-3.0

    Voice Activity Detection • Updated May 10, 2024 • 14.3M • 745

  • Running on Zero
    430

    Seed Voice Conversion

    🎤
    430

    Convert voice to match another's style or tone


  • Supertone/supertonic

    Text-to-Speech • Updated Dec 10, 2025 • 4.07k • 458

  • maya-research/maya1

    Text-to-Speech • 3B • Updated Nov 12, 2025 • 41.2k • 844

  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10, 2025 • 1.81M • • 5.56k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs