istupakov/parakeet-tdt-0.6b-v2-onnx Automatic Speech Recognition • Updated Jul 13, 2025 • 1.07k • 21
litagin/anime_speaker_embedding_by_va_ecapa_tdnn_groupnorm Audio Classification • Updated Jun 22, 2025 • 3
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion Paper • 2303.09057 • Published Mar 16, 2023 • 3
Voice Separation with an Unknown Number of Multiple Speakers Paper • 2003.01531 • Published Feb 29, 2020 • 3
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency Paper • 2408.04708 • Published Aug 8, 2024 • 8
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published Aug 29, 2024 • 50
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators Paper • 2505.09558 • Published May 14, 2025 • 10