Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
8
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
DSTK
/
semantic_tokenizer
3.87 GB
1 contributor
History:
4 commits
gooorillax
refine readme, add logo, and fix a punct normalization problem in tn
bdecca1
3 months ago
f40ms
refine readme, add logo, and fix a punct normalization problem in tn
3 months ago
__init__.py
0 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
3 months ago