Matcha-TTS NgNgNgan

Demo 🤗 HuggingFace space: https://huggingface.co./spaces/doof-ferb/MatchaTTS_ngngngan

License

In accordance with the terms of the CC-BY-NC-SA-4.0 license, the use of my checkpoints and any audio output generated by them for commercial purposes is strictly prohibited. This includes, but is not limited to:

online and offline voice cloning as a service
online and offline text-to-speech as a service
content creation for monetization on social media platforms

Căn cứ vào các điều khoản của giấp phép CC-BY-NC-SA-4.0, việc sử dụng các checkpoints này và bất kỳ đầu ra âm thanh nào được tạo bởi chúng đều bị nghiêm cấm sử dụng cho mục đích thương mại. Điều này bao gồm, nhưng không giới hạn ở:

các dịch vụ nhân bản giọng nói trực tuyến và ngoại tuyến
các dịch vụ chuyển văn bản thành giọng nói trực tuyến và ngoại tuyến
tạo nội dung để kiếm tiền trên các nền tảng mạng xã hội

What is Matcha-TTS?

original: https://github.com/shivammehta25/Matcha-TTS

vocoder copied from:

hifigan_univ_v1: https://github.com/shivammehta25/Matcha-TTS-checkpoints/releases/download/v1.0/g_02500000
hifigan_T2_v1: https://github.com/shivammehta25/Matcha-TTS-checkpoints/releases/download/v1.0/generator_v1

About this repo

speaker: Vietnamese M.C. Nguyễn Ngọc Ngạn
data scraping code: https://github.com/phineas-pta/speech-synthesis-ngngngan
4h50min audio, 6.6k samples
batch size = 16 ⇒ 1 epoch = 363 steps
train locally from scratch, ≈ 3 minute/epoch
train 600 epochs, save ckpt every 20 epoch, select ckpt at 420th epoch
i haven’t tested all the checkpoints 1 by 1

doof-ferb
/

matcha_ngngngan

Matcha-TTS NgNgNgan

License

What is Matcha-TTS?

About this repo

Spaces using doof-ferb/matcha_ngngngan 3