--- tags: - espnet - audio - text-to-speech language: en datasets: - ljspeech license: cc-by-4.0 --- TTS model trained with Montreal Forced Aligner. To replicate or continue training from the given checkpoint, download [LJSpeech](https://keithito.com/LJ-Speech-Dataset/), install [MFA](https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) and follow the steps [here](https://github.com/espnet/espnet/blob/master/egs2/TEMPLATE/tts1/README.md). I recommend downloading the [pretrained MFA models](https://mfa-models.readthedocs.io/en/latest/) and running `mfa.sh` with `--train false`. This would help to save time, but one disadvantage is you have to use the MFA g2p for inference (which has a non-standard phoneme set).