Kotoba-Speech-v0.1
Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:
- Fluent text-to-speech generation in Japanese
- One-shot voice cloning through speech prompt
Usage
Plesae check out our HF Spaces demo.
Model Details
- Model type: Our model is end-to-end transformers.
- Language(s): Japanese
- Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.
Acknowledgements
- We thank meta-voice for opensourcing their code.
License
Apache License Version 2.0, January 2004