Edit model card

Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:

  1. Fluent text-to-speech generation in Japanese
  2. One-shot voice cloning through speech prompt

logo

Usage

Plesae check out our HF Spaces demo.

Model Details

  • Model type: Our model is end-to-end transformers.
  • Language(s): Japanese
  • Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.

Acknowledgements

  • We thank meta-voice for opensourcing their code.

License

Apache License Version 2.0, January 2004

Downloads last month
98
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using kotoba-tech/kotoba-speech-v0.1 1

Collection including kotoba-tech/kotoba-speech-v0.1