kotoba-tech
/

kotoba-speech-v0.1

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:

Fluent text-to-speech generation in Japanese
One-shot voice cloning through speech prompt

Usage

Plesae check out our HF Spaces demo.

Model Details

Model type: Our model is end-to-end transformers.
Language(s): Japanese
Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.

Acknowledgements

We thank meta-voice for opensourcing their code.

License

Apache License Version 2.0, January 2004

Downloads last month: 98

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using kotoba-tech/kotoba-speech-v0.1 1

Collection including kotoba-tech/kotoba-speech-v0.1

Kotoba-Speech

2 items • Updated Sep 30