--- license: mit title: KKR2 sdk: gradio colorFrom: blue colorTo: green --- # Text-to-Speech App with Kokoro-82M-ONNX This is a Gradio-based text-to-speech (TTS) app that uses the [Kokoro-82M-ONNX](https://huggingface.co./onnx-community/Kokoro-82M-ONNX) model from Hugging Face. The app allows you to generate speech from text with multiple speaker options and download the resulting audio file. ## Features - **Text-to-Speech Conversion**: Convert any input text into speech. - **Multiple Speakers**: Choose from different speaker voices. - **Download Audio**: Download the generated speech as a `.wav` file. ## How to Use 1. **Enter Text**: Type or paste your text into the input box. 2. **Select Speaker**: Choose a speaker from the dropdown menu. 3. **Generate Speech**: Click the "Submit" button to generate the speech. 4. **Download Audio**: Once the speech is generated, you can listen to it or download the `.wav` file. ## Example Inputs - **Text**: "Hello, welcome to the text-to-speech app!" - **Speaker**: "Speaker 1" ## Requirements The app requires the following Python packages: - `onnxruntime` - `torch` - `gradio` - `scipy` - `numpy` - `huggingface_hub` These dependencies are automatically installed when the Space is built. ## Model Details The app uses the [Kokoro-82M-ONNX](https://huggingface.co./onnx-community/Kokoro-82M-ONNX) model, a lightweight and efficient text-to-speech model in ONNX format. The model supports multiple speakers and generates high-quality speech. ## Limitations - The model may not handle very long texts efficiently. - Speaker options are limited to the embeddings supported by the model. ## Feedback and Contributions If you encounter any issues or have suggestions for improvement, please open an issue on the [GitHub repository](https://github.com/your-repo-link) or contact me directly. --- Enjoy using the app! 🎉