Metadata-Version: 2.1 Name: MyShell-OpenVoice Version: 0.0.0 Summary: Instant voice cloning by MyShell. Home-page: https://github.com/myshell-ai/OpenVoice Author: MyShell Author-email: ethan@myshell.ai License: MIT License Project-URL: Documentation, https://github.com/myshell-ai/OpenVoice/blob/main/docs/USAGE.md Project-URL: Changes, https://github.com/myshell-ai/OpenVoice/releases Project-URL: Code, https://github.com/myshell-ai/OpenVoice Project-URL: Issue tracker, https://github.com/myshell-ai/OpenVoice/issues Keywords: text-to-speech,tts,voice-clone,zero-shot-tts Requires-Python: >=3.9 Description-Content-Type: text/markdown License-File: LICENSE Requires-Dist: librosa==0.9.1 Requires-Dist: faster-whisper==0.9.0 Requires-Dist: pydub==0.25.1 Requires-Dist: wavmark==0.0.3 Requires-Dist: numpy==1.22.0 Requires-Dist: eng_to_ipa==0.0.2 Requires-Dist: inflect==7.0.0 Requires-Dist: unidecode==1.3.7 Requires-Dist: whisper-timestamped==1.14.2 Requires-Dist: pypinyin==0.50.0 Requires-Dist: cn2an==0.5.22 Requires-Dist: jieba==0.42.1 Requires-Dist: gradio==3.48.0 Requires-Dist: langid==1.1.6
 
[Paper](https://arxiv.org/abs/2312.01479) | [Website](https://research.myshell.ai/open-voice)
## Introduction ### OpenVoice V1 As we detailed in our [paper](https://arxiv.org/abs/2312.01479) and [website](https://research.myshell.ai/open-voice), the advantages of OpenVoice are three-fold: **1. Accurate Tone Color Cloning.** OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents. **2. Flexible Voice Style Control.** OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation. **3. Zero-shot Cross-lingual Voice Cloning.** Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset. ### OpenVoice V2 In April 2024, we released OpenVoice V2, which includes all features in V1 and has: **1. Better Audio Quality.** OpenVoice V2 adopts a different training strategy that delivers better audio quality. **2. Native Multi-lingual Support.** English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2. **3. Free Commercial Use.** Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use. [Video](https://github.com/myshell-ai/OpenVoice/assets/40556743/3cba936f-82bf-476c-9e52-09f0f417bb2f) OpenVoice has been powering the instant voice cloning capability of [myshell.ai](https://app.myshell.ai/explore) since May 2023. Until Nov 2023, the voice cloning model has been used tens of millions of times by users worldwide, and witnessed the explosive user growth on the platform. ## Main Contributors - [Zengyi Qin](https://www.qinzy.tech) at MIT and MyShell - [Wenliang Zhao](https://wl-zhao.github.io) at Tsinghua University - [Xumin Yu](https://yuxumin.github.io) at Tsinghua University - [Ethan Sun](https://twitter.com/ethan_myshell) at MyShell ## How to Use Please see [usage](docs/USAGE.md) for detailed instructions. ## Common Issues Please see [QA](docs/QA.md) for common questions and answers. We will regularly update the question and answer list. ## Join Our Community Join our [Discord community](https://discord.gg/myshell) and select the `Developer` role upon joining to gain exclusive access to our developer-only channel! Don't miss out on valuable discussions and collaboration opportunities. ## Citation ``` @article{qin2023openvoice, title={OpenVoice: Versatile Instant Voice Cloning}, author={Qin, Zengyi and Zhao, Wenliang and Yu, Xumin and Sun, Xin}, journal={arXiv preprint arXiv:2312.01479}, year={2023} } ``` ## License OpenVoice V1 and V2 are MIT Licensed. Free for both commercial and research use. ## Acknowledgements This implementation is based on several excellent projects, [TTS](https://github.com/coqui-ai/TTS), [VITS](https://github.com/jaywalnut310/vits), and [VITS2](https://github.com/daniilrobnikov/vits2). Thanks for their awesome work!