Benjamin-png
commited on
Commit
•
fec23e8
1
Parent(s):
3398b6d
updated Readme.md
Browse files
README.md
CHANGED
@@ -1,3 +1,52 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
|
5 |
+
|
6 |
+
# Swahili MMS TTS - Finetuned Model
|
7 |
+
|
8 |
+
This is a fine-tuned version of the Facebook MMS (Massively Multilingual Speech) model for Swahili Text-to-Speech (TTS). The model was fine-tuned to improve Swahili pronunciation and performance using custom audio datasets.
|
9 |
+
|
10 |
+
## Model Details
|
11 |
+
|
12 |
+
- **Model Name**: Swahili MMS TTS - Finetuned
|
13 |
+
- **Languages Supported**: Swahili
|
14 |
+
- **Base Model**: Facebook MMS
|
15 |
+
- **Use Case**: Text-to-Speech for Swahili language, suitable for generating high-quality speech from text.
|
16 |
+
|
17 |
+
## Training Details
|
18 |
+
|
19 |
+
The fine-tuning process was done using a custom dataset of Swahili voice samples to improve the fluency and accuracy of the original MMS model in Swahili. This resulted in enhanced pronunciation and natural-sounding speech for Swahili.
|
20 |
+
|
21 |
+
You can check out the code and process used in the fine-tuning by visiting the [GitHub repository](https://github.com/benny-png/Swahili-model-for-Audio-Text-to-Speech).
|
22 |
+
|
23 |
+
## How to Use
|
24 |
+
|
25 |
+
You can load and use the model directly from the Hugging Face model hub:
|
26 |
+
|
27 |
+
```python
|
28 |
+
from transformers import pipeline
|
29 |
+
|
30 |
+
# Load the fine-tuned model
|
31 |
+
tts = pipeline("text-to-speech", model="Benjamin-png/swahili-mms-tts-finetuned")
|
32 |
+
|
33 |
+
# Generate speech from text
|
34 |
+
speech = tts("Habari, karibu kwenye mfumo wetu wa kusikiliza kwa Kiswahili.")
|
35 |
+
```
|
36 |
+
|
37 |
+
## Example Notebook
|
38 |
+
|
39 |
+
If you're interested in reproducing the fine-tuning process or using the model for similar purposes, you can check out the Google Colab notebook that outlines the entire process:
|
40 |
+
|
41 |
+
- [Google Colab Notebook](upload file to Google Drive and provide the link here)
|
42 |
+
|
43 |
+
The notebook includes detailed steps on how to fine-tune the MMS model for Swahili TTS.
|
44 |
+
|
45 |
+
## GitHub Repository
|
46 |
+
|
47 |
+
For further exploration and code snippets, visit the [GitHub repository](https://github.com/benny-png/Swahili-model-for-Audio-Text-to-Speech) where you’ll find additional scripts, datasets, and instructions for customizing the model.
|
48 |
+
|
49 |
+
## License
|
50 |
+
|
51 |
+
This project is licensed under the terms of the apache License.
|
52 |
+
|