Update README.md
Browse files
README.md
CHANGED
@@ -93,13 +93,20 @@ pip install git+https://github.com/langtech-bsc/Matcha-TTS.git@dev-cat
|
|
93 |
|
94 |
The model was trained on 2 **Catalan** speech datasets
|
95 |
|
96 |
-
| Dataset | Language | Hours |
|
97 |
-
|
98 |
-
| Festcat | ca | 22 |
|
99 |
-
| OpenSLR69 | ca | 5 |
|
100 |
|
101 |
### Training procedure
|
102 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
103 |
|
104 |
## Evaluation
|
105 |
|
|
|
93 |
|
94 |
The model was trained on 2 **Catalan** speech datasets
|
95 |
|
96 |
+
| Dataset | Language | Hours | Num. Speakers |
|
97 |
+
|---------------------|----------|---------|-----------------|
|
98 |
+
| Festcat | ca | 22 | 11 |
|
99 |
+
| OpenSLR69 | ca | 5 | 36 |
|
100 |
|
101 |
### Training procedure
|
102 |
|
103 |
+
***Catalan Matcha-TTS*** was not trained from scratch. Instead, we finetuned the model from the English multispeaker checkpoint
|
104 |
+
(trained with the [VCTK dataset](https://huggingface.co/datasets/vctk)) provided by the authors.
|
105 |
+
The embedding layer was initialized with the number of catalan speakers (47) and original hyperparameters were kept.
|
106 |
+
|
107 |
+
### Training Hyperparameters
|
108 |
+
|
109 |
+
|
110 |
|
111 |
## Evaluation
|
112 |
|