Text-to-Speech
PyTorch
ONNX
Catalan
matcha-tts
acoustic modelling
speech
multispeaker
AlexK-PL commited on
Commit
917b3c1
1 Parent(s): 2a49114

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -93,13 +93,20 @@ pip install git+https://github.com/langtech-bsc/Matcha-TTS.git@dev-cat
93
 
94
  The model was trained on 2 **Catalan** speech datasets
95
 
96
- | Dataset | Language | Hours |
97
- |---------------------|----------|---------|
98
- | Festcat | ca | 22 |
99
- | OpenSLR69 | ca | 5 |
100
 
101
  ### Training procedure
102
 
 
 
 
 
 
 
 
103
 
104
  ## Evaluation
105
 
 
93
 
94
  The model was trained on 2 **Catalan** speech datasets
95
 
96
+ | Dataset | Language | Hours | Num. Speakers |
97
+ |---------------------|----------|---------|-----------------|
98
+ | Festcat | ca | 22 | 11 |
99
+ | OpenSLR69 | ca | 5 | 36 |
100
 
101
  ### Training procedure
102
 
103
+ ***Catalan Matcha-TTS*** was not trained from scratch. Instead, we finetuned the model from the English multispeaker checkpoint
104
+ (trained with the [VCTK dataset](https://huggingface.co/datasets/vctk)) provided by the authors.
105
+ The embedding layer was initialized with the number of catalan speakers (47) and original hyperparameters were kept.
106
+
107
+ ### Training Hyperparameters
108
+
109
+
110
 
111
  ## Evaluation
112