Update README.md
Browse files
README.md
CHANGED
@@ -33,9 +33,10 @@ datasets:
|
|
33 |
|
34 |
## Model description
|
35 |
|
36 |
-
Matcha-TTS is an encoder-decoder architecture designed for fast acoustic modelling in TTS. The encoder predicts phoneme durations and its mean feature vectors
|
37 |
-
|
38 |
-
with 1D instead of 2D CNNs, making a high reduction on memory consumption and speedy synthesis.
|
|
|
39 |
Matcha-TTS is non-autorregressive and is trained using optimal-transport conditional flow matching (OT-CFM).
|
40 |
This yields an ODE-based decoder capable of high output quality in fewer synthesis steps than models trained using score matching.
|
41 |
|
@@ -99,6 +100,20 @@ Data comes from two different datasets: festcat and openslr69
|
|
99 |
|
100 |
### Results
|
101 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
102 |
|
103 |
## Additional information
|
104 |
|
|
|
33 |
|
34 |
## Model description
|
35 |
|
36 |
+
Matcha-TTS is an encoder-decoder architecture designed for fast acoustic modelling in TTS. The encoder predicts phoneme durations and its mean feature vectors.
|
37 |
+
And the decoder is essentially a U-Net inspired by Grad-TTS, that is based on Transformers architecture combined
|
38 |
+
with 1D instead of 2D CNNs, making a high reduction on memory consumption and speedy synthesis.
|
39 |
+
|
40 |
Matcha-TTS is non-autorregressive and is trained using optimal-transport conditional flow matching (OT-CFM).
|
41 |
This yields an ODE-based decoder capable of high output quality in fewer synthesis steps than models trained using score matching.
|
42 |
|
|
|
100 |
|
101 |
### Results
|
102 |
|
103 |
+
## Citation
|
104 |
+
|
105 |
+
If this code contributes to your research, please cite the work:
|
106 |
+
|
107 |
+
```
|
108 |
+
@misc{mehta2024matchatts,
|
109 |
+
title={Matcha-TTS: A fast TTS architecture with conditional flow matching},
|
110 |
+
author={Shivam Mehta and Ruibo Tu and Jonas Beskow and Éva Székely and Gustav Eje Henter},
|
111 |
+
year={2024},
|
112 |
+
eprint={2309.03199},
|
113 |
+
archivePrefix={arXiv},
|
114 |
+
primaryClass={eess.AS}
|
115 |
+
}
|
116 |
+
```
|
117 |
|
118 |
## Additional information
|
119 |
|