Text-to-Speech
PyTorch
ONNX
Catalan
matcha-tts
acoustic modelling
speech
multispeaker
AlexK-PL commited on
Commit
b780397
1 Parent(s): 2913be0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -15,7 +15,7 @@ datasets:
15
  - projecte-aina/openslr-slr69-ca-trimmed-denoised
16
  ---
17
 
18
- # Matcha TTS Catalan
19
 
20
  ## Table of Contents
21
  <details>
@@ -33,6 +33,11 @@ datasets:
33
 
34
  ## Model description
35
 
 
 
 
 
 
36
  ## Intended uses and limitations
37
 
38
  ## How to use
 
15
  - projecte-aina/openslr-slr69-ca-trimmed-denoised
16
  ---
17
 
18
+ # Matcha-TTS Catalan Multispeaker
19
 
20
  ## Table of Contents
21
  <details>
 
33
 
34
  ## Model description
35
 
36
+ Matcha-TTS is an encoder-decoder architecture designed for fast acoustic modelling in TTS. The encoder side is inspired by previous works (Grad-TTS and Glow-TTS)
37
+ modelling alignment with Monotonic Alignment Search (MOS). The decoder is essentially a U-Net inspired by Grad-TTS based on Transformers architecture combined with 1D CNNs,
38
+ making a high reduction on memory consumption while increasing synthesis speed. Matcha-TTS is probabilistic, non-autorregressive and is trained using optimal-transport
39
+ conditional flow matching (OT-CFM). This yields an ODE-based decoder capable of high output quality in fewer synthesis steps than models trained using score matching.
40
+
41
  ## Intended uses and limitations
42
 
43
  ## How to use