projecte-aina
/

matxa-tts-cat-multispeaker

acoustic modelling

Model card Files Files and versions Community

AlexK-PL commited on Mar 29

Commit

b780397

•

1 Parent(s): 2913be0

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ datasets:
 - projecte-aina/openslr-slr69-ca-trimmed-denoised
 ---
-# Matcha TTS Catalan
 ## Table of Contents
 <details>
@@ -33,6 +33,11 @@ datasets:
 ## Model description
 ## Intended uses and limitations
 ## How to use

 - projecte-aina/openslr-slr69-ca-trimmed-denoised
 ---
+# Matcha-TTS Catalan Multispeaker
 ## Table of Contents
 <details>
 ## Model description
+Matcha-TTS is an encoder-decoder architecture designed for fast acoustic modelling in TTS. The encoder side is inspired by previous works (Grad-TTS and Glow-TTS)
+modelling alignment with Monotonic Alignment Search (MOS). The decoder is essentially a U-Net inspired by Grad-TTS based on Transformers architecture combined with 1D CNNs,
+making a high reduction on memory consumption while increasing synthesis speed. Matcha-TTS is probabilistic, non-autorregressive and is trained using optimal-transport
+conditional flow matching (OT-CFM). This yields an ODE-based decoder capable of high output quality in fewer synthesis steps than models trained using score matching.
 ## Intended uses and limitations
 ## How to use