YourTTS paper
Browse files
README.md
CHANGED
@@ -50,7 +50,10 @@ xVAPitch_5820651 model sample: <audio controls>
|
|
50 |
Your browser does not support the audio element.
|
51 |
</audio>
|
52 |
|
53 |
-
|
|
|
|
|
|
|
54 |
- Multi-head attention with Relative Positional embedding - https://arxiv.org/pdf/1809.04281.pdf
|
55 |
- Transformer with Relative Potional Encoding- https://arxiv.org/abs/1803.02155
|
56 |
- SDP - https://arxiv.org/pdf/2106.06103.pdf
|
|
|
50 |
Your browser does not support the audio element.
|
51 |
</audio>
|
52 |
|
53 |
+
Papers:
|
54 |
+
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone - https://arxiv.org/abs/2112.02418
|
55 |
+
|
56 |
+
Referenced papers within code:
|
57 |
- Multi-head attention with Relative Positional embedding - https://arxiv.org/pdf/1809.04281.pdf
|
58 |
- Transformer with Relative Potional Encoding- https://arxiv.org/abs/1803.02155
|
59 |
- SDP - https://arxiv.org/pdf/2106.06103.pdf
|