Prosody Ttransfer ETTS
Prosody is underspecified by the text. (eg. rising intonation)
-
Add a multi-speaker model to Tacotron. (speaker embedding)
-
reference encoder: using a fixed-dimension embeddings.
>
Prosody is underspecified by the text. (eg. rising intonation)
Add a multi-speaker model to Tacotron. (speaker embedding)
reference encoder: using a fixed-dimension embeddings.