Prosody Ttransfer ETTS
Prosody is underspecified by the text. (eg. rising intonation)
- 
Add a multi-speaker model to Tacotron. (speaker embedding)
 - 
reference encoder: using a fixed-dimension embeddings.
 
>
Prosody is underspecified by the text. (eg. rising intonation)
Add a multi-speaker model to Tacotron. (speaker embedding)
reference encoder: using a fixed-dimension embeddings.