Prosody Ttransfer ETTS

May 12, 2022 in Paper

Prosody is underspecified by the text. (eg. rising intonation) Add a multi-speaker model to Tacotron. (speaker embedding) reference encoder: using a fixed-dimension embeddings. Reference https://arxiv.org/pdf/1803.09047.pdf

Prosody Ttransfer ETTS

LI WEI