> Prosody Ttransfer ETTS

Prosody Ttransfer ETTS

May 12, 2022 in Paper

Prosody is underspecified by the text. (eg. rising intonation)

Add a multi-speaker model to Tacotron. (speaker embedding)
reference encoder: using a fixed-dimension embeddings.

Reference

https://arxiv.org/pdf/1803.09047.pdf

Author's picture

LI WEI

苟日新，日日新，又日新

Not yet

Tokyo