> Paper
Prosody is underspecified by the text. (eg. rising intonation) Add a multi-speaker model to Tacotron. (speaker embedding) reference encoder: using a fixed-dimension embeddings. Reference https://arxiv.org/pdf/1803.09047.pdf

Continue reading

Author's picture

LI WEI

苟日新,日日新,又日新

Not yet

Tokyo