> Intensity control of speech synthesis

Intensity control of speech synthesis

November 4, 2023 in TTS

1. Notes

(1) We can’t regard different non-neutral speech pair as similar set, otherwise the emotional intensity labels attract each other.

(2) The intensity predictor should be fixed while training the text-to-speech model, otherwise the intensity cannot be controlled. (maybe because the label for each sample always fluctuates)

LI WEI

苟日新，日日新，又日新

Not yet

Tokyo