Analysis of Speech Emotion Intensity
1. Relative Attributes
- Extract emotion related features by OpenSMILE toolkit. (384-dim)
- Train Ranking function (Linear) based on the RA.
- Normalize the intensity values to the range 0 ~ 1.
2. Intensity Distribution
3. Intensity Embedding
-
Map the real number to high dimensional embedding: (effective) $$Inty*W$$
-
Combine the neutral and emotional embeddings: (not so effective) $$Neu*(1-Inty)+Emo*Inty$$
-
Combine the neutral and emotional embeddings: (To be done) $$Neu.detach()(1-Inty)+EmoInty$$