Fernando Martinez, Juntao Chen, et al.
AAAI 2025
A description with regards to the experiments done in emotive spoken language user interfaces is given. It has been found out that when the use of multimodal, synthesizing, and recognizing information has been optimized in both the audio and video modalities, there has been an improvement when it comes to recognition accuracy and synthesis quality. Specific topics being covered include: the speech and emotion recognition by humans; the automatic audiovisual speech and emotion recognition; the audiovisual speech synthesis; the emotive prosody; and finally the emotionally nuanced audiovisual speech.
Fernando Martinez, Juntao Chen, et al.
AAAI 2025
W.C. Tang, H. Rosen, et al.
SPIE Optics, Electro-Optics, and Laser Applications in Science and Engineering 1991
J. LaRue, C. Ting
Proceedings of SPIE 1989
A.R. Gourlay, G. Kaye, et al.
Proceedings of SPIE 1989