Oznur Alkan, Massimilliano Mattetti, et al.
INFORMS 2020
This paper focuses on the use of both text and phonetic information in a speech translation system in order to make translation results more robust to speech recognition errors. Conventional statistical speech translation formulas are extended to exploit both text-form and phonetic speech recognition results. A novel data-driven word/text tying algorithm is then proposed to group words based on both pronunciation similarity and meaning equivalency. In our speech-to-text translation experiments, significant improvement was achieved by using phonetic information and the proposed word tying algorithm. ©2006 IEEE.
Oznur Alkan, Massimilliano Mattetti, et al.
INFORMS 2020
Casey Dugan, Werner Geyer, et al.
CHI 2010
Rajesh Balchandran, Leonid Rachevsky, et al.
INTERSPEECH 2009
Elaine Hill
Human-Computer Interaction