Title :
JANUS-II: towards spontaneous Spanish speech recognition
Author :
Zhan, Puming ; Ries, Klaus ; Gavalda, Marsal ; Gates, Donna ; Lavie, Alon ; Waibel, Alex
Author_Institution :
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
JANUS-II is a research system for investigating various issues in speech-to-speech translations and has been implemented for translations in many languages. In this paper, we address the Spanish speech recognition part of JANUS-II. First, we report the bootstrapping and optimization of the recognition system. Then we investigate the difference between push-to-talk and cross-talk dialogs, which are two different kinds of data in our database. We give a detailed noise analysis for the push-to-talk and cross-talk dialogs and present some recognition results for comparison. We have observed that the cross-talk dialogs are harder than the push-to-talk dialogs for speech recognition, because they are more noisy than the latter. Currently, the error rate of our Spanish recognizer is 27% for the push-to-talk test set and 32% for the cross-talk test set
Keywords :
acoustic noise; crosstalk; language translation; optimisation; speech recognition; JANUS-II; bootstrapping; cross-talk dialogue; database; noise analysis; optimization; push-to-talk dialogue; recognition error rate; speech-to-speech translation; spontaneous Spanish speech recognition; Acoustic testing; Background noise; Crosstalk; Databases; Hidden Markov models; Humans; Loudspeakers; Speech enhancement; Speech recognition; Vocabulary;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607263