Title :
Acoustic and language modeling of human and nonhuman noises for human-to-human spontaneous speech recognition
Author :
Schultz, T. ; Rogina, I.
Author_Institution :
Interactive Syst. Lab., Karlsruhe Univ., Germany
Abstract :
Several improvements of our speech-to-speech translation system JANUS on spontaneous human-to-human dialogs are presented. Common phenomena in spontaneous speech are described, followed by a classification of different types of noise. To handle the variety of spontaneous effects in human-to-human dialogs, special noise models are introduced representing both human and nonhuman noise, as well as word fragments. It is shown that both the acoustic and the language modeling of the noise increase the recognition performance significantly. In the experiments, a clustering of the noise classes is performed and the resulting cluster variants are compared, thus allowing one to determine the best tradeoff between the sensitivity and trainability of the models
Keywords :
acoustic signal processing; interactive systems; language translation; natural languages; speech processing; speech recognition; JANUS; acoustic modeling; cluster variants; experiments; human noise; human-to-human dialogs; human-to-human spontaneous speech recognition; language modeling; noise classes clustering; noise classification; noise models; nonhuman noise; recognition performance; sensitivity; speech-to-speech translation system; trainability; word fragments; Acoustic noise; Acoustic testing; Databases; Error analysis; Hidden Markov models; Humans; Interactive systems; Natural languages; Speech recognition; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479531