Title :
Towards reliable speech recognition in operating room noise environment
Author :
Zelinka, Petr ; Sigmund, Milan
Author_Institution :
Dept. of Radio Electron., Brno Univ. of Technol., Brno, Czech Republic
Abstract :
This paper describes several practical steps for accurate statistical modeling of a known acoustical noise environment to attain good performance of a small vocabulary speech recognizer for isolated words based on whole-word hidden Markov models. Hierarchical segmentation based on Bayes information criterion and k-means clustering followed by split-merge Gaussian mixture model training were utilized for noise model estimation. Parallel model combination technique produces final noise-corrupted speech models for a small group of speakers. Experiments were carried out on a real operating room ambient noise recorded during a neurosurgery at the University Hospital in Marburg.
Keywords :
Bayes methods; Gaussian processes; estimation theory; hidden Markov models; hospitals; noise (working environment); speech recognition; statistical analysis; vocabulary; Bayes information criterion; Marburg University Hospital; acoustical noise environment; final noise-corrupted speech models; hierarchical segmentation; isolated words; k-means clustering; neurosurgery; noise model estimation; operating room noise environment; parallel model combination technique; real operating room ambient noise; small vocabulary speech recognizer; speech recognition; split-merge Gaussian mixture model training; statistical modeling; whole-word hidden Markov models; Automatic speech recognition; Hidden Markov models; Noise generators; Noise reduction; Noise robustness; Phase noise; Speech enhancement; Speech recognition; Vocabulary; Working environment noise; Bayes information criterion; Speech recognition; hidden Markov models; parallel model combination;
Conference_Titel :
Radioelektronika (RADIOELEKTRONIKA), 2010 20th International Conference
Conference_Location :
Brno
Print_ISBN :
978-1-4244-6318-3
DOI :
10.1109/RADIOELEK.2010.5478597