Title :
Acoustic model training for non-audible murmur recognition using transformed normal speech data
Author :
Babani, Denis ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro
Author_Institution :
Nara Inst. of Sci. & Technol., Ikoma, Japan
Abstract :
In this paper we present a novel approach to acoustic model training for non-audible murmur (NAM) recognition using normal speech data transformed into NAM data. NAM is extremely soft murmur, that is so quiet that people around the speaker can hardly hear it. It is detected directly through the soft tissue of the head using a special body-conductive microphone, NAM microphone, placed on the neck below the ear. NAM recognition is one of the promising silent speech interfaces for man-machine speech communication. We have previously shown the effectiveness of speaker adaptive training (SAT) based on constrained maximum likelihood linear regression (CMLLR) in NAM acoustic model training. However, since the amount of available NAM data is still small, the effect of SAT is limited. In this paper we propose modified SAT methods capable of using a larger amount of normal speech data by transforming them into NAM data. The experimental results demonstrate that the pro posed methods yield an absolute increase of approximately 2% in word accuracy compared with the conventional method.
Keywords :
maximum likelihood estimation; regression analysis; speech recognition; CMLLR; NAM; acoustic model training; body-conductive microphone; constrained maximum likelihood linear regression; man-machine speech communication; nonaudible murmur recognition; normal speech data; speaker adaptive training; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech; Speech recognition; Transforms; acoustic model; non-audible murmur recognition; silent speech interfaces; speaker adaptive training; transformed normal speech;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947535