Acoustic model training for non-audible murmur recognition using transformed normal speech data

Author

Babani, Denis ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro

Author_Institution

Nara Inst. of Sci. & Technol., Ikoma, Japan

fYear

2011

fDate

22-27 May 2011

Firstpage

5224

Lastpage

5227

Abstract

In this paper we present a novel approach to acoustic model training for non-audible murmur (NAM) recognition using normal speech data transformed into NAM data. NAM is extremely soft murmur, that is so quiet that people around the speaker can hardly hear it. It is detected directly through the soft tissue of the head using a special body-conductive microphone, NAM microphone, placed on the neck below the ear. NAM recognition is one of the promising silent speech interfaces for man-machine speech communication. We have previously shown the effectiveness of speaker adaptive training (SAT) based on constrained maximum likelihood linear regression (CMLLR) in NAM acoustic model training. However, since the amount of available NAM data is still small, the effect of SAT is limited. In this paper we propose modified SAT methods capable of using a larger amount of normal speech data by transforming them into NAM data. The experimental results demonstrate that the pro posed methods yield an absolute increase of approximately 2% in word accuracy compared with the conventional method.

Keywords

maximum likelihood estimation; regression analysis; speech recognition; CMLLR; NAM; acoustic model training; body-conductive microphone; constrained maximum likelihood linear regression; man-machine speech communication; nonaudible murmur recognition; normal speech data; speaker adaptive training; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech; Speech recognition; Transforms; acoustic model; non-audible murmur recognition; silent speech interfaces; speaker adaptive training; transformed normal speech;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location

Prague

ISSN

1520-6149

Print_ISBN

978-1-4577-0538-0

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2011.5947535

Filename

5947535