• DocumentCode
    2178519
  • Title

    Acoustic model training for non-audible murmur recognition using transformed normal speech data

  • Author

    Babani, Denis ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro

  • Author_Institution
    Nara Inst. of Sci. & Technol., Ikoma, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5224
  • Lastpage
    5227
  • Abstract
    In this paper we present a novel approach to acoustic model training for non-audible murmur (NAM) recognition using normal speech data transformed into NAM data. NAM is extremely soft murmur, that is so quiet that people around the speaker can hardly hear it. It is detected directly through the soft tissue of the head using a special body-conductive microphone, NAM microphone, placed on the neck below the ear. NAM recognition is one of the promising silent speech interfaces for man-machine speech communication. We have previously shown the effectiveness of speaker adaptive training (SAT) based on constrained maximum likelihood linear regression (CMLLR) in NAM acoustic model training. However, since the amount of available NAM data is still small, the effect of SAT is limited. In this paper we propose modified SAT methods capable of using a larger amount of normal speech data by transforming them into NAM data. The experimental results demonstrate that the pro posed methods yield an absolute increase of approximately 2% in word accuracy compared with the conventional method.
  • Keywords
    maximum likelihood estimation; regression analysis; speech recognition; CMLLR; NAM; acoustic model training; body-conductive microphone; constrained maximum likelihood linear regression; man-machine speech communication; nonaudible murmur recognition; normal speech data; speaker adaptive training; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech; Speech recognition; Transforms; acoustic model; non-audible murmur recognition; silent speech interfaces; speaker adaptive training; transformed normal speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947535
  • Filename
    5947535