• DocumentCode
    2178184
  • Title

    Gesture-based Dynamic Bayesian Network for noise robust speech recognition

  • Author

    Mitra, Vikramjit ; Nam, Hosung ; Espy-Wilson, Carol Y. ; Saltzman, Elliot ; Goldstein, Louis

  • Author_Institution
    Dept. of ECE, Univ. of Maryland, College Park, MD, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5172
  • Lastpage
    5175
  • Abstract
    Previously we have proposed different models for estimating articulatory gestures and vocal tract variable (TV) trajectories from synthetic speech. We have shown that when deployed on natural speech, such models can help to improve the noise robustness of a hidden Markov model (HMM) based speech recognition system. In this paper we propose a model for estimating TVs trained on natural speech and present a Dynamic Bayesian Network (DBN) based speech recognition architecture that treats vocal tract constriction gestures as hidden variables, eliminating the necessity for explicit gesture recognition. Using the proposed architecture we performed a word recognition task for the noisy data of Aurora 2. Significant improvement was observed in using the gestural information as hidden variables in a DBN architecture over using only the mel-frequency cepstral coefficient based HMM or DBN backend. We also compare our results with other noise-robust front ends.
  • Keywords
    belief networks; hidden Markov models; speech recognition; DBN architecture; HMM; TV trajectory; dynamic Bayesian network; gesture-based dynamic Bayesian network; hidden Markov model; noise robust speech recognition; noise-robust front ends; vocal tract variable trajectories; Acoustics; Artificial neural networks; Hidden Markov models; Speech; Speech recognition; TV; Trajectory; Articulatory Phonology; Articulatory Speech Recognition; Dynamic Bayesian Network; Noise-robust Speech Recognition; Task Dynamic model; Vocal-Tract variables;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947522
  • Filename
    5947522