• DocumentCode
    3640864
  • Title

    Hierarchical tandem feature extraction

  • Author

    Sunil Sivadas;Hynek Hermansky

  • Author_Institution
    Oregon Graduate Institute of Science and Technology, Portland, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    5/1/2002 12:00:00 AM
  • Abstract
    We present a hierarchical architecture for tandem acoustic modeling. In the tandem acoustic modeling paradigm a Multi Layer Perceptron (MLP) is discriminatively trained to estimate phoneme posterior probabilities on a labeled database. The outputs of the MLP after nonlinear transformation and whitening are used as features in a Gaussian Mixture Model (GMM) based recognizer. In this paper we replace the large monolithic MLP with hierarchies of MLP experts. We apply this approach on Speech in Noisy Environments (SPINE 1) evaluation conducted by the Naval Research Laboratory (NRL). We observe a reduction in word error rate of 30% with context-independent models and 5% WER with context-dependent models relative to PLP features.
  • Keywords
    "Artificial neural networks","Books","Speech"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743841
  • Filename
    5743841