• DocumentCode
    2703031
  • Title

    An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling

  • Author

    Cetin, Omer ; Kantor, Amir ; King, Simon ; Bartels, Christopher ; Magimai-Doss, Mathew ; Frankel, Jorg ; Livescu, Karen

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    The so-called tandem approach, where the posteriors of a multilayer perceptron (MLP) classifier are used as features in an automatic speech recognition (ASR) system has proven to be a very effective method. Most tandem approaches up to date have relied on MLPs trained for phone classification, and appended the posterior features to some standard feature hidden Markov model (HMM). In this paper, we develop an alternative tandem approach based on MLPs trained for articulatory feature (AF) classification. We also develop a factored observation model for characterizing the posterior and standard features at the HMM outputs, allowing for separate hidden mixture and state-tying structures for each factor. In experiments on a subset of Switchboard, we show that the AF-based tandem approach is as effective as the phone-based approach, and that the factored observation model significantly outperforms the simple feature concatenation approach while using fewer parameters.
  • Keywords
    hidden Markov models; multilayer perceptrons; speech processing; speech recognition; ASR; MLP; articulatory feature-based tandem approach; automatic speech recognition; factored observation modeling; feature concatenation approach; hidden Markov model; multilayer perceptron; phone classification; state-tying structures; Acoustic signal processing; Automatic speech recognition; Computer science; Concatenated codes; Feature extraction; Hidden Markov models; Multilayer perceptrons; Speech processing; Speech recognition; Standards development; Multilayer Perceptrons; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366995
  • Filename
    4218183