• DocumentCode
    3422547
  • Title

    Polyphase speech recognition

  • Author

    Lin, Hui ; Bilmes, Jeff

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4109
  • Lastpage
    4112
  • Abstract
    We propose a model for speech recognition that consists of multiple semi-synchronized recognizers operating on a polyphase decomposition of standard speech features. Specifically, we consider multiple out-of-phase downsampled speech features as separate streams which are modeled separately at the lowest level, and are then integrated at the higher level (words) during first-pass decoding. Our model lessens the severity of the oversampling problem in many speech recognition systems - i.e., that speech modulation energy is most important below 25 Hz but a 100 Hz frame rate gives a modulation bandwidth of 50 Hz. Our polyphase approach moreover captures wider and more diverse dynamics within the speech signal. Our integrative network is high-level, namely it couples together and decodes word strings from different recognizers simultaneously and asynchronously. We provide preliminary results on the 10-word vocabulary version of the switchboard (small-vocabulary switchboard) task and show that our polyphase recognition system significantly outperforms an optimized baseline (HMM) approach.
  • Keywords
    decoding; speech recognition; downsampled speech features; first-pass decoding; out-of-phase speech features; polyphase decomposition; polyphase recognition system; semisynchronized recognizers; speech modulation energy; speech recognition; switchboard; Acoustics; Automatic speech recognition; Bandwidth; Bayesian methods; Decoding; Frequency modulation; Hidden Markov models; Power system modeling; Speech recognition; Streaming media; dynamic Bayesian network; polyphase speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518558
  • Filename
    4518558