• DocumentCode
    3010584
  • Title

    Point process models of spectro-temporal modulation events for speech recognition

  • Author

    Jansen, Aren ; Mesgarani, Nima ; Niyogi, Partha

  • fYear
    2010
  • fDate
    7-10 Nov. 2010
  • Firstpage
    104
  • Lastpage
    108
  • Abstract
    Neurobiological research has uncovered the existence of cortical neurons in various animal species tuned to particular spectro-temporal modulations (STM) in the auditory stimulus. Other findings indicate that temporal statistics of the resulting neural spike trains may encode the underlying content of species-specific communication calls. With this motivation, we present an alternative approach to speech recognition based on point process statistical models of the local maxima events produced by a cortically-inspired spectro-temporal filter bank. We demonstrate the computational adequacy of this approach on the practical task of keyword spotting.
  • Keywords
    speech recognition; statistics; animal species; auditory stimulus; cortical neurons; cortically-inspired spectro-temporal filter bank; neural spike trains; neurobiological research; point process models; species-specific communication; spectro-temporal modulation events; spectro-temporal modulations; speech recognition; temporal statistics; Acoustics; Detectors; Hidden Markov models; Modulation; Speech; Speech recognition; Training; point process models; spectro-temporal modulation features; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers (ASILOMAR), 2010 Conference Record of the Forty Fourth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA
  • ISSN
    1058-6393
  • Print_ISBN
    978-1-4244-9722-5
  • Type

    conf

  • DOI
    10.1109/ACSSC.2010.5757477
  • Filename
    5757477