DocumentCode
3010584
Title
Point process models of spectro-temporal modulation events for speech recognition
Author
Jansen, Aren ; Mesgarani, Nima ; Niyogi, Partha
fYear
2010
fDate
7-10 Nov. 2010
Firstpage
104
Lastpage
108
Abstract
Neurobiological research has uncovered the existence of cortical neurons in various animal species tuned to particular spectro-temporal modulations (STM) in the auditory stimulus. Other findings indicate that temporal statistics of the resulting neural spike trains may encode the underlying content of species-specific communication calls. With this motivation, we present an alternative approach to speech recognition based on point process statistical models of the local maxima events produced by a cortically-inspired spectro-temporal filter bank. We demonstrate the computational adequacy of this approach on the practical task of keyword spotting.
Keywords
speech recognition; statistics; animal species; auditory stimulus; cortical neurons; cortically-inspired spectro-temporal filter bank; neural spike trains; neurobiological research; point process models; species-specific communication; spectro-temporal modulation events; spectro-temporal modulations; speech recognition; temporal statistics; Acoustics; Detectors; Hidden Markov models; Modulation; Speech; Speech recognition; Training; point process models; spectro-temporal modulation features; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers (ASILOMAR), 2010 Conference Record of the Forty Fourth Asilomar Conference on
Conference_Location
Pacific Grove, CA
ISSN
1058-6393
Print_ISBN
978-1-4244-9722-5
Type
conf
DOI
10.1109/ACSSC.2010.5757477
Filename
5757477
Link To Document