DocumentCode
2769498
Title
Speech recognition with localized time-frequency pattern detectors
Author
Schutte, Ken ; Glass, James
Author_Institution
MIT Comput. Sci. & Artificial Intelligence Lab., Cambridge
fYear
2007
fDate
9-13 Dec. 2007
Firstpage
341
Lastpage
346
Abstract
A method for acoustic modeling of speech is presented which is based on learning and detecting the occurrence of localized time-frequency patterns in a spectrogram. A boosting algorithm is applied to both build classifiers and perform feature selection from a large set of features derived by filtering spectrograms. Initial experiments are performed to discriminate digits in the Aurora database. The system succeeds in learning sequences of localized time-frequency patterns which are highly interpretable from an acoustic-phonetic viewpoint. While the work and the results are preliminary, they suggest that pursuing these techniques further could lead to new approaches to acoustic modeling for ASR which are more noise robust and offer better encoding of temporal dynamics than typical features such as frame-based cepstra.
Keywords
acoustic signal processing; speech recognition; time-frequency analysis; acoustic speech modeling; localized time-frequency pattern detectors; spectrogram; speech recognition; Acoustic noise; Acoustic signal detection; Automatic speech recognition; Boosting; Detectors; Filtering algorithms; Spatial databases; Spectrogram; Speech recognition; Time frequency analysis; acoustic modeling; automatic speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location
Kyoto
Print_ISBN
978-1-4244-1746-9
Electronic_ISBN
978-1-4244-1746-9
Type
conf
DOI
10.1109/ASRU.2007.4430135
Filename
4430135
Link To Document