DocumentCode :
2769498
Title :
Speech recognition with localized time-frequency pattern detectors
Author :
Schutte, Ken ; Glass, James
Author_Institution :
MIT Comput. Sci. & Artificial Intelligence Lab., Cambridge
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
341
Lastpage :
346
Abstract :
A method for acoustic modeling of speech is presented which is based on learning and detecting the occurrence of localized time-frequency patterns in a spectrogram. A boosting algorithm is applied to both build classifiers and perform feature selection from a large set of features derived by filtering spectrograms. Initial experiments are performed to discriminate digits in the Aurora database. The system succeeds in learning sequences of localized time-frequency patterns which are highly interpretable from an acoustic-phonetic viewpoint. While the work and the results are preliminary, they suggest that pursuing these techniques further could lead to new approaches to acoustic modeling for ASR which are more noise robust and offer better encoding of temporal dynamics than typical features such as frame-based cepstra.
Keywords :
acoustic signal processing; speech recognition; time-frequency analysis; acoustic speech modeling; localized time-frequency pattern detectors; spectrogram; speech recognition; Acoustic noise; Acoustic signal detection; Automatic speech recognition; Boosting; Detectors; Filtering algorithms; Spatial databases; Spectrogram; Speech recognition; Time frequency analysis; acoustic modeling; automatic speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430135
Filename :
4430135
Link To Document :
بازگشت