مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech recognition with localized time-frequency pattern detectors

DocumentCode :

2769498

Title :

Speech recognition with localized time-frequency pattern detectors

Author :

Schutte, Ken ; Glass, James

Author_Institution :

MIT Comput. Sci. & Artificial Intelligence Lab., Cambridge

fYear :

2007

fDate :

9-13 Dec. 2007

Firstpage :

341

Lastpage :

346

Abstract :

A method for acoustic modeling of speech is presented which is based on learning and detecting the occurrence of localized time-frequency patterns in a spectrogram. A boosting algorithm is applied to both build classifiers and perform feature selection from a large set of features derived by filtering spectrograms. Initial experiments are performed to discriminate digits in the Aurora database. The system succeeds in learning sequences of localized time-frequency patterns which are highly interpretable from an acoustic-phonetic viewpoint. While the work and the results are preliminary, they suggest that pursuing these techniques further could lead to new approaches to acoustic modeling for ASR which are more noise robust and offer better encoding of temporal dynamics than typical features such as frame-based cepstra.

Keywords :

acoustic signal processing; speech recognition; time-frequency analysis; acoustic speech modeling; localized time-frequency pattern detectors; spectrogram; speech recognition; Acoustic noise; Acoustic signal detection; Automatic speech recognition; Boosting; Detectors; Filtering algorithms; Spatial databases; Spectrogram; Speech recognition; Time frequency analysis; acoustic modeling; automatic speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location :

Kyoto

Print_ISBN :

978-1-4244-1746-9

Electronic_ISBN :

978-1-4244-1746-9

Type :

conf

DOI :

10.1109/ASRU.2007.4430135

Filename :

4430135

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2769498