مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2800555

Title :

Sparse coding for speech recognition

Author :

Sivaram, G.S.V.S. ; Nemala, Sridhar Krishna ; Elhilali, Mounya ; Tran, Trac D. ; Hermansky, Hynek

Author_Institution :

Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4346

Lastpage :

4349

Abstract :

This paper proposes a novel feature extraction technique for speech recognition based on the principles of sparse coding. The idea is to express a spectro-temporal pattern of speech as a linear combination of an overcomplete set of basis functions such that the weights of the linear combination are sparse. These weights (features) are subsequently used for acoustic modeling. We learn a set of overcomplete basis functions (dictionary) from the training set by adopting a previously proposed algorithm which iteratively minimizes the reconstruction error and maximizes the sparsity of weights. Furthermore, features are derived using the learned basis functions by applying the well established principles of compressive sensing. Phoneme recognition experiments show that the proposed features outperform the conventional features in both clean and noisy conditions.

Keywords :

feature extraction; speech coding; speech recognition; acoustic modeling; compressive sensing; feature extraction technique; overcomplete basis functions; phoneme recognition; sparse coding; spectro temporal speech pattern; speech recognition; Automatic speech recognition; Dictionaries; Feature extraction; Gabor filters; Image reconstruction; Iterative algorithms; Neurons; Speech recognition; Training data; Vectors; compressive sensing; feature extraction; sparse coding; speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495649

Filename :

5495649

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2800555