DocumentCode :
1694183
Title :
Embedding time warping in exemplar-based sparse representations of speech
Author :
Yilmaz, Emre ; Gemmeke, Jort F. ; Van hamme, Hugo
Author_Institution :
Dept. ESAT, KU Leuven, Leuven, Belgium
fYear :
2013
Firstpage :
8076
Lastpage :
8080
Abstract :
This paper describes a new sparse representation model for speech that allows time warping as an extension to a recently proposed sparse representations-based speech recognition system. This recognition system uses exemplars to model the acoustics which are labeled speech occurrences of different length extracted from the training data. Exemplars are organized in multiple dictionaries on the basis of their class and length. Input speech segments are approximated as a sparse linear combination of the exemplars using these dictionaries and a reconstruction error-based decoding is adopted in order to find the best matching class sequence. With the current sparse representation model using a dictionary and a weight vector to approximate an input speech segment, it is not possible to compare input speech segments with exemplars of different lengths. The goal of this work is to introduce a novel sparse representation model which allows time warping using a third matrix which linearly combines consecutive frames in order to shrink or expand the approximation. Preliminary results have shown the feasibility of the proposed sparse representation model.
Keywords :
acoustic signal processing; decoding; matrix algebra; signal representation; speech coding; speech recognition; acoustics model; dictionaries; exemplars; labeled speech occurrences; matching class sequence; reconstruction error-based decoding; sparse linear combination; sparse representation model; sparse representations-based speech recognition system; speech segmentation; third matrix; time warping; weight vector; Dictionaries; Mathematical model; Sparse matrices; Speech; Speech recognition; Time-frequency analysis; Vectors; Exemplar-based speech recognition; sparse representations; time warping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639238
Filename :
6639238
Link To Document :
بازگشت