DocumentCode
3145406
Title
Weakly supervised keyword learning using sparse representations of speech
Author
Driesen, Joris ; Gemmeke, Jort ; Van hamme, Hugo
Author_Institution
Dept. Electr. Eng., Katholieke Univ. Leuven, Leuven, Belgium
fYear
2012
fDate
25-30 March 2012
Firstpage
5145
Lastpage
5148
Abstract
When applied to speech, Non-negative Matrix Factorization is capable of learning a small vocabulary of words, foregoing any prior linguistic knowledge. This makes it adequate for small-scale speech applications where flexibility is of the utmost importance, e.g. assistive technology for the speech impaired. However, its performance depends on the way its inputs are represented. We propose the use of exemplar-based sparse representations of speech, and explore the influence of some of these representation´s basic parameters, such as the total number of exemplars considered and the sparseness imposed on them. We show that the resulting learning performance compares favorably with those of previously proposed approaches.
Keywords
handicapped aids; learning (artificial intelligence); matrix decomposition; speech processing; speech recognition; exemplar based sparse representations; learning performance; nonnegative matrix factorization; small scale speech applications; sparse speech representations; speech impaired assistive technology; weakly supervised keyword learning; Acoustics; Adaptation models; Histograms; Speech; Speech recognition; Vectors; Vocabulary; Exemplars; Lasso; Nonnegative Matrix Factorization; Sparseness; Vocabulary Acquisition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6287950
Filename
6287950
Link To Document