DocumentCode :
1024901
Title :
Discovering Phone Patterns in Spoken Utterances by Non-Negative Matrix Factorization
Author :
Stouten, Veronique ; Demuynck, Kris ; Van hamme, Hugo
Author_Institution :
Katholieke Univ. Leuven, Leuven
Volume :
15
fYear :
2008
fDate :
6/30/1905 12:00:00 AM
Firstpage :
131
Lastpage :
134
Abstract :
We present a technique to automatically discover the (word-sized) phone patterns that are present in speech utterances. These patterns are learnt from a set of phone lattices generated from the utterances. Just like children acquiring language, our system does not have prior information on what the meaningful patterns are. By applying the non-negative matrix factorization algorithm to a fixed-length high-dimensional vector representation of the speech utterances, a decomposition in terms of additive units is obtained. We illustrate that these units correspond to words in case of a small vocabulary task. Our result also raises questions about whether explicit segmentation and clustering are needed in an unsupervised learning context.
Keywords :
matrix decomposition; speaker recognition; telephone sets; unsupervised learning; language acquisition; matrix factorization; phone lattices; speech utterances; unsupervised learning; vector representation; word segmentation; Automatic speech recognition; Humans; Lattices; Matrix decomposition; Natural languages; Pattern recognition; Pediatrics; Principal component analysis; Speech recognition; Streaming media; Language acquisition; matrix factorization; phone lattices; word segmentation;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2007.911723
Filename :
4418411
Link To Document :
بازگشت