DocumentCode
1024901
Title
Discovering Phone Patterns in Spoken Utterances by Non-Negative Matrix Factorization
Author
Stouten, Veronique ; Demuynck, Kris ; Van hamme, Hugo
Author_Institution
Katholieke Univ. Leuven, Leuven
Volume
15
fYear
2008
fDate
6/30/1905 12:00:00 AM
Firstpage
131
Lastpage
134
Abstract
We present a technique to automatically discover the (word-sized) phone patterns that are present in speech utterances. These patterns are learnt from a set of phone lattices generated from the utterances. Just like children acquiring language, our system does not have prior information on what the meaningful patterns are. By applying the non-negative matrix factorization algorithm to a fixed-length high-dimensional vector representation of the speech utterances, a decomposition in terms of additive units is obtained. We illustrate that these units correspond to words in case of a small vocabulary task. Our result also raises questions about whether explicit segmentation and clustering are needed in an unsupervised learning context.
Keywords
matrix decomposition; speaker recognition; telephone sets; unsupervised learning; language acquisition; matrix factorization; phone lattices; speech utterances; unsupervised learning; vector representation; word segmentation; Automatic speech recognition; Humans; Lattices; Matrix decomposition; Natural languages; Pattern recognition; Pediatrics; Principal component analysis; Speech recognition; Streaming media; Language acquisition; matrix factorization; phone lattices; word segmentation;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2007.911723
Filename
4418411
Link To Document