Title :
Discovering Phone Patterns in Spoken Utterances by Non-Negative Matrix Factorization
Author :
Stouten, Veronique ; Demuynck, Kris ; Van hamme, Hugo
Author_Institution :
Katholieke Univ. Leuven, Leuven
fDate :
6/30/1905 12:00:00 AM
Abstract :
We present a technique to automatically discover the (word-sized) phone patterns that are present in speech utterances. These patterns are learnt from a set of phone lattices generated from the utterances. Just like children acquiring language, our system does not have prior information on what the meaningful patterns are. By applying the non-negative matrix factorization algorithm to a fixed-length high-dimensional vector representation of the speech utterances, a decomposition in terms of additive units is obtained. We illustrate that these units correspond to words in case of a small vocabulary task. Our result also raises questions about whether explicit segmentation and clustering are needed in an unsupervised learning context.
Keywords :
matrix decomposition; speaker recognition; telephone sets; unsupervised learning; language acquisition; matrix factorization; phone lattices; speech utterances; unsupervised learning; vector representation; word segmentation; Automatic speech recognition; Humans; Lattices; Matrix decomposition; Natural languages; Pattern recognition; Pediatrics; Principal component analysis; Speech recognition; Streaming media; Language acquisition; matrix factorization; phone lattices; word segmentation;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2007.911723