• DocumentCode
    1024901
  • Title

    Discovering Phone Patterns in Spoken Utterances by Non-Negative Matrix Factorization

  • Author

    Stouten, Veronique ; Demuynck, Kris ; Van hamme, Hugo

  • Author_Institution
    Katholieke Univ. Leuven, Leuven
  • Volume
    15
  • fYear
    2008
  • fDate
    6/30/1905 12:00:00 AM
  • Firstpage
    131
  • Lastpage
    134
  • Abstract
    We present a technique to automatically discover the (word-sized) phone patterns that are present in speech utterances. These patterns are learnt from a set of phone lattices generated from the utterances. Just like children acquiring language, our system does not have prior information on what the meaningful patterns are. By applying the non-negative matrix factorization algorithm to a fixed-length high-dimensional vector representation of the speech utterances, a decomposition in terms of additive units is obtained. We illustrate that these units correspond to words in case of a small vocabulary task. Our result also raises questions about whether explicit segmentation and clustering are needed in an unsupervised learning context.
  • Keywords
    matrix decomposition; speaker recognition; telephone sets; unsupervised learning; language acquisition; matrix factorization; phone lattices; speech utterances; unsupervised learning; vector representation; word segmentation; Automatic speech recognition; Humans; Lattices; Matrix decomposition; Natural languages; Pattern recognition; Pediatrics; Principal component analysis; Speech recognition; Streaming media; Language acquisition; matrix factorization; phone lattices; word segmentation;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2007.911723
  • Filename
    4418411