• DocumentCode
    3485812
  • Title

    Efficient spoken term discovery using randomized algorithms

  • Author

    Jansen, Aren ; Van Durme, Benjamin

  • Author_Institution
    Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    401
  • Lastpage
    406
  • Abstract
    Spoken term discovery is the task of automatically identifying words and phrases in speech data by searching for long repeated acoustic patterns. Initial solutions relied on exhaustive dynamic time warping-based searches across the entire similarity matrix, a method whose scalability is ultimately limited by the O(n2) nature of the search space. Recent strategies have attempted to improve search efficiency by using either unsupervised or mismatched-language acoustic models to reduce the complexity of the feature representation. Taking a completely different approach, this paper investigates the use of randomized algorithms that operate directly on the raw acoustic features to produce sparse approximate similarity matrices in O(n) space and O(n log n) time. We demonstrate these techniques facilitate spoken term discovery performance capable of outperforming a model-based strategy in the zero resource setting.
  • Keywords
    speech recognition; acoustic patterns; exhaustive dynamic time warping-based searches; randomized algorithms; raw acoustic features; speech data; spoken term discovery; Acoustics; Approximation algorithms; Approximation methods; Image segmentation; Sparse matrices; Speech; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163965
  • Filename
    6163965