• DocumentCode
    730834
  • Title

    Order-free spoken term detection

  • Author

    Mangu, Lidia ; Saon, George ; Picheny, Michael ; Kingsbury, Brian

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5331
  • Lastpage
    5335
  • Abstract
    In this paper, we propose Time-Marked Word (TMW) lists as a replacement for the lattices and Confusion Networks (CNs) widely used as indexing vehicles for Spoken Term Detection (STD). In a TMW list, candidates are simply tagged with posterior probabilities and time information and stored as a large list of words: the additional ordering present in a lattice or CN is discarded. TMW lists compactly summarize a large ASR search space. Representing a large search space is critical for STD metrics such as ATWV that heavily penalize misses of rare keywords. Comparisons on the OpenKWS 2014 Tamil limited language pack task [1] show that the new TMW-based indexing results in better performance while being faster and having a smaller footprint.
  • Keywords
    audio signal processing; indexing; speech processing; speech recognition; OpenKWS 2014 Tamil limited language pack task; STD; TMW-based indexing; confusion networks; order-free spoken term detection; posterior probabilities; rare keywords; time information; time-marked word lists; Art; Indexes; Lattices; audio indexing; confusion networks; keyword search; keyword spotting; spoken term detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178989
  • Filename
    7178989