• DocumentCode
    730811
  • Title

    Language independent query-by-example spoken term detection using N-best phone sequences and partial matching

  • Author

    Haihua Xu ; Peng Yang ; Xiong Xiao ; Lei Xie ; Cheung-Chi Leung ; Hongjie Chen ; Yu Jia ; Hang, L.V. ; Lei Wang ; Su Jun Leow ; Bin Ma ; Eng Siong Chng ; Haizhou Li

  • Author_Institution
    Temasek Lab., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5191
  • Lastpage
    5195
  • Abstract
    In this paper, we propose a partial sequence matching based symbolic search (SS) method for the task of language independent query-by-example spoken term detection. One main drawback of conventional SS approach is the high miss rate for long queries. This is due to high variations in symbol representation of query and search audios, especially in language independent scenario. The successful matching of a query with its instances in search audio becomes exponentially more difficult as the query grows longer. To reduce miss rate, we propose a partial matching strategy, in which all partial phone sequences of a query are used to search for query instances. The partial matching is also suitable for real life applications where exact match is usually not necessary and word prefix, suffix, and order should not affect the search result. When applied to the QUESST 2014 task, results show the partial matching of phone sequences is able to reduce miss rate of long queries significantly compared with conventional full matching method. In addition, for the most challenging inexact matching queries (type 3), it also shows clear advantage over DTW-based methods.
  • Keywords
    query formulation; query processing; speech recognition; QUESST 2014 task; inexact matching queries; language independent query-by-example spoken term detection; language independent scenario; miss rate; partial matching strategy; partial phone sequences; partial sequence matching based symbolic search method; search audios; symbol representation; Acoustics; Audio databases; Indexing; Keyword search; Lattices; Search problems; Speech; keyword search; partial matching; phone tokenizer; queryby-example; spoken term detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178961
  • Filename
    7178961