Title :
Query-by-example spoken term detection using phonetic posteriorgram templates
Author :
Hazen, Timothy J. ; Shen, Wade ; White, Christopher
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
This paper examines a query-by-example approach to spoken term detection in audio files. The approach is designed for low-resource situations in which limited or no in-domain training material is available and accurate word-based speech recognition capability is unavailable. Instead of using word or phone strings as search terms, the user presents the system with audio snippets of desired search terms to act as the queries. Query and test materials are represented using phonetic posteriorgrams obtained from a phonetic recognition system. Query matches in the test data are located using a modified dynamic time warping search between query templates and test utterances. Experiments using this approach are presented using data from the Fisher corpus.
Keywords :
query processing; speech processing; speech recognition; Fisher corpus; audio snippets; modified dynamic time warping search; phonetic posteriorgram template; query-by-example spoken term detection; test utterances; word based speech recognition capability; Acoustic measurements; Acoustic testing; Hidden Markov models; Laboratories; Materials testing; Music information retrieval; Research and development; Speech recognition; System testing; Vocabulary;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5372889