DocumentCode :
2963934
Title :
Query-by-example spoken term detection using phonetic posteriorgram templates
Author :
Hazen, Timothy J. ; Shen, Wade ; White, Christopher
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
421
Lastpage :
426
Abstract :
This paper examines a query-by-example approach to spoken term detection in audio files. The approach is designed for low-resource situations in which limited or no in-domain training material is available and accurate word-based speech recognition capability is unavailable. Instead of using word or phone strings as search terms, the user presents the system with audio snippets of desired search terms to act as the queries. Query and test materials are represented using phonetic posteriorgrams obtained from a phonetic recognition system. Query matches in the test data are located using a modified dynamic time warping search between query templates and test utterances. Experiments using this approach are presented using data from the Fisher corpus.
Keywords :
query processing; speech processing; speech recognition; Fisher corpus; audio snippets; modified dynamic time warping search; phonetic posteriorgram template; query-by-example spoken term detection; test utterances; word based speech recognition capability; Acoustic measurements; Acoustic testing; Hidden Markov models; Laboratories; Materials testing; Music information retrieval; Research and development; Speech recognition; System testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5372889
Filename :
5372889
Link To Document :
بازگشت