DocumentCode :
3716116
Title :
Query by example search with segmented dynamic time warping for non-exact spoken queries
Author :
Jorge Proenga;Arlindo Veiga;Fernando Perdigao
Author_Institution :
Instituto de Telecomunicacoes, Coimbra, Portugal Electrical and Computer Eng. Department, University of Coimbra, Portugal
fYear :
2015
Firstpage :
1661
Lastpage :
1665
Abstract :
This paper presents an approach to the Query-by-Example task of finding spoken queries on speech databases when the intended match may be non-exact or slightly complex. The built system is low-resource as it tries to solve the problem where the language of queries and searched audio is unspecified. Our method is based on a modified Dynamic Time Warping (DTW) algorithm using posterior-grams and extracting intricate paths to account for special cases of query match such as word re-ordering, lexical variations and filler content. This system was evaluated on the MediaEval 2014 task of Query by Example Search on Speech (QUESST) where the spoken data is from different languages, unknown to the participant. We combined the results of five DTW modifications computed on the output of three phoneme recognizers of different languages. The combination of all systems provided the best performance overall and improved detection of complex case queries.
Keywords :
"Speech","Databases","Europe","Signal processing","Acoustics","Search problems","Signal processing algorithms"
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN :
2076-1465
Type :
conf
DOI :
10.1109/EUSIPCO.2015.7362666
Filename :
7362666
Link To Document :
بازگشت