Title :
An inner-product lower-bound estimate for dynamic time warping
Author :
Zhang, Yaodong ; Glass, James R.
Author_Institution :
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
Abstract :
In this paper, we present a lower-bound estimate for dynamic time warping (DTW) on time series consisting of multi-dimensional posterior probability vectors known as posteriorgrams. We develop a lower-bound estimate based on the inner-product distance that has been found to be an effective metric for computing similarities between posteriorgrams. In addition to deriving the lower-bound estimate, we show how it can be efficiently used in an admissible K nearest neighbor (KNN) search for spotting matching sequences. We quantify the amount of computational savings achieved by performing a set of unsupervised spoken keyword spotting experiments using Gaussian mixture model posteriorgrams. In these experiments the proposed lower-bound estimate eliminates 89% of the DTW previously required calculations without affecting overall keyword detection performance.
Keywords :
Gaussian processes; dynamic programming; learning (artificial intelligence); pattern classification; pattern matching; probability; speech processing; time series; DTW; Gaussian mixture model posteriorgrams; K nearest neighbor search; dynamic programming technique; dynamic time warping; inner-product lower-bound estimate; matching sequence spotting; multidimensional posterior probability vectors; time series; unsupervised spoken keyword spotting experiments; Glass; Hidden Markov models; Keyword search; Speech; Speech recognition; Time series analysis; Training; dynamic time warping; posteriorgram;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947644