Title :
Fast similarity search on a large speech data set with neighborhood graph indexing
Author :
Aoyama, Kazuo ; Watanabe, Shinji ; Sawada, Hiroshi ; Minami, Yasuhiro ; Ueda, Naonori ; Saito, Kazumi
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Seika, Japan
Abstract :
This paper presents a novel graph-based approach for solving a problem of fast finding a speech model acoustically similar to a query model from a large set of speech models. Each speech model in the set is represented by a Gaussian mixture model and dissimilarity from a GMM to another is measured with a Kullback-Leibler divergence (KLD). Conventional pruning techniques based on the triangle inequality for fast similarity search are not available because the model space with a KLD is not a metric space. We propose a search method that is characterized by an index of a degree-reduced nearest neighbor (DRNN) graph. The search method can efficiently find the most similar (closest) GMM to a query, exploring the DRNN graph with a best-first manner. Experimental evaluations on utterance GMM search tasks reveal a significantly low computational cost of the proposed method.
Keywords :
Gaussian processes; graph theory; speech processing; Gaussian mixture model; Kullback-Leibler divergence; degree-reduced nearest neighbor graph; fast similarity search; graph-based approach; large speech data set; neighborhood graph indexing; pruning techniques; speech model; triangle inequality; Acoustic measurements; Computational efficiency; Extraterrestrial measurements; Indexing; Laboratories; Nearest neighbor searches; Search methods; Search problems; Signal processing; Speech; Gaussian mixture model; Graph index; Kullback-Leibler divergence; Similarity search; Utterance retrieval;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5494950