Title :
Graph index based query-by-example search on a large speech data set
Author :
Aoyama, Konosuke ; Ogawa, Anna ; Hattori, Toshihiro ; Hori, Toshikazu ; Nakamura, A.
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Keihanna Science City, Japan
Abstract :
This paper presents a neighborhood graph index approach for query-by-example search using dynamic time warping (DTW) on Gaussian mixture model (GMM) posteriorgram sequences. The approach is intended to achieve a significant speed-up of a spoken term detection (STD) task for resource-limited situations. The proposed method employs a degree-reduced k-nearest neighbor (k-DR) graph as an index. A set of k-DR graphs is pre-constructed off-line from a large number of GMM posteriorgram sequences. Given a query posteriorgram sequence, one k-DR graph is selected from the set as the index. By applying a newly introduced combination of greedy-search (GS) and breadth-first search (BFS) algorithms to the selected k-DR graph index, the proposed method efficiently achieves query-by-example STD. Experimental results on the MIT lecture corpus demonstrate that the proposed method works much faster than a state-of-art method by more than one order magnitude, keeping almost the same precision.
Keywords :
Gaussian processes; graph theory; greedy algorithms; search problems; speech synthesis; GMM posteriorgram sequences; Gaussian mixture model; MIT lecture corpus; breadth-first search (BFS) algorithms; degree-reduced k-nearest neighbor graph; dynamic time warping; graph index based query-by-example search; greedy search; k-DR graphs; large speech data set; neighborhood graph index approach; query-by-example STD; query-by-example search; resource-limited situations; spoken term detection task; Acoustics; Extraterrestrial measurements; Indexes; Search problems; Speech; Speech processing; Speech recognition; Dynamic time warping; Gaussian mixture model posteriorgram; Neighborhood graph index; Query-by-example search; Spoken term detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639328