DocumentCode :
730912
Title :
Segmental acoustic indexing for zero resource keyword search
Author :
Levin, Keith ; Jansen, Aren ; Van Durme, Benjamin
Author_Institution :
Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5828
Lastpage :
5832
Abstract :
The task of zero resource query-by-example keyword search has received much attention in recent years as the speech technology needs of the developing world grow. These systems traditionally rely upon dynamic time warping (DTW) based retrieval algorithms with runtimes that are linear in the size of the search collection. As a result, their scalability substantially lags that of their supervised counterparts, which take advantage of efficient word-based indices. In this paper, we present a novel audio indexing approach called Segmental Randomized Acoustic Indexing and Logarithmic-time Search (S-RAILS). S-RAILS generalizes the original frame-based RAILS methodology to word-scale segments by exploiting a recently proposed acoustic segment embedding technique. By indexing word-scale segments directly, we avoid higher cost frame-based processing of RAILS while taking advantage of the improved lexical discrimination of the embeddings. Using the same conversational telephone speech benchmark, we demonstrate major improvements in both speed and accuracy over the original RAILS system.
Keywords :
speech enhancement; speech intelligibility; speech processing; S-RAILS; acoustic segment embedding; dynamic time warping; novel audio indexing approach; retrieval algorithms; segmental acoustic indexing; segmental randomized acoustic indexing-and-logarithmic-time search; speech technology; telephone speech benchmark; word-based indices; zero resource keyword search; zero resource query-by-example keyword search task; Acoustics; Approximation methods; Indexing; Laplace equations; Rails; Speech; Zero resource; fixed-dimensional embedding; query-by-example search; speech indexing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7179089
Filename :
7179089
Link To Document :
بازگشت