Title :
Open-Vocabulary Spoken Utterance Retrieval using Confusion Networks
Author :
Hori, Toshikazu ; Hetherington, I.L. ; Hazen, Timothy J. ; Glass, James R.
Author_Institution :
NTT Commun. Sci. Lab., NTT Corp., Kyoto, Japan
Abstract :
This paper presents a novel approach to open-vocabulary spoken utterance retrieval using confusion networks. If out-of-vocabulary (OOV) words are present in queries and the corpus, word-based indexing will not be sufficient. For this problem, we apply phone confusion networks and combine them with word confusion networks. With this approach, we can generate a more compact index table that enables robust keyword matching compared with typical lattice-based methods. In the retrieval experiments with speech recordings in MIT lecture corpus, our method using phone confusion networks outperformed lattice-based methods especially for OOV queries.
Keywords :
indexing; information retrieval; speech processing; MIT lecture corpus; confusion networks; lattice-based methods; open-vocabulary spoken utterance retrieval; out-of-vocabulary words; phone confusion networks; robust keyword matching; speech recordings; word-based indexing; Artificial intelligence; Audio recording; Automatic speech recognition; Computer science; Error analysis; Glass; Indexing; Laboratories; Lattices; Robustness; Audio Indexing; Confusion Network; Spoken Utterance Retrieval;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367166