DocumentCode :
2700659
Title :
Open-Vocabulary Spoken Utterance Retrieval using Confusion Networks
Author :
Hori, Toshikazu ; Hetherington, I.L. ; Hazen, Timothy J. ; Glass, James R.
Author_Institution :
NTT Commun. Sci. Lab., NTT Corp., Kyoto, Japan
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper presents a novel approach to open-vocabulary spoken utterance retrieval using confusion networks. If out-of-vocabulary (OOV) words are present in queries and the corpus, word-based indexing will not be sufficient. For this problem, we apply phone confusion networks and combine them with word confusion networks. With this approach, we can generate a more compact index table that enables robust keyword matching compared with typical lattice-based methods. In the retrieval experiments with speech recordings in MIT lecture corpus, our method using phone confusion networks outperformed lattice-based methods especially for OOV queries.
Keywords :
indexing; information retrieval; speech processing; MIT lecture corpus; confusion networks; lattice-based methods; open-vocabulary spoken utterance retrieval; out-of-vocabulary words; phone confusion networks; robust keyword matching; speech recordings; word-based indexing; Artificial intelligence; Audio recording; Automatic speech recognition; Computer science; Error analysis; Glass; Indexing; Laboratories; Lattices; Robustness; Audio Indexing; Confusion Network; Spoken Utterance Retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367166
Filename :
4218040
Link To Document :
بازگشت