Title :
Open-vocabulary spoken term detection using graphone-based hybrid recognition systems
Author :
Akbacak, Murat ; Vergyri, Dimitra ; Stolcke, Andreas
Author_Institution :
SRI Int., Speech Technol. & Res. Lab., Menlo Park, CA
fDate :
March 31 2008-April 4 2008
Abstract :
We address the problem of retrieving out-of-vocabulary (OOV) words/queries from audio archives for spoken term detection (STD) task. Many STD systems use the output of an automatic speech recognition (ASR) system which has a limited and fixed vocabulary, and are not capable of detecting rare words of high information content, such as named entities. Since such words are often of great interest for a retrieval task it is important to index spoken archives in a way that allows a user to search an OOV query/term.1 In this work, we employ hybrid recognition systems which contain both words and subword units (graphones) to generate hybrid lattice indexes. We use a word-based STD system as our baseline, and present improvements by employing our proposed hybrid STD system that uses words plus graphones on the English broadcast news genre of the 2006 NIST STD task.
Keywords :
audio signal processing; query processing; speech recognition; vocabulary; word processing; audio archives; automatic speech recognition system; graphone-based hybrid recognition systems; open-vocabulary spoken term detection; out of vocabulary queries; out-of-vocabulary words; retrieval task; word-based system; Automatic speech recognition; Digital audio broadcasting; Hybrid power systems; Laboratories; Lattices; Measurement; NIST; Speech recognition; Testing; Vocabulary; audio indexing; open vocabulary; spoken term detection; voice search;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518841