Title :
Video indexing using speech recognition techniques in audio channel -preliminary system design
Author_Institution :
Florida Univ., Gainesville, FL, USA
Abstract :
Multimedia document archiving is an important and interesting issue with the current growth of multimedia documents. Among the different types of archiving, audio/video indexing plays a crucial role. For audio indexing techniques, most current audio indexing systems use a combination of speech recognition and information retrieval. A large vocabulary continuous speech recognition (LVCSR) system is used to produce time aligned transcripts of the given collection of speech. Information retrieval techniques are then employed on these recognized transcripts to identify locations in the text that are relevant to the search request. In this paper, we introduce the technology details of an audio indexing system for a specific application of the National Library of Medicine.
Keywords :
audio signals; information retrieval; information retrieval systems; multimedia systems; natural languages; speech processing; speech recognition; video signals; vocabulary; audio channel; audio indexing; information retrieval; large vocabulary continuous speech recognition system; multimedia document archiving; video indexing; Engines; Indexing; Information retrieval; Libraries; Multimedia systems; Signal processing; Speech processing; Speech recognition; Text recognition; Vocabulary;
Conference_Titel :
Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on
Print_ISBN :
0-7803-8689-2
DOI :
10.1109/ISSPIT.2004.1433790