DocumentCode :
3240882
Title :
Video indexing using speech recognition techniques in audio channel -preliminary system design
Author :
Gu, Lingyun
Author_Institution :
Florida Univ., Gainesville, FL, USA
fYear :
2004
fDate :
18-21 Dec. 2004
Firstpage :
342
Lastpage :
345
Abstract :
Multimedia document archiving is an important and interesting issue with the current growth of multimedia documents. Among the different types of archiving, audio/video indexing plays a crucial role. For audio indexing techniques, most current audio indexing systems use a combination of speech recognition and information retrieval. A large vocabulary continuous speech recognition (LVCSR) system is used to produce time aligned transcripts of the given collection of speech. Information retrieval techniques are then employed on these recognized transcripts to identify locations in the text that are relevant to the search request. In this paper, we introduce the technology details of an audio indexing system for a specific application of the National Library of Medicine.
Keywords :
audio signals; information retrieval; information retrieval systems; multimedia systems; natural languages; speech processing; speech recognition; video signals; vocabulary; audio channel; audio indexing; information retrieval; large vocabulary continuous speech recognition system; multimedia document archiving; video indexing; Engines; Indexing; Information retrieval; Libraries; Multimedia systems; Signal processing; Speech processing; Speech recognition; Text recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on
Print_ISBN :
0-7803-8689-2
Type :
conf
DOI :
10.1109/ISSPIT.2004.1433790
Filename :
1433790
Link To Document :
بازگشت