مرکز منطقه ای اطلاع رساني علوم و فناوري - Video indexing using speech recognition techniques in audio channel -preliminary system design

DocumentCode :

3240882

Title :

Video indexing using speech recognition techniques in audio channel -preliminary system design

Author :

Gu, Lingyun

Author_Institution :

Florida Univ., Gainesville, FL, USA

fYear :

2004

fDate :

18-21 Dec. 2004

Firstpage :

342

Lastpage :

345

Abstract :

Multimedia document archiving is an important and interesting issue with the current growth of multimedia documents. Among the different types of archiving, audio/video indexing plays a crucial role. For audio indexing techniques, most current audio indexing systems use a combination of speech recognition and information retrieval. A large vocabulary continuous speech recognition (LVCSR) system is used to produce time aligned transcripts of the given collection of speech. Information retrieval techniques are then employed on these recognized transcripts to identify locations in the text that are relevant to the search request. In this paper, we introduce the technology details of an audio indexing system for a specific application of the National Library of Medicine.

Keywords :

audio signals; information retrieval; information retrieval systems; multimedia systems; natural languages; speech processing; speech recognition; video signals; vocabulary; audio channel; audio indexing; information retrieval; large vocabulary continuous speech recognition system; multimedia document archiving; video indexing; Engines; Indexing; Information retrieval; Libraries; Multimedia systems; Signal processing; Speech processing; Speech recognition; Text recognition; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on

Print_ISBN :

0-7803-8689-2

Type :

conf

DOI :

10.1109/ISSPIT.2004.1433790

Filename :

1433790

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3240882