مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech recognition in the Informedia Digital Video Library: uses and limitations

DocumentCode :

2923619

Title :

Speech recognition in the Informedia Digital Video Library: uses and limitations

Author :

Hauptmann, Alexander G.

Author_Institution :

Dept. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear :

1995

fDate :

5-8 Nov 1995

Firstpage :

288

Lastpage :

294

Abstract :

In principle, speech recognition technology can make any spoken data useful for library indexing and retrieval. The paper describes the Informedia Digital Video Library project and discusses how speech recognition is used for transcript creation from video, alignment with hand-generated transcripts, query interface and audio paragraph segmentation. The results show that speech recognition accuracy varies dramatically depending on the quality and type of data used. Our information retrieval experiments also show that reasonable recall and precision can be obtained with moderate speech recognition accuracy. Finally we discuss some active areas of speech research relevant to the digital video library problem

Keywords :

information retrieval; information services; interactive video; natural language interfaces; speech recognition; Informedia Digital Video Library; audio paragraph segmentation; hand-generated transcripts; information retrieval experiments; library indexing; moderate speech recognition accuracy; query interface; speech recognition; speech recognition accuracy; spoken data; transcript creation; Frequency; Humans; Image color analysis; Image segmentation; Indexing; Software libraries; Speech analysis; Speech recognition; System testing; Text analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Tools with Artificial Intelligence, 1995. Proceedings., Seventh International Conference on

Conference_Location :

Herndon, VA

ISSN :

1082-3409

Print_ISBN :

0-8186-7312-5

Type :

conf

DOI :

10.1109/TAI.1995.479616

Filename :

479616

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2923619