Title :
Speech recognition in the Informedia Digital Video Library: uses and limitations
Author :
Hauptmann, Alexander G.
Author_Institution :
Dept. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
In principle, speech recognition technology can make any spoken data useful for library indexing and retrieval. The paper describes the Informedia Digital Video Library project and discusses how speech recognition is used for transcript creation from video, alignment with hand-generated transcripts, query interface and audio paragraph segmentation. The results show that speech recognition accuracy varies dramatically depending on the quality and type of data used. Our information retrieval experiments also show that reasonable recall and precision can be obtained with moderate speech recognition accuracy. Finally we discuss some active areas of speech research relevant to the digital video library problem
Keywords :
information retrieval; information services; interactive video; natural language interfaces; speech recognition; Informedia Digital Video Library; audio paragraph segmentation; hand-generated transcripts; information retrieval experiments; library indexing; moderate speech recognition accuracy; query interface; speech recognition; speech recognition accuracy; spoken data; transcript creation; Frequency; Humans; Image color analysis; Image segmentation; Indexing; Software libraries; Speech analysis; Speech recognition; System testing; Text analysis;
Conference_Titel :
Tools with Artificial Intelligence, 1995. Proceedings., Seventh International Conference on
Conference_Location :
Herndon, VA
Print_ISBN :
0-8186-7312-5
DOI :
10.1109/TAI.1995.479616