DocumentCode :
2923619
Title :
Speech recognition in the Informedia Digital Video Library: uses and limitations
Author :
Hauptmann, Alexander G.
Author_Institution :
Dept. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
1995
fDate :
5-8 Nov 1995
Firstpage :
288
Lastpage :
294
Abstract :
In principle, speech recognition technology can make any spoken data useful for library indexing and retrieval. The paper describes the Informedia Digital Video Library project and discusses how speech recognition is used for transcript creation from video, alignment with hand-generated transcripts, query interface and audio paragraph segmentation. The results show that speech recognition accuracy varies dramatically depending on the quality and type of data used. Our information retrieval experiments also show that reasonable recall and precision can be obtained with moderate speech recognition accuracy. Finally we discuss some active areas of speech research relevant to the digital video library problem
Keywords :
information retrieval; information services; interactive video; natural language interfaces; speech recognition; Informedia Digital Video Library; audio paragraph segmentation; hand-generated transcripts; information retrieval experiments; library indexing; moderate speech recognition accuracy; query interface; speech recognition; speech recognition accuracy; spoken data; transcript creation; Frequency; Humans; Image color analysis; Image segmentation; Indexing; Software libraries; Speech analysis; Speech recognition; System testing; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 1995. Proceedings., Seventh International Conference on
Conference_Location :
Herndon, VA
ISSN :
1082-3409
Print_ISBN :
0-8186-7312-5
Type :
conf
DOI :
10.1109/TAI.1995.479616
Filename :
479616
Link To Document :
بازگشت