Title :
Spoken information retrieval for multimedia databases
Author :
Salgado-Garza, Luis R. ; Nolazco-FIores, J.A. ; Díaz-López, Pablo D.
Author_Institution :
Dept. of Comput. Sci., ITESM, Monterrey, Mexico
Abstract :
Summary form only given. This document describes the realization of a spoken information retrieval system and its application to word search into indexed multimedia databases. The multimedia database is build from a multiformat set of text, audio and video documents. The whole archive collection is indexed using preprocessing techniques to produce transcripts and indexing software tools to catalog them. The system uses a Java-based distributed client-server architecture. A Java applet is used to capture the audio signal for a spoken query, then it is transmitted to a server where an automatic speech recognition (ASR) software is applied to convert the signal into a transcripted hypothesis. Later, a query tool process the transcript sentence along with the indexed multimedia database and a set of pointers to documents are generated. Finally, a Web page with links to the resulting documents, where queried words appear, is presented to the user.
Keywords :
Java; Web sites; audio signal processing; cataloguing; client-server systems; indexing; information retrieval systems; multimedia databases; software tools; speech recognition; Java applet; Web page; archive collection; audio document; audio signal; automatic speech recognition; cataloging; distributed client-server architecture; indexed multimedia databases; indexing software tools; multiformat set; query tool; spoken information retrieval; spoken query; text document; video document; word search; Automatic speech recognition; Computer science; Decoding; Humans; Indexing; Information retrieval; Internet; Java; Multimedia databases; Search engines;
Conference_Titel :
Computer Systems and Applications, 2005. The 3rd ACS/IEEE International Conference on
Print_ISBN :
0-7803-8735-X
DOI :
10.1109/AICCSA.2005.1387135