DocumentCode
312046
Title
Keyword spotting enhancement for video soundtrack indexing
Author
Gelin, Philippe ; Wellekens, Chris J.
Author_Institution
Dept. of Multimedia Commun., Inst. Eurecom, Sophia Antipolis, France
Volume
2
fYear
1996
fDate
3-6 Oct 1996
Firstpage
586
Abstract
Multimedia databases contain an increasing number of videos that are not easily semantically accessed. Among the useful indices that can be extracted from the soundtrack, the presence of a keyword at some place plays a prominent role. This paper deals with the specificities of such a keyword spotter and the enhancements brought to our previous technique (1996) based on frame labeling. To be useful, such a keyword spotter has to be speaker-independent. Moreover, it has to be able to detect any word from an open vocabulary. This directly implies the use of a phonemic representation of the word. These constraints usually lead to an excessively time-consuming tool. The division of the indexing process into two parts-the first one off-line, the second one at query time-allows a faster response
Keywords
audio coding; audio recording; audio systems; indexing; multimedia computing; speech coding; speech recognition; video recording; visual databases; vocabulary; frame labeling; multimedia databases; off-line process; open vocabulary; phonemic representation; query time; response speed; semantic access; speaker-independent keyword spotting; video soundtrack indexing; word detection; Hidden Markov models; Indexing; Labeling; Lattices; Loudspeakers; Multimedia communication; Multimedia databases; Speech; Viterbi algorithm; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607429
Filename
607429
Link To Document