DocumentCode
1573013
Title
Extracting High Level Semantics by Means of Speech, Audio, and Image Primitives in Surveillance Applications
Author
Goldmann, L. ; Samour, A. ; Karaman, Mustafa ; Sikora, Thomas
fYear
2006
Firstpage
2397
Lastpage
2400
Abstract
Traditional surveillance systems are usually based on visual information only. With the emerging multimedia analysis techniques, interests are changing towards systems that incorporate multiple sensors and different modalities, which leads to new ways of analyzing this multimedia data and more sophisticated applications. This paper shortly reviews the ideas of traditional surveillance systems and explains actual research interests in this domain. Then, it focuses on the typical structure, goals, and applications of multimedia surveillance systems. These issues are supported by short descriptions of selected analysis steps of such a system currently under development. Some experimental results are given to illustrate the extracted semantics and to assess the performance of the individual steps.
Keywords
audio signal processing; image recognition; multimedia systems; speaker recognition; surveillance; audio primitives; image primitives; multimedia data analysis; multimedia surveillance system; multimodal analysis; semantic extraction; smart room technology; speech primitives; video surveillance; Computer vision; Data analysis; Data mining; Information analysis; Multimedia systems; Pattern recognition; Smart cameras; Speech; Surveillance; Vehicles; multimedia surveillance; multimodal analysis; smart room technologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 2006 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1522-4880
Print_ISBN
1-4244-0480-0
Type
conf
DOI
10.1109/ICIP.2006.312945
Filename
4107050
Link To Document