Title :
Relating audio-visual events caused by multiple movements: in the case of entire object movement
Author :
Chen, Jinji ; Mukai, Toshiharu ; Takeuchi, Yoshinori ; Matsumoto, Tetsuya ; Kudo, Hiroaki ; Yamamura, Tsuyoshi ; Ohnishi, Noboru
Author_Institution :
CIAIR, Nagoya Univ., Japan
Abstract :
Relating audio-visual events is important for constructing for an artificial intelligent system, which can acquire the audio-visual knowledge of movement through active observation without teaching. This paper proposes a method for relating multiple audiovisual events observed by a camera and a microphone according to general laws without object-specific knowledge (including the case of entire object movement). As corresponding cues, we use Gestalt´s grouping law; simultaneity of the occurrence of the sound and the change in movement or the same motion starting, similarity of repetition between sound and movement. Based on the correlation coefficient between auditory and visual sequence, the component of frequency at sound onset is related to the short-term space-time invariants (STSTI) of movement. We experimented in the real environment and obtained satisfactory results showing the effectiveness of the proposed method.
Keywords :
acoustic signal processing; computer vision; image motion analysis; sensor fusion; active observation; artificial intelligent system; audio-visual events; audio-visual knowledge; auditory sequence; correlation coefficient; event correspondence; grouping law; object-specific knowledge; occurrence simultaneity; repetition similarity; sensor fusion; short-term space-time invariants; visual sequence; Auditory system; Cameras; Computer aided software engineering; Control systems; Education; Layout; Mouth; Postal services; Sensor fusion; Speech;
Conference_Titel :
Information Fusion, 2002. Proceedings of the Fifth International Conference on
Conference_Location :
Annapolis, MD, USA
Print_ISBN :
0-9721844-1-4
DOI :
10.1109/ICIF.2002.1021153