DocumentCode :
3547125
Title :
Heterogeneous information saliency features´ fusion approach for machine´s environment sounds based awareness
Author :
Jingyu Wang ; Ke Zhang ; Madani, Kurash ; Sabourin, Christophe
Author_Institution :
Sch. of Astronaut., Northwestern Polytech. Univ., Xi´an, China
fYear :
2013
fDate :
2-4 Nov. 2013
Firstpage :
197
Lastpage :
205
Abstract :
Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of "sense" or "awareness", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine\´s awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine\´s environment sounds based awareness.
Keywords :
Fourier transforms; acoustic signal processing; artificial intelligence; control engineering computing; hearing; robots; sensor fusion; visual perception; HISFF approach; Mel-frequency Cepstrum coefficient; acoustic saliency; biological acoustic awareness; heterogeneous information saliency feature fusion; human acoustic awareness; human hearing system; human vision; indoor environment sound; machine environment sound based awareness; outdoor environment sound; short-time Fourier transform; visual saliency; Equations; Feature extraction; Mathematical model; Mel frequency cepstral coefficient; Robots; Spectrogram; MFCC; STFT algorithm; environment sound signal; heterogeneous information; machine´s awareness; saliency feature fusion; spectrogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Awareness Science and Technology and Ubi-Media Computing (iCAST-UMEDIA), 2013 International Joint Conference on
Conference_Location :
Aizuwakamatsu
Type :
conf
DOI :
10.1109/ICAwST.2013.6765433
Filename :
6765433
Link To Document :
بازگشت