DocumentCode
3547125
Title
Heterogeneous information saliency features´ fusion approach for machine´s environment sounds based awareness
Author
Jingyu Wang ; Ke Zhang ; Madani, Kurash ; Sabourin, Christophe
Author_Institution
Sch. of Astronaut., Northwestern Polytech. Univ., Xi´an, China
fYear
2013
fDate
2-4 Nov. 2013
Firstpage
197
Lastpage
205
Abstract
Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of "sense" or "awareness", which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine\´s awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine\´s environment sounds based awareness.
Keywords
Fourier transforms; acoustic signal processing; artificial intelligence; control engineering computing; hearing; robots; sensor fusion; visual perception; HISFF approach; Mel-frequency Cepstrum coefficient; acoustic saliency; biological acoustic awareness; heterogeneous information saliency feature fusion; human acoustic awareness; human hearing system; human vision; indoor environment sound; machine environment sound based awareness; outdoor environment sound; short-time Fourier transform; visual saliency; Equations; Feature extraction; Mathematical model; Mel frequency cepstral coefficient; Robots; Spectrogram; MFCC; STFT algorithm; environment sound signal; heterogeneous information; machine´s awareness; saliency feature fusion; spectrogram;
fLanguage
English
Publisher
ieee
Conference_Titel
Awareness Science and Technology and Ubi-Media Computing (iCAST-UMEDIA), 2013 International Joint Conference on
Conference_Location
Aizuwakamatsu
Type
conf
DOI
10.1109/ICAwST.2013.6765433
Filename
6765433
Link To Document