DocumentCode
706060
Title
Using auditory saliency to understand complex auditory scenes
Author
Duangudom, Varinthira ; Anderson, David V.
Author_Institution
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2007
fDate
3-7 Sept. 2007
Firstpage
1206
Lastpage
1210
Abstract
In this paper, we present a computational model for predicting pre-attentive, bottom-up auditory saliency. The model determines perceptually what in a scene stands out to observers and can be used to determine what part of a complex auditory scene is most important. The vision equivalency of this is visual saliency as defined by Koch and others [1]. The model is based on inhibition of features obtained from auditory spectro-temporal receptive fields (STRFs) and produces results that match well with preliminary psychoacoustic experiments. The model does well in predicting what is salient for some common auditory examples and there is a strong correlation between scenes chosen as salient by the model and scenes that human subjects selected as salient.
Keywords
audio signal processing; feature extraction; hearing; speech processing; STRF; auditory saliency; auditory spectrotemporal receptive fields; complex auditory scenes; visual saliency; Computational modeling; Correlation; Modulation; Observers; Psychoacoustic models; Spectrogram; Time-frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2007 15th European
Conference_Location
Poznan
Print_ISBN
978-839-2134-04-6
Type
conf
Filename
7098996
Link To Document