DocumentCode :
683714
Title :
Audio Feature and Classifier Analysis for Efficient Recognition of Environmental Sounds
Author :
Okuyucu, C. ; Sert, M. ; Yazici, Adnan
Author_Institution :
Adv. Diagnostic Imaging Dept., Philips Med. Syst. Int. B.V, Best, Netherlands
fYear :
2013
fDate :
9-11 Dec. 2013
Firstpage :
125
Lastpage :
132
Abstract :
Environmental sounds (ES) have different characteristics, such as unstructured nature and typically noise-like and flat spectrums, which make recognition task difficult compared to speech or music sounds. Here, we perform an exhaustive feature and classifier analysis for the recognition of considerably similar ES categories and propose a best representative feature to yield higher recognition accuracy. In the experiments, thirteen (13) ES categories, namely emergency alarm, car horn, gun, explosion, automobile, helicopter, water, wind, rain, applause, crowd, and laughter are detected and tested based on eleven (11) audio features (MPEG-7 family, ZCR, MFCC, and combinations) by using the HMM and SVM classifiers. Extensive experiments have been conducted to demonstrate the effectiveness of these joint features for ES classification. Our experiments show that, the joint feature set ASFCS-H (Audio Spectrum Flatness, Centroid, Spread, and Audio Harmonicity) is the best representative feature set with an average F-measure value of 80.6%.
Keywords :
audio signal processing; feature extraction; hidden Markov models; speech recognition; support vector machines; ASFCS-H; HMM; MFCC; MPEG-7 family; SVM classifiers; ZCR; applause; audio features; audio spectrum flatness centroid spread and audio harmonicity; automobile; car horn; classifier analysis; crowd; emergency alarm; environmental sound recognition; explosion; feature analysis; gun; helicopter; laughter; rain; unstructured nature; water; wind; Explosions; Helicopters; Hidden Markov models; Mel frequency cepstral coefficient; Rain; Support vector machines; Transform coding; Environmental sound classification; HMM; MFCC; MPEG-7; SVM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia (ISM), 2013 IEEE International Symposium on
Conference_Location :
Anaheim, CA
Print_ISBN :
978-0-7695-5140-1
Type :
conf
DOI :
10.1109/ISM.2013.29
Filename :
6746781
Link To Document :
بازگشت