Title :
An Empirical Study of Feature Extraction Methods for Audio Classification
Abstract :
With the growing popularity of video sharing web sites and the increasing use of consumer-level video capture devices, new algorithms are needed for intelligent searching and indexing of such data. The audio from these video streams is particularly challenging due to its low quality and high variability. Here, we perform a broad empirical study of features used for intelligent audio processing. We perform experiments on a dataset of 200 consumer videos over which we attempt to detect 10 semantic audio concepts.
Keywords :
Web sites; audio signal processing; feature extraction; video signal processing; video streaming; audio classification; consumer-level video capture devices; feature extraction methods; intelligent audio processing; video sharing Web sites; video streams; Conferences; Feature extraction; Mel frequency cepstral coefficient; Semantics; Signal processing algorithms; Speech; Training; Audio Classification; Audio Event; Audio Features; Consumer Video; Gammatone; MFCC; Wavelet;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.1111