Title :
A neural network based audio content classification
Author :
Mitra, Vikramjit ; Wang, Chia-Jiu
Author_Institution :
Univ. of Maryland, College Park
fDate :
Oct. 30 2007-Nov. 2 2007
Abstract :
The emergence of digital music in the Internet calls for a reliable real-time tool to analyze and properly categorize them for the users. To incorporate content or genre queries in web searches, audio content analysis and classification is imperative. This paper proposes a set of audio content features and a parallel Neural Network architecture that addresses the task of automated content based audio classification. Feature sets based on signal periodicity, beat information, sub-band energy, Mel-frequency Cepstral coefficients and Wavelet transforms are proposed and each of the feature sets are individually analyzed for their pertinence in the proposed task. A parallel Multi-layered Perceptron network is proposed which offers a classification accuracy of 84.4% to distinguish between 6 different genres. The proposed architecture is compared with a Support Vector Machine based classifier and is found to perform superiorly than the later.
Keywords :
Internet; audio signal processing; cepstral analysis; multilayer perceptrons; neural net architecture; wavelet transforms; Internet calls; Mel-frequency Cepstral coefficients; Web search; audio content analysis; audio content classification; beat information; digital music; multilayered perceptron; neural network architecture; signal periodicity; subband energy; wavelet transforms; Cepstral analysis; Information analysis; Internet; Multilayer perceptrons; Neural networks; Signal analysis; Support vector machines; Wavelet analysis; Wavelet transforms; Web search;
Conference_Titel :
TENCON 2007 - 2007 IEEE Region 10 Conference
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-1272-3
Electronic_ISBN :
978-1-4244-1272-3
DOI :
10.1109/TENCON.2007.4428919