Title :
Music Genre Classification of MPEG AAC Audio Data
Author :
Kobayakawa, Michihiro ; Hoshi, Mamoru ; Yuzawa, Koichiro
Author_Institution :
Tokyo Metropolitan Coll. of Ind. Technol., Tokyo, Japan
Abstract :
In this paper, we propose a musical feature extracted from the bit stream of AAC (Advanced Audio Coding) compressed audio data without decoding to audio signals. We focus on the spectral data which are stored in the bit stream for representing the flatten MDCT (Modified Discrete Cosine Transform) of an audio signal. For computing the musical feature, we extract the spectral data and apply the Discrete Wavelet Transform (DWT) to the extracted spectral data. For musical genre classification, we use the discriminant analysis as a classifier. We experimented on 1,498 AAC compressed audio data collected from 10 musical genres and evaluated the performance of the musical feature. We got the maximum correct ratios 81.24%. The experiments showed that the musical feature based on the spectral data in the bit stream had good performance for genre classification in the MPEG-4 AAC compressed domain.
Keywords :
audio coding; discrete cosine transforms; discrete wavelet transforms; feature extraction; music; signal classification; spectral analysis; AAC compressed audio data bitstream; AAC compressed audio data collection; DWT; MPEG AAC audio data; MPEG-4 AAC compressed domain; advanced audio coding; audio signal; discrete wavelet transform; discriminant analysis; flatten MDCT representation; maximum correct ratios; modified discrete cosine transform; music genre classification; musical feature extraction; musical feature performance evaluation; spectral data extraction; Data mining; Discrete wavelet transforms; Feature extraction; Music; Reliability; Transform coding; Vectors; MPEG AAC audio; Music Genre Classification; music information retrieval;
Conference_Titel :
Multimedia (ISM), 2014 IEEE International Symposium on
Conference_Location :
Taichung
Print_ISBN :
978-1-4799-4312-8
DOI :
10.1109/ISM.2014.25