Title :
Musical genre classification of MPEG-4 TwinVQ audio data
Author :
Kobayakawa, Michihiro ; Hoshi, Mamoru
Author_Institution :
Tokyo Metropolitan Coll. of Ind. Technol., Tokyo, Japan
Abstract :
We proposed a musical feature based on LSP (Line Spectrum Pair) parameter directly extracted from the bitstream in the MPEG-4 TwinVQ audio data. Our key idea is to extract the musical features by using information stored in the bitstream without decoding to audio signals. In this paper, we propose two musical features for musical genre classification of MPEG-4 TwinVQ audio data. For extracting musical features, we focus on LPC (Linear Predictive Coding) cepstrum and LPC coefficient computed from LSP parameter in the bitstream of TwinVQ audio data by inverse operations of encoding steps. The musical features based on LPC cepstrum and on LPC coefficient are computed by Discrete Wavelet Transform (DWT). For musical genre classification, we use the Discriminant Analysis (DA) as a classifier. We experimented on 2, 196 TwinVQ audio data collected from 10 musical genres and evaluated the performance of the musical features. From the experiments, we got the correct ratios 79.7% and 84.1% for LPC coefficient-based musical feature and LPC cepstrum-based musical feature, respectively. And we compared the performance of two musical features. The experiments showed that LPC cepstrum-based musical feature had good performance for musical genre classification in the compressed domain of MPEG-4 TwinVQ audio compression.
Keywords :
audio coding; discrete wavelet transforms; music; MPEG-4 TwinVQ audio data; audio signals; discrete wavelet transform; discriminant analysis; encoding; inverse operations; line spectrum pair; linear predictive coding; musical features; musical genre classification; Audio compression; Cepstrum; Discrete wavelet transforms; Encoding; Feature extraction; Rocks; Transform coding;
Conference_Titel :
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-61284-348-3
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2011.6012195