Title :
Content-based methods for the management of digital music
Author_Institution :
AT&T Labs., Cambridge, UK
Abstract :
The literature on content-based music retrieval has largely finessed acoustic issues by using MIDI format music. This paper however considers content-based classification and retrieval of a typical (MPEG layer III) digital music archive. Two statistical techniques are investigated and appraised. Gaussian mixture modelling performs well with an accuracy of 92% on a music classification task. A tree-based vector quantization scheme offers marginally worse performance in a faster, scalable framework. Good results are also reported for music retrieval-by-similarity using the same techniques. Mel-frequency cepstral coefficients parameterize the audio well, though are slow to compute from the compressed domain. A new parameterization (MP3CEP), based on a partial decompression of MPEG layer III audio, is therefore proposed to facilitate music processing at user-interactive speeds. Overall, the techniques described provide useful tools in the management of a typical digital music library
Keywords :
audio coding; classification; code standards; content-based retrieval; music; statistical analysis; trees (mathematics); vector quantisation; Gaussian mixture modelling; MPEG; audio; content-based classification; content-based music retrieval; digital music management; mel-frequency cepstral coefficients; music retrieval-by-similarity; parameterization; scalable framework; statistical techniques; tree-based vector quantization; Appraisal; Cepstral analysis; Content based retrieval; Content management; Laboratories; Multiple signal classification; Music information retrieval; Software libraries; Transform coding; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.859334