Title :
Audio content description with wavelets and neural nets
Author :
Rein, Stephan ; Reisslein, Martin ; Sikora, Thomas
Author_Institution :
Tech. Univ. Berlin, Germany
Abstract :
Precision audio content description is one of the key components of next generation Internet multimedia search machines. We examine the usability of a combination of 39 different wavelets and three different types of neural nets for precision audio content description. More specifically, we develop a novel wavelet dispersion measure that measures obtained ranks of wavelet coefficients. Our dispersion measure in conjunction with a probabilistic radial basis neural network trained by only three independent example sets obtains a success rate of approximately 78% in identifying unknown complex classical music movements.
Keywords :
Internet; audio signal processing; identification; learning (artificial intelligence); multimedia communication; music; radial basis function networks; search engines; wavelet transforms; independent example sets; multimedia search machines; neural nets; neural network training; next generation Internet; precision audio content description; probabilistic radial basis neural network; unknown classical music identification; usability; wavelet coefficient rank; wavelet dispersion measure; wavelets; Audio recording; Content based retrieval; Dispersion; Internet; Music information retrieval; Neural networks; Performance evaluation; Usability; Wavelet coefficients; World Wide Web;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326833