DocumentCode :
417784
Title :
Audio content description with wavelets and neural nets
Author :
Rein, Stephan ; Reisslein, Martin ; Sikora, Thomas
Author_Institution :
Tech. Univ. Berlin, Germany
Volume :
4
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Precision audio content description is one of the key components of next generation Internet multimedia search machines. We examine the usability of a combination of 39 different wavelets and three different types of neural nets for precision audio content description. More specifically, we develop a novel wavelet dispersion measure that measures obtained ranks of wavelet coefficients. Our dispersion measure in conjunction with a probabilistic radial basis neural network trained by only three independent example sets obtains a success rate of approximately 78% in identifying unknown complex classical music movements.
Keywords :
Internet; audio signal processing; identification; learning (artificial intelligence); multimedia communication; music; radial basis function networks; search engines; wavelet transforms; independent example sets; multimedia search machines; neural nets; neural network training; next generation Internet; precision audio content description; probabilistic radial basis neural network; unknown classical music identification; usability; wavelet coefficient rank; wavelet dispersion measure; wavelets; Audio recording; Content based retrieval; Dispersion; Internet; Music information retrieval; Neural networks; Performance evaluation; Usability; Wavelet coefficients; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326833
Filename :
1326833
Link To Document :
بازگشت