Title :
Content-based audio classification and retrieval using the nearest feature line method
Author_Institution :
Microsoft Res., Beijing, China
fDate :
9/1/2000 12:00:00 AM
Abstract :
A method is presented for content-based audio classification and retrieval. It is based on a new pattern classification method called the nearest feature line (NFL). In the NFL, information provided by multiple prototypes per class is explored. This contrasts to the nearest neighbor (NN) classification in which the query is compared to each prototype individually. Regarding audio representation, perceptual and cepstral features and their combinations are considered. Extensive experiments are performed to compare various classification methods and feature sets. The results show that the NFL-based method produces consistently better results than the NN-based and other methods. A system resulting from this work has achieved the error rate of 9.78%, as compared to that of 18.34% of a compelling existing system, as tested on a common audio database
Keywords :
audio signal processing; content-based retrieval; database management systems; music; pattern classification; signal classification; NFL; audio representation; cepstral features; content-based audio classification; multiple prototypes; nearest feature line method; perceptual feature; retrieval; Application software; Audio databases; Cepstral analysis; Content based retrieval; Multimedia databases; Nearest neighbor searches; Neural networks; Pattern recognition; Prototypes; Roentgenium;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on