Title :
Audio Clips Retrieval Using Anchor Reference Space and Latent Semantic Analysis
Author :
Biatov, Konstantin
Author_Institution :
Fraunhofer IAIS, St. Augustin, Germany
Abstract :
This paper describes a technique for audio clips retrieval. The audio clips are modeled using a common universal codebook. The codebook is based on a bag-of-features (BOF). The features extracted from all clips are grouped into clusters using the k-means algorithm. The individual audio clips are modeled by the normalized distribution of the numbers of cluster bins. The latent semantic indexing (LSI) is applied to the feature-audio clip matrix to represent the data in latent semantic space. Then the primary audio clip description is converted to the vector in anchor reference space. Each component of the anchor vector is a probabilistic similarity between this clip and the clip corresponding to the considered component. Then LSI is applied to new feature-audio clip matrix, mapping the data to the latent semantic space based on anchor representation. For audio retrieval the nearest-neighbor (NN) algorithm is exploited. The described algorithm demonstrates high retrieval performance.
Keywords :
audio coding; content-based retrieval; feature extraction; pattern clustering; anchor reference space; anchor representation; audio clip codebok; audio clips retrieval; audio content retrieval; data mapping; feature-audio clip matrix; features extraction; k-means clustering algorithm; latent semantic analysis; latent semantic indexing; nearest-neighbor algorithm; normalized distribution; Audio databases; Euclidean distance; Information retrieval; Large scale integration; Linear discriminant analysis; Matrix converters; Mel frequency cepstral coefficient; Music information retrieval; Principal component analysis; Speech;
Conference_Titel :
Multimedia, 2009. ISM '09. 11th IEEE International Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-5231-6
Electronic_ISBN :
978-0-7695-3890-7
DOI :
10.1109/ISM.2009.102