DocumentCode
2849420
Title
Audio Clips Retrieval Using Anchor Reference Space and Latent Semantic Analysis
Author
Biatov, Konstantin
Author_Institution
Fraunhofer IAIS, St. Augustin, Germany
fYear
2009
fDate
14-16 Dec. 2009
Firstpage
32
Lastpage
37
Abstract
This paper describes a technique for audio clips retrieval. The audio clips are modeled using a common universal codebook. The codebook is based on a bag-of-features (BOF). The features extracted from all clips are grouped into clusters using the k-means algorithm. The individual audio clips are modeled by the normalized distribution of the numbers of cluster bins. The latent semantic indexing (LSI) is applied to the feature-audio clip matrix to represent the data in latent semantic space. Then the primary audio clip description is converted to the vector in anchor reference space. Each component of the anchor vector is a probabilistic similarity between this clip and the clip corresponding to the considered component. Then LSI is applied to new feature-audio clip matrix, mapping the data to the latent semantic space based on anchor representation. For audio retrieval the nearest-neighbor (NN) algorithm is exploited. The described algorithm demonstrates high retrieval performance.
Keywords
audio coding; content-based retrieval; feature extraction; pattern clustering; anchor reference space; anchor representation; audio clip codebok; audio clips retrieval; audio content retrieval; data mapping; feature-audio clip matrix; features extraction; k-means clustering algorithm; latent semantic analysis; latent semantic indexing; nearest-neighbor algorithm; normalized distribution; Audio databases; Euclidean distance; Information retrieval; Large scale integration; Linear discriminant analysis; Matrix converters; Mel frequency cepstral coefficient; Music information retrieval; Principal component analysis; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia, 2009. ISM '09. 11th IEEE International Symposium on
Conference_Location
San Diego, CA
Print_ISBN
978-1-4244-5231-6
Electronic_ISBN
978-0-7695-3890-7
Type
conf
DOI
10.1109/ISM.2009.102
Filename
5365285
Link To Document