DocumentCode
2140972
Title
Automatic low-dimensional analysis of audio databases
Author
Arias, Joss Anibal ; Andre-Obrecht, Regine ; Farinas, Jerome
Author_Institution
IRIT/SAMOVA, Univ. Paul Sabatier, Toulouse
fYear
2008
fDate
18-20 June 2008
Firstpage
556
Lastpage
559
Abstract
In this paper we present an approach designed to map variable size audio sequences into fixed-length vectors, useful to discover contents of audio databases. First, we model standard audio parameters with Gaussian mixture models (GMM). Then, symmetric Kullback-Leiber divergences between models are approximated with a Monte-Carlo method. We use these statistical dissimilarities to find a low-dimensional representation of each audio sequence through Multidimensional scaling (MDS) algorithm. Vectors in low-dimensional spaces are then easily explored with kernel and clustering methods. Experiments carried out in different kind of audio databases (music, speakers and languages) show good potential of the proposed approach and provide a framework for more challenging applications.
Keywords
Gaussian processes; Monte Carlo methods; audio databases; pattern clustering; Gaussian mixture models; Monte-Carlo method; audio databases; automatic low-dimensional analysis; fixed-length vectors; multidimensional scaling algorithm; statistical dissimilarities; variable size audio sequences; Audio databases; Data analysis; Feature extraction; Hidden Markov models; Kernel; Machine learning; Machine learning algorithms; Multidimensional systems; Natural languages; Spatial databases; Unsupervised learning; audio databases; machine learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
Conference_Location
London
Print_ISBN
978-1-4244-2043-8
Electronic_ISBN
978-1-4244-2044-5
Type
conf
DOI
10.1109/CBMI.2008.4564996
Filename
4564996
Link To Document