Title :
Unsupervised anchor space generation for similarity measurement of general audio
Author :
Lu, Lie ; Hanjalic, Alan
Author_Institution :
Microsoft Res. Asia, Beijing
fDate :
March 31 2008-April 4 2008
Abstract :
Reliably measuring similarity between audio clips is critical to many applications. As opposed to the conventional way of measuring audio similarity using low-level features directly, in this paper we consider the similarity computation using an anchor space. Each dimension of such a space corresponds to a semantic category (anchor). Mapping an audio clip onto this space results in a vector, which indicates the membership probability of this audio clip with respect to each semantic category. The more similar the mappings of two audio clips, the more similar they are. While an anchor space is typically generated in a supervised fashion, supervised approach is infeasible in many realistic scenarios where audio content semantics is too diverse or simply unknown a priori. We therefore propose an unsupervised approach to anchor space generation. There, spectral clustering is employed to cluster the audio clips with similar low-level features and then the obtained clusters are adopted as semantic categories. Using this semantic space for audio similarity computation shows a considerable accuracy improvement (7% on mAP) in an audio retrieval system, compared with the conventional low-level feature based approach.
Keywords :
audio signal processing; document handling; feature extraction; information retrieval; pattern clustering; probability; unsupervised learning; audio clip; audio document similarity measurement; audio retrieval system; feature extraction; probability; semantic category; spectral clustering; unsupervised anchor space generation; Asia; Content based retrieval; Event detection; Extraterrestrial measurements; Feature extraction; Fluctuations; Layout; Signal analysis; Signal processing; Space technology; anchor space; audio content analysis; audio segmentation; audio similarity computation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517544