Title :
Using naïve text queries for robust audio information retrieval
Author :
Kim, Samuel ; Georgiou, Panayiotis ; Narayanan, Shrikanth ; Sundaram, Shiva
Author_Institution :
Signal Anlaysis & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
Abstract :
The goal of this work is to build an audio information retrieval system which provides users with flexibility in formulating their queries: from audio examples to naïve text. Specifically, the focus of this paper is on using naïve text to create input queries describing the desired information of the users. Using naïve text queries, however, raises interoperability issues between annotation and retrieval processes due to the wide variety of available audio descriptions. In this paper, we propose an intermediate audio description layer (iADL) to solve the interoperability issues between the annotation and retrieval processes. The iADL comprises two axes corresponding to semantic and onomatopoeic descriptions based on human-to-human communication experiments on how humans express sounds verbally. Various text modeling schemes, such as latent semantic analysis (LSA) and latent topic model, are utilized to transform the naïve text onto the proposd iADL.
Keywords :
audio signal processing; information retrieval; open systems; text analysis; audio descriptions; human-to-human communication experiments; intermediate audio description layer; interoperability issues; latent semantic analysis; naïve text queries; onomatopoeic descriptions; robust audio information retrieval; semantic descriptions; topic model; Audio databases; Data mining; Humans; Information retrieval; Laboratories; Natural languages; Robustness; Signal processing; Usability; Vocabulary; audio descriptions; audio information retrieval; naïve text query; out-of-vocabulary problem;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5496235