DocumentCode :
3480154
Title :
An audio representation for content based retrieval
Author :
Melih, Kathy ; Gonzalez, Ruben ; Ogunbona, Philip
Author_Institution :
Sch. of Inf. Technol., Griffith Univ., Brisbane, Qld., Australia
Volume :
1
fYear :
1997
fDate :
4-4 Dec. 1997
Firstpage :
207
Abstract :
Despite the increasing interest in multimedia data retrieval audio data has received little attention. This is due, not to a lack of interest but rather to unique difficulties posed by the medium. In particular existing unstructured audio representations do not easily lend themselves to content based retrieval and especially browsing. This paper aims to address this oversight by developing an audio representation that provides direct support for browsing and content based retrieval. This support is the result of a structured representation based on psychoacoustic principles in which salient attributes of audio are directly accessible. In addition, the representation is compact thus addressing the requirement for minimisation of storage.
Keywords :
acoustic signal processing; audio signals; multimedia computing; music; query processing; speech processing; audio representation; browsing; content based retrieval; multimedia data retrieval; music; noise; psychoacoustic principles; silence; speech; storage minimisation; structured representation; Audio recording; Content based retrieval; Data mining; Indexing; Information retrieval; Information technology; Music information retrieval; Psychology; Speech; Video recording;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld., Australia
Print_ISBN :
0-7803-4365-4
Type :
conf
DOI :
10.1109/TENCON.1997.647293
Filename :
647293
Link To Document :
بازگشت