Title :
Source segmentation for structured audio
Author :
Melih, Kathy ; Gonzalez, Ruben
Author_Institution :
Sch. of Inf. Technol., Griffith Univ., Gold Coast, Qld., Australia
Abstract :
With the increasing demand for content based manipulation of ever growing stores of audio data and the emergence of MPEG-7 has come the need for structured audio representations. However, while the necessity of such a representation has been recognised and, to some extent, its essential features have been identified, its actual development and implementation have generally been relegated as problems for another time or person to solve. This paper attempts to address the shortfall by defining an audio structure that will allow content-based manipulation of audio at the level of audio objects. The paper then summarises the processes required to generate such a structure. Further, details are provided as to how the second level of this structure can be derived from a low-level perceptually based audio representation previously developed by the authors to satisfy the requirements at the lowest level of the audio structure. Finally, initial experimental results are presented
Keywords :
audio signal processing; MPEG-7; audio objects; content based manipulation; low-level perceptually based audio representation; source segmentation; structured audio representations; Auditory system; Content based retrieval; Data mining; Feature extraction; Gold; Information technology; MPEG 7 Standard; Music information retrieval; Speech; Streaming media;
Conference_Titel :
Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
0-7803-6536-4
DOI :
10.1109/ICME.2000.871484