Title :
Improve audio representation by using feature structure patterns
Author :
Cai, Rui ; Lu, Lie ; Zhang, Hong-Jiang ; Cai, Lian-Hong
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
Although statistical characteristics of audio features are widely used for audio representation in most current audio analysis systems and have been proved to be effective, they only utilize the average feature variations over time, and thus lead to ambiguities in some cases. Structure patterns, which describe the representative structure characteristics of both temporal and spectral features, are proposed to improve audio representation. In this paper, three structure patterns, including energy envelope pattern, sub-band spectral shape pattern and harmonicity prominence pattern, are proposed or refined, as successive development of our previous work. Evaluations on a content-based audio retrieval system with more than 1500 clips showed very encouraging results.
Keywords :
audio databases; audio signal processing; content-based retrieval; feature extraction; signal representation; spectral analysis; audio representation; content-based audio retrieval system; energy envelope pattern; feature structure patterns; harmonicity prominence pattern; spectral features; sub-band spectral shape pattern; temporal features; Asia; Computer science; Content based retrieval; Frequency; Humans; Image analysis; Performance analysis; Spectral shape; Spectrogram; Statistical analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326834