DocumentCode
417785
Title
Improve audio representation by using feature structure patterns
Author
Cai, Rui ; Lu, Lie ; Zhang, Hong-Jiang ; Cai, Lian-Hong
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume
4
fYear
2004
fDate
17-21 May 2004
Abstract
Although statistical characteristics of audio features are widely used for audio representation in most current audio analysis systems and have been proved to be effective, they only utilize the average feature variations over time, and thus lead to ambiguities in some cases. Structure patterns, which describe the representative structure characteristics of both temporal and spectral features, are proposed to improve audio representation. In this paper, three structure patterns, including energy envelope pattern, sub-band spectral shape pattern and harmonicity prominence pattern, are proposed or refined, as successive development of our previous work. Evaluations on a content-based audio retrieval system with more than 1500 clips showed very encouraging results.
Keywords
audio databases; audio signal processing; content-based retrieval; feature extraction; signal representation; spectral analysis; audio representation; content-based audio retrieval system; energy envelope pattern; feature structure patterns; harmonicity prominence pattern; spectral features; sub-band spectral shape pattern; temporal features; Asia; Computer science; Content based retrieval; Frequency; Humans; Image analysis; Performance analysis; Spectral shape; Spectrogram; Statistical analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326834
Filename
1326834
Link To Document