DocumentCode :
661391
Title :
Towards a more efficient sparse coding based audio-word feature extraction system
Author :
Yeh, Chin-Chia Michael ; Yi-Hsuan Yang
Author_Institution :
Res. Center for Inf. Technol. Innovation, Acad. Sinica, Taipei, Taiwan
fYear :
2013
fDate :
Oct. 29 2013-Nov. 1 2013
Firstpage :
1
Lastpage :
7
Abstract :
This paper is concerned with the efficiency of sparse coding based audio-word feature extraction system. In particular, we have defined and added the concept of early and late temporal pooling to the classic sparse coding based audio-word feature extraction pipeline, and we have tested them on the genre tags subset of the CAL10k data set. We define temporal pooling as any functions that are able to transforms the input time series representation into a more temporally compact representation. Under this definition, we have examined the following two temporal pooling functions for improving the feature extraction´s efficiency, and they are: Early Texture Window Pooling and Multiple Frame Representation. Early texture window pooling tremendously boost the efficiency by compromising the retrieving accuracy, while multiple frame representation slightly improve both the feature extracting efficiency and retrieving accuracy. Overall, our best feature extraction setup achieves 0.202 in mean average precision on the genre tags subset of the CAL10k data set.
Keywords :
audio coding; feature extraction; information retrieval; time series; word processing; CAL10k data set; early temporal pooling concept; early texture window pooling; feature extracting efficiency; feature retrieving accuracy; genre tag subset; input time series representation; late temporal pooling concept; multiple frame representation; sparse coding based audio-word feature extraction system; temporally compact representation; Accuracy; Computational efficiency; Dictionaries; Encoding; Feature extraction; Pipelines; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
Type :
conf
DOI :
10.1109/APSIPA.2013.6694252
Filename :
6694252
Link To Document :
بازگشت