DocumentCode :
3153583
Title :
Auditory context classification using random forests
Author :
Yang, Li ; Su, Feng
Author_Institution :
State Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
2349
Lastpage :
2352
Abstract :
High-level semantic information can be extracted from audio materials to facilitate various content-based analysis and context-awareness applications. In this paper, we propose a novel automatic auditory context classification method, which combines the characterization of audio events and the inference of auditory context category in a single ensemble analysis framework. In the proposed framework, key audio events in the context are characterized by composite features from discriminative representation models (local discriminant bases, pseudo-semantic and bag-of-audio-words) learned from samples. A random forest based ensemble learning and classification model is employed for auditory contexts, in which individual segments of audio stream are classified and aggregated by Hough voting or bagging to form the final context category. The effectiveness of the proposed approach is demonstrated by the experimental results.
Keywords :
audio streaming; learning (artificial intelligence); random processes; trees (mathematics); ubiquitous computing; Hough voting; audio stream; automatic auditory context classification; content-based analysis; context awareness; ensemble analysis; ensemble learning; high-level semantic information; random forests; Context; Context modeling; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Training; Vegetation; Auditory context; HMM; MFCC; local discriminant bases; random forest;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288386
Filename :
6288386
Link To Document :
بازگشت