DocumentCode :
1550148
Title :
Environmental audio scene and activity recognition through mobile-based crowdsourcing
Author :
Hwang, Kyuwoong ; Lee, Soo-Young
Author_Institution :
Dept. of Bio & Brain Eng, Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
Volume :
58
Issue :
2
fYear :
2012
fDate :
5/1/2012 12:00:00 AM
Firstpage :
700
Lastpage :
705
Abstract :
Environmental audio recognition through mobile devices is difficult because of background noise, unseen audio events, and changes in audio channel characteristics due to the phone´s context, e.g., whether the phone is in the user´s pocket or in his hand. We propose a crowdsourcing framework that models the combination of scene, event, and phone context to overcome these issues. The framework gathers audio data from many people and shares user-generated models through a cloud server to accurately classify unseen audio data. A Gaussian histogram is used to represent an audio clip with a small number of parameters, and a k-nearest classifier allows the easy incorporation of new training data into the system. Using the Kullback-Leibler divergence between two Gaussian histograms as the distance measure, we find that audio scenes, events, and phone context are classified with 85.2%, 77.6%, and 88.9% accuracy, respectively.
Keywords :
Gaussian processes; audio signal processing; cloud computing; mobile computing; mobile handsets; pattern classification; Gaussian histogram; Kullback-Leibler divergence; activity recognition; audio channel characteristics; audio clip; background noise; cloud server; environmental audio scene recognition; k-nearest classifier; mobile devices; mobile-based crowdsourcing; unseen audio events; user-generated models; Accuracy; Context; Histograms; Noise measurement; Servers; Training; Training data; acoustic scene analysis; crowdsourcing; environment recognition; sound recognition;
fLanguage :
English
Journal_Title :
Consumer Electronics, IEEE Transactions on
Publisher :
ieee
ISSN :
0098-3063
Type :
jour
DOI :
10.1109/TCE.2012.6227479
Filename :
6227479
Link To Document :
بازگشت