• DocumentCode
    1550148
  • Title

    Environmental audio scene and activity recognition through mobile-based crowdsourcing

  • Author

    Hwang, Kyuwoong ; Lee, Soo-Young

  • Author_Institution
    Dept. of Bio & Brain Eng, Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
  • Volume
    58
  • Issue
    2
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    700
  • Lastpage
    705
  • Abstract
    Environmental audio recognition through mobile devices is difficult because of background noise, unseen audio events, and changes in audio channel characteristics due to the phone´s context, e.g., whether the phone is in the user´s pocket or in his hand. We propose a crowdsourcing framework that models the combination of scene, event, and phone context to overcome these issues. The framework gathers audio data from many people and shares user-generated models through a cloud server to accurately classify unseen audio data. A Gaussian histogram is used to represent an audio clip with a small number of parameters, and a k-nearest classifier allows the easy incorporation of new training data into the system. Using the Kullback-Leibler divergence between two Gaussian histograms as the distance measure, we find that audio scenes, events, and phone context are classified with 85.2%, 77.6%, and 88.9% accuracy, respectively.
  • Keywords
    Gaussian processes; audio signal processing; cloud computing; mobile computing; mobile handsets; pattern classification; Gaussian histogram; Kullback-Leibler divergence; activity recognition; audio channel characteristics; audio clip; background noise; cloud server; environmental audio scene recognition; k-nearest classifier; mobile devices; mobile-based crowdsourcing; unseen audio events; user-generated models; Accuracy; Context; Histograms; Noise measurement; Servers; Training; Training data; acoustic scene analysis; crowdsourcing; environment recognition; sound recognition;
  • fLanguage
    English
  • Journal_Title
    Consumer Electronics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0098-3063
  • Type

    jour

  • DOI
    10.1109/TCE.2012.6227479
  • Filename
    6227479