• DocumentCode
    3716242
  • Title

    Multi-modal service operation estimation using DNN-based acoustic bag-of-features

  • Author

    Satoshi Tamura;Takuya Uno;Masanori Takehara;Satoru Hayamizu;Takeshi Kurata

  • Author_Institution
    Department of Information Science Gifu University, Japan
  • fYear
    2015
  • Firstpage
    2291
  • Lastpage
    2295
  • Abstract
    In service engineering it is important to estimate when and what a worker did, because they include crucial evidences to improve service quality and working environments. For Service Operation Estimation (SOE), acoustic information is one of useful and key modalities; particularly environmental or background sounds include effective cues. This paper focuses on two aspects: (1) extracting powerful and robust acoustic features by using stacked-denoising-autoencoder and bag-of-feature techniques, and (2) investigating a multi-modal SOE scheme by combining the audio features and the other sensor data as well as non-sensor information. We conducted evaluation experiments using multi-modal data recorded in a restaurant. We improved SOE performance in comparison to conventional acoustic features, and effectiveness of our multimodal SOE scheme is also clarified.
  • Keywords
    "Feature extraction","Speech","Mel frequency cepstral coefficient","Signal processing","Estimation","Support vector machines"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2015 23rd European
  • Electronic_ISBN
    2076-1465
  • Type

    conf

  • DOI
    10.1109/EUSIPCO.2015.7362793
  • Filename
    7362793