DocumentCode
3716242
Title
Multi-modal service operation estimation using DNN-based acoustic bag-of-features
Author
Satoshi Tamura;Takuya Uno;Masanori Takehara;Satoru Hayamizu;Takeshi Kurata
Author_Institution
Department of Information Science Gifu University, Japan
fYear
2015
Firstpage
2291
Lastpage
2295
Abstract
In service engineering it is important to estimate when and what a worker did, because they include crucial evidences to improve service quality and working environments. For Service Operation Estimation (SOE), acoustic information is one of useful and key modalities; particularly environmental or background sounds include effective cues. This paper focuses on two aspects: (1) extracting powerful and robust acoustic features by using stacked-denoising-autoencoder and bag-of-feature techniques, and (2) investigating a multi-modal SOE scheme by combining the audio features and the other sensor data as well as non-sensor information. We conducted evaluation experiments using multi-modal data recorded in a restaurant. We improved SOE performance in comparison to conventional acoustic features, and effectiveness of our multimodal SOE scheme is also clarified.
Keywords
"Feature extraction","Speech","Mel frequency cepstral coefficient","Signal processing","Estimation","Support vector machines"
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN
2076-1465
Type
conf
DOI
10.1109/EUSIPCO.2015.7362793
Filename
7362793
Link To Document