• DocumentCode
    3703324
  • Title

    Multimodal depression recognition with dynamic visual and audio cues

  • Author

    Lang He;Dongmei Jiang;Hichem Sahli

  • Author_Institution
    NPU-VUB Joint AVSP Research lab, School of Computer Science, Northwestern Polytechnical University(NPU) Shaanxi Key Lab on Speech and Image Information Processing, 127 Youyi Xilu, Xi´an 710072, China
  • fYear
    2015
  • Firstpage
    260
  • Lastpage
    266
  • Abstract
    In this paper, we present our system design for audio visual multi-modal depression recognition. To improve the estimation accuracy of the Beck Depression Inventory (BDI) score, besides the Low Level Descriptors (LLD) features and the Local Gabor Binary Pattern-Three Orthogonal Planes (LGBP-TOP) features provided by the 2014 Audio/Visual Emotion Challenge and Workshop (AVEC2014), we extract extra features to capture key behavioural changes associated with depression. From audio we extract the speaking rate, and from video, the head pose features, the Space-Temporal Interesting Point (STIP) features, and local kinematic features via the Divergence-Curl-Shear descriptors. These features describe body movements, and spatio-temporal changes within the image sequence. We also consider global dynamic features, obtained using motion history histogram (MHH), bag of words (BOW) features and vector of local aggregated descriptors (VLAD). To capture the complementary information within the used features, we evaluate two fusion systems - the feature fusion scheme, and the model fusion scheme via local linear regression (LLR). Experiments are carried out on the training set and development set of the Depression Recognition Sub-Challenge (DSC) of AVEC2014, we obtain root mean square error (RMSE) of 7.6697, and mean absolute error (MAE) of 6.1683 on the development set, which are better or comparable with the state of the art results of the AVEC2014 challenge.
  • Keywords
    "Feature extraction","Visualization","Speech","Head","Histograms","Optical imaging","History"
  • Publisher
    ieee
  • Conference_Titel
    Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on
  • Electronic_ISBN
    2156-8111
  • Type

    conf

  • DOI
    10.1109/ACII.2015.7344581
  • Filename
    7344581