Title : 
Detecting depression: A comparison between spontaneous and read speech
         
        
            Author : 
Alghowinem, Sharifa ; Goecke, Roland ; Wagner, Michael ; Epps, Julien ; Breakspear, Michael ; Parker, Gordon
         
        
            Author_Institution : 
Australian Nat. Univ., Canberra, ACT, Australia
         
        
        
        
        
            Abstract : 
Major depressive disorders are mental disorders of high prevalence, leading to a high impact on individuals, their families, society and the economy. In order to assist clinicians to better diagnose depression, we investigate an objective diagnostic aid using affective sensing technology with a focus on acoustic features. In this paper, we hypothesise that (1) classifying the general characteristics of clinical depression using spontaneous speech will give better results than using read speech, (2) that there are some acoustic features that are robust and would give good classification results in both spontaneous and read, and (3) that a `thin-slicing´ approach using smaller parts of the speech data will perform similarly if not better than using the whole speech data. By examining and comparing recognition results for acoustic features on a real-world clinical dataset of 30 depressed and 30 control subjects using SVM for classification and a leave-one-out cross-validation scheme, we found that spontaneous speech has more variability, which increases the recognition rate of depression. We also found that jitter, shimmer, energy and loudness feature groups are robust in characterising both read and spontaneous depressive speech. Remarkably, thin-slicing the read speech, using either the beginning of each sentence or the first few sentences performs better than using all reading task data.
         
        
            Keywords : 
medical diagnostic computing; medical disorders; speech recognition; support vector machines; SVM; acoustic features; affective sensing technology; clinical depression; depression detection; depressive disorder; jitter; leave-one-out cross-validation scheme; mental disorder; objective diagnostic; read speech; spontaneous speech; thin-slicing approach; Feature extraction; Jitter; Mel frequency cepstral coefficient; Speech; Speech recognition; Support vector machines; Mood detection; affective sensing; clinical depression; voice feature classification;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
         
        
            Conference_Location : 
Vancouver, BC
         
        
        
        
            DOI : 
10.1109/ICASSP.2013.6639130