• DocumentCode
    3486027
  • Title

    Detection of persons with Parkinson´s disease by acoustic, vocal, and prosodic analysis

  • Author

    Bocklet, Tobias ; Nöth, Elmar ; Stemmer, Georg ; Ruzickova, Hana ; Rusz, Jan

  • Author_Institution
    Phoniatric & Paedaudiologic Dept., Univ. Clinics Erlangen, Erlangen, Germany
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    478
  • Lastpage
    483
  • Abstract
    70% to 90% of patients with Parkinson´s disease (PD) show an affected voice. Various studies revealed, that voice and prosody is one of the earliest indicators of PD. The issue of this study is to automatically detect whether the speech/voice of a person is affected by PD. We employ acoustic features, prosodic features and features derived from a two-mass model of the vocal folds on different kinds of speech tests: sustained phonations, syllable repetitions, read texts and monologues. Classification is performed in either case by SVMs. A correlation-based feature selection was performed, in order to identify the most important features for each of these systems. We report recognition results of 91% when trying to differentiate between normal speaking persons and speakers with PD in early stages with prosodic modeling. With acoustic modeling we achieved a recognition rate of 88% and with vocal modeling we achieved 79%. After feature selection these results could greatly be improved. But we expect those results to be too optimistic. We show that read texts and monologues are the most meaningful texts when it comes to the automatic detection of PD based on articulation, voice, and prosodic evaluations. The most important prosodic features were based on energy, pauses and F0. The masses and the compliances of spring were found to be the most important parameters of the two-mass vocal fold model.
  • Keywords
    diseases; medical signal processing; speaker recognition; PD; Parkinson disease; SVM; acoustic analysis; acoustic anperson detection; acoustic modeling; articulation evaluations; classification; correlation-based feature selection; human speech; human voice; monologues; prosodic analysis; prosodic evaluations; prosodic modeling; speech tests; sustained phonations; syllable repetitions; two-mass model; vocal analysis; vocal modeling; voice evaluations; Acoustics; Computational modeling; Feature extraction; Optimization; Speech; Springs; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163978
  • Filename
    6163978