Detection of persons with Parkinson´s disease by acoustic, vocal, and prosodic analysis

Author

Bocklet, Tobias ; Nöth, Elmar ; Stemmer, Georg ; Ruzickova, Hana ; Rusz, Jan

Author_Institution

Phoniatric & Paedaudiologic Dept., Univ. Clinics Erlangen, Erlangen, Germany

fYear

2011

fDate

11-15 Dec. 2011

Firstpage

478

Lastpage

483

Abstract

70% to 90% of patients with Parkinson´s disease (PD) show an affected voice. Various studies revealed, that voice and prosody is one of the earliest indicators of PD. The issue of this study is to automatically detect whether the speech/voice of a person is affected by PD. We employ acoustic features, prosodic features and features derived from a two-mass model of the vocal folds on different kinds of speech tests: sustained phonations, syllable repetitions, read texts and monologues. Classification is performed in either case by SVMs. A correlation-based feature selection was performed, in order to identify the most important features for each of these systems. We report recognition results of 91% when trying to differentiate between normal speaking persons and speakers with PD in early stages with prosodic modeling. With acoustic modeling we achieved a recognition rate of 88% and with vocal modeling we achieved 79%. After feature selection these results could greatly be improved. But we expect those results to be too optimistic. We show that read texts and monologues are the most meaningful texts when it comes to the automatic detection of PD based on articulation, voice, and prosodic evaluations. The most important prosodic features were based on energy, pauses and F0. The masses and the compliances of spring were found to be the most important parameters of the two-mass vocal fold model.

Keywords

diseases; medical signal processing; speaker recognition; PD; Parkinson disease; SVM; acoustic analysis; acoustic anperson detection; acoustic modeling; articulation evaluations; classification; correlation-based feature selection; human speech; human voice; monologues; prosodic analysis; prosodic evaluations; prosodic modeling; speech tests; sustained phonations; syllable repetitions; two-mass model; vocal analysis; vocal modeling; voice evaluations; Acoustics; Computational modeling; Feature extraction; Optimization; Speech; Springs; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on

Conference_Location

Waikoloa, HI

Print_ISBN

978-1-4673-0365-1

Electronic_ISBN

978-1-4673-0366-8

Type

conf

DOI

10.1109/ASRU.2011.6163978

Filename

6163978