DocumentCode :
3604451
Title :
Characterization Methods for the Detection of Multiple Voice Disorders: Neurological, Functional, and Laryngeal Diseases
Author :
Orozco-Arroyave, Juan Rafael ; Belalcazar-Bolanos, Elkyn Alexander ; Arias-Londono, Julian David ; Vargas-Bonilla, Jesus Francisco ; Skodda, Sabine ; Rusz, Jan ; Daqrouq, Khaled ; Honig, Florian ; Noth, Elmar
Author_Institution :
Fac. of Eng., Univ. de Antioquia UdeA, Medellin, Colombia
Volume :
19
Issue :
6
fYear :
2015
Firstpage :
1820
Lastpage :
1828
Abstract :
This paper evaluates the accuracy of different characterization methods for the automatic detection of multiple speech disorders. The speech impairments considered include dysphonia in people with Parkinson´s disease (PD), dysphonia diagnosed in patients with different laryngeal pathologies (LP), and hypernasality in children with cleft lip and palate (CLP). Four different methods are applied to analyze the voice signals including noise content measures, spectral-cepstral modeling, nonlinear features, and measurements to quantify the stability of the fundamental frequency. These measures are tested in six databases: three with recordings of PD patients, two with patients with LP, and one with children with CLP. The abnormal vibration of the vocal folds observed in PD patients and in people with LP is modeled using the stability measures with accuracies ranging from 81% to 99% depending on the pathology. The spectral-cepstral features are used in this paper to model the voice spectrum with special emphasis around the first two formants. These measures exhibit accuracies ranging from 95% to 99% in the automatic detection of hypernasal voices, which confirms the presence of changes in the speech spectrum due to hypernasality. Noise measures suitably discriminate between dysphonic and healthy voices in both databases with speakers suffering from LP. The results obtained in this study suggest that it is not suitable to use every kind of features to model all of the voice pathologies; conversely, it is necessary to study the physiology of each impairment to choose the most appropriate set of features.
Keywords :
diseases; feature extraction; medical disorders; medical signal processing; neurophysiology; paediatrics; signal denoising; speech; speech processing; vibration measurement; Parkinson´s disease; abnormal vibration; children; cleft lip-and-palate; dysphonia; functional diseases; fundamental frequency; hypernasal voices; hypernasality; laryngeal diseases; multiple speech disorders; multiple voice disorder detection; neurological diseases; noise content; nonlinear features; spectral-cepstral modeling; speech impairments; vocal folds; voice pathologies; voice signals; voice spectrum; Noise; Noise measurement; Parkinson´s disease; Pathology; Speech; Speech processing; Hypernasality; Parkinson’s disease; Parkinson´s disease (PD); hypernasality; laryngeal pathologies; laryngeal pathologies (LP); noise measures; nonlinear behavior; periodicity; spectral-cepstral modeling; stability;
fLanguage :
English
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
Publisher :
ieee
ISSN :
2168-2194
Type :
jour
DOI :
10.1109/JBHI.2015.2467375
Filename :
7192603
Link To Document :
بازگشت