Title :
Telephone-quality pathological speech classification using empirical mode decomposition
Author :
Kaleem, M.F. ; Ghoraani, B. ; Guergachi, A. ; Krishnan, S.
Author_Institution :
Dept. of Electr. Eng., Ryerson Univ., Toronto, ON, Canada
fDate :
Aug. 30 2011-Sept. 3 2011
Abstract :
This paper presents a computationally simple and effective methodology based on empirical mode decomposition (EMD) for classification of telephone quality normal and pathological speech signals. EMD is used to decompose continuous normal and pathological speech signals into intrinsic mode functions, which are analyzed to extract physically meaningful and unique temporal and spectral features. Using continuous speech samples from a database of 51 normal and 161 pathological speakers, which has been modified to simulate telephone quality speech under different levels of noise, a linear classifier is used with the feature vector thus obtained to obtain a high classification accuracy, thereby demonstrating the effectiveness of the methodology. The classification accuracy reported in this paper (89.7% for signal-to-noise ratio 30 dB) is a significant improvement over previously reported results for the same task, and demonstrates the utility of our methodology for cost-effective remote voice pathology assessment over telephone channels.
Keywords :
feature extraction; medical signal processing; signal classification; speech processing; EMD; cost-effective remote voice pathology assessment; database; empirical mode decomposition; feature vector; intrinsic mode functions; linear classifier; pathological speakers; pathological speech signal classification; signal-to-noise ratio; spectral feature extraction; telephone channels; telephone quality normal speech signal classification; telephone-quality pathological speech classification; temporal feature extraction; Accuracy; Feature extraction; Pathology; Signal to noise ratio; Speech; Time frequency analysis; Algorithms; Cost-Benefit Analysis; Humans; Linear Models; Models, Statistical; Reproducibility of Results; Signal Processing, Computer-Assisted; Signal-To-Noise Ratio; Software; Sound; Speech; Speech Production Measurement; Telephone; Time Factors; Voice; Voice Disorders;
Conference_Titel :
Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-4121-1
Electronic_ISBN :
1557-170X
DOI :
10.1109/IEMBS.2011.6091793