DocumentCode :
2502101
Title :
Telephone-quality pathological speech classification using empirical mode decomposition
Author :
Kaleem, M.F. ; Ghoraani, B. ; Guergachi, A. ; Krishnan, S.
Author_Institution :
Dept. of Electr. Eng., Ryerson Univ., Toronto, ON, Canada
fYear :
2011
fDate :
Aug. 30 2011-Sept. 3 2011
Firstpage :
7095
Lastpage :
7098
Abstract :
This paper presents a computationally simple and effective methodology based on empirical mode decomposition (EMD) for classification of telephone quality normal and pathological speech signals. EMD is used to decompose continuous normal and pathological speech signals into intrinsic mode functions, which are analyzed to extract physically meaningful and unique temporal and spectral features. Using continuous speech samples from a database of 51 normal and 161 pathological speakers, which has been modified to simulate telephone quality speech under different levels of noise, a linear classifier is used with the feature vector thus obtained to obtain a high classification accuracy, thereby demonstrating the effectiveness of the methodology. The classification accuracy reported in this paper (89.7% for signal-to-noise ratio 30 dB) is a significant improvement over previously reported results for the same task, and demonstrates the utility of our methodology for cost-effective remote voice pathology assessment over telephone channels.
Keywords :
feature extraction; medical signal processing; signal classification; speech processing; EMD; cost-effective remote voice pathology assessment; database; empirical mode decomposition; feature vector; intrinsic mode functions; linear classifier; pathological speakers; pathological speech signal classification; signal-to-noise ratio; spectral feature extraction; telephone channels; telephone quality normal speech signal classification; telephone-quality pathological speech classification; temporal feature extraction; Accuracy; Feature extraction; Pathology; Signal to noise ratio; Speech; Time frequency analysis; Algorithms; Cost-Benefit Analysis; Humans; Linear Models; Models, Statistical; Reproducibility of Results; Signal Processing, Computer-Assisted; Signal-To-Noise Ratio; Software; Sound; Speech; Speech Production Measurement; Telephone; Time Factors; Voice; Voice Disorders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE
Conference_Location :
Boston, MA
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4121-1
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/IEMBS.2011.6091793
Filename :
6091793
Link To Document :
بازگشت