DocumentCode :
2174702
Title :
Phase-based information for voice pathology detection
Author :
Drugman, Thomas ; Dubuisson, Thomas ; Dutoit, Thierry
Author_Institution :
TCTS Lab., Univ. of Mons, Mons, Belgium
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4612
Lastpage :
4615
Abstract :
In most current approaches of speech processing, information is extracted from the magnitude spectrum. However re cent perceptual studies have underlined the importance of the phase component. The goal of this paper is to investigate the potential of using phase-based features for automatically detecting voice disorders. It is shown that group delay functions are appropriate for characterizing irregularities in the phonation. Besides the respect of the mixed-phase model of speech is discussed. The proposed phase-based features are evaluated and compared to other parameters derived from the magnitude spectrum. Both streams are shown to be interestingly complementary. Furthermore phase-based features turn out to convey a great amount of relevant information, leading to high discrimination performance.
Keywords :
speech recognition; magnitude spectrum; mixed-phase model; phase-based information; speech processing; voice pathology detection; Delay; Estimation; Feature extraction; Pathology; Spectrogram; Speech; Speech processing; Group Delay; Mixed-Phase Model; Phase Information; Voice pathology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947382
Filename :
5947382
Link To Document :
بازگشت