DocumentCode
2174702
Title
Phase-based information for voice pathology detection
Author
Drugman, Thomas ; Dubuisson, Thomas ; Dutoit, Thierry
Author_Institution
TCTS Lab., Univ. of Mons, Mons, Belgium
fYear
2011
fDate
22-27 May 2011
Firstpage
4612
Lastpage
4615
Abstract
In most current approaches of speech processing, information is extracted from the magnitude spectrum. However re cent perceptual studies have underlined the importance of the phase component. The goal of this paper is to investigate the potential of using phase-based features for automatically detecting voice disorders. It is shown that group delay functions are appropriate for characterizing irregularities in the phonation. Besides the respect of the mixed-phase model of speech is discussed. The proposed phase-based features are evaluated and compared to other parameters derived from the magnitude spectrum. Both streams are shown to be interestingly complementary. Furthermore phase-based features turn out to convey a great amount of relevant information, leading to high discrimination performance.
Keywords
speech recognition; magnitude spectrum; mixed-phase model; phase-based information; speech processing; voice pathology detection; Delay; Estimation; Feature extraction; Pathology; Spectrogram; Speech; Speech processing; Group Delay; Mixed-Phase Model; Phase Information; Voice pathology;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947382
Filename
5947382
Link To Document