• DocumentCode
    2174702
  • Title

    Phase-based information for voice pathology detection

  • Author

    Drugman, Thomas ; Dubuisson, Thomas ; Dutoit, Thierry

  • Author_Institution
    TCTS Lab., Univ. of Mons, Mons, Belgium
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4612
  • Lastpage
    4615
  • Abstract
    In most current approaches of speech processing, information is extracted from the magnitude spectrum. However re cent perceptual studies have underlined the importance of the phase component. The goal of this paper is to investigate the potential of using phase-based features for automatically detecting voice disorders. It is shown that group delay functions are appropriate for characterizing irregularities in the phonation. Besides the respect of the mixed-phase model of speech is discussed. The proposed phase-based features are evaluated and compared to other parameters derived from the magnitude spectrum. Both streams are shown to be interestingly complementary. Furthermore phase-based features turn out to convey a great amount of relevant information, leading to high discrimination performance.
  • Keywords
    speech recognition; magnitude spectrum; mixed-phase model; phase-based information; speech processing; voice pathology detection; Delay; Estimation; Feature extraction; Pathology; Spectrogram; Speech; Speech processing; Group Delay; Mixed-Phase Model; Phase Information; Voice pathology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947382
  • Filename
    5947382