• DocumentCode
    3321934
  • Title

    Quantifying parameters of a source-filter model for oesophageal speech

  • Author

    Toole, John M O´ ; Zapirain, Begoña García

  • Author_Institution
    DeustoTech-LIFE, Univ. of Deusto, Bilbao, Spain
  • fYear
    2011
  • fDate
    14-17 Dec. 2011
  • Firstpage
    532
  • Lastpage
    537
  • Abstract
    Signal processing methods can improve the quality and intelligibility of oesophageal speech. Current methods show only moderate improvement leaving potential for better results. Quantifying parameters of oesophageal speech relative to laryngeal (normal) speech would help in the design of future enhancement methods for oesophageal speech. We quantified parameters of a source-filter model on a database of sustained vowels in Spanish from 4 oesophageal and 4 normal speakers. A ten-parameter glottal waveform model was used as the source and an autoregressive model was used as the filter. Classification, using a log-spectral distance measure, showed that all oesophageal speech samples were classified as whisper voice types; a voice type with a signal to noise ratio of -20 dB. Filter parameters representing spectral amplitudes and bandwidths had a large degree of variation for oesophageal speech comparative to the degree of variation for normal speech (Brown-Forsythe test, F <; 0.001). Source metrics, noise to harmonic ratio (NHR) and variation in fundamental frequency, were also significantly greater for oesophageal speech (i-test, P <; 0.001). These results show a greater degree of nonstationarity, and a noisier glottal waveform, for oesophageal speech comparative to normal speech.
  • Keywords
    autoregressive processes; filtering theory; natural language processing; pattern classification; speech coding; Brown-Forsythe test; Spanish; autoregressive model; fundamental frequency; laryngeal speech; log-spectral distance measure; noise-to-harmonic ratio; normal speakers; oesophageal speakers; oesophageal speech intelligibility; oesophageal speech quality; oesophageal speech sample classification; signal processing method; signal-to-noise ratio; source metrics; source-filter model; spectral amplitudes; spectral bandwidths; sustained vowel database; ten-parameter glottal waveform model; voice type; whisper voice type; Analytical models; Harmonic analysis; Jitter; Noise; Noise measurement; Speech; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2011 IEEE International Symposium on
  • Conference_Location
    Bilbao
  • Print_ISBN
    978-1-4673-0752-9
  • Electronic_ISBN
    978-1-4673-0751-2
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2011.6151618
  • Filename
    6151618