• DocumentCode
    699636
  • Title

    Relative energy and intelligibility of transient speech components

  • Author

    Sungyub Yoo ; Boston, J. Robert ; Durrant, John D. ; Kovacyk, Kristie ; Karn, Stacey ; Shaiman, Susan ; El-Jaroudi, Amro ; Ching-Chung Li

  • Author_Institution
    Depts. of Electr. Eng., Univ. of Pittsburgh, Pittsburgh, PA, USA
  • fYear
    2004
  • fDate
    6-10 Sept. 2004
  • Firstpage
    1031
  • Lastpage
    1034
  • Abstract
    It is generally recognized that consonants are more critical than vowels to speech intelligibility, but we suggest that important information is contained in transient speech components, rather than the quasi-steady-state components of both consonants and vowels. Fixed-frequency filters cannot uniquely separate transients from the more steady-state vowel formants and consonant hubs, even though the former are predominately low frequency and the latter, high frequency. To study the relative speech intelligibility of the transient versus steady-state components, we employed an algorithm based on time-frequency analysis to extract quasi-steady-state energy from the speech signal, leaving a residual signal of predominantly transient components. Psychometric functions were measured for speech recognition of processed and unprocessed monosyllabic words. The transient components were found to account for approximately 2% of the energy of the original speech, yet were nearly equally intelligible. As hypothesized, the quasi-steady-state components contained much greater energy while providing significantly less intelligibility.
  • Keywords
    speech intelligibility; speech recognition; time-frequency analysis; consonant hubs; fixed-frequency filters; predominantly-transient components; processed monosyllabic word; psychometric functions; quasisteady-state component; quasisteady-state energy extraction; relative energy; relative speech intelligibility; residual signal; speech recognition; speech signal; steady-state vowel formants; time-frequency analysis; transient speech component intelligibility; unprocessed monosyllabic word; Abstracts; Cutoff frequency; Frequency estimation; Speech; Speech recognition; Transient analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2004 12th European
  • Conference_Location
    Vienna
  • Print_ISBN
    978-320-0001-65-7
  • Type

    conf

  • Filename
    7080166