DocumentCode
699636
Title
Relative energy and intelligibility of transient speech components
Author
Sungyub Yoo ; Boston, J. Robert ; Durrant, John D. ; Kovacyk, Kristie ; Karn, Stacey ; Shaiman, Susan ; El-Jaroudi, Amro ; Ching-Chung Li
Author_Institution
Depts. of Electr. Eng., Univ. of Pittsburgh, Pittsburgh, PA, USA
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
1031
Lastpage
1034
Abstract
It is generally recognized that consonants are more critical than vowels to speech intelligibility, but we suggest that important information is contained in transient speech components, rather than the quasi-steady-state components of both consonants and vowels. Fixed-frequency filters cannot uniquely separate transients from the more steady-state vowel formants and consonant hubs, even though the former are predominately low frequency and the latter, high frequency. To study the relative speech intelligibility of the transient versus steady-state components, we employed an algorithm based on time-frequency analysis to extract quasi-steady-state energy from the speech signal, leaving a residual signal of predominantly transient components. Psychometric functions were measured for speech recognition of processed and unprocessed monosyllabic words. The transient components were found to account for approximately 2% of the energy of the original speech, yet were nearly equally intelligible. As hypothesized, the quasi-steady-state components contained much greater energy while providing significantly less intelligibility.
Keywords
speech intelligibility; speech recognition; time-frequency analysis; consonant hubs; fixed-frequency filters; predominantly-transient components; processed monosyllabic word; psychometric functions; quasisteady-state component; quasisteady-state energy extraction; relative energy; relative speech intelligibility; residual signal; speech recognition; speech signal; steady-state vowel formants; time-frequency analysis; transient speech component intelligibility; unprocessed monosyllabic word; Abstracts; Cutoff frequency; Frequency estimation; Speech; Speech recognition; Transient analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7080166
Link To Document