DocumentCode
323804
Title
Speech intelligibility in the presence of cross-channel spectral asynchrony
Author
Arai, Takayuki ; Greenberg, Steven
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
933
Abstract
The spectrum of spoken sentences was partitioned into quarter-octave channels and the onset of each channel shifted in time relative to the others so as to desynchronize spectral information across the frequency axis. Human listeners are remarkably tolerant of cross-channel spectral asynchrony induced in this fashion. Speech intelligibility remains relatively unimpaired until the average asynchrony spans three or more phonetic segments. Such perceptual robustness is correlated with the magnitude of the low-frequency (3-6 Hz) modulation spectrum and thus highlights the importance of syllabic segmentation and analysis for robust processing of spoken language. High-frequency channels (>1.5 kHz) play a particularly important role when the spectral asynchrony is sufficiently large as to significantly reduce the power in the low-frequency modulation spectrum (analogous to acoustic reverberation) and may thereby account for the deterioration of speech intelligibility among the hearing impaired under conditions of acoustic interference (such as background noise and reverberation) characteristic of the real world
Keywords
acoustic noise; acoustic signal processing; hearing; modulation; natural languages; reverberation; spectral analysis; speech intelligibility; speech processing; speech recognition; 3 to 6 Hz; acoustic interference; background noise; correlation; cross-channel spectral asynchrony; frequency axis; hearing impaired; high-frequency channels; low-frequency modulation spectrum; phonetic segments; quarter-octave channels; reverberation; signal processing; spectral asynchrony; spectral information desynchronization; speech intelligibility; speech recognition; spoken language processing; spoken sentences spectrum; syllabic analysis; syllabic segmentation; Decorrelation; Delay effects; Delay estimation; Energy resolution; Headphones; History; Jitter; Reverberation; Sampling methods; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675419
Filename
675419
Link To Document