Title :
The psychoacoustic approach towards enhancing speech intelligibility in noise
Author :
Chan, Paul Yaozhu ; Dong, Minghui ; Cen, Ling ; Li, Haizhou
Author_Institution :
Dept. of Human Language Technol., Agency for Sci. Technol. & Res. (A*STAR), Singapore, Singapore
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
In this paper, we propose a psychoacoustic approach towards enhancing speech intelligibility in noise. Understanding the relationship between the short-term spectral movement of a sound and a listener´s sensitivity towards it, we conjecture that humans rely greatly on Inter-Phoneme Spectral Gradients (IPSGs) to distinguish each phoneme, especially when the short-term speech spectrum is masked by extremely high levels of noise. We then move on to explain how the IPSG may most effectively be steepened while introducing the concept of Formant Contrast. The effectiveness of this process is validated with spectral analysis and listening tests, verifying that our initial deduction is true. In these, we present a simple, yet novel and effective method of improving speech intelligibility - especially in extremely high noise environments.
Keywords :
noise; spectral analysis; speech intelligibility; speech synthesis; formant contrast; interphoneme spectral gradient; listener sensitivity; noise susceptibility; psychoacoustic approach; short term spectral movement; spectral analysis; speech intelligibility; speech synthesis; Humans; Real time systems; Signal to noise ratio; Spectrogram; Speech; Speech enhancement; formant contrast; noise susceptibility; noise tolerance; spectral gradient; speech intelligibility; speech synthesis;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684902