DocumentCode :
730794
Title :
SOBM - a binary mask for noisy speech that optimises an objective intelligibility metric
Author :
Lightburn, Leo ; Brookes, Mike
Author_Institution :
Dept. of Electr. & Electron. Eng., Imperial Coll. London, London, UK
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5078
Lastpage :
5082
Abstract :
It is known that the intelligibility of noisy speech can be improved by applying a binary-valued gain mask to a time-frequency representation of the speech. We present the SOBM, an oracle binary mask that maximises STOI, an objective speech intelligibility metric. We show how to determine the SOBM for a deterministic noise signal and also for a stochastic noise signal with a known power spectrum. We demonstrate that applying the SOBM to noisy speech results in a higher predicted intelligibility than is obtained with other masks and show that the stochastic version is robust to mismatch errors in SNR and noise spectrum.
Keywords :
hearing; signal representation; speech intelligibility; stochastic processes; SNR; SOBM; deterministic noise signal; objective speech intelligibility metric; oracle binary-valued gain mask; power spectrum; speech time-frequency representation; stochastic noise signal; Speech enhancement; binary mask; intelligibility metric; noise reduction; speech intelligibility;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178938
Filename :
7178938
Link To Document :
بازگشت