DocumentCode
1449813
Title
An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech
Author
Taal, Cees H. ; Hendriks, Richard C. ; Heusdens, Richard ; Jensen, Jesper
Author_Institution
Signal Inf. & Process. Lab., Delft Univ. of Technol., Delft, Netherlands
Volume
19
Issue
7
fYear
2011
Firstpage
2125
Lastpage
2136
Abstract
In the development process of noise-reduction algorithms, an objective machine-driven intelligibility measure which shows high correlation with speech intelligibility is of great interest. Besides reducing time and costs compared to real listening experiments, an objective intelligibility measure could also help provide answers on how to improve the intelligibility of noisy unprocessed speech. In this paper, a short-time objective intelligibility measure (STOI) is presented, which shows high correlation with the intelligibility of noisy and time-frequency weighted noisy speech (e.g., resulting from noise reduction) of three different listening experiments. In general, STOI showed better correlation with speech intelligibility compared to five other reference objective intelligibility models. In contrast to other conventional intelligibility models which tend to rely on global statistics across entire sentences, STOI is based on shorter time segments (386 ms). Experiments indeed show that it is beneficial to take segment lengths of this order into account. In addition, a free Matlab implementation is provided.
Keywords
speech enhancement; speech intelligibility; intelligibility prediction; noise-reduction algorithm; noisy unprocessed speech; objective intelligibility model; objective machine-driven intelligibility measure; short-time objective intelligibility measure; speech intelligibility; time-frequency weighted noisy speech; Correlation; Noise measurement; Signal to noise ratio; Speech; Speech processing; Time frequency analysis; Noise reduction; objective measure; speech enhancement; speech intelligibility prediction;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2114881
Filename
5713237
Link To Document