Title :
Transient-based speech transmission index for predicting intelligibility in nonlinear speech enhancement processors
Author :
Schlesinger, Anton
Author_Institution :
Inst. of Commun. Acoust., Ruhr Univ. Bochum, Bochum, Germany
Abstract :
A new speech intelligibility metric is proposed for the assessment of speech enhancement processors. These processors usually affect the fine structure in speech that is of fundamental importance to speech intelligibility. Classical metrics analyze the entire signal and thereby generally overestimate intelligibility. The measure presented here, therefore, isolates speech-transients by a cepstral smoothing technique and subsequently calculates speech intelligibility using an efficient version of the speech transmission index. By means of a genetic optimization of adjustable parameters, the proposed transition-based speech transmission index (TB STI) is adapted to the subjective data of linearly and nonlinearly processed speech. The method was assessed on untrained subjective data and showed a considerable improvement over other well-established measures.
Keywords :
genetic algorithms; speech enhancement; TB STI; cepstral smoothing technique; genetic optimization; intelligibility prediction; nonlinear speech enhancement processors; nonlinearly processed speech; speech intelligibility metric; transient-based speech transmission index; transition-based speech transmission index; untrained subjective data; Cepstral analysis; Indexes; Measurement; Optimization; Speech; Speech enhancement; Cepstrum; intelligibility; speech enhancement; speech perception; transients;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288793