DocumentCode
3161427
Title
Transient-based speech transmission index for predicting intelligibility in nonlinear speech enhancement processors
Author
Schlesinger, Anton
Author_Institution
Inst. of Commun. Acoust., Ruhr Univ. Bochum, Bochum, Germany
fYear
2012
fDate
25-30 March 2012
Firstpage
3993
Lastpage
3996
Abstract
A new speech intelligibility metric is proposed for the assessment of speech enhancement processors. These processors usually affect the fine structure in speech that is of fundamental importance to speech intelligibility. Classical metrics analyze the entire signal and thereby generally overestimate intelligibility. The measure presented here, therefore, isolates speech-transients by a cepstral smoothing technique and subsequently calculates speech intelligibility using an efficient version of the speech transmission index. By means of a genetic optimization of adjustable parameters, the proposed transition-based speech transmission index (TB STI) is adapted to the subjective data of linearly and nonlinearly processed speech. The method was assessed on untrained subjective data and showed a considerable improvement over other well-established measures.
Keywords
genetic algorithms; speech enhancement; TB STI; cepstral smoothing technique; genetic optimization; intelligibility prediction; nonlinear speech enhancement processors; nonlinearly processed speech; speech intelligibility metric; transient-based speech transmission index; transition-based speech transmission index; untrained subjective data; Cepstral analysis; Indexes; Measurement; Optimization; Speech; Speech enhancement; Cepstrum; intelligibility; speech enhancement; speech perception; transients;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288793
Filename
6288793
Link To Document