DocumentCode :
151525
Title :
A quantitative comparison of blind C50 estimators
Author :
Parada, P. Peso ; Sharma, Divya ; Lainez, J. ; Barreda, D. ; Naylor, Patrick A. ; van Waterschoot, Toon
Author_Institution :
Nuance Commun., Inc., Marlow, UK
fYear :
2014
fDate :
8-11 Sept. 2014
Firstpage :
298
Lastpage :
302
Abstract :
The problem of blind estimation of the room acoustic clarity index C50 from single-channel reverberant speech signals is presented in this paper. We analyze the performance of several machine learning methods for a regression task using 309 features derived from the speech signal and modeled with a Deep Belief Network (DBN), Classification And Regression Tree (CART) and Linear Regression (LR). These techniques are evaluated on a large test database (86 hours) that includes babble noise and reverberation using both artificial and real room impulses responses (RIRs). All methods are trained on a database which contains noise, speech and simulated RIRs different from the test set. The performance results show that the DBN model gives the lowest error for the simulated RIRs whereas the LR model gives the best generalization performance with the highest accuracy for real RIRs.
Keywords :
acoustic wave reflection; architectural acoustics; belief networks; learning (artificial intelligence); regression analysis; reverberation; signal classification; speech recognition; DBN model; LR model; RIR; babble noise; blind estimation; classification and regression tree; deep belief network; linear regression; machine learning methods; real room impulse responses; room acoustic clarity index C50; single-channel reverberant speech signals; Acoustics; Conferences; Databases; Estimation; Noise; Speech; Training; C50; CART; DBN; Linear regression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustic Signal Enhancement (IWAENC), 2014 14th International Workshop on
Conference_Location :
Juan-les-Pins
Type :
conf
DOI :
10.1109/IWAENC.2014.6954306
Filename :
6954306
Link To Document :
بازگشت