DocumentCode
151525
Title
A quantitative comparison of blind C50 estimators
Author
Parada, P. Peso ; Sharma, Divya ; Lainez, J. ; Barreda, D. ; Naylor, Patrick A. ; van Waterschoot, Toon
Author_Institution
Nuance Commun., Inc., Marlow, UK
fYear
2014
fDate
8-11 Sept. 2014
Firstpage
298
Lastpage
302
Abstract
The problem of blind estimation of the room acoustic clarity index C50 from single-channel reverberant speech signals is presented in this paper. We analyze the performance of several machine learning methods for a regression task using 309 features derived from the speech signal and modeled with a Deep Belief Network (DBN), Classification And Regression Tree (CART) and Linear Regression (LR). These techniques are evaluated on a large test database (86 hours) that includes babble noise and reverberation using both artificial and real room impulses responses (RIRs). All methods are trained on a database which contains noise, speech and simulated RIRs different from the test set. The performance results show that the DBN model gives the lowest error for the simulated RIRs whereas the LR model gives the best generalization performance with the highest accuracy for real RIRs.
Keywords
acoustic wave reflection; architectural acoustics; belief networks; learning (artificial intelligence); regression analysis; reverberation; signal classification; speech recognition; DBN model; LR model; RIR; babble noise; blind estimation; classification and regression tree; deep belief network; linear regression; machine learning methods; real room impulse responses; room acoustic clarity index C50; single-channel reverberant speech signals; Acoustics; Conferences; Databases; Estimation; Noise; Speech; Training; C50; CART; DBN; Linear regression;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustic Signal Enhancement (IWAENC), 2014 14th International Workshop on
Conference_Location
Juan-les-Pins
Type
conf
DOI
10.1109/IWAENC.2014.6954306
Filename
6954306
Link To Document