Title :
Feature mining for GMM-based speech quality measurement
Author :
Falk, Tiago H. ; Chan, Wai-Yip
Author_Institution :
Dept. of Electr. & Comput. Eng., Queen´´s Univ., Kingston, Ont., Canada
Abstract :
We propose a novel approach to objective speech quality measurement using feature mining and Gaussian mixture models (GMMs). A large pool of perceptual distortion features is extracted from the speech signal and data mining techniques are used to sift out the most relevant feature variables from the pool. We examine using multivariate adaptive regression splines (MARS), classification and regression trees (CART), a hybrid CART-MARS scheme, and the sequential forward selection (SFS) algorithm for data mining. For our speech databases, the SFS algorithm provides best performance with a five-feature, three-component GMM. A reduction of 21.7% in root-mean-squared mean opinion score estimation error is obtained in comparison with ITU-T P.862 PESQ.
Keywords :
Gaussian processes; audio databases; data mining; feature extraction; mean square error methods; regression analysis; signal classification; speech processing; splines (mathematics); trees (mathematics); GMM-based speech quality measurement; Gaussian mixture models; classification and regression trees; data mining techniques; feature mining; multivariate adaptive regression splines; perceptual distortion feature extraction; root-mean-squared mean opinion score estimation error; sequential forward selection; speech databases; speech signal; Classification tree analysis; Data mining; Distortion measurement; Electric variables measurement; Estimation error; Mars; Quality assessment; Regression tree analysis; Speech analysis; Testing;
Conference_Titel :
Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
Print_ISBN :
0-7803-8622-1
DOI :
10.1109/ACSSC.2004.1399576