• DocumentCode
    2654652
  • Title

    Feature mining for GMM-based speech quality measurement

  • Author

    Falk, Tiago H. ; Chan, Wai-Yip

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Queen´´s Univ., Kingston, Ont., Canada
  • Volume
    2
  • fYear
    2004
  • fDate
    7-10 Nov. 2004
  • Firstpage
    2290
  • Abstract
    We propose a novel approach to objective speech quality measurement using feature mining and Gaussian mixture models (GMMs). A large pool of perceptual distortion features is extracted from the speech signal and data mining techniques are used to sift out the most relevant feature variables from the pool. We examine using multivariate adaptive regression splines (MARS), classification and regression trees (CART), a hybrid CART-MARS scheme, and the sequential forward selection (SFS) algorithm for data mining. For our speech databases, the SFS algorithm provides best performance with a five-feature, three-component GMM. A reduction of 21.7% in root-mean-squared mean opinion score estimation error is obtained in comparison with ITU-T P.862 PESQ.
  • Keywords
    Gaussian processes; audio databases; data mining; feature extraction; mean square error methods; regression analysis; signal classification; speech processing; splines (mathematics); trees (mathematics); GMM-based speech quality measurement; Gaussian mixture models; classification and regression trees; data mining techniques; feature mining; multivariate adaptive regression splines; perceptual distortion feature extraction; root-mean-squared mean opinion score estimation error; sequential forward selection; speech databases; speech signal; Classification tree analysis; Data mining; Distortion measurement; Electric variables measurement; Estimation error; Mars; Quality assessment; Regression tree analysis; Speech analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
  • Print_ISBN
    0-7803-8622-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2004.1399576
  • Filename
    1399576