• DocumentCode
    2870993
  • Title

    Maximum likelihood estimation of elliptical basis function parameters with application to speaker verification

  • Author

    Mak, M.W. ; Li, C.K. ; Li, X.

  • Author_Institution
    Dept. of Electron. Eng., Hong Kong Polytech. Univ., Hung Hom, Hong Kong
  • Volume
    2
  • fYear
    1998
  • fDate
    1998
  • Firstpage
    1287
  • Abstract
    The use of the K-means algorithm and the K-nearest neighbor heuristic in estimating the radial basis function (RBF) parameters may produce sub-optimal performance when the input vectors contain correlated components. This paper proposes to overcome this problem by incorporating full covariance matrices into the RBF structure and to use the expectation-maximization (EM) algorithm to estimate the network parameters. The resulting networks, referred to as elliptical basis function (EBF) networks, are applied to text-independent speaker verification. To examine the robustness of the networks in a noisy environment, both clean speech and telephone speech have been used. Experimental results show that smaller size EBF networks with basis function parameters determined by the EM algorithm outperform the large RBF networks trained in the conventional approach. The best error rates achieved by the EBF networks is 3.70%, while that achieved by the RBF networks is 10.37%
  • Keywords
    covariance matrices; maximum likelihood estimation; radial basis function networks; speaker recognition; statistical analysis; K-means algorithm; K-nearest neighbor heuristic; RBF structure; clean speech; elliptical basis function networks; elliptical basis function parameters; error rates; expectation-maximization algorithm; full covariance matrices; input vectors; maximum likelihood estimation; network parameters; radial basis function; speaker verification; telephone speech; text-independent speaker verification; Computer networks; Covariance matrix; Error analysis; Maximum likelihood estimation; Parameter estimation; Radial basis function networks; Speaker recognition; Speech; Telephony; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Proceedings, 1998. ICSP '98. 1998 Fourth International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-4325-5
  • Type

    conf

  • DOI
    10.1109/ICOSP.1998.770854
  • Filename
    770854