• DocumentCode
    454701
  • Title

    Acoustic Model Adaptation Based on Coarse/Fine Training of Transfer Vectors Using Directional Statistics

  • Author

    Watanabe, Shinji ; Nakamura, Atsushi

  • Author_Institution
    NTT Commun. Sci. Lab., NTT Corp.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    In this paper, we reformulate an adaptation scheme of coarse/fine training (CFT) of transfer vectors in acoustic modeling by using directional statistics. In CFT, the transfer vector is decomposed into a unit direction vector and a scaling factor. By using coarse tied Gaussian class (coarse class) estimation for the unit direction vector, and by using fine tied Gaussian class (fine class) estimation for the scaling factor, we can obtain accurate transfer vectors with a small number of free parameters. Directional statistics is a method for analyzing geometric parameters (e.g. angle and unit vector) using directional data, and is suited for the analysis of the CFT representation. Using directional statistics as a basis, we construct expectation-maximization algorithms for CFT parameters analytically using the maximum likelihood and Bayesian (maximum a posteriori) approaches. In particular, with the Bayesian approach, prior and posterior distributions for unit direction vectors are represented with a von Mises distribution, a representative distribution in directional statistics. Speaker adaptation experiments show that our proposal improves the performance of large vocabulary continuous speech recognition due to the efficient coarse/fine representation of transfer vectors, compared with the conventional transfer vector adaptation
  • Keywords
    Bayes methods; Gaussian distribution; acoustics; expectation-maximisation algorithm; speech recognition; Bayesian approaches; Gaussian class; acoustic model adaptation; coarse/fine training; directional statistics; expectation-maximization algorithms; large vocabulary continuous speech recognition; maximum a posteriori; maximum likelihood approaches; transfer vectors; von Mises distribution; Adaptation model; Algorithm design and analysis; Bayesian methods; Expectation-maximization algorithms; Maximum likelihood estimation; Proposals; Statistical analysis; Statistical distributions; Statistics; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660193
  • Filename
    1660193