• DocumentCode
    417260
  • Title

    Automatic determination of acoustic model topology using variational Bayesian estimation and clustering

  • Author

    Watanabe, Shinji ; Sako, Atsushi ; Nakamura, Atsushi

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We describe the automatic determination of an acoustic model for speech recognition, which is very complicated and includes latent variables, using VBEC: variational Bayesian estimation and clustering for speech recognition. We propose an efficient Gaussian mixture model (GMM) based phonetic decision tree construction within the VBEC framework. The proposed method features a novel approach to reduce the unrealistically large number of computations needed for iterative calculations in the GMM-based decision tree method to a practical level by assuming that each Gaussian per state has the same occupancy and is represented by the same posterior distribution for the covariance parameter. The experimental results confirmed that VBEC automatically provided an optimum model topology with the highest performance level.
  • Keywords
    Bayes methods; Gaussian distribution; covariance analysis; decision trees; hidden Markov models; iterative methods; parameter estimation; pattern clustering; speech processing; speech recognition; topology; tree searching; variational techniques; GMM; Gaussian mixture model; VBEC; acoustic model topology; automatic determination; covariance parameter; iterative calculations; latent variables; optimum model topology; phonetic decision tree; posterior distribution; speech recognition; variational Bayesian estimation and clustering; Bayesian methods; Clustering algorithms; Decision trees; Distributed computing; Hidden Markov models; Iterative methods; Laboratories; Maximum likelihood estimation; Speech recognition; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326110
  • Filename
    1326110