• DocumentCode
    2174005
  • Title

    Clustering of bootstrapped acoustic model with full covariance

  • Author

    Chen, Xin ; Cui, Xiaodong ; Xue, Jian ; Olsen, Peder ; Hershey, John ; Zhou, Bowen ; Zhao, Yunxin

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4496
  • Lastpage
    4499
  • Abstract
    HMM-based acoustic models built from bootstrap are generally very large, especially when full covariance matrices are used for Gaussians. Therefore, clustering is needed to compact the acoustic model to a reasonable size for practical applications. This paper discusses and investigates multiple distance measurements and algorithms for the clustering. The distance measurements include Entropy, KL, Bhattacharyya, Chernoff and their weighted versions. For clustering algorithms, besides conventional greedy bottom-up, algorithms such as N-Best distance Refinement (NBR), K-step Look-Ahead (KLA), Breadth-First Searched (BFS) best path are proposed. A two-pass optimization approach is also proposed to improve the model structure. Experiments in the Bootstrap and Restructuring (B SRS) frame work on Pashto show that the discussed clustering approach can lead to better quality of the restructured model. It also shows that final acoustic model that is diagonalized from the full covariance yields good improvement over BSRS model directly with diagonal model and yields significant improvement over the conventional diagonal model.
  • Keywords
    Gaussian processes; covariance matrices; hidden Markov models; speech recognition; Breadth-first search; Gaussian components; HMM-based acoustic models; K-step Look-Ahead; N-best distance refinement; bootstrapped acoustic model; covariance matrices; Acoustic beams; Acoustics; Clustering algorithms; Data models; Entropy; Hidden Markov models; Optimization; Acoustic modeling; Clustering; Full covariance; Global search optimization; K-step lookahead;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947353
  • Filename
    5947353