DocumentCode :
2174005
Title :
Clustering of bootstrapped acoustic model with full covariance
Author :
Chen, Xin ; Cui, Xiaodong ; Xue, Jian ; Olsen, Peder ; Hershey, John ; Zhou, Bowen ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4496
Lastpage :
4499
Abstract :
HMM-based acoustic models built from bootstrap are generally very large, especially when full covariance matrices are used for Gaussians. Therefore, clustering is needed to compact the acoustic model to a reasonable size for practical applications. This paper discusses and investigates multiple distance measurements and algorithms for the clustering. The distance measurements include Entropy, KL, Bhattacharyya, Chernoff and their weighted versions. For clustering algorithms, besides conventional greedy bottom-up, algorithms such as N-Best distance Refinement (NBR), K-step Look-Ahead (KLA), Breadth-First Searched (BFS) best path are proposed. A two-pass optimization approach is also proposed to improve the model structure. Experiments in the Bootstrap and Restructuring (B SRS) frame work on Pashto show that the discussed clustering approach can lead to better quality of the restructured model. It also shows that final acoustic model that is diagonalized from the full covariance yields good improvement over BSRS model directly with diagonal model and yields significant improvement over the conventional diagonal model.
Keywords :
Gaussian processes; covariance matrices; hidden Markov models; speech recognition; Breadth-first search; Gaussian components; HMM-based acoustic models; K-step Look-Ahead; N-best distance refinement; bootstrapped acoustic model; covariance matrices; Acoustic beams; Acoustics; Clustering algorithms; Data models; Entropy; Hidden Markov models; Optimization; Acoustic modeling; Clustering; Full covariance; Global search optimization; K-step lookahead;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947353
Filename :
5947353
Link To Document :
بازگشت