DocumentCode :
2176436
Title :
Eigentriphones: A basis for context-dependent acoustic modeling
Author :
Ko, Tom ; Mak, Brian
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4892
Lastpage :
4895
Abstract :
In context-dependent acoustic modeling, it is important to strike a balance between detailed modeling and data sufficiency for robust estimation of model parameters. In the past, parameter sharing or tying is one of the most common techniques to solve the problem. In recent years, another technique which may be loosely and collectively called the subspace approach tries to express a phonetic or sub-phonetic unit in terms of a small set of canonical vectors or units. In this paper, we investigate the development of an eigenbasis over the triphones and model each triphone as a point in the basis. We call the eigenvectors in the basis eigentriphones. From another perspective, we investigate the use of the eigenvoice adaptation method as a general acoustic modeling method for training triphones - especially the less frequent triphones without tying their states so that all the triphones are really distinct from each other and thus may be more discriminative. Experimental evaluation on the 5K-vocabulary HUB2 recognition task shows that a triphone HMM system trained using only eigentriphones without state tying may achieve slightly better performance than the common tied-state triphones.
Keywords :
eigenvalues and eigenfunctions; hidden Markov models; speech recognition; 5K-vocabulary HUB2 recognition task; canonical units; canonical vectors; context-dependent acoustic modeling; eigentriphones; eigenvoice adaptation method; general acoustic modeling method; subspace approach; triphone HMM system; Acoustics; Adaptation models; Context modeling; Hidden Markov models; Speech; Speech recognition; Training; Eigenvoices; adaptation; context-dependent acoustic modeling; eigentriphones;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947452
Filename :
5947452
Link To Document :
بازگشت