DocumentCode :
2875203
Title :
Selective EM training of acoustic models based on sufficient statistics of single utterances
Author :
Cincarek, Tobias ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol.
fYear :
2005
fDate :
27-27 Nov. 2005
Firstpage :
168
Lastpage :
173
Abstract :
In this paper, a new algorithm for selective training of acoustic models is proposed. The algorithm is formulated for an HMM-based model with Gaussian mixture densities, but works in principle for any statistical model, which has sufficient statistics. Since there are too many possibilities for selecting a data subset from a larger database, a heuristic has to be employed. The algorithm is based on deleting single utterances from a data pool temporarily or alternating between successive deletion or addition of utterances. The optimization criterion is the likelihood of the new model parameters given some development data, which can be calculated in a short amount of time based on sufficient statistics. The method is applied to automatically obtain task-dependent acoustic models for infant and elderly speech by selecting utterances from a data pool which are acoustically close to the development data. The proposed method is computationally practical and also addresses the issue of reducing the high costs evolving from the development of applications which make use of speech recognition technology
Keywords :
Gaussian processes; acoustic signal processing; expectation-maximisation algorithm; hidden Markov models; speech recognition; speech synthesis; EM training; Gaussian mixture densities; HMM; elderly speech; infant speech; single utterances; speech recognition; task-dependent acoustic models; Acoustic applications; Costs; Databases; Hidden Markov models; Information science; Senior citizens; Speech processing; Speech recognition; Statistics; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
Type :
conf
DOI :
10.1109/ASRU.2005.1566486
Filename :
1566486
Link To Document :
بازگشت