DocumentCode :
1194879
Title :
Study of a Fast Discriminative Training Algorithm for Pattern Recognition
Author :
Qi Li ; Biing-Hwang Juang
Author_Institution :
Bell Labs., Lucent Technol., Florham Park, NJ
Volume :
17
Issue :
5
fYear :
2006
Firstpage :
1212
Lastpage :
1221
Abstract :
Discriminative training refers to an approach to pattern recognition based on direct minimization of a cost function commensurate with the performance of the recognition system. This is in contrast to the procedure of probability distribution estimation as conventionally required in Bayes´ formulation of the statistical pattern recognition problem. Currently, most discriminative training algorithms for nonlinear classifier designs are based on gradient-descent (GD) methods for cost minimization. These algorithms are easy to derive and effective in practice, but are slow in training speed and have difficulty selecting the learning rates. To address the problem, we present our study on a fast discriminative training algorithm. The algorithm initializes the parameters by the expectation-maximization (EM) algorithm, and then uses a set of closed-form formulas derived in this paper to further optimize a proposed objective of minimizing error rate. Experiments in speech applications show that the algorithm provides better recognition accuracy in a fewer iterations than the EM algorithm and a neural network trained by hundreds of GD iterations. Although some convergent properties need further research, the proposed objective and derived formulas can benefit further study of the problem
Keywords :
Bayes methods; expectation-maximisation algorithm; gradient methods; neural nets; pattern recognition; probability; Bayes formulation; cost minimization; expectation-maximization algorithm; fast discriminative training algorithm; gradient-descent methods; neural network; pattern recognition; probability distribution estimation; Backpropagation algorithms; Cost function; Hidden Markov models; Maximum likelihood estimation; Minimization methods; Multilayer perceptrons; Neural networks; Pattern recognition; Probability distribution; Speech recognition; Classification; EM algorithm; discriminative training; neural networks; pattern recognition; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Computing Methodologies; Discriminant Analysis; Models, Statistical; Neural Networks (Computer); Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2006.875992
Filename :
1687931
Link To Document :
بازگشت