Title :
Accelerating generalized iterative scaling using componentwise extrapolations for on-line conditional random fields
Author :
Yang, Hee-Deok ; Lee, Seong-Whan
Author_Institution :
Sch. of Comput. Eng., Chosun Univ., Gwangju, South Korea
Abstract :
In this paper, a simple and globally convergent method based on penalized generalized iterative scaling (GIS) with staggered Aitken acceleration is proposed to efficiently estimate the parameters for an on-line conditional random field (CRF). The staggered Aitken acceleration method, which alternates between an acceleration step and a non-acceleration step, provides numerical stability and computational simplicity in analyzing the incompleteness of data. The proposed method is based on stochastic gradient descent (SGD) and it has the following advantages: (1) it can approximate parameters close to the empirical optimum in a single pass through the training examples; (2) it can reduce the computing time by eliminating computation of the inverse of the objective function´s Hessian matrix with staggered Aitken acceleration. We show the convergence of penalized GIS based on the staggered Aitken acceleration method, compare its speed of convergence with that of other stochastic optimization methods, and also illustrate experimental results with public data sets.
Keywords :
Hessian matrices; gradient methods; learning (artificial intelligence); matrix inversion; numerical stability; random processes; Hessian matrix; componentwise extrapolation; generalized iterative scaling; inverse matrix; numerical stability; online conditional random field; staggered Aitken acceleration; stochastic gradient descent; Acceleration; Convergence; Cybernetics; Geographic Information Systems; Machine learning; Stochastic processes; Training; Aitken acceleration; Conditional random field; incremental learning; on-line learning;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
DOI :
10.1109/ICMLC.2010.5580707