Title :
Minimum Risk Training for Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields
Author :
Xiang-Dong Zhou ; Feng Tian ; Cheng-Lin Liu ; Hong-An Wang
Author_Institution :
Beijing Key Lab. of Human-Comput. Interaction, Inst. of Software, Beijing, China
Abstract :
Semi-Markov conditional random fields (semi-CRFs) are usually trained with maximum a posteriori (MAP) criterion which adopts the 0/1 cost for measuring the loss of misclassification. In this paper, based on our previous work on handwritten Chinese/Japanese text recognition (HCTR) using semi-CRFs, we propose an alternative parameter learning method by minimizing the risk, in which the misclassification costs are not equal, but different depending on the hypothesis and the ground-truth. The proposed method is lattice-based, i.e., the hypothesis space is the entire lattice on which the semi-CRF is defined. Experimental results on two online handwriting databases: CASIA-OLHWDB and TUAT Kondate demonstrate that minimum-risk training can yield superior string recognition rates compared to MAP training.
Keywords :
Markov processes; handwritten character recognition; learning (artificial intelligence); maximum likelihood estimation; CASIA-OLHWDB database; HCTR; MAP criterion; MAP training; TUAT Kondate database; alternative parameter learning method; handwritten Chinese text recognition; handwritten Japanese text recognition; maximum a posteriori criterion; minimum risk training; misclassification costs; semiCRF; semiMarkov conditional random fields; string recognition rates; Character recognition; Cost function; Databases; Error analysis; High definition video; Lattices; Training;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.191