DocumentCode :
3497697
Title :
Exploring the use of conditional random field models and HMMs for historical handwritten document recognition
Author :
Feng, Shaolei ; Manmatha, R. ; McCallum, Andrew
Author_Institution :
Center for Intelligent Inf. Retrieval, Univ. of Massachusetts, Amherst, MA
fYear :
2006
fDate :
27-28 April 2006
Lastpage :
37
Abstract :
In this paper we explore different approaches for improving the performance of dependency models on discrete features for handwriting recognition. Hidden Markov models have often been used for handwriting recognition. Conditional random fields (CRF´s) allow for more general dependencies and we investigate their use. We believe that this is the first attempt at apply CRF´s for handwriting recognition. We show that on the whole word recognition task, the CRF performs better than a HMM on a publicly available standard dataset of 20 pages of George Washington´s manuscripts. The scale space for the whole word recognition task is large - almost 1200 states. To make CRF computation tractable we use beam search to make inference more efficient using three different approaches. Better improvement can be obtained using the HMM by directly smoothing the discrete features using the collection frequencies. This shows the importance of smoothing and also indicates the difficulty of training CRF´s when large state spaces are involved
Keywords :
document image processing; handwriting recognition; hidden Markov models; George Washington manuscripts; HMM; conditional random field models; conditional random fields; handwriting recognition; hidden Markov models; historical handwritten document recognition; whole word recognition; Character recognition; Frequency; Handwriting recognition; Hidden Markov models; Information retrieval; Labeling; Libraries; Smoothing methods; State-space methods; Zoology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Image Analysis for Libraries, 2006. DIAL '06. Second International Conference on
Conference_Location :
Lyon
Print_ISBN :
0-7695-2531-8
Type :
conf
DOI :
10.1109/DIAL.2006.19
Filename :
1612944
Link To Document :
بازگشت