DocumentCode
2172578
Title
Markov model order optimization for text recognition
Author
Olivier, Christian ; Jouzel, Frédéric ; Avila, Manuel
Author_Institution
Rouen Univ., Mont-Saint-Aignan, France
Volume
2
fYear
1997
fDate
18-20 Aug 1997
Firstpage
548
Abstract
Markov models are currently used for printed or handwritten word recognition. The order k is a very important parameter of these models. The aim of this paper is to use model selection criteria in order to estimate the order of a Markov model. Akaike (1973) suggested the AIC criterion for the estimation of the order k of a parameterized statistical model, including the term k as penalization of the likelihood function. Yet, selection according to this criterion leads asymptotically to a strict overestimation of the order. That is why we suggest the use of other consistent criteria in a Markovian case: the Bayesian and the Hannan and Quinn information criteria (BIC and ρ, respectively). The performance of the criteria are analysed on simulated data and on a real case: a handwritten word description. We discuss the limit of these methods in relation to the number of states in the model
Keywords
Bayes methods; Markov processes; document image processing; handwriting recognition; maximum likelihood estimation; optical character recognition; optimisation; statistical analysis; AIC criterion; Bayesian method; Hannan information criteria; Markov model order optimization; Quinn information criteria; handwritten word description; handwritten word recognition; likelihood function; model selection criteria; overestimation; parameterized statistical model; performance; printed word recognition; simulated data; text recognition; Analytical models; Bayesian methods; Context modeling; Data analysis; Information theory; Markov processes; Performance analysis; Statistical analysis; Stochastic processes; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location
Ulm
Print_ISBN
0-8186-7898-4
Type
conf
DOI
10.1109/ICDAR.1997.620560
Filename
620560
Link To Document