• DocumentCode
    2172578
  • Title

    Markov model order optimization for text recognition

  • Author

    Olivier, Christian ; Jouzel, Frédéric ; Avila, Manuel

  • Author_Institution
    Rouen Univ., Mont-Saint-Aignan, France
  • Volume
    2
  • fYear
    1997
  • fDate
    18-20 Aug 1997
  • Firstpage
    548
  • Abstract
    Markov models are currently used for printed or handwritten word recognition. The order k is a very important parameter of these models. The aim of this paper is to use model selection criteria in order to estimate the order of a Markov model. Akaike (1973) suggested the AIC criterion for the estimation of the order k of a parameterized statistical model, including the term k as penalization of the likelihood function. Yet, selection according to this criterion leads asymptotically to a strict overestimation of the order. That is why we suggest the use of other consistent criteria in a Markovian case: the Bayesian and the Hannan and Quinn information criteria (BIC and ρ, respectively). The performance of the criteria are analysed on simulated data and on a real case: a handwritten word description. We discuss the limit of these methods in relation to the number of states in the model
  • Keywords
    Bayes methods; Markov processes; document image processing; handwriting recognition; maximum likelihood estimation; optical character recognition; optimisation; statistical analysis; AIC criterion; Bayesian method; Hannan information criteria; Markov model order optimization; Quinn information criteria; handwritten word description; handwritten word recognition; likelihood function; model selection criteria; overestimation; parameterized statistical model; performance; printed word recognition; simulated data; text recognition; Analytical models; Bayesian methods; Context modeling; Data analysis; Information theory; Markov processes; Performance analysis; Statistical analysis; Stochastic processes; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
  • Conference_Location
    Ulm
  • Print_ISBN
    0-8186-7898-4
  • Type

    conf

  • DOI
    10.1109/ICDAR.1997.620560
  • Filename
    620560