• DocumentCode
    3180268
  • Title

    A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition

  • Author

    Li, Nanxi ; Jin, Lianwen

  • Author_Institution
    Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
  • fYear
    2010
  • fDate
    10-13 Oct. 2010
  • Firstpage
    3664
  • Lastpage
    3668
  • Abstract
    A Bayesian-based probabilistic model is presented for unconstrained handwritten offline Chinese text line recognition. After pre-segmentation of a text line, plenty of invalid characters are produced which heavily interfere in the process of text line recognition. The proposed probabilistic model can incorporate isolated character recognition, character sample verification, and n-gram language model in a simple way, leading to more reliable recognition of a text line. When testing on HIT-MW database, experiments show that the proposed method can achieve character-level recognition accuracies of 63.19% without language model and 73.97% with bi-gram language model, respectively, outperforming the most recent results testing on the same dataset.
  • Keywords
    belief networks; handwritten character recognition; natural language processing; probability; text analysis; Bayesian-based probabilistic model; HIT-MW database; isolated character recognition; unconstrained handwritten offline Chinese text line recognition; Bayesian methods; Handwriting recognition; Text recognition; Chinese text line recognition; confidence measurement; handwritten Chinese character recognition; invalid character; verification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-6586-6
  • Type

    conf

  • DOI
    10.1109/ICSMC.2010.5641873
  • Filename
    5641873