• DocumentCode
    419633
  • Title

    Off-line handwritten textline recognition using a mixture of natural and synthetic training data

  • Author

    Varga, Tamás ; Bunke, Horst

  • Author_Institution
    Institut fur Informatik und angewandte Mathematik, Bern Univ., Switzerland
  • Volume
    2
  • fYear
    2004
  • fDate
    23-26 Aug. 2004
  • Firstpage
    545
  • Abstract
    In this paper the problem of off-line handwritten cursive text recognition is considered. A method for expanding the set of available training textlines by applying random perturbations is presented. The goal is to improve the recognition performance of an off-line handwritten textline recognizer by providing it with additional synthetic training data. Three important issues - quality, variability, and capacity - related to this method are discussed, and a basic strategy to make use of the possibility of expanding the training set by synthetic textlines is proposed. It is shown that significant improvement of the recognition performance is possible even when the original training set is large and the textlines are provided by many different writers.
  • Keywords
    handwritten character recognition; learning (artificial intelligence); optical character recognition; offline handwritten cursive textline recognition; synthetic training data; training textlines; Character recognition; Handwriting recognition; Humans; Nonlinear distortion; Nonlinear optics; Optical character recognition software; Optical distortion; Text recognition; Thumb; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2128-2
  • Type

    conf

  • DOI
    10.1109/ICPR.2004.1334299
  • Filename
    1334299