• DocumentCode
    402499
  • Title

    Line separation for complex document images using fuzzy runlength

  • Author

    Shi, Zhixin ; Govindaraju, Venu

  • Author_Institution
    Center of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
  • fYear
    2004
  • fDate
    2004
  • Firstpage
    306
  • Lastpage
    312
  • Abstract
    A new text line location and separation algorithm for complex handwritten documents is proposed. The algorithm is based on the application of a fuzzy directional runlength. The proposed technique was tested on a variety of complex handwritten document images including postal parcel images and historical handwritten documents such as Newton´s and Galileo´s manuscripts. A preliminary testing showed a successful rate of 93% of the test set.
  • Keywords
    document image processing; handwritten character recognition; history; pattern classification; runlength codes; text analysis; Galileo manuscripts; Newton manuscripts; complex handwritten document images; fuzzy runlength; historical handwritten documents; postal parcel images; text line separation algorithm; Character recognition; Computational efficiency; Data mining; Graphics; Histograms; Nearest neighbor searches; Optical character recognition software; Testing; Text analysis; Venus;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Image Analysis for Libraries, 2004. Proceedings. First International Workshop on
  • Print_ISBN
    0-7695-2088-X
  • Type

    conf

  • DOI
    10.1109/DIAL.2004.1263259
  • Filename
    1263259