• DocumentCode
    1804984
  • Title

    Detection and correction of deformed historical arabic manuscripts

  • Author

    El-etriby, Sherif Said ; Amin, Khalid Mohammad

  • Author_Institution
    Fac. of Comput. & Inf., Menoufia Univ., Shebin-El-Kom, Egypt
  • fYear
    2010
  • fDate
    11-12 May 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Historical manuscripts are considered one of the most imperative human riches and a source of intellectual production. Unfortunately, due to aging effects, multiple noises and deviations are found in the document image. Moreover, challenges for several images of ancient documents show defects of inclinations and curvatures of text lines. These defects arise due to bad storage conditions, or during the digitization process. In order to improve the readability and the automatic recognition of historical Arabic manuscripts, preprocessing steps are imperative. This paper presents a novel method that consists of two major phases. The first refer to binarization and enhancement of the scanned document image. In the second phase, correction of skew angle in the text line passes by the detection of curvature/inclination of the baseline. Then, calculating the skewed angle of this line, and finally, correcting the line with a rotation relative to its centre. The proposed method was implemented on different scanned Arabic documents. The proposed methodology overcomes the defects of global binarization method, also, save the high computation effort of adaptive binarization techniques. Moreover, it works well with both Arabic handwritten words and printed text.
  • Keywords
    document image processing; image enhancement; image restoration; natural languages; text analysis; automatic recognition; curvature detection; deformed historical Arabic manuscript correction; deformed historical Arabic manuscript detection; digitization process; scanned document image binarization; scanned document image enhancement; skew angle correction; Computers; Feature extraction; Linear regression; Noise; Optical character recognition software; Pixel; Skeleton; Ancient documents; Arabic Baseline; correction of curvature; correction of inclination; preprocessing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Communication Engineering (ICCCE), 2010 International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-6233-9
  • Type

    conf

  • DOI
    10.1109/ICCCE.2010.5556860
  • Filename
    5556860