DocumentCode
1804984
Title
Detection and correction of deformed historical arabic manuscripts
Author
El-etriby, Sherif Said ; Amin, Khalid Mohammad
Author_Institution
Fac. of Comput. & Inf., Menoufia Univ., Shebin-El-Kom, Egypt
fYear
2010
fDate
11-12 May 2010
Firstpage
1
Lastpage
6
Abstract
Historical manuscripts are considered one of the most imperative human riches and a source of intellectual production. Unfortunately, due to aging effects, multiple noises and deviations are found in the document image. Moreover, challenges for several images of ancient documents show defects of inclinations and curvatures of text lines. These defects arise due to bad storage conditions, or during the digitization process. In order to improve the readability and the automatic recognition of historical Arabic manuscripts, preprocessing steps are imperative. This paper presents a novel method that consists of two major phases. The first refer to binarization and enhancement of the scanned document image. In the second phase, correction of skew angle in the text line passes by the detection of curvature/inclination of the baseline. Then, calculating the skewed angle of this line, and finally, correcting the line with a rotation relative to its centre. The proposed method was implemented on different scanned Arabic documents. The proposed methodology overcomes the defects of global binarization method, also, save the high computation effort of adaptive binarization techniques. Moreover, it works well with both Arabic handwritten words and printed text.
Keywords
document image processing; image enhancement; image restoration; natural languages; text analysis; automatic recognition; curvature detection; deformed historical Arabic manuscript correction; deformed historical Arabic manuscript detection; digitization process; scanned document image binarization; scanned document image enhancement; skew angle correction; Computers; Feature extraction; Linear regression; Noise; Optical character recognition software; Pixel; Skeleton; Ancient documents; Arabic Baseline; correction of curvature; correction of inclination; preprocessing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Engineering (ICCCE), 2010 International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4244-6233-9
Type
conf
DOI
10.1109/ICCCE.2010.5556860
Filename
5556860
Link To Document