DocumentCode :
2765302
Title :
An efficient preprocessing block for the middle-age Persian manuscripts
Author :
Alirezaee, Shahpour ; Aghaeinia, Hassan ; Faez, Karim ; Rashidzadeh, Rashid
Author_Institution :
Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
fYear :
2005
fDate :
1-4 May 2005
Firstpage :
2170
Lastpage :
2173
Abstract :
In this paper, a preprocessing block for the middle-age Persian documents is proposed. The main idea is based on the mathematical morphology, connected components and clustering. The proposed algorithm is capable to simultaneously remove the noise and segment the manuscript to its basic components i.e. lines, words and characters. The proposed strategy has been tested on 200 page of the middle-age Persian. We have also used the success of the k-means algorithm on page to line segmentation as a criterion for the performance evaluation on the test data. The results show the proposed algorithm has 98.12% accuracy on page to line segmentation
Keywords :
document image processing; handwritten character recognition; mathematical morphology; mathematical morphology; middle-age Persian manuscripts; page to line segmentation; performance evaluation; preprocessing block; Character recognition; Chromium; Clustering algorithms; Gray-scale; Handwriting recognition; Histograms; Image segmentation; Morphology; Smoothing methods; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 2005. Canadian Conference on
Conference_Location :
Saskatoon, Sask.
ISSN :
0840-7789
Print_ISBN :
0-7803-8885-2
Type :
conf
DOI :
10.1109/CCECE.2005.1557418
Filename :
1557418
Link To Document :
بازگشت