Title :
An efficient preprocessing block for the middle-age Persian manuscripts
Author :
Alirezaee, Shahpour ; Aghaeinia, Hassan ; Faez, Karim ; Rashidzadeh, Rashid
Author_Institution :
Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
Abstract :
In this paper, a preprocessing block for the middle-age Persian documents is proposed. The main idea is based on the mathematical morphology, connected components and clustering. The proposed algorithm is capable to simultaneously remove the noise and segment the manuscript to its basic components i.e. lines, words and characters. The proposed strategy has been tested on 200 page of the middle-age Persian. We have also used the success of the k-means algorithm on page to line segmentation as a criterion for the performance evaluation on the test data. The results show the proposed algorithm has 98.12% accuracy on page to line segmentation
Keywords :
document image processing; handwritten character recognition; mathematical morphology; mathematical morphology; middle-age Persian manuscripts; page to line segmentation; performance evaluation; preprocessing block; Character recognition; Chromium; Clustering algorithms; Gray-scale; Handwriting recognition; Histograms; Image segmentation; Morphology; Smoothing methods; Testing;
Conference_Titel :
Electrical and Computer Engineering, 2005. Canadian Conference on
Conference_Location :
Saskatoon, Sask.
Print_ISBN :
0-7803-8885-2
DOI :
10.1109/CCECE.2005.1557418