Title :
A learning framework for degraded document image binarization using Markov Random Field
Author :
Bolan Su ; Shijian Lu ; Chew Lim Tan
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
Document image binarization is an important preprocessing technique for document image analysis that segments the text from the document image backgrounds. Many techniques have been proposed and successfully applied in different applications, such as document image retrieval. However, these techniques may perform poorly on degraded document images. In this paper, we propose a learning framework that makes use of the Markov Random Field to improve the performance of the existing document image binarization methods for those degraded document images. Extensive experiments on the recent Document Image Bina-rization Contest datasets demonstrate that significant improvements of the existing binarization methods when applying our proposed framework.
Keywords :
Markov processes; document image processing; image retrieval; image segmentation; learning (artificial intelligence); random processes; text analysis; Markov random field; degraded document image binarization methods; document image analysis; document image backgrounds; document image binarization contest datasets; document image retrieval; learning framework; text segmentation; Equations; Image edge detection; Markov random fields; Mathematical model; Text analysis; Vectors;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4