DocumentCode
730278
Title
Blind bleed-through removal for scanned historical document images with conditional random fields
Author
Bin Sun ; Shutao Li ; Jun Sun
Author_Institution
Coll. of Electr. & Inf. Eng., Hunan Univ., Changsha, China
fYear
2015
fDate
19-24 April 2015
Firstpage
1652
Lastpage
1656
Abstract
Due to the quality of paper and long-time preservation, the ink on one side of the historical documents often seeps through and appears on the other side. In this paper, a new blind ink bleed-through removal method is proposed to deal with the scanned historical document images. The scanned historical document image generally consists of three components: foreground, bleed-through and background. In the proposed method, conditional probability distribution (CPD) models of the three components are firstly established by statistics. Then, conditional random fields (CRFs) are used to model the observed scanned image and the corresponding labels. For each input scanned image, parameters of the component-wise CPD models are estimated and belief propagation is performed on the CRFs model to determine the most possible labels. Once the bleed-through component is found, an inpainting algorithm is proposed to remove the ink bleed-through from the input historical image. Experimental results show that the proposed method preserves the foreground component very well and removes the bleed-through effectively.
Keywords
document image processing; history; probability; blind bleed-through removal; component-wise CPD models; conditional probability distribution model; conditional random fields; scanned historical document images; Computational modeling; Hidden Markov models; Image segmentation; Ink; Labeling; Measurement; Radio frequency;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178251
Filename
7178251
Link To Document