DocumentCode :
153141
Title :
Non-stationary modeling for the separation of overlapped texts in documents
Author :
Tonazzini, A. ; Savino, Pasquale ; Salerno, Emanuele
Author_Institution :
Ist. di Sci. e Tecnol. dell´Inf., Pisa, Italy
fYear :
2014
fDate :
23-25 April 2014
Firstpage :
2314
Lastpage :
2318
Abstract :
In this paper, we address the removal of severe back-to-front interferences in archival documents, when recto and verso images of the page are available. The problem is approached from a modeling point of view, considering the ideal images of the two separated texts as individual source patterns that overlap in the observed images through some parametric mixing operator. Earlier approaches were based on linear mixtures of the ideal reflectance maps, or of the ideal optical densities and absorptance maps, through unknown coefficients or blur kernels. Some approximations and/or partial user supervision were then adopted to jointly estimate the sources and the model parameters. Nevertheless, a feasible and reliable data model for this problem should at least be non-linear and space-variant, to cope with occlusions, ink saturation, and large variability of the mixing level. This is especially true for ancient documents affected by ink seeping (bleed-through). The search for such a model is still far from being concluded, or even impossible to pursue, due to the unavailability of information about the chemical and physical processes at the origin of the phenomenon. Hence, here, we propose the use of pixel-dependent parameters, within a model additive in the optical densities, to compensate not only for non-stationarity, but also for the lack or the imprecise knowledge of the non-linearity, and for modeling errors more in general.
Keywords :
document image processing; image restoration; text analysis; absorptance maps; archival documents; back-to-front interferences; blur kernels; chemical process; ink saturation; mixing level variability; nonstationary modeling; occlusions; optical densities; overlapped text separation; parametric mixing operator; physical process; pixel-dependent parameters; reflectance maps; source patterns; Adaptive optics; Data models; Image restoration; Ink; Interference; Optical imaging; Signal processing; Document restoration; non-stationary data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location :
Trabzon
Type :
conf
DOI :
10.1109/SIU.2014.6830727
Filename :
6830727
Link To Document :
بازگشت