DocumentCode
3142318
Title
Integrated algorithms for newspaper page decomposition and article tracking
Author
Gatos, B. ; Mantzaris, S.L. ; Chandrinos, K.V. ; Tsigris, A. ; Perantonis, S.J.
Author_Institution
Lambrakis Press S.A., Athens, Greece
fYear
1999
fDate
20-22 Sep 1999
Firstpage
559
Lastpage
562
Abstract
The conversion of newspaper pages into digital resources is an important task that greatly contributes to the preservation of and access to newspaper archives. In this paper, an integrated methodology is presented for segmenting newspaper pages and identifying newspaper articles. In the first stage, a succession of image processing and document analysis algorithms is employed for segmenting newspaper page images into various objects (text, images and drawings, titles). A rule based approach is subsequently applied to the objects identified during the page segmentation phase for reconstructing individual articles. Experimental results, obtained from a large testbed of old newspaper issues, are presented which clearly demonstrate the applicability of our integrated approach to successful newspaper page segmentation and identification of newspaper articles
Keywords
document image processing; image segmentation; optical character recognition; visual databases; article tracking; document analysis; document preservation; experimental results; image processing; image segmentation; newspaper archives; newspaper page decomposition; page segmentation; rule based approach; Decision support systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location
Bangalore
Print_ISBN
0-7695-0318-7
Type
conf
DOI
10.1109/ICDAR.1999.791849
Filename
791849
Link To Document