• DocumentCode
    3142318
  • Title

    Integrated algorithms for newspaper page decomposition and article tracking

  • Author

    Gatos, B. ; Mantzaris, S.L. ; Chandrinos, K.V. ; Tsigris, A. ; Perantonis, S.J.

  • Author_Institution
    Lambrakis Press S.A., Athens, Greece
  • fYear
    1999
  • fDate
    20-22 Sep 1999
  • Firstpage
    559
  • Lastpage
    562
  • Abstract
    The conversion of newspaper pages into digital resources is an important task that greatly contributes to the preservation of and access to newspaper archives. In this paper, an integrated methodology is presented for segmenting newspaper pages and identifying newspaper articles. In the first stage, a succession of image processing and document analysis algorithms is employed for segmenting newspaper page images into various objects (text, images and drawings, titles). A rule based approach is subsequently applied to the objects identified during the page segmentation phase for reconstructing individual articles. Experimental results, obtained from a large testbed of old newspaper issues, are presented which clearly demonstrate the applicability of our integrated approach to successful newspaper page segmentation and identification of newspaper articles
  • Keywords
    document image processing; image segmentation; optical character recognition; visual databases; article tracking; document analysis; document preservation; experimental results; image processing; image segmentation; newspaper archives; newspaper page decomposition; page segmentation; rule based approach; Decision support systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    0-7695-0318-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1999.791849
  • Filename
    791849