• DocumentCode
    1640732
  • Title

    ICDAR 2009 Book Structure Extraction Competition

  • Author

    Doucet, Antoine ; Kazai, Gabriella ; Dresevic, Bodin ; Uzelac, Aleksandar ; Radakovic, Bogdan ; Todic, Nikola

  • Author_Institution
    Univ. of Caen, Caen, France
  • fYear
    2009
  • Firstpage
    1408
  • Lastpage
    1412
  • Abstract
    This paper introduces the Book Structure Extraction competition run at ICDAR 2009. The goal of the competition is to evaluate and compare automatic techniques for deriving structure information from digitized books, which could then be used to aid navigation inside the books. More specifically, the task that participants are faced with is to construct hyperlinked tables of contents for a collection of 1,000 digitized books. This paper describes the setup of the competition, the book collection used in the task, and the proposed measures for the evaluation. Results of the evaluation will be presented at the ICDAR 2009 conference and will be published in the INEX 2009 proceedings.
  • Keywords
    electronic publishing; information retrieval; ICDAR 2009; book collection; book structure extraction competition; digitized books; hyperlinked tables; Books; Data mining; Focusing; Image converters; Information retrieval; Navigation; Optical character recognition software; Testing; Text analysis; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.280
  • Filename
    5277791