DocumentCode :
3490383
Title :
ICDAR 2013 Competition on Book Structure Extraction
Author :
Doucet, Arnaud ; Kazai, Gabriella ; Colutto, Sebastian ; Muhlberger, Gunter
Author_Institution :
Univ. of Normandy - Unicaen, Caen, France
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
1438
Lastpage :
1443
Abstract :
This paper summarizes the 3rd Book Structure Extraction competition that was run at the ICDAR 2013. Its goal is to evaluate and compare automatic techniques for deriving structure information from digitized books, which could then be used to aid navigation inside the books. More specifically, the task that participants are faced with is to construct hyper linked tables of contents for a collection of 1,000 digitized books. This paper reviews the setup of the competition, the book collection used in the task, and the measures used for the evaluation. The main novelty of the 2013 competition is that we were able to rely on an external provider for the ground truthing phase, hence granting the consistency of the evaluation. In addition, this permitted to nearly double the number of annotated books from the 1,040 books annotated in 2009 and 2011 to over 2,000 books. The paper further presents the result performance of the 6 participating research teams, and briefly summarizes their approaches.
Keywords :
electronic publishing; information retrieval; ICDAR 2013 competition; annotated books; book collection; book navigation; book structure extraction; digitized books; ground truthing phase; hyper linked tables; international conference on document analysis and recognition; structure information; Data mining; Educational institutions; Navigation; Optical character recognition software; Organizations; Portable document format; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.290
Filename :
6628851
Link To Document :
بازگشت