Title :
The Maurdor Project: Improving Automatic Processing of Digital Documents
Author :
Brunessaux, Stephan ; Giroux, Patrick ; Grilheres, Bruno ; Manta, Mathieu ; Bodin, Maylis ; Choukri, Khalid ; Galibert, Olivier ; Kahn, Juliette
Author_Institution :
Airbus Defence & Space, Val-de-Reuil, France
Abstract :
This paper presents the achievements of an experimental project called Maurdor (Moyens AUtomatisés deReconnaissance de Documents ecRits - Automatic Processingof Digital Documents) funded by the French DGA that aims at improving processing technologies for handwritten and typewritten documents in French, English and Arabic. The first part describes the context and objectives of the project. The second part deals with the challenge of creating a realistic corpus of 10,000 annotated documents to support the efficient development and evaluation of processing modules. The third part presents the organisation, metric definition and results of the Maurdor International evaluation campaign. The last part presents the Maurdor demonstrator with a functional and technical perspective.
Keywords :
document image processing; handwriting recognition; natural language processing; optical character recognition; Arabic; English; French; Maurdor International evaluation campaign; Maurdor Project; Moyens automatisés dereconnaissance de documents ecrits; annotated documents; automatic digital document processing; functional perspective; handwritten documents; metric definition; optical character recognition; technical perspective; typewritten documents; Databases; Facsimile; Measurement; Optical character recognition software; Semantics; Text recognition; Writing; SOA; corpus creation; demonstrator; evaluation campaign; processing chain; writing recognition;
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
DOI :
10.1109/DAS.2014.58