DocumentCode
1580639
Title
DMOS: a generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems
Author
Coüasnon, Bertrand
Author_Institution
Dept. Inf., IRISA/INSA, Rennes, France
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
215
Lastpage
220
Abstract
Genericity in structured document recognition is a difficult challenge. We therefore propose a new generic document recognition method, called DMOS (Description and MOdification of Segmentation), that is made up of a new grammatical formalism, called EPF (Enhanced Position Formalism) and an associated parser which is able to introduce context in segmentation. We implement this method to obtain a generator of document recognition systems. This generator can automatically produce new recognition systems. It is only necessary to describe the document with an EPF grammar, which is then simply compiled. In this way, we have developed various recognition systems: one on musical scores, one on mathematical formulae and one on recursive table structures. We have also defined a specific application to damaged military forms of the 19th Century. We have been able to test the generated system on 5,000 of these military forms. This has permitted us to validate the DMOS method on a real-world application
Keywords
data structures; document image processing; grammars; history; image recognition; image segmentation; mathematics computing; military computing; music; program compilers; DMOS; EPF grammatical formalism; Enhanced Position Formalism; automatic musical score generator; compilation; damaged military forms; generic document recognition method; mathematical formulae; parser; recursive table structure recognition; segmentation context; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location
Seattle, WA
Print_ISBN
0-7695-1263-1
Type
conf
DOI
10.1109/ICDAR.2001.953786
Filename
953786
Link To Document