• DocumentCode
    1580639
  • Title

    DMOS: a generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems

  • Author

    Coüasnon, Bertrand

  • Author_Institution
    Dept. Inf., IRISA/INSA, Rennes, France
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    215
  • Lastpage
    220
  • Abstract
    Genericity in structured document recognition is a difficult challenge. We therefore propose a new generic document recognition method, called DMOS (Description and MOdification of Segmentation), that is made up of a new grammatical formalism, called EPF (Enhanced Position Formalism) and an associated parser which is able to introduce context in segmentation. We implement this method to obtain a generator of document recognition systems. This generator can automatically produce new recognition systems. It is only necessary to describe the document with an EPF grammar, which is then simply compiled. In this way, we have developed various recognition systems: one on musical scores, one on mathematical formulae and one on recursive table structures. We have also defined a specific application to damaged military forms of the 19th Century. We have been able to test the generated system on 5,000 of these military forms. This has permitted us to validate the DMOS method on a real-world application
  • Keywords
    data structures; document image processing; grammars; history; image recognition; image segmentation; mathematics computing; military computing; music; program compilers; DMOS; EPF grammatical formalism; Enhanced Position Formalism; automatic musical score generator; compilation; damaged military forms; generic document recognition method; mathematical formulae; parser; recursive table structure recognition; segmentation context; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7695-1263-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2001.953786
  • Filename
    953786