• DocumentCode
    2489638
  • Title

    Generic scale-space process for handwriting documents analysis

  • Author

    Joutel, Guillaume ; Églin, Véronique ; Emptoz, Hubert

  • Author_Institution
    LIRIS, CNRS, Villeurbanne
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents a generic architecture for handwriting documents analysis. It covers all analysis steps from the content description of the document (layout analysis, handwriting shape characterization) to three dedicated Digital Libraries applications (CBIR in great ancient documents images database, Paleo-graphical images classification and word spotting). The generic scale space tool is based on the Curvelets decomposition of images for the indexation of linear singularities of handwritten shapes. The proposed scheme for handwritten shape characterization targets to detect oriented and curved fragments at different scales: it is used in a first step to extract visual textual interest regions and secondly to use the Curvelets coefficients in various ways to satisfy the three designed applications. The complete implementation scheme is validated with a specific application of word spotting based on the orientations analysis. The proposed method is language independent and only visual orientation and appearance based. In that context, no lexical information nor any other statistical language models are required. The first proposed tests for this application are proposed on medieval documents images and on European 18th century correspondences corpus from the CERPHI. Precision-recall analysis testifies the relevance of the contribution.
  • Keywords
    database indexing; document image processing; handwriting recognition; object detection; shape recognition; transforms; content description; curvelet decomposition; digital library; generic scale-space process; handwriting documents analysis; handwritten shape characterization; handwritten shape indexing; linear singularity; target detection; visual textual interest region extraction; Anisotropic magnetoresistance; Image analysis; Image databases; Image retrieval; Image storage; Robustness; Shape; Software libraries; Testing; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761827
  • Filename
    4761827