• DocumentCode
    3340948
  • Title

    Pre-Printed and Hand-Filled Table-Form Analysis Aiming Cell Extraction

  • Author

    Felipe, Rafaela Dandolini ; Neves, Luiz Antonio Pereira

  • Author_Institution
    PUCPR, Pontifical Catholic Univ. of Parana, Parana
  • fYear
    2008
  • fDate
    16-19 Sept. 2008
  • Firstpage
    439
  • Lastpage
    443
  • Abstract
    This paper presents an approach to extract the structure of pre-printed and hand-filled table-forms. The first module performs the cell identification based on Watershed transform. A second module detects the wrong cells produced by handwritten and/or pre-printed data. In this module, wrong cells and other cells are filtered by a compactness, perimeter and area analysis. In a third module, the wrong cells are merged with other cells to determine the exact structure. A miscellaneous database composed of 300 pre-printed and hand-filled table-form images was used to evaluate the efficiency of our methodology. Experiments showed significant and promising results.
  • Keywords
    document image processing; edge detection; feature extraction; visual databases; cell extraction; hand-filled table-form analysis; pre-printed table-form analysis; watershed transform; Data mining; Gray-scale; Image databases; Image recognition; Image segmentation; Lakes; Pollution; Surface morphology; Surface topography; Text analysis; Image processing; Pattern recognition; Table-form document;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on
  • Conference_Location
    Nara
  • Print_ISBN
    978-0-7695-3337-7
  • Type

    conf

  • DOI
    10.1109/DAS.2008.46
  • Filename
    4669992