• DocumentCode
    146783
  • Title

    Detection of table structure and content extraction from scanned documents

  • Author

    Deivalakshmi, S. ; Chaitanya, K. ; Palanisamy, P.

  • Author_Institution
    Dept. of Electron. & Commun. Eng., Nat. Inst. of Technol., Trichy, India
  • fYear
    2014
  • fDate
    3-5 April 2014
  • Firstpage
    270
  • Lastpage
    274
  • Abstract
    Tables are one of the efficient information conveying methods used now days in larger extent. This paper report a fast, language independent (English and Tamil), skilled technique for table structure detection and its content extraction from a scanned document image based on morphological operation, connected components and labeling. From the conducted exhaustive experimentation, it is observed that the proposed method is the fastest approach because of its simple operations. In addition with that it is noticed that it does not lead to any kind of degradation in the extracted table content since after detecting contents location it is retrieved from the original image. More over it is also very interesting to note that the presented approach works well for documents with different font´s size and font styles.
  • Keywords
    document image processing; information retrieval; content extraction; information conveying method; morphological operation; scanned document image; table structure detection; Companies; Context; Labeling; Lead; Morphology; connected components; labeling; morphological operation; scanned document image; table detection and content extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Signal Processing (ICCSP), 2014 International Conference on
  • Conference_Location
    Melmaruvathur
  • Print_ISBN
    978-1-4799-3357-0
  • Type

    conf

  • DOI
    10.1109/ICCSP.2014.6949843
  • Filename
    6949843