Title :
Detection of table structure and content extraction from scanned documents
Author :
Deivalakshmi, S. ; Chaitanya, K. ; Palanisamy, P.
Author_Institution :
Dept. of Electron. & Commun. Eng., Nat. Inst. of Technol., Trichy, India
Abstract :
Tables are one of the efficient information conveying methods used now days in larger extent. This paper report a fast, language independent (English and Tamil), skilled technique for table structure detection and its content extraction from a scanned document image based on morphological operation, connected components and labeling. From the conducted exhaustive experimentation, it is observed that the proposed method is the fastest approach because of its simple operations. In addition with that it is noticed that it does not lead to any kind of degradation in the extracted table content since after detecting contents location it is retrieved from the original image. More over it is also very interesting to note that the presented approach works well for documents with different font´s size and font styles.
Keywords :
document image processing; information retrieval; content extraction; information conveying method; morphological operation; scanned document image; table structure detection; Companies; Context; Labeling; Lead; Morphology; connected components; labeling; morphological operation; scanned document image; table detection and content extraction;
Conference_Titel :
Communications and Signal Processing (ICCSP), 2014 International Conference on
Conference_Location :
Melmaruvathur
Print_ISBN :
978-1-4799-3357-0
DOI :
10.1109/ICCSP.2014.6949843