DocumentCode
146783
Title
Detection of table structure and content extraction from scanned documents
Author
Deivalakshmi, S. ; Chaitanya, K. ; Palanisamy, P.
Author_Institution
Dept. of Electron. & Commun. Eng., Nat. Inst. of Technol., Trichy, India
fYear
2014
fDate
3-5 April 2014
Firstpage
270
Lastpage
274
Abstract
Tables are one of the efficient information conveying methods used now days in larger extent. This paper report a fast, language independent (English and Tamil), skilled technique for table structure detection and its content extraction from a scanned document image based on morphological operation, connected components and labeling. From the conducted exhaustive experimentation, it is observed that the proposed method is the fastest approach because of its simple operations. In addition with that it is noticed that it does not lead to any kind of degradation in the extracted table content since after detecting contents location it is retrieved from the original image. More over it is also very interesting to note that the presented approach works well for documents with different font´s size and font styles.
Keywords
document image processing; information retrieval; content extraction; information conveying method; morphological operation; scanned document image; table structure detection; Companies; Context; Labeling; Lead; Morphology; connected components; labeling; morphological operation; scanned document image; table detection and content extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Signal Processing (ICCSP), 2014 International Conference on
Conference_Location
Melmaruvathur
Print_ISBN
978-1-4799-3357-0
Type
conf
DOI
10.1109/ICCSP.2014.6949843
Filename
6949843
Link To Document