DocumentCode :
146783
Title :
Detection of table structure and content extraction from scanned documents
Author :
Deivalakshmi, S. ; Chaitanya, K. ; Palanisamy, P.
Author_Institution :
Dept. of Electron. & Commun. Eng., Nat. Inst. of Technol., Trichy, India
fYear :
2014
fDate :
3-5 April 2014
Firstpage :
270
Lastpage :
274
Abstract :
Tables are one of the efficient information conveying methods used now days in larger extent. This paper report a fast, language independent (English and Tamil), skilled technique for table structure detection and its content extraction from a scanned document image based on morphological operation, connected components and labeling. From the conducted exhaustive experimentation, it is observed that the proposed method is the fastest approach because of its simple operations. In addition with that it is noticed that it does not lead to any kind of degradation in the extracted table content since after detecting contents location it is retrieved from the original image. More over it is also very interesting to note that the presented approach works well for documents with different font´s size and font styles.
Keywords :
document image processing; information retrieval; content extraction; information conveying method; morphological operation; scanned document image; table structure detection; Companies; Context; Labeling; Lead; Morphology; connected components; labeling; morphological operation; scanned document image; table detection and content extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Signal Processing (ICCSP), 2014 International Conference on
Conference_Location :
Melmaruvathur
Print_ISBN :
978-1-4799-3357-0
Type :
conf
DOI :
10.1109/ICCSP.2014.6949843
Filename :
6949843
Link To Document :
بازگشت