DocumentCode :
3775922
Title :
A hybrid method for table detection from document image
Author :
Tran Tuan Anh;Na In-Seop;Kim Soo-Hyung
Author_Institution :
Department of Computer Science, Chonnam National University, 77 Yongbong-ro, 500-757 South Korea
fYear :
2015
Firstpage :
131
Lastpage :
135
Abstract :
In this paper, we present a hybrid method consisting of three main stages for detecting tables in document images. Based on table structure, our system separates table into two main categories, ruling line table and non-ruling line table. In the first stage, the text and non-text elements in document are classified by a heuristic filter. Then, the white space analysis is used to group the text elements into text lines, while ruling line table candidates are identified from non-text elements. In the second stage, based on the text lines, text and non-text elements, a hybrid method which consist of the alternative bottom-up and top-down approaches is implemented to find the table region candidates. In the final stage, these candidates are examined to get the table regions by analyzing text lines and spare lines. Experimental results with the document database from the ICDAR2013 table competition show that the proposed method works better than the previous ones.
Keywords :
"Image color analysis","Portable document format","Text analysis","Feature extraction","White spaces","Image segmentation","Optical character recognition software"
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on
Electronic_ISBN :
2327-0985
Type :
conf
DOI :
10.1109/ACPR.2015.7486480
Filename :
7486480
Link To Document :
بازگشت