DocumentCode :
2023155
Title :
Extraction of Vectorized Graphical Information from Scientific Chart Images
Author :
Huang, Weihua ; Liu, Ruizhe ; Tan, Chew Lim
Author_Institution :
Nat. Univ. of Singapore, Singapore
Volume :
1
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
521
Lastpage :
525
Abstract :
Graphical components information extraction is a crucial step in the chart recognition and understanding process. However, existing methods of information extraction from chart images either are type-dependent or rely on certain assumptions. In this paper, we present a general method to extract vectorized graphical information from scientific chart images. Our algorithm firstly constructs a data structure called directional single-connected chains (DSCC). It then employs ellipse-specific fitting and orthogonal diagonalization to calculate the curvatures of the chains and classify the chains into either straight lines or arcs. Finally we combine all straight lines and all arcs accordingly and use linear regression to compute their attributes. The DSCC has a good property in that it is less susceptible to noise. The experiment results show that our algorithm is efficient, robust and accurate.
Keywords :
charts; data visualisation; image recognition; information retrieval; regression analysis; chart recognition; directional single-connected chains; graphical components information extraction; linear regression; scientific chart images; vectorized graphical information; Computational complexity; Curve fitting; Data mining; Data structures; Graphics; Image converters; Image recognition; Linear regression; Noise robustness; Pixel;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4378764
Filename :
4378764
Link To Document :
بازگشت