Title :
Extraction of Vectorized Graphical Information from Scientific Chart Images
Author :
Huang, Weihua ; Liu, Ruizhe ; Tan, Chew Lim
Author_Institution :
Nat. Univ. of Singapore, Singapore
Abstract :
Graphical components information extraction is a crucial step in the chart recognition and understanding process. However, existing methods of information extraction from chart images either are type-dependent or rely on certain assumptions. In this paper, we present a general method to extract vectorized graphical information from scientific chart images. Our algorithm firstly constructs a data structure called directional single-connected chains (DSCC). It then employs ellipse-specific fitting and orthogonal diagonalization to calculate the curvatures of the chains and classify the chains into either straight lines or arcs. Finally we combine all straight lines and all arcs accordingly and use linear regression to compute their attributes. The DSCC has a good property in that it is less susceptible to noise. The experiment results show that our algorithm is efficient, robust and accurate.
Keywords :
charts; data visualisation; image recognition; information retrieval; regression analysis; chart recognition; directional single-connected chains; graphical components information extraction; linear regression; scientific chart images; vectorized graphical information; Computational complexity; Curve fitting; Data mining; Data structures; Graphics; Image converters; Image recognition; Linear regression; Noise robustness; Pixel;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4378764