Title :
An Automated Approach for Interpretation of Statistical Graphics
Author :
Mahmood, Arif ; Bajwa, Imran Sarwar ; Qazi, Kashifuddin
Author_Institution :
Dept. of Comput. Sci. &IT, Islamia Univ. of Bahawalpur, Bahawalpur, Pakistan
Abstract :
Text plays vital role in the analysis of quantitative data as in statistics the data representation is made through different graphical tools such as bar charts, pie charts, line charts, scatter diagram, histograms etc. Statistical graphics are the valuable tool used for visual information representation in multimodal documents. It is often observed that communicative goal of the statistical graphics is not captured by documents accompanying text. To perceive the represented information using statistical graphics is hard-hitting job for novice readers. An approach to automate the process of image classification and information extraction is presented in this paper. This study focuses on the area charts that are important type of statistical graphics used for probability distribution and testing of hypothesis process. Firstly, we have classified the area charts into different classes and then designed architecture for chart image classification and information withdrawal from each class of area chart. The extracted information is represented in the form of natural language summaries using template based approach.
Keywords :
data analysis; data structures; image classification; information retrieval; natural language processing; statistical analysis; statistical distributions; statistical testing; text detection; area charts; chart image classification process; data representation; graphical tools; hypothesis process testing; information extraction process; information withdrawal; multimodal documents; natural language summaries; probability distribution; quantitative data analysis; statistical graphics; template based approach; visual information representation; Data mining; Feature extraction; Graphics; Image recognition; Information retrieval; Natural languages; Optical character recognition software; Natural language processing; Optical character recognition; Statistical graphics; Text detection and extraction; area charts classification and detection;
Conference_Titel :
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2014 Sixth International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-4956-4
DOI :
10.1109/IHMSC.2014.192