DocumentCode :
2853846
Title :
Happy or not: Generating topic-based emotional heatmaps for Culturomics using CyberGIS
Author :
Shook, E. ; Leetaru, K. ; Guofeng Cao ; Padmanabhan, Anand ; Shaowen Wang
Author_Institution :
Dept. of Geogr. & Geographic Inf. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2012
fDate :
8-12 Oct. 2012
Firstpage :
1
Lastpage :
6
Abstract :
The field of Culturomics exploits “big data” to explore human society at population scale. Culturomics increasingly needs to consider geographic contexts and, thus, this research develops a geospatial visual analytical approach that transforms vast amounts of textual data into emotional heatmaps with fine-grained spatial resolution. Fulltext geocoding and sentiment mining extract locations and latent “tone” from text-based data, which are combined with spatial analysis methods - kernel density estimation and spatial interpolation - to generate heatmaps that capture the interplay of location, topic, and tone toward narrative impacts. To demonstrate the effectiveness of the approach, the complete English edition of Wikipedia is processed using a supercomputer to extract all locations and tone associated with the year of 2003. An emotional heatmap of Wikipedia´s discussion of “armed conflict” for that year is created using the spatial analysis methods. Unlike previous research, our approach is designed for exploratory spatial analysis of topics in text archives by incorporating multiple attributes including the prominence of each location mentioned in the text, the density of a topic at each location compared to other topics, and the tone of the topics of interest into a single analysis. The generation of such fine-grained emotional heatmaps is computationally intensive particularly when accounting for the multiple attributes at fine scales. Therefore a CyberGIS platform based on national cyberinfrastructure in the United States is used to enable the computationally intensive visual analytics.
Keywords :
Web sites; computational linguistics; cultural aspects; data analysis; data mining; data visualisation; emotion recognition; geographic information systems; interpolation; social sciences computing; text analysis; CyberGIS; United States; Wikipedia English edition; armed conflict discussion; big data; computationally intensive visual analytics; culturomics; fine-grained spatial resolution; fulltext geocoding; geographic information science; geospatial visual analytical approach; human society; kernel density estimation; latent tone extraction; location extraction; national cyberinfrastructure; population scale; sentiment mining; spatial analysis; spatial interpolation; spatial text mining; supercomputer; text archives; topic-based emotional heatmap generation; Data mining; Electronic publishing; Encyclopedias; Heating; Humans; Internet; CyberGIS; digital HASS; heatmap; sentiment mining; spatial text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
E-Science (e-Science), 2012 IEEE 8th International Conference on
Conference_Location :
Chicago, IL
Print_ISBN :
978-1-4673-4467-8
Type :
conf
DOI :
10.1109/eScience.2012.6404440
Filename :
6404440
Link To Document :
بازگشت