Title :
From news to facts: An Hadoop-based social graphs analysis
Author :
Puglisi, Piera Laura ; Montanari, Daniele ; Petrella, Alessandro ; Picelli, Marco ; Rossetti, Davide
Author_Institution :
GESP - Geographic Inf. Syst., Bologna, Italy
Abstract :
This paper describes a system combining a distributed setup based on Hadoop, MapReduce, Impala and a general semantic model focusing on common entities (people, organizations, places) and their connections as co-occurrences and facts offering the analysts the opportunity to do mining in social networks. Emphasis is given to recall rather than precision, suggesting the analyst many possible relations and connections to be explored. Early studies have shown interesting results, and further explorations are planned to extend the reach and abilities of the system.
Keywords :
data analysis; data mining; data visualisation; parallel programming; social networking (online); Hadoop-based social graphs analysis; Impala; MapReduce; general semantic model; social network mining; Big data; Distributed databases; Feature extraction; Organizations; Semantics; Text mining;
Conference_Titel :
High Performance Computing & Simulation (HPCS), 2014 International Conference on
Conference_Location :
Bologna
Print_ISBN :
978-1-4799-5312-7
DOI :
10.1109/HPCSim.2014.6903702