DocumentCode :
1820976
Title :
Mining Twitter in the Cloud: A Case Study
Author :
Noordhuis, Pieter ; Heijkoop, Michiel ; Lazovik, Alexander
Author_Institution :
Comput. Sci., Univ. of Groningen, Groningen, Netherlands
fYear :
2010
fDate :
5-10 July 2010
Firstpage :
107
Lastpage :
114
Abstract :
Mining and analyzing data from social networks can be difficult because of the large amounts of data involved. Such activities are usually very expensive, as they require a lot of computational resources. With the recent success of cloud computing, data analysis is going to be more accessible due to easier access to less expensive computational resources. In this work we propose to use cloud computing services as a possible solution for analysis of large amounts of data. As a source for a large data set, we propose to use Twitter, yielding a graph with 50 million nodes and 1.8 billion edges. In this paper, we use computation of PageRank on Twitter´s social graph to investigate whether or not cloud computing, and Amazon cloud services in particular, can make these tasks more feasible and, as a side effect, whether or not PageRank provides a good ranking of Twitter users.
Keywords :
data analysis; data mining; social networking (online); Amazon cloud services; PageRank; Twitter; cloud computing; data analysis; data mining; social networks; Clouds; Crawlers; Google; Table lookup; Twitter; Web pages; amazon; data mining; pagerank; twitter; web crawl;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing (CLOUD), 2010 IEEE 3rd International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-8207-8
Electronic_ISBN :
978-0-7695-4130-3
Type :
conf
DOI :
10.1109/CLOUD.2010.59
Filename :
5558003
Link To Document :
بازگشت