DocumentCode
2775971
Title
Mapping the Blogosphere--Towards a Universal and Scalable Blog-Crawler
Author
Berger, Philipp ; Hennig, Patrick ; Bross, Justus ; Meinel, Christoph
Author_Institution
IT-Syst. Eng., Hasso-Plattner Inst., Potsdam, Germany
fYear
2011
fDate
9-11 Oct. 2011
Firstpage
672
Lastpage
677
Abstract
The massive adoption of social media has provided new ways for individuals to express their opinions online. The blogosphere, an inherent part of this trend, contains a vast array of information about a variety of topics. Thus, it is a huge think tank that creates an enormous and ever-changing archive of open source intelligence. Modeling and mining this vast pool of data to extract and describe meaningful knowledge in order to leverage (content-related) structures and dynamics of emerging networks within the blogo sphere is the higher-level aim of the research presented here. While the concept of our tailor-mode feed-crawler was already discussed in two earlier publications this paper focuses on our approach to extend the earlier feed crawler to a more universal and highly scalable blog-crawler.
Keywords
social networking (online); blog-crawler; blogosphere; open source intelligence; social media; tailor-mode feed-crawler; Blogs; Crawlers; Data mining; Databases; Feeds; HTML; Hardware; Blog; Blogosphere; Data Mining; Hasso Plattner; In-Memory; MapReduce; Ranking; Social Media Monitoring;
fLanguage
English
Publisher
ieee
Conference_Titel
Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
Conference_Location
Boston, MA
Print_ISBN
978-1-4577-1931-8
Type
conf
DOI
10.1109/PASSAT/SocialCom.2011.57
Filename
6113195
Link To Document