• DocumentCode
    2775971
  • Title

    Mapping the Blogosphere--Towards a Universal and Scalable Blog-Crawler

  • Author

    Berger, Philipp ; Hennig, Patrick ; Bross, Justus ; Meinel, Christoph

  • Author_Institution
    IT-Syst. Eng., Hasso-Plattner Inst., Potsdam, Germany
  • fYear
    2011
  • fDate
    9-11 Oct. 2011
  • Firstpage
    672
  • Lastpage
    677
  • Abstract
    The massive adoption of social media has provided new ways for individuals to express their opinions online. The blogosphere, an inherent part of this trend, contains a vast array of information about a variety of topics. Thus, it is a huge think tank that creates an enormous and ever-changing archive of open source intelligence. Modeling and mining this vast pool of data to extract and describe meaningful knowledge in order to leverage (content-related) structures and dynamics of emerging networks within the blogo sphere is the higher-level aim of the research presented here. While the concept of our tailor-mode feed-crawler was already discussed in two earlier publications this paper focuses on our approach to extend the earlier feed crawler to a more universal and highly scalable blog-crawler.
  • Keywords
    social networking (online); blog-crawler; blogosphere; open source intelligence; social media; tailor-mode feed-crawler; Blogs; Crawlers; Data mining; Databases; Feeds; HTML; Hardware; Blog; Blogosphere; Data Mining; Hasso Plattner; In-Memory; MapReduce; Ranking; Social Media Monitoring;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4577-1931-8
  • Type

    conf

  • DOI
    10.1109/PASSAT/SocialCom.2011.57
  • Filename
    6113195