• DocumentCode
    3297439
  • Title

    Research of Blog Quality Based on Similarity and Influence Analysis

  • Author

    Chen, Xiaorui

  • Author_Institution
    Comput. Lab., Oxford Univ., Oxford, UK
  • fYear
    2008
  • fDate
    22-24 Oct. 2008
  • Firstpage
    231
  • Lastpage
    242
  • Abstract
    This work presents a combination of several techniques (such as RSS Feed, Lucene, and MySQL) that constituted a powerful, efficient system to acquire, parse, and optimize data from blogs, and then based on analyzing TF (term frequency) and Links we make a contribution to similarity analysis and influence analysis by proposing another two novel algorithms which are similarity score and influence score. Hence it becomes much easier and more effective to rank the related and authoritative Blogs under the comparison of scores.
  • Keywords
    Web sites; Lucene; MySQL; RSS Feed; blog quality; influence analysis; similarity analysis; term frequency analysis; Algorithm design and analysis; Blogs; Data engineering; Feeds; Frequency; Information services; Internet; Power engineering and energy; Protocols; Web sites; Influence; Similarity; Term Frequency; Vector Distance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    World Congress on Engineering and Computer Science 2008, WCECS '08. Advances in Electrical and Electronics Engineering - IAENG Special Edition of the
  • Conference_Location
    San Francisco, CA
  • Print_ISBN
    978-1-4244-3545-6
  • Type

    conf

  • DOI
    10.1109/WCECS.2008.36
  • Filename
    5233163