Title :
Research of Blog Quality Based on Similarity and Influence Analysis
Author_Institution :
Comput. Lab., Oxford Univ., Oxford, UK
Abstract :
This work presents a combination of several techniques (such as RSS Feed, Lucene, and MySQL) that constituted a powerful, efficient system to acquire, parse, and optimize data from blogs, and then based on analyzing TF (term frequency) and Links we make a contribution to similarity analysis and influence analysis by proposing another two novel algorithms which are similarity score and influence score. Hence it becomes much easier and more effective to rank the related and authoritative Blogs under the comparison of scores.
Keywords :
Web sites; Lucene; MySQL; RSS Feed; blog quality; influence analysis; similarity analysis; term frequency analysis; Algorithm design and analysis; Blogs; Data engineering; Feeds; Frequency; Information services; Internet; Power engineering and energy; Protocols; Web sites; Influence; Similarity; Term Frequency; Vector Distance;
Conference_Titel :
World Congress on Engineering and Computer Science 2008, WCECS '08. Advances in Electrical and Electronics Engineering - IAENG Special Edition of the
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-3545-6
DOI :
10.1109/WCECS.2008.36