DocumentCode
3297439
Title
Research of Blog Quality Based on Similarity and Influence Analysis
Author
Chen, Xiaorui
Author_Institution
Comput. Lab., Oxford Univ., Oxford, UK
fYear
2008
fDate
22-24 Oct. 2008
Firstpage
231
Lastpage
242
Abstract
This work presents a combination of several techniques (such as RSS Feed, Lucene, and MySQL) that constituted a powerful, efficient system to acquire, parse, and optimize data from blogs, and then based on analyzing TF (term frequency) and Links we make a contribution to similarity analysis and influence analysis by proposing another two novel algorithms which are similarity score and influence score. Hence it becomes much easier and more effective to rank the related and authoritative Blogs under the comparison of scores.
Keywords
Web sites; Lucene; MySQL; RSS Feed; blog quality; influence analysis; similarity analysis; term frequency analysis; Algorithm design and analysis; Blogs; Data engineering; Feeds; Frequency; Information services; Internet; Power engineering and energy; Protocols; Web sites; Influence; Similarity; Term Frequency; Vector Distance;
fLanguage
English
Publisher
ieee
Conference_Titel
World Congress on Engineering and Computer Science 2008, WCECS '08. Advances in Electrical and Electronics Engineering - IAENG Special Edition of the
Conference_Location
San Francisco, CA
Print_ISBN
978-1-4244-3545-6
Type
conf
DOI
10.1109/WCECS.2008.36
Filename
5233163
Link To Document