DocumentCode
3545958
Title
Feed Ranking Refinement with Similitary Distribution in Blog Distillation
Author
Gao, Huiji ; Xu, Weiran ; Guo, Jun
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2009
fDate
21-22 Nov. 2009
Firstpage
420
Lastpage
423
Abstract
Blog Distillation is the process of finding a blog with a principle and recurring interest. In this paper, two baselines are used to validate the results of our experiments. A set of features of individual feed is firstly constructed by decision tree to represent the similarity distribution of every feed against certain interest. Features are then selected by computing their centroid distances to standard centroids of relevant feeds and irrelevant feeds. Later, SVM classifier is used to predict and re-rank the top 250 results of two baselines. The result shows that our algorithm can effectively present the feeds´ similarity distribution and re-rank them into a new order which has much better MAP.
Keywords
distillation; production engineering; support vector machines; SVM classifier; blog distillation; decision tree; distillation process; feed ranking refinement; Feeds; HTML; Information services; Information technology; Intelligent systems; Internet; Pattern recognition; Support vector machines; Web pages; Web sites; blog distillation; machine learning; ranking refinement; similarity distribution; svm;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Technology Application Workshops, 2009. IITAW '09. Third International Symposium on
Conference_Location
Nanchang
Print_ISBN
978-1-4244-6420-3
Electronic_ISBN
978-1-4244-6421-0
Type
conf
DOI
10.1109/IITAW.2009.112
Filename
5419592
Link To Document