DocumentCode :
1728484
Title :
An Adaptive Page Clustering Based Weighting Method for Information Retrieval
Author :
Yi-Xian Lin ; Hung-Yu Kao
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear :
2013
Firstpage :
199
Lastpage :
204
Abstract :
With the coming of the era of information explosion, using Internet to obtain information has become the most convenient pipeline information flow. However, the found information mostly based on keyword matching through the search engines, and the search engines do not generally conduct filtering and screening in order to enhance the returns. If the web pages pass a systematic arrangement and are divided into multiple categories or clusters, the users will be guided to obtain real help of information. In this paper, we propose an adaptive web pages clustering algorithm to perform this task. It extracts features to reduce feature dimensions, then filters automatically web pages into its appropriate cluster and enhances the features of the pages to site features for different coefficients to improve the effect. Finally, providing users a more accurate search data model. The experimental results show that compared to the traditional TF-IDF, the proposed approach can find the needed web pages and the topics of the web pages in the corresponding cluster that are highly similar.
Keywords :
Internet; Web sites; document handling; information retrieval; pattern clustering; search engines; Internet; TF-IDF; Web pages clustering algorithm; adaptive page clustering based weighting method; information explosion; information retrieval; keyword matching; pipeline information flow; search engines; systematic arrangement; Clustering algorithms; Clustering methods; Feature extraction; Keyword search; Search engines; Web pages; Correlation Coefficient; Information Retrieval; Page Clustering; Topic Feature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2013 Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4799-2528-5
Type :
conf
DOI :
10.1109/TAAI.2013.48
Filename :
6783867
Link To Document :
بازگشت