DocumentCode
3202679
Title
Web Page Clustering Based on Searching Keywords
Author
Li, Taoying ; Chen, Yan
Author_Institution
Transp. Manage. Coll., Dalian Maritime Univ., Dalian, China
Volume
3
fYear
2010
fDate
11-12 May 2010
Firstpage
1163
Lastpage
1166
Abstract
In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then clustering algorithm was chosen to group pages of searching results according to matching degree. Next we used duplicated pages deletion to detect and remove duplicated pages with same titles and abstracts. Finally, the proposed algorithm is applied in practice and results show that it is effective and feasible for solving information explosion on Web.
Keywords
Internet; data mining; pattern clustering; Web crawling operation; Web page clustering; duplicated pages deletion; matching degree; searching keywords; Automation; Clustering algorithms; Couplings; Data mining; Explosions; Partitioning algorithms; Transportation; Web mining; Web pages; Web services; matching degree; searching degree; web clustering; web mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computation Technology and Automation (ICICTA), 2010 International Conference on
Conference_Location
Changsha
Print_ISBN
978-1-4244-7279-6
Electronic_ISBN
978-1-4244-7280-2
Type
conf
DOI
10.1109/ICICTA.2010.53
Filename
5523220
Link To Document