DocumentCode
3471184
Title
Detecting and Clustering Similar Results of Search Engine by Exploiting Web Page´s Contents
Author
Gao, Kai ; WU, Hui-cong
Author_Institution
Sch. of Inf. Sci. & Eng., Hebei Univ. of Sci. & Technol., Shijiazhuang
fYear
2008
fDate
12-14 Oct. 2008
Firstpage
1
Lastpage
4
Abstract
This paper presents an approach to detect and cluster similar results of search engine based on analyzing pages´ URLs and their contents. A novel hash function, together with a Chinese key concept extractor module, has been used. The similar measurement on key concept overlap degree is proposed to cluster similar retrieval results. This can minimize the overlap effectively. The experimental results show the feasibility of the approach. On the basis of the above works, a search engine has been developed.
Keywords
Internet; file organisation; search engines; Chinese key concept extractor; Web page contents; hash functions; search engines; Clustering algorithms; Educational institutions; Fingerprint recognition; Information science; Internet; Mechanical engineering; Parallel robots; Search engines; Uniform resource locators; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location
Dalian
Print_ISBN
978-1-4244-2107-7
Electronic_ISBN
978-1-4244-2108-4
Type
conf
DOI
10.1109/WiCom.2008.2548
Filename
4680737
Link To Document