• DocumentCode
    3471184
  • Title

    Detecting and Clustering Similar Results of Search Engine by Exploiting Web Page´s Contents

  • Author

    Gao, Kai ; WU, Hui-cong

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Hebei Univ. of Sci. & Technol., Shijiazhuang
  • fYear
    2008
  • fDate
    12-14 Oct. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents an approach to detect and cluster similar results of search engine based on analyzing pages´ URLs and their contents. A novel hash function, together with a Chinese key concept extractor module, has been used. The similar measurement on key concept overlap degree is proposed to cluster similar retrieval results. This can minimize the overlap effectively. The experimental results show the feasibility of the approach. On the basis of the above works, a search engine has been developed.
  • Keywords
    Internet; file organisation; search engines; Chinese key concept extractor; Web page contents; hash functions; search engines; Clustering algorithms; Educational institutions; Fingerprint recognition; Information science; Internet; Mechanical engineering; Parallel robots; Search engines; Uniform resource locators; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
  • Conference_Location
    Dalian
  • Print_ISBN
    978-1-4244-2107-7
  • Electronic_ISBN
    978-1-4244-2108-4
  • Type

    conf

  • DOI
    10.1109/WiCom.2008.2548
  • Filename
    4680737