• DocumentCode
    3499279
  • Title

    Clustering Web Search Results Based on Interactive Suffix Tree Algorithm

  • Author

    Wang, Ying ; Zuo, Wanli ; Peng, Tao ; He, Fengling ; Hu, Hailong

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun
  • Volume
    2
  • fYear
    2008
  • fDate
    11-13 Nov. 2008
  • Firstpage
    851
  • Lastpage
    857
  • Abstract
    Clustering is an effective way to organize Web search results, which allows users to navigate into relevant documents quickly. Traditional clustering techniques are inadequate to Chinese search results and do not generated clusters with highly readable names. In this paper, we propose a new method to clustering Web search results which is based on interactive suffix tree algorithm (ISTC). This method uses phrase extracted from the snippets as characteristics of clustering. In the course of interaction with users, it only returns cluster label to users in the first tier. When users want to make further interaction, users can select a document which they are interested in for the second clustering instead of the traditional recursive clustering. ISTC can also be applied to Chinese and English information processing which avoids the recursive algorithm for achieving linear time complexity and improving the efficiency of search engine. Experimental results verify our methodpsilas feasibility and effectiveness.
  • Keywords
    information retrieval; search engines; Web search results clustering; information processing; interactive suffix tree algorithm; linear time complexity; phrase extraction; search engine; Clustering algorithms; Computer science; Educational institutions; Helium; Information processing; Information technology; Navigation; Search engines; Visualization; Web search; ISTC; Interactive clustering; Suffix Tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Convergence and Hybrid Information Technology, 2008. ICCIT '08. Third International Conference on
  • Conference_Location
    Busan
  • Print_ISBN
    978-0-7695-3407-7
  • Type

    conf

  • DOI
    10.1109/ICCIT.2008.108
  • Filename
    4682352