مرکز منطقه ای اطلاع رساني علوم و فناوري - A New Web Search Result Clustering based on True Common Phrase Label Discovery

DocumentCode :

3101311

Title :

A New Web Search Result Clustering based on True Common Phrase Label Discovery

Author :

Janruang, Jongkol ; Kreesuradej, Worapoj

Author_Institution :

King Mongkut´´s Inst. of Technol., Bangkok

fYear :

2006

fDate :

Nov. 28 2006-Dec. 1 2006

Firstpage :

242

Lastpage :

242

Abstract :

Web search results clustering are navigator for users to search results. Therefore the correct cluster label is important which has been index the set of web document. Suffix tree clustering (STC) is fast automatically clustering and labeling. However, STC is inadequate since they generate interrupted cluster label due to using n-gram technique. In this paper, we propose an approach for web search results clustering and labeling based on a new suffix tree data structure, a new base cluster combining algorithm with a new partial phase join operation. The algorithm for constructing the data structure is an incremental and a linear time algorithm. Thus, the proposed approach is suitable for on-the-fly the web search results clustering and labeling cluster. The proposed approach provides more readable and true common phrase of web document cluster than conventional web search result clustering. Experimental results also show that the proposed approach has better performance than that of conventional web search result clustering.

Keywords :

document handling; information retrieval; pattern clustering; search engines; tree searching; Web document; Web search; common phrase label discovery; n-gram technique; result clustering; suffix tree clustering; Clustering algorithms; Computational intelligence; Data structures; Feeds; Information technology; Labeling; Navigation; Search engines; Tree data structures; Web search;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence for Modelling, Control and Automation, 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on

Conference_Location :

Sydney, NSW

Print_ISBN :

0-7695-2731-0

Type :

conf

DOI :

10.1109/CIMCA.2006.22

Filename :

4052852

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3101311