DocumentCode :
3625822
Title :
A Fast, Feature-based Cluster Algorithm for Information Retrieval
Author :
Martin Mehlitz;Christian Bauckhage;Sahin Albayrak
Author_Institution :
Technical University Berlin, DAI-Lab, Berlin, Germany. martin.mehlitz@dai-labor.de
fYear :
2007
Firstpage :
335
Lastpage :
341
Abstract :
The Internet is a vast resource of information. Unfortunately, finding and accessing this information is often a very cumbersome task even with existing information platforms. Searching on the WWW suffers from the fact that almost every word is ambiguous to a certain degree in the information-rich environment of the Internet. Clustering search results is a way to solve this problem. This paper introduces a novel, fast way to cluster documents based on frequent term sets.
Keywords :
"Clustering algorithms","Information retrieval","Internet","Search engines","Laboratories","World Wide Web","Clustering methods","Matrix decomposition","Singular value decomposition","Web pages"
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Print_ISBN :
1-4244-1499-7
Type :
conf
DOI :
10.1109/IRI.2007.4296643
Filename :
4296643
Link To Document :
بازگشت