Title :
Clustering Technique on Search Engine Dataset Using Data Mining Tool
Author :
Ahmed, Mahrous E. ; Bansal, Poonam
Author_Institution :
Dept. of Comput. Sci. & Eng., itm Univ., Gurgaon, India
Abstract :
Unlabeled document collections are becoming increasingly common and mining such databases becomes a major challenge. It is a major issue to retrieve good websites from the larger collections of websites. As the number of available Web pages grows, it is become more difficult for users finding documents relevant to their interests. Clustering is the classification of a data set into subsets (clusters), so that the data in each subset share some common trait - often proximity according to some defined distance measure. By clustering we improve the quality of websites by grouping similar websites in groups. This paper addresses the applications of data mining tool Weka by applying k means clustering to find clusters from huge data sets and find the attributes that govern optimization of search engines.
Keywords :
Web sites; data mining; optimisation; pattern classification; search engines; Web pages; Websites; data mining tool; distance measure; huge data sets; k means clustering technique; optimization; search engine dataset; Clustering algorithms; Communications technology; Computer science; Data mining; Databases; Educational institutions; Search engines; Data mining; Dataset; Websites; Weka; k-means;
Conference_Titel :
Advanced Computing and Communication Technologies (ACCT), 2013 Third International Conference on
Conference_Location :
Rohtak
Print_ISBN :
978-1-4673-5965-8
DOI :
10.1109/ACCT.2013.15