DocumentCode
603226
Title
Clustering Technique on Search Engine Dataset Using Data Mining Tool
Author
Ahmed, Mahrous E. ; Bansal, Poonam
Author_Institution
Dept. of Comput. Sci. & Eng., itm Univ., Gurgaon, India
fYear
2013
fDate
6-7 April 2013
Firstpage
86
Lastpage
89
Abstract
Unlabeled document collections are becoming increasingly common and mining such databases becomes a major challenge. It is a major issue to retrieve good websites from the larger collections of websites. As the number of available Web pages grows, it is become more difficult for users finding documents relevant to their interests. Clustering is the classification of a data set into subsets (clusters), so that the data in each subset share some common trait - often proximity according to some defined distance measure. By clustering we improve the quality of websites by grouping similar websites in groups. This paper addresses the applications of data mining tool Weka by applying k means clustering to find clusters from huge data sets and find the attributes that govern optimization of search engines.
Keywords
Web sites; data mining; optimisation; pattern classification; search engines; Web pages; Websites; data mining tool; distance measure; huge data sets; k means clustering technique; optimization; search engine dataset; Clustering algorithms; Communications technology; Computer science; Data mining; Databases; Educational institutions; Search engines; Data mining; Dataset; Websites; Weka; k-means;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computing and Communication Technologies (ACCT), 2013 Third International Conference on
Conference_Location
Rohtak
ISSN
2327-0632
Print_ISBN
978-1-4673-5965-8
Type
conf
DOI
10.1109/ACCT.2013.15
Filename
6524279
Link To Document