• DocumentCode
    603226
  • Title

    Clustering Technique on Search Engine Dataset Using Data Mining Tool

  • Author

    Ahmed, Mahrous E. ; Bansal, Poonam

  • Author_Institution
    Dept. of Comput. Sci. & Eng., itm Univ., Gurgaon, India
  • fYear
    2013
  • fDate
    6-7 April 2013
  • Firstpage
    86
  • Lastpage
    89
  • Abstract
    Unlabeled document collections are becoming increasingly common and mining such databases becomes a major challenge. It is a major issue to retrieve good websites from the larger collections of websites. As the number of available Web pages grows, it is become more difficult for users finding documents relevant to their interests. Clustering is the classification of a data set into subsets (clusters), so that the data in each subset share some common trait - often proximity according to some defined distance measure. By clustering we improve the quality of websites by grouping similar websites in groups. This paper addresses the applications of data mining tool Weka by applying k means clustering to find clusters from huge data sets and find the attributes that govern optimization of search engines.
  • Keywords
    Web sites; data mining; optimisation; pattern classification; search engines; Web pages; Websites; data mining tool; distance measure; huge data sets; k means clustering technique; optimization; search engine dataset; Clustering algorithms; Communications technology; Computer science; Data mining; Databases; Educational institutions; Search engines; Data mining; Dataset; Websites; Weka; k-means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computing and Communication Technologies (ACCT), 2013 Third International Conference on
  • Conference_Location
    Rohtak
  • ISSN
    2327-0632
  • Print_ISBN
    978-1-4673-5965-8
  • Type

    conf

  • DOI
    10.1109/ACCT.2013.15
  • Filename
    6524279