Author/Authors
ALZAND, Hüssein Ridha Ali Gazi Üniversitesi - Bilişim Enstitüsü - Bilgisayar Bilimleri Anabilim Dalı, Turkey , KARACAN, Hacer Gazi Üniversitesi - Mühendislik Fakültesi - Bilgisayar Mühendisliği Bölümü, Turkey
Title Of Article
Comparison of partitioning-based clustering algorithms on differently distributed data
شماره ركورد
28223
Abstract
As a result of widespread use of technology, large volumes of collected data began to emerge. It is impossible to discover and analyze any information in large data like this, so in this case data mining comes into play. Data mining is a process that discovers unpredictable and usable knowledge from databases. In other words, data mining is defined as the process of finding relation patterns, changes, deviations and trends, as well as interesting information specific structures from large databases. One of the widely used data mining methods is a method of clustering. Clustering divides the data set into different clusters, and it tries to make the likelihood ratio as minimum inside the cluster and as maximum among other clusters depending on the options in the database. In this study, partitioning-based clustering methods are discussed by applying them on data sets with different distribution patterns. We used k-means and kernel k-means partitioning algorithms for clustering data sets. By applying clustering operations on differently distributed data sets, we compared the speed, clustering quality and the size of memory usage for these algorithms. The information that we gathered by this comparison is presented and discussed in the related sections of this paper.
From Page
56
NaturalLanguageKeyword
clustering algorithms , clustering analysis.
JournalTitle
Erciyes University Journal Of The Institute Of Science and Technology
To Page
62
JournalTitle
Erciyes University Journal Of The Institute Of Science and Technology
Link To Document