Title :
Continuous attribute discretization algorithm of Rough Set based on k-means
Author :
Xing Xiaoxue ; Guan Xiuli ; Shang Weiwei
Author_Institution :
Coll. of Commun. Eng., Jilin Univ., Changchun, China
Abstract :
In the application of the Rough Set theory to preprocess the data, continuous attribute discretization is the necessary and key step. Here, a discretization method based on the k-means algorithm was introduced. Using this method, the wholly attributes could be classified into 2 categories. Four sets of data on UCI database were chosen to verify the performance of the presented method. In this experiment, the k-means algorithm was used to implement the data discretization firstly; and then they are used to do attributes reduction through rough set; finally, the classification result is validated with KNN (k-Nearest Neighbor algorithm, k=10) classifier classification algorithm. The experimental results show that this method presented in this paper can improve the efficiency of discretization, and effectively reduce the break points.
Keywords :
pattern classification; rough set theory; KNN classifier classification algorithm; UCI database; attribute classification; attribute reduction; break point reduction; continuous attribute discretization algorithm; data discretization; data preprocessing; discretization efficiency; k-means algorithm; k-nearest neighbor algorithm classifier classification algorithm; rough set theory; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Educational institutions; Information entropy; Partitioning algorithms; discretization; k-means; kNN; rough set;
Conference_Titel :
Advanced Research and Technology in Industry Applications (WARTIA), 2014 IEEE Workshop on
Conference_Location :
Ottawa, ON
DOI :
10.1109/WARTIA.2014.6976541