Title :
Dynamic data discretization technique based on frequency and K-Nearest Neighbour algorithm
Author :
Ahmed, Almahdi Mohammed ; Bakar, Afarulrazi Abu ; Hamdan, Abdul Razak
Author_Institution :
Fac. of Technol. & Inf. Sci., Univ. Kebangsaan Malaysia, Bangi, Malaysia
Abstract :
In this paper we propose a new approach to the dynamic data discretization technique. The technique is called frequency dynamic interval class (FDIC). FDIC consists of two important phases: The dynamic intervals class phase and the interval merging phase. The first phase uses a simple statistical frequency measure to obtain the initial intervals while in the second phase a K-nearest neighbour is used to calculate the merging factor for the unknown intervals. The experimental results showed that FDIC generates more intervals in an attribute, and less number rules with comparable accuracies within three tested datasets. It indicates that FDIC managed to reduce the loss of knowledge in several other techniques that generated the very least number of intervals.
Keywords :
data mining; statistical analysis; FDIC; K-nearest neighbour algorithm; data mining; dynamic data discretization technique; dynamic intervals class phase; frequency dynamic interval class; interval merging phase; merging factor; statistical frequency measure; Algorithm design and analysis; Data mining; Discrete transforms; Frequency measurement; Information science; Knowledge management; Merging; Phase measurement; Spatial databases; Testing; Discretization; Dynamic Intervals; KNN measures;
Conference_Titel :
Data Mining and Optimization, 2009. DMO '09. 2nd Conference on
Conference_Location :
Kajand
Print_ISBN :
978-1-4244-4944-6
DOI :
10.1109/DMO.2009.5341919