Title :
Metric based attribute reduction in decision tables
Author :
Nguyen, Long Giang
Author_Institution :
Inst. of Inf. Technol., VAST, Hanoi, Vietnam
Abstract :
In an information system, each subset of attributes determines knowledge structure on the set of objects, in which each element is an equivalence class. Thus, a metric which is defined on knowledge structures is established on the attribute sets. Once a metric is established, we can use the metric to measure attributes distance, cluster and discover important attributes. As a result, effective algorithms are constructed to solve attribute reduction in information systems. With metric on knowledge structures based on the Jaccard distance between two finite sets, this paper proposes a new method for attribute reduction in decision table. The paper proves theoretically and experimentally that this metric method is more effective than other methods based on conditional Shannon entropy.
Keywords :
data mining; decision tables; equivalence classes; information systems; pattern clustering; Jaccard distance; attribute clustering; attribute discovery; attribute distance measurement; attribute sets; decision tables; equivalence class; finite sets; information system; knowledge structure determination; metric based attribute reduction; Data mining; Entropy; Equations; Information systems; Measurement; Set theory;
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2012 Federated Conference on
Conference_Location :
Wroclaw
Print_ISBN :
978-1-4673-0708-6
Electronic_ISBN :
978-83-60810-51-4