Title :
Incomplete hybrid attributes reduction based on neighborhood granulation and approximation
Author :
Zhao, Baiting ; Chen, Xijun ; Zeng, Qingshuang
Author_Institution :
Space Control & Inertia Technol. Res. Center, Harbin Inst. of Technol., HIT Harbin, Harbin, China
Abstract :
There are two semantic explanations including the lost and the unrelated for the missing values. To deal with the hybrid information systems which including the numerical attributes and the missing value attributes, a generalized incomplete rough set model is proposed based on neighborhood relations. The model approximates an arbitrary subset in the universe with neighborhood granules, and the generalized neighborhood relations are the generalization of the asymmetry similarity relations and the tolerance relations. It overcomes the shortcoming that the classical rough set can not deal with numerical attributes directly and can deal with the generalized incomplete system which has the missing values both include the lost and the unrelated. The discrimination methods of the missing value and a hybrid reduction algorithm are proposed also. The discrimination methods of the lost or the unrelated conditions are proposed based on the assumption of the consistency classification, and the influence of the noise samples and the neighborhood values to the classification accuracy is presented as well. The validity and feasibility of the reduction algorithm are demonstrated by the results of experiments on five UCI machine learning databases.
Keywords :
approximation theory; data reduction; information systems; rough set theory; asymmetry similarity relations; classical rough set; consistency classification; generalized incomplete rough set model; generalized incomplete system; generalized neighborhood relations; hybrid information systems; hybrid reduction algorithm; incomplete hybrid attributes reduction; missing value attributes; neighborhood approximation; neighborhood granulation; neighborhood granules; numerical attributes; tolerance relations; Automation; Databases; Decision making; Information systems; Machine learning; Machine learning algorithms; Mechatronics; Set theory; Space technology; Generalized incomplete; Hybrid decision system; Neighborhood; Reduction; Rough set;
Conference_Titel :
Mechatronics and Automation, 2009. ICMA 2009. International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4244-2692-8
Electronic_ISBN :
978-1-4244-2693-5
DOI :
10.1109/ICMA.2009.5246230