DocumentCode :
2925543
Title :
Test-cost-sensitive attribute reduction based on neighborhood rough set
Author :
Zhao, Hong ; Min, Fan ; Zhu, William
Author_Institution :
Lab. of Granular Comput., Zhangzhou Normal Univ., Zhangzhou, China
fYear :
2011
fDate :
8-10 Nov. 2011
Firstpage :
802
Lastpage :
806
Abstract :
Recent research in machine learning and data mining has produced a wide variety of algorithms for cost-sensitive learning. Most existing rough set methods on this issue deal with nominal attributes. This is because that nominal attributes produce equivalent relations and therefore are easy to process. However, in real applications, datasets often contain numerical attributes. As we know, numerical attributes are more complex than nominal ones and require more computational resources. Consequently, respective learning tasks are more challenging. This paper deals with test-cost-sensitive attribute reduction for numerical valued decision systems. Neighborhood rough set achieved success in numerical data processing, hence we adopt the model to define the minimal test cost reduct problem. Due to the complexity of the new problem, heuristic algorithms are needed to find a sub-optimal solution. We propose one kind of heuristic information, which is the sum of the positive region and weighted test cost. When the test cost is not considered, the information degrades to the positive region, which is the most commonly used one in classical rough set. Three metrics are adopted to evaluate the performance of reduction algorithms from a statistical viewpoint. Experimental results show that the proposed method takes advantages of test costs and therefore produces satisfactory results.
Keywords :
data mining; data reduction; decision making; learning (artificial intelligence); rough set theory; statistical analysis; cost sensitive learning; data mining; machine learning; neighborhood rough set; numerical valued decision systems; statistical viewpoint; test cost sensitive attribute reduction; weighted test cost; Algorithm design and analysis; Data mining; Heuristic algorithms; Iris; Measurement; Numerical models; Rough sets; Cost-sensitive learning; heuristic algorithm; neighborhood; reduction; rough set;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2011 IEEE International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4577-0372-0
Type :
conf
DOI :
10.1109/GRC.2011.6122701
Filename :
6122701
Link To Document :
بازگشت