Author_Institution :
Radio & TV Dept., Qinzhou Univ., Qinzhou, China
Abstract :
Most researches focus on two costs for building cost-sensitive decision trees, such as, misclassification costs, test costs. And the existing literatures always consider the two costs as the same scales, for instance, dollars. However, in real application, it is difficult for us to regard two costs as same scales, for instance, considering misclassification cost as a dollar unit. In this paper, a new splitting attributes criterion which is combined with classification ability, test costs and misclassification costs, is proposed under the assumption of multiple-costs scales and with missing values in the dataset. The experimental results show the proposed method outperforms the existed methods in terms of the decrease of misclassification cost.
Keywords :
cost reduction; decision trees; learning by example; pattern classification; cost-sensitive decision tree; cost-sensitive learning; dataset missing value; inductive learning; misclassification cost reduction; multiple cost scale; pattern classification; splitting attribute criterion; test cost; Algorithm design and analysis; Artificial intelligence; Buildings; Cost function; Decision trees; Design methodology; Medical diagnosis; Medical tests; TV; Testing; cost-sensitive; missing values; tree;