DocumentCode
3252917
Title
Comparative analysis of attribute selection measures used for attribute selection in decision tree induction
Author
Bhatt, A.S.
Author_Institution
Dept. of Comput. Sci. & Technol., L.D. Eng. Coll., Ahmedabad, India
fYear
2012
fDate
21-22 Dec. 2012
Firstpage
230
Lastpage
234
Abstract
Data mining is a process of finding hidden information from databases storing historical data which are also known as data-warehouses. Classification being a very well-known data mining technique, groups similar data objects by establishing relationship between the objects under test and the pre-defined class labels obtained during training phase. Of all the classification algorithms, decision tree is most commonly used. In this paper we will discuss scalability of decision tree algorithm based on the selection of the attribute selection measure. Attribute selection measure is mainly used to select the splitting criterion that best separates the given data partition. The popular attribute selection measures are Information Gain and Gain Ratio. We would perform the comparative analysis of these measures and based on their results we would determine which measure should be used in which situation in order to increase the scalability of the Decision Tree algorithm.
Keywords
data mining; data warehouses; decision trees; inference mechanisms; attribute selection measure; data mining; data warehouses; decision tree induction; gain ratio; information gain; splitting criterion; Classification algorithms; Data mining; Databases; Decision trees; Gain measurement; Training; Training data; attribute selection measure; classification; data mining technique; decision tree; gain ratio; information gain; supervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Radar, Communication and Computing (ICRCC), 2012 International Conference on
Conference_Location
Tiruvannamalai
Print_ISBN
978-1-4673-2756-5
Type
conf
DOI
10.1109/ICRCC.2012.6450584
Filename
6450584
Link To Document