Title :
Feature selection using mutual information for high- dimensional data sets
Author :
Nagpal, Arpita ; Gaur, Deepti ; Gaur, Surabhi
Author_Institution :
Comput. Sci. Deptt., ITM Univ., Gurgaon, India
Abstract :
To reduce the dimensionality of dataset, redundant and irrelevant features need to be segregated from multidimensional dataset. To remove these features, one of the feature selection techniques needs to be used. Here, a feature selection technique to remove irrelevant features has been used. Correlation measures based on the concept of mutual information has been adopted to calculate the degree of association between features. In this paper authors are proposing a new algorithm to segregate features from high dimensional data by visualizing relevant features in the form of graph as a dataset.
Keywords :
data visualisation; feature selection; trees (mathematics); correlation measures; dimensionality reduction; feature removal; feature segregation; feature selection; feature visualization; high-dimensional data sets; minimum spanning tree; multidimensional dataset; mutual information; Algorithm design and analysis; Classification algorithms; Correlation; Entropy; Filtering algorithms; Mutual information; Random variables; Correlation; data set; feature selection; minimum spanning tree;
Conference_Titel :
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location :
Gurgaon
Print_ISBN :
978-1-4799-2571-1
DOI :
10.1109/IAdCC.2014.6779292