Title :
High dimensional microarray data classification using correlation based feature selection
Author :
Hasan, Abid ; Adnan, Md Akhtaruzzaman
Author_Institution :
Dept. of Comput. Sci. & Eng., Islamic Univ. of Technol. (IUT), Gazipur, Bangladesh
Abstract :
Analyzing DNA microarray data pose a serious challenge because of their large number of features (genes) and relatively small number of samples. Extracting features, those have predictive capability for classifying these huge datasets demands appropriate approaches like feature reduction and identifying optimal set of genes. In this paper along with conventional statistical methods like filtering the dataset to reduce the number of features, one additional approach of evaluating correlation between the classes for each feature is performed. Proposed approach yields higher classification accuracy for both Acute Lymphoblastic (ALL) and High Grade Glioma cancer dataset than using only traditional statistical filtering methods.
Keywords :
DNA; blood; cancer; feature extraction; filtering theory; genetics; lab-on-a-chip; medical signal processing; signal classification; acute lymphoblastic cancer dataset; correlation based feature selection; dataset filtering; feature extraction; feature reduction; genes; high dimensional DNA microarray data classification; high grade glioma cancer dataset; statistical filtering methods; Accuracy; Cancer; Correlation; Educational institutions; Filtering; Gene expression; Redundancy; DNA microarray data; classification; correlation; feature selection;
Conference_Titel :
Biomedical Engineering (ICoBE), 2012 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1990-5
DOI :
10.1109/ICoBE.2012.6179029