DocumentCode :
3770067
Title :
Binary categorization of DNA data with unbalanced class distribution for prediction of hepatocellular carcinoma
Author :
V Sruthi;Naren T Kesh;R Priyanka;Shomona Gracia Jacob
Author_Institution :
CSE Department, SSN College of Engineering, Kalavakkam, Chennai
fYear :
2015
Firstpage :
490
Lastpage :
494
Abstract :
Experiments and generally data in the real world are unbalanced, that is the classification categories are not approximately equally presented because of subject mortality, non-response, etc. The term "Unbalanced" in this context is relative to the distribution of records among the target classes. The various limitations of working with an unbalanced data are discrepancies in calculating the effective mean and also lead to heterogeneity of variance across cells and make problems for valid standard error estimates. The idea of this paper is to investigate classification algorithms and compare the consistency using Matthew´s Correlation Coefficient. With this motive, the authors aim to stress on the importance of balanced data to predict the defective and abnormal DNA that aids in detecting Liver ailments leading to Hepatocellular Carcinoma (Liver Cancer).
Keywords :
"DNA","Classification algorithms","Cancer","Data mining","Correlation coefficient","Algorithm design and analysis","RNA"
Publisher :
ieee
Conference_Titel :
Applied and Theoretical Computing and Communication Technology (iCATccT), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICATCCT.2015.7456934
Filename :
7456934
Link To Document :
بازگشت