DocumentCode :
725241
Title :
Feature based classification of nuclear receptors and their subfamilies using fuzzy K-nearest neighbor
Author :
Tiwari, Arvind Kumar ; Srivastava, Rajeev
Author_Institution :
Dept. of Comput. Sci.& Eng., Indian Inst. of Technol. (BHU), Varanasi, India
fYear :
2015
fDate :
19-20 March 2015
Firstpage :
24
Lastpage :
28
Abstract :
The efficient classification of nuclear receptors and their subfamilies plays an important role in the detection of various diseases such as diabetes, cancer, and inflammatory diseases and their related drug design and discovery. As of now, few methods have been reported in literature for the same but the performance and efficacy of these methods are not up to the desired level. To address the issue of efficient classification of nuclear receptor and their subfamilies, here in this paper we propose to use a fuzzy k-nearest neighbor classifier with minimum redundancy maximum relevance for the classification of nuclear receptor and their eight subfamilies. The minimum redundancy maximum relevance algorithm is used to select the optimal feature subset and observed that highest accuracy and Matthew´s correlation coefficient is obtained with 400 features among 753 features through fuzzy kNN classifier. The performance of fuzzy kNN classifier depends on two parameter number of nearest neighbor (k) and fuzzy coefficient (m) and it is observed that the highest accuracy and MCC is obtained at k=7 and m= 1.25. The overall accuracies of 10 fold cross validation with optimal number of features, k and m are 98.09% and 97.85% and the MCC values of 0.97 and 0.90 for the prediction of nuclear receptor families and subfamilies respectively. From the obtained results and analysis it is observed that the performance of the proposed approach for the classification of nuclear receptor and their eight subfamilies is very competitive with some other standard methods available in literature.
Keywords :
cancer; fuzzy set theory; pattern classification; proteins; Matthew correlation coefficient; cancer; diabetes; drug design and discovery; fuzzy k-nearest neighbor classifier; fuzzy kNN classifier; inflammatory diseases; minimum redundancy maximum relevance algorithm; nuclear receptor feature based classification; Accuracy; Amino acids; Classification algorithms; Feature extraction; Noise measurement; Proteins; Redundancy; Matthew´s correlation coefficient; Nuclear receptor; cross validation; fuzzy k-nearest neighbor; minimum redundancy maximum relevance; sequence derived properties;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering and Applications (ICACEA), 2015 International Conference on Advances in
Conference_Location :
Ghaziabad
Type :
conf
DOI :
10.1109/ICACEA.2015.7164707
Filename :
7164707
Link To Document :
بازگشت