Title :
A committee of NNA classifiers for the prediction of the binding between miRNAs and the target genes using a novel coding method
Author :
He, Zhisong ; Feng, KaiYan ; Cai, Yudong
Author_Institution :
Dept. of Bioinf., Zhejiang Univ., Hangzhou, China
Abstract :
We present a paper for the prediction of the bindings between microRNAs (miRNAs) and their target genes. A novel coding for the miRNAs, the binding sites (i.e. the target genes) and the flanking sequences of the binding sites is adopted to code the related information comprehensively. A feature selection method, Minimum Redundancy Maximum Relevance (mRMR), is used to filter out ineffective and redundant features. Because the data are severely imbalanced, a committee of NNA (Nearest Neighbor Algorithm) classifiers is applied to distribute the data more evenly between different classes. The final prediction results are gained through voting from the classifier committee. As a result, 83.33% positive samples are correctly identified with an overall correct prediction rate of 76.78%. The feature analysis, performed by mRMR feature selections using the classifier committee, shows that the seed region of miRNAs and the flanking sequences of the binding sites play a significant role in the regulation of miRNA binding.
Keywords :
bioinformatics; genetics; macromolecules; molecular biophysics; organic compounds; NNA classifiers; data distribution; flanking sequences; mRMR feature selections; miRNA binding regulation; microRNA binding sites; minimum redundancy maximum relevance; nearest neighbor algorithm; novel coding method; target genes; Bioinformatics; Biomedical imaging; Cells (biology); Cybernetics; Educational institutions; Helium; Nearest neighbor searches; Sequences; Testing; USA Councils; mRMR; miRNA target prediction;
Conference_Titel :
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
978-1-4244-2793-2
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2009.5346808