DocumentCode :
3196007
Title :
miRNA and gene expression based cancer classification using self-learning and co-training approaches
Author :
Ibrahim, Roliana ; Yousri, Noha A. ; Ismail, Muhammad Ali ; El-Makky, Nagwa M.
Author_Institution :
Comput. & Syst. Eng. Dept., Alexandria Univ., Alexandria, Egypt
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
495
Lastpage :
498
Abstract :
A number of attempts to classify cancer samples using miRNA/gene expression profiles are known in literature. However, semi-supervised learning models have only been recently introduced to exploit the huge unlabeled expression profiles in enhancing sample classification. It is important to combine both miRNA and gene expression sets as that provides more information on the characteristics of cancer samples. The use of both of labeled and unlabeled miRNA and gene expression sets to enhance sample classification has not been explored yet. In this paper, two semi-supervised machine learning approaches, namely self-learning and co-training are adapted to enhance the quality of cancer sample classification. In self-learning, miRNA and gene based classifiers are enhanced independently. While in co-training, both miRNA and gene expression profiles are used simultaneously to provide different views of cancer samples. The approaches were evaluated using breast cancer, hepatocellular carcinoma (HCC) and lung cancer expression sets. Results show up to 20% improvement in F1-measure over Random Forests and SVM classifiers. Co-Training also outperforms Low Density Separation (LDS) approach by around 25% improvement in F1-measure in breast cancer.
Keywords :
RNA; cancer; genetics; learning (artificial intelligence); medical computing; pattern classification; support vector machines; F1-measure improvement; SVM classifiers; breast cancer expression sets; cotraining approach; gene expression based cancer classification; hepatocellular carcinoma expression sets; labeled gene expression sets; labeled miRNA expression sets; lung cancer expression; miRNA based cancer classification; random forests; sample classification enhancement; self-learning approach; semi-supervised machine learning models; unlabeled gene expression sets; unlabeled miRNA expression sets; Accuracy; Breast cancer; Gene expression; Lungs; Support vector machines; Training; Cancer sample classifiers; Co-Training; Self-Learning; Semi-supervised Approaches; miRNA and gene expression analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732544
Filename :
6732544
Link To Document :
بازگشت