Title :
Random forest in semi-supervised learning (Co-Forest)
Author :
Settouti, Nesma ; El Habib Daho, Mostafa ; El Amine Lazouni, Mohammed ; Chikh, Mohammed Amine
Author_Institution :
Biomed. Eng. Lab., Tlemcen Univ. Algeria, Tlemcen, Algeria
Abstract :
The semi-supervised learning has been widely applied in many fields such as medical diagnosis, pattern recognition. The semi supervised learning methods are used to employ unlabelled data in addition to labelled data for better classification of large data sets, where only a small number of labelled examples is available. Ensemble Methods are considered as an effective solution to the problem of dimensionality and can improve the robustness and generalization ability of individual learners. In this paper, we are particularly interested in the overall algorithm Random Forest semi-supervised named Co-Forest for the classification of large biological data. The algorithm is evaluated on its ability to correctly predict the labels of unlabelled examples, and its robustness when the number of labelled examples available decreases.
Keywords :
biology computing; data analysis; learning (artificial intelligence); very large databases; co-forest; ensemble methods; generalization ability; labelled data; large biological data classification; large data sets; medical diagnosis; pattern recognition; random forest; semi-supervised learning methods; unlabelled data; Biology; Cancer; Labeling; Prediction algorithms; Semisupervised learning; Supervised learning; Tumors;
Conference_Titel :
Systems, Signal Processing and their Applications (WoSSPA), 2013 8th International Workshop on
Conference_Location :
Algiers
DOI :
10.1109/WoSSPA.2013.6602385