DocumentCode :
579772
Title :
Particle Competition and Cooperation to Prevent Error Propagation from Mislabeled Data in Semi-supervised Learning
Author :
Breve, Fabricio ; Zhao, Liang
Author_Institution :
Inst. of Geosci. & Exact Sci. (IGCE), Sao Paulo State Univ. (UNESP), Rio Claro, Brazil
fYear :
2012
fDate :
20-25 Oct. 2012
Firstpage :
79
Lastpage :
84
Abstract :
Semi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger.
Keywords :
graph theory; learning (artificial intelligence); pattern classification; artificial data sets; classification problems; competitive andbehavior; cooperative behavior; error propagation prevention; graph-based semisupervised learning method; label contamination; label reliability; mislabeled data; network-based semisupervised learning method; particle competition; particle cooperation; real-world data sets; walking particles; Computational modeling; Computer simulation; Error analysis; Legged locomotion; Robustness; Semisupervised learning; Vectors; Computational intelligence; Machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (SBRN), 2012 Brazilian Symposium on
Conference_Location :
Curitiba
ISSN :
1522-4899
Print_ISBN :
978-1-4673-2641-4
Type :
conf
DOI :
10.1109/SBRN.2012.16
Filename :
6374828
Link To Document :
بازگشت