Title :
A generalized version space learning algorithm for noisy and uncertain data
Author :
Hong, Tzung-Pei ; Shian-Shyong Tsang
Author_Institution :
Dept. of Inf. Manage., Kaohsiung Polytech. Inst., Taiwan
Abstract :
This paper generalizes the learning strategy of version space to manage noisy and uncertain training data. A new learning algorithm is proposed that consists of two main phases: searching and pruning. The searching phase generates and collects possible candidates into a large set; the pruning then prunes this set according to various criteria to find a maximally consistent version space. When the training instances cannot completely be classified, the proposed learning algorithm can make a trade-off between including positive training instances and excluding negative ones according to the requirements of different application domains. Furthermore, suitable pruning parameters are chosen according to a given time limit, so the algorithm can also make a trade-off between time complexity and accuracy. The proposed learning algorithm is then a flexible and efficient induction method that makes the version space learning strategy more practical
Keywords :
learning (artificial intelligence); uncertainty handling; application domains; generalized version space learning algorithm; learning strategy; maximally consistent version space; noisy data; positive training instances; pruning; searching; uncertain data; version space learning strategy; Computer Society; Genetic algorithms; Information management; Learning systems; Machine learning; Machine learning algorithms; Management training; Training data; Uncertainty; Working environment noise;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on