Title :
The study of KPCA active learning method based on the ROC curve
Author :
Li-lin, Cui ; Hai-chao, Zhu ; Lin-ke, Zhang ; Chao, Ma
Author_Institution :
Instn. of Noise & Vibration, Naval Univ. of Eng., Wuhan, China
Abstract :
During the course of ship machinery condition monitoring, it is universal that the number of normal condition samples is greater than the number of fault samples because the factors of test difficult or expensive testing costs. In the passive learning method to train the classifier using the all normal samples , it will not only lead to too much training time and even NP problem for some machine learning methods, but also each sample has different impacts for classifier model because noise and other factors,which would lead to the phenomenon that the classifier generalization performance degradation when the bad training samples are too many. A KPCA active learning method based on the Receiver Operating Characteristic (ROC) curve is proposed in this paper. In this method, the significance of each training sample is evaluated by the KPCA method, and then a sequence comprised all training samples is obtained based on the significance. Selecting the foregoing samples compose the train sets to train the classifier and evaluate the performance based on the Receiver Operating Characteristic curve step by step. At last, when the area under the ROC curve (AUC) of the data set is biggest, the data set is selected as the optimal training sample set, which is the finally result of the KPCA active learning method based on the ROC curve. The experiment results of 1:1 Cabin model show that the method is feasible and effective.
Keywords :
computational complexity; condition monitoring; learning (artificial intelligence); marine engineering; pattern classification; principal component analysis; sensitivity analysis; ships; KPCA active learning method; NP problem; classifier generalization performance degradation; machine learning methods; passive learning method; receiver operating characteristic curve; ship machinery condition monitoring; Active noise reduction; Chaos; Computer science education; Condition monitoring; Costs; Educational technology; Learning systems; Machinery; Marine vehicles; Testing; Active learning; Kernal principal component analysis(KPCA); Receiver Operating Characteristic curve;
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
DOI :
10.1109/ICETC.2010.5529682