Title :
Online sequential classification of imbalanced data by combining extreme learning machine and improved SMOTE algorithm
Author :
Wentao Mao;Jinwan Wang;Liyun Wang
Author_Institution :
School of Computer and Information Engineering, Henan Normal University, Xinxiang, CHINA, 453007
fDate :
7/1/2015 12:00:00 AM
Abstract :
Presently, the data imbalance problems become more pronounced in the applications of machine learning and pattern recognition. However, many traditional machine learning methods suffer from the imbalanced data which are also collected in online sequential manner. To get fast and efficient classification for this special problem, a new online sequential extreme learning machine method with sequential SMOTE strategy is proposed. The key idea of this method is to reduce the randomness while generating virtual minority samples by means of the distribution characteristic of online sequential data. Utilizing online-sequential extreme learning machine as baseline algorithm, this method contains two stages. In offline stage, principal curve is introduced to model the each class´s distribution based on which some virtual samples are generated by synthetic minority over-sampling technique(SMOTE). In online stage, each class´s membership is determined according to the projection distance of sample to principal curve. With the help of these memberships, the redundant majority samples as well as unreasonable virtual minority samples are all excluded to lighten the imbalance level in online stage. The proposed method is evaluated on four UCI datasets and the real-world air pollutant forecasting dataset. The experimental results show that, the proposed method outperforms the classical ELM, OS-ELM and SMOTE-based OS-ELM in terms of generalization performance and numerical stability.
Keywords :
"Classification algorithms","Prediction algorithms","Standards","Niobium"
Conference_Titel :
Neural Networks (IJCNN), 2015 International Joint Conference on
Electronic_ISBN :
2161-4407
DOI :
10.1109/IJCNN.2015.7280620