DocumentCode :
1038266
Title :
Imbalanced learning with a biased minimax probability machine
Author :
Huang, Kaizhu ; Yang, Haiqin ; King, Irwin ; Lyu, Michael R.
Author_Institution :
Inf. Technol. Lab., Fujitsu R&D Center Co. Ltd., Beijing
Volume :
36
Issue :
4
fYear :
2006
Firstpage :
913
Lastpage :
923
Abstract :
Imbalanced learning is a challenged task in machine learning. In this context, the data associated with one class are far fewer than those associated with the other class. Traditional machine learning methods seeking classification accuracy over a full range of instances are not suitable to deal with this problem, since they tend to classify all the data into a majority class, usually the less important class. In this correspondence, the authors describe a new approach named the biased minimax probability machine (BMPM) to deal with the problem of imbalanced learning. This BMPM model is demonstrated to provide an elegant and systematic way for imbalanced learning. More specifically, by controlling the accuracy of the majority class under all possible choices of class-conditional densities with a given mean and covariance matrix, this model can quantitatively and systematically incorporate a bias for the minority class. By establishing an explicit connection between the classification accuracy and the bias, this approach distinguishes itself from the many current imbalanced-learning methods; these methods often impose a certain bias on the minority data by adapting intermediate factors via the trial-and-error procedure. The authors detail the theoretical foundation, prove its solvability, propose an efficient optimization algorithm, and perform a series of experiments to evaluate the novel model. The comparison with other competitive methods demonstrates the effectiveness of this new model
Keywords :
covariance matrices; learning (artificial intelligence); minimax techniques; probability; biased minimax probability machine; covariance matrix; imbalanced learning; machine learning; trial-and-error procedure; Costs; Councils; Covariance matrix; Information technology; Learning systems; Machine learning; Minimax techniques; Performance evaluation; Proposals; Sampling methods; Fractional programming (FP); imbalanced learning; receiver operating characteristic (ROC) analysis; worst case accuracy;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2006.870610
Filename :
1658302
Link To Document :
بازگشت