Title :
Optimizing the classification accuracy of imbalanced dataset based on SVM
Author :
Sheng, Zhang ; Xiuyu, Shang ; Wei, Wang ; Xiuli, Huang
Author_Institution :
Coll. of Math., Phys. & Inf., Eng., Zhejiang Normal Univ., Jinhua, China
Abstract :
A dataset can be called imbalanced if at least one class of the data is represented by significantly less number of samples than the others. Imbalanced data generally exists in the real world. The classification performance of traditional machine learning algorithm is hampered in the classification tasks of imbalanced dataset. Support Vector Machines (SVM) is a new kind of machine learning method based on structural risk minimization principle and has had the best performance so far in several challenging applications. This paper summarizes the applications of SVM in imbalance dataset first and then presents some main improved methods which greatly improved the performance of classification in imbalanced dataset.
Keywords :
learning (artificial intelligence); pattern classification; support vector machines; SVM; classification accuracy optimisation; imbalanced dataset; machine learning algorithm; structural risk minimization principle; support vector machines; Computers; Gallium nitride; Machine learning; Imbalanced Dataset; Machine Learning; SVM;
Conference_Titel :
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-7235-2
Electronic_ISBN :
978-1-4244-7237-6
DOI :
10.1109/ICCASM.2010.5620370