Title :
Granular Support Vector Machines Using Linear Decision Hyperplanes for Fast Medical Binary Classification
Author :
Tang, Yuchun ; Jin, Bo ; Zhang, Yan-Qing ; Fang, Hao ; Wang, Binghe
Author_Institution :
Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA
Abstract :
In many life science applications, due to the requirement of real time data processing and/or the very large size of the available dataset, a classifier with high efficiency is usually more preferable, or even necessary, with the prerequisite of not deteriorating effectiveness too much. That means a more desirable classifier in this context should run faster but still retain high accuracy. In this paper, we show a simple but fast method for modeling a linear granular support vector machine by splitting the feature space to two smaller subspaces and then building a SVM for each of them. The hyperplane used to halve the feature space is searched by applying the extended statistical margin maximization principle along the direction orthogonal to the first principle component. One public medical dataset is used to compare the resulting linear GSVM to one optimized SVM with the radial basis function (RBF) kernel in the whole feature space. The experimental results show that finding the splitting hyperplane is not a trivial task and the linear GSVM is even a little better than the optimized RBF-SVM in terms of testing accuracy, but the linear GSVM is more robust against noises, more stable to model parameters, and runs much faster. In general, GSVM provides an interesting new mechanism, which is competitive to kernel mapping methods, to address complex classification problems effectively and efficiently
Keywords :
classification; data mining; fuzzy set theory; medical information systems; principal component analysis; radial basis function networks; real-time systems; search problems; support vector machines; data classification; feature space splitting; kernel mapping methods; life science; linear decision hyperplanes; linear granular support vector machine; medical binary classification; principal component analysis; public medical dataset; radial basis function kernel; real time data processing; statistical margin maximization principle; Biology computing; Biomedical computing; Chemistry; Data mining; Data processing; Kernel; Noise robustness; Support vector machine classification; Support vector machines; Testing;
Conference_Titel :
Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
Conference_Location :
Reno, NV
Print_ISBN :
0-7803-9159-4
DOI :
10.1109/FUZZY.2005.1452382