Title :
Margin Maximization Model of Text Classification Based on Support Vector Machines
Author :
Chen, Peng ; Wen, Tao
Author_Institution :
Dept. of Comput. Sci. & Technol., Neusoft Inst. of Inf., Dalian
Abstract :
Support vector machines (SVMs) are more suitable for text categorization than traditional machine learning methods by acknowledging various statistical characteristic of text learning task. By introducing the margin maximization principle in the statistical machine learning theory, the feature statistic matrix based on average document mapping (FSM-ADM) model, which partitions the set of features using weighted odds ratio, is proposed in the form of generalization capability estimation theorem with rigorous proofs and solid experimental validation. The theoretical model has successfully discovered the unexplored capability of being classified in text classification
Keywords :
matrix algebra; statistical analysis; support vector machines; text analysis; FSM-ADM; SVMs; average document mapping model; feature statistic matrix; generalization capability estimation theorem; margin maximization model; solid experimental validation; statistical machine learning theory; support vector machine; text categorization; text classification; text learning task; Computer science; Cybernetics; Electronic mail; Estimation theory; Lagrangian functions; Learning systems; Machine learning; Solid modeling; Statistics; Support vector machine classification; Support vector machines; Text categorization; FSM-ADM; Support vector machines; margin maximization; text classification;
Conference_Titel :
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location :
Dalian, China
Print_ISBN :
1-4244-0061-9
DOI :
10.1109/ICMLC.2006.258543