Title :
Hypertext Classification using Weighted Transductive Support Vector Machines
Author :
Liu, Shuang ; Jia, Chuan-Ying ; Chen, Peng
Author_Institution :
Inst. of Nautical Sci. & Technol., Dalian Maritime Univ.
Abstract :
Hypertext document is a special but important kind of text document for text classification. This paper introduces weighted transductive support vector machines (WTSVMs), which treat test samples discriminately based on their weight factors rather than treat every test sample equally in transductive support vector machines (TSVMs). A hybrid similarity function that includes hyperlink and term components is defined and computed, measuring the similarity between an unlabeled sample and labeled documents. Thus, the adjustment of the decision hyper-plane is refined due to reformulating the penalties on unlabeled samples in the training process. Experimental results on benchmark problems show the efficiency of the proposed method
Keywords :
classification; support vector machines; text analysis; TSVM; WTSVM; decision hyper-plane; hybrid similarity function; hyperlink; hypertext classification; hypertext document; labeled documents; term components; unlabeled sample documents; weighted transductive support vector machines; Computer science; Context modeling; Cybernetics; Electronic mail; Information retrieval; Labeling; Machine learning; Support vector machine classification; Support vector machines; Testing; Text categorization; Web pages; Content similarity; Hyperlink similarity; Text classification; Transductive support vector machines; Weighted transductive support vector machines;
Conference_Titel :
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location :
Dalian, China
Print_ISBN :
1-4244-0061-9
DOI :
10.1109/ICMLC.2006.258547