Title :
Latent Factor SVM for Text Categorization
Author :
Xiaofei Zhou ; Li Guo ; Ping Liu ; Yanbing Liu
Author_Institution :
Inst. of Inf. Eng., Beijing, China
Abstract :
Text categorization is an important research in nature language process and content analysis. In this paper, we present latent factor SVM (LF-SVM) for text categorization which use latent factor vectors for category representation on text categorization. We prove that latent factors extracted by PLSA (probability latent semantic analysis) can span convex structure to express text category. Based on the category expression we adopt maximal margin hyper plane to divide the categories. The experiments on normal text datasets show that our motivation and algorithm are reasonable and effective.
Keywords :
natural language interfaces; statistical analysis; support vector machines; text analysis; LF-SVM; category representation; content analysis; convex structure; latent factor SVM; latent factor vectors; maximal margin hyper plane; nature language process; probability latent semantic analysis; text categorization; text datasets; Feature extraction; Semantics; Support vector machine classification; Text categorization; Training; Vectors; PLSA; SVM; Text categorization; latent semantic; text classification;
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
DOI :
10.1109/ICDMW.2014.9