DocumentCode
2550549
Title
Feature extended short text categorization based on theme ontology
Author
Zhan, Yan ; Chen, Hao
Author_Institution
Key Lab. of Machine Learning & Comput. Intell., Hebei Univ., Baoding, China
fYear
2012
fDate
29-31 May 2012
Firstpage
702
Lastpage
705
Abstract
Short text classification problem as text classification a branch, in addition to the same with traditional text classification to a certain degree, still need to face some special problems to be solved, because of short text length, features sparse, measuring the words difficultly. Due to the nature of ontology emphasize related field concept, which has the defect of few characteristics in short text, it is necessary to emphasize the relationship between semantic. This paper uses the feature extended method based on theme Ontology. As considering the semantic relations, it can get better classification performance compared to the conventional method. Meanwhile, using Case-Base Maintenance learning via the GC (Generalization Capability) algorithm, which can reduce the case number into K-NN algorithm, can improve efficiency when indexing near neighbor in K-Nearest Neighbor algorithm. The numerical experiments prove the validity of this learning algorithm.
Keywords
learning (artificial intelligence); ontologies (artificial intelligence); pattern classification; text analysis; K-nearest neighbor algorithm; case-base maintenance learning; classification performance; feature extended short text categorization; field concept; generalization capability algorithm; short text length; text classification; theme ontology; Algorithm design and analysis; Classification algorithms; Educational institutions; Ontologies; Semantics; Text categorization; Training; CBM; Short Text Categorization; Theme Ontology;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location
Sichuan
Print_ISBN
978-1-4673-0025-4
Type
conf
DOI
10.1109/FSKD.2012.6234220
Filename
6234220
Link To Document