• DocumentCode
    2550549
  • Title

    Feature extended short text categorization based on theme ontology

  • Author

    Zhan, Yan ; Chen, Hao

  • Author_Institution
    Key Lab. of Machine Learning & Comput. Intell., Hebei Univ., Baoding, China
  • fYear
    2012
  • fDate
    29-31 May 2012
  • Firstpage
    702
  • Lastpage
    705
  • Abstract
    Short text classification problem as text classification a branch, in addition to the same with traditional text classification to a certain degree, still need to face some special problems to be solved, because of short text length, features sparse, measuring the words difficultly. Due to the nature of ontology emphasize related field concept, which has the defect of few characteristics in short text, it is necessary to emphasize the relationship between semantic. This paper uses the feature extended method based on theme Ontology. As considering the semantic relations, it can get better classification performance compared to the conventional method. Meanwhile, using Case-Base Maintenance learning via the GC (Generalization Capability) algorithm, which can reduce the case number into K-NN algorithm, can improve efficiency when indexing near neighbor in K-Nearest Neighbor algorithm. The numerical experiments prove the validity of this learning algorithm.
  • Keywords
    learning (artificial intelligence); ontologies (artificial intelligence); pattern classification; text analysis; K-nearest neighbor algorithm; case-base maintenance learning; classification performance; feature extended short text categorization; field concept; generalization capability algorithm; short text length; text classification; theme ontology; Algorithm design and analysis; Classification algorithms; Educational institutions; Ontologies; Semantics; Text categorization; Training; CBM; Short Text Categorization; Theme Ontology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
  • Conference_Location
    Sichuan
  • Print_ISBN
    978-1-4673-0025-4
  • Type

    conf

  • DOI
    10.1109/FSKD.2012.6234220
  • Filename
    6234220