• DocumentCode
    2386618
  • Title

    Co_NBM: A Semi-Supervised Categorization Algorithm Based TEF_WA Technique

  • Author

    Tang Huanling ; Lu Mingyu ; Liu Na

  • Author_Institution
    Dalian Maritime Univ., Dalian
  • fYear
    2007
  • fDate
    2-4 Nov. 2007
  • Firstpage
    271
  • Lastpage
    271
  • Abstract
    We propose a semi-supervised categorization algorithm Co_NBM incorporating co-training and TEF_WA technique. General co-training algorithm relies on the assumption that the features set can be split into two compatible and independent views. However, the assumption is usually violated to some degree in practice and sometimes the natural feature split does not exist. TEF_WA technique utilizes term evaluation functions to reduce dimensionality and adjust terms weight. Now, it is used to construct multiple views. Our experimental results show that utilizing unlabeled data Co_NBM can significantly decrease classification error, especially when labeled training data are sparse.
  • Keywords
    Bayes methods; document handling; learning (artificial intelligence); pattern classification; Co_NBM co-training algorithm; TEF_WA technique; document classification; naive Bayes classifier; semi-supervised document categorization algorithm; semi-supervised learning; term evaluation function-weight adjustment; Assembly; Constraint theory; Semisupervised learning; Smoothing methods; Text categorization; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing, 2007. GRC 2007. IEEE International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3032-1
  • Type

    conf

  • DOI
    10.1109/GrC.2007.14
  • Filename
    4403108