• DocumentCode
    2432772
  • Title

    Machine Learning Algorithms with Co-occurrence Based Term Association for Text Mining

  • Author

    Yi, YunFei ; Lijun Liu ; Li, Cheng Hua ; Song, Wei ; Liu, Shuai

  • Author_Institution
    Dept. of Comput. Sci. & Inf., He Chi Univ., YiZhou, China
  • fYear
    2012
  • fDate
    3-5 Nov. 2012
  • Firstpage
    958
  • Lastpage
    962
  • Abstract
    In this paper, an effective method for computing term association from a text corpus is presented. Two machine learning algorithms are employed to evaluate the effectiveness of the proposed method for text mining. The co-occurrence based term association method is to overcome the problem of lack of relationship between words for keyword based text mining and improve the performance of text mining. The experiments are conducted on the standard Reuter-21578 data set and 20 news group data set. Different number of associated terms are compared in the experiments. Experimental results show that the proposed method can achieve better results on different machine learning algorithms when measured by F measure.
  • Keywords
    data mining; learning (artificial intelligence); text analysis; F measure; Reuter-21578 data set; cooccurrence based term association; keyword based text mining; machine learning algorithm; text corpus; Educational institutions; Information retrieval; Text categorization; Text mining; Thesauri; Training; Vectors; Machine learning algorithms; Term association; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Communication Networks (CICN), 2012 Fourth International Conference on
  • Conference_Location
    Mathura
  • Print_ISBN
    978-1-4673-2981-1
  • Type

    conf

  • DOI
    10.1109/CICN.2012.141
  • Filename
    6375257