• DocumentCode
    353854
  • Title

    Discovering and filtering text information from Internet based on inductive learning

  • Author

    Bo, Yang ; Fei, Liu ; Yiqi, Sun

  • Author_Institution
    Shandong Univ. of Building Mater., Jinan, China
  • Volume
    4
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    2749
  • Abstract
    The paper aims at the discovering and filtering of text information from the Internet. Based on the theories of probability and statistics, an essential key word list is established by the statistics of functional words in the sample text and whether or not the text is suspicious. A probability-based inductive learning algorithm is presented to show how to set up and optimize the key words and further more classify the text by the parameter estimation of the text-class attributes. A knowledge-based network information classification and filtering scheme is studied by using the algorithm to different sets of sample text belongs to several domains. The theoretical basis and algorithm implementation are discussed in detail, the experimental results and application examples are also given in the paper
  • Keywords
    Internet; data mining; learning by example; multi-agent systems; parameter estimation; probability; Internet; essential key word list; knowledge-based network information classification; probability-based inductive learning algorithm; text information; text-class attribute; Building materials; Filtering algorithms; Information filtering; Information filters; Internet; Parameter estimation; Probability; Statistics; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2000. Proceedings of the 3rd World Congress on
  • Conference_Location
    Hefei
  • Print_ISBN
    0-7803-5995-X
  • Type

    conf

  • DOI
    10.1109/WCICA.2000.862559
  • Filename
    862559