DocumentCode
353854
Title
Discovering and filtering text information from Internet based on inductive learning
Author
Bo, Yang ; Fei, Liu ; Yiqi, Sun
Author_Institution
Shandong Univ. of Building Mater., Jinan, China
Volume
4
fYear
2000
fDate
2000
Firstpage
2749
Abstract
The paper aims at the discovering and filtering of text information from the Internet. Based on the theories of probability and statistics, an essential key word list is established by the statistics of functional words in the sample text and whether or not the text is suspicious. A probability-based inductive learning algorithm is presented to show how to set up and optimize the key words and further more classify the text by the parameter estimation of the text-class attributes. A knowledge-based network information classification and filtering scheme is studied by using the algorithm to different sets of sample text belongs to several domains. The theoretical basis and algorithm implementation are discussed in detail, the experimental results and application examples are also given in the paper
Keywords
Internet; data mining; learning by example; multi-agent systems; parameter estimation; probability; Internet; essential key word list; knowledge-based network information classification; probability-based inductive learning algorithm; text information; text-class attribute; Building materials; Filtering algorithms; Information filtering; Information filters; Internet; Parameter estimation; Probability; Statistics; Sun;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control and Automation, 2000. Proceedings of the 3rd World Congress on
Conference_Location
Hefei
Print_ISBN
0-7803-5995-X
Type
conf
DOI
10.1109/WCICA.2000.862559
Filename
862559
Link To Document