DocumentCode
2176013
Title
Rule-based text categorization using hierarchical categories
Author
Sasaki, Minoru ; Kita, Kenji
Author_Institution
Fac. of Eng., Tokushima Univ., Japan
Volume
3
fYear
1998
fDate
11-14 Oct 1998
Firstpage
2827
Abstract
Document categorization, which is defined as the classification of text documents into one of several fixed classes or categories, has become important with the explosive growth of the World Wide Web. The goal of the work described here is to automatically categorize Web documents in order to enable effective retrieval of Web information. In this paper, based on the rule learning algorithm RIPPER (for Repeated Incremental Pruning to Produce Error Reduction), we propose an efficient method for hierarchical document categorization
Keywords
classification; data mining; information resources; information retrieval; knowledge based systems; learning (artificial intelligence); Web information retrieval; World Wide Web; document categorization; hierarchical categories; hierarchical document categorization; rule learning algorithm RIPPER; rule-based text categorization; text documents classification; Data mining; Explosives; Filtering; Humans; Learning systems; Painting; Partitioning algorithms; Text categorization; Web sites; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
Conference_Location
San Diego, CA
ISSN
1062-922X
Print_ISBN
0-7803-4778-1
Type
conf
DOI
10.1109/ICSMC.1998.725090
Filename
725090
Link To Document