Title :
Using Multiple Sets of Attributes for Text Categorization
Author :
Bi, Ya-xin ; Zhang, Qiang ; Wu, Sheng-li ; Guan, Ji-wen
Author_Institution :
Sch. of Comput. & Math., Ulster Univ., Newtownabbey
Abstract :
This paper investigates how multiple sets of attributes can be generated using a rough sets-based inductive learning method and how they can be combined for improving classification decisions, particularly in the context of text categorization, by using Dempster\´s rule of combination. We first propose a boosting-like technique for generating multiple sets of attributes based on rough set theory, and a method for transforming multiple sets of attributes to multiple sets of rules, and then model classification decisions inferred by the rules as pieces of evidence. The various experiments have been carried out on 10 out of the 20-newsgroups - a benchmark data collection ndividually and in combination. Our experimental results support the claim that "decisions made by multiple experts would be more effective than any one if their individual judgments are appropriately combined"
Keywords :
inference mechanisms; learning (artificial intelligence); pattern classification; rough set theory; text analysis; Dempster combination rule; ensemble method; inductive learning; information fusion; multiple attribute set; pattern classification; rough set theory; text categorization; Bismuth; Boosting; Computer science; Cybernetics; Educational institutions; Electronic mail; Learning systems; Machine learning; Mathematics; Pattern recognition; Set theory; Text categorization; Inductive learning; ensemble methods; information fusion; text categorization;
Conference_Titel :
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location :
Dalian, China
Print_ISBN :
1-4244-0061-9
DOI :
10.1109/ICMLC.2006.258668