DocumentCode :
2452390
Title :
Improve text classification accuracy based on classifier fusion methods
Author :
Danesh, Ali ; Moshiri, Behzad ; Fatemi, Omid
Author_Institution :
Tehran Univ., Tehran
fYear :
2007
fDate :
9-12 July 2007
Firstpage :
1
Lastpage :
6
Abstract :
Naive-Bayes and k-NN classifiers are two machine learning approaches for text classification. Rocchio is the classic method for text classification in information retrieval. Based on these three approaches and using classifier fusion methods, we propose a novel approach in text classification. Our approach is a supervised method, meaning that the list of categories should be defined and a set of training data should be provided for training the system. In this approach, documents are represented as vectors where each component is associated with a particular word. We proposed voting methods and OWA operator and decision template method for combining classifiers. Experimental results show that these methods decrese the classification error 15 percent as measured on 2000 training data from 20 newsgroups dataset.
Keywords :
Bayes methods; classification; decision theory; information retrieval; learning (artificial intelligence); sensor fusion; text analysis; OWA operator; Rocchio algorithm; decision template method; information retrieval; k-NN classifier; machine learning; naive-Bayes classifier; text classification; voting method; Classification tree analysis; Intelligent control; Linear discriminant analysis; Machine learning; Neural networks; Open wireless architecture; Process control; Text categorization; Training data; Voting; Classifier Fusion; Decision Template; K-NN; Naïve-Bayes; OWA; Rocchio; TFIDF; Text Classification; Voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Fusion, 2007 10th International Conference on
Conference_Location :
Quebec, Que.
Print_ISBN :
978-0-662-45804-3
Electronic_ISBN :
978-0-662-45804-3
Type :
conf
DOI :
10.1109/ICIF.2007.4408196
Filename :
4408196
Link To Document :
بازگشت