Title :
Two layer algorithm for data classification based on rough set and Bayesian network classifiers
Author :
Mirzakhani, Marzieh ; Moghadam, AmirMasoud Eftekhari
Author_Institution :
Dept. of Comput. Eng., Islamic Azad Univ., Qazvin, Iran
Abstract :
Data classification, especially text classification has been one of the key subjects in intelligent information processing due to the enormous growth of digital content available on-line. Owing to the high feature space dimensions in most of data types, reduction of feature space and improving classification accuracy is important and difficult problem. A rough set theory is a powerful tool to deal with uncertainty, so it is a good tool for feature reduction. Bayesian networks are also one of the most powerful tools in design of expert systems located in an uncertainty framework. In this paper, we proposed an algorithm for data classification that first, it uses rough set theory and conditional entropy for feature selection and then through rough membership degree concept, it classifies objects with high membership degree, certainly. For classification of other objects, we use Bayesian network classifiers for instance Tree Augmented Naïve Bayes and general Bayesian classifier with two different search approaches. Finally, proposed algorithm is evaluated on Reuters-21578 collection and 4 UCI data sets.
Keywords :
belief networks; entropy; expert systems; feature extraction; pattern classification; rough set theory; text analysis; uncertainty handling; Bayesian network classifiers; Bayesian networks; classification accuracy; conditional entropy; data classification; data types; digital content; expert systems; feature reduction; feature selection; feature space dimensions; feature space reduction; general Bayesian classifier; intelligent information processing; rough membership degree concept; rough set theory; text classification; tree augmented naïve Bayes classifier; two layer algorithm; uncertainty framework; Bayesian Network; General Bayesian Classifier; Rough Set Theory; Text Classification; Tree Augmented Naïve Bayes;
Conference_Titel :
Fuzzy Systems (IFSC), 2013 13th Iranian Conference on
Conference_Location :
Qazvin
Print_ISBN :
978-1-4799-1227-8
DOI :
10.1109/IFSC.2013.6675587