• DocumentCode
    649813
  • Title

    Two layer algorithm for data classification based on rough set and Bayesian network classifiers

  • Author

    Mirzakhani, Marzieh ; Moghadam, AmirMasoud Eftekhari

  • Author_Institution
    Dept. of Comput. Eng., Islamic Azad Univ., Qazvin, Iran
  • fYear
    2013
  • fDate
    27-29 Aug. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Data classification, especially text classification has been one of the key subjects in intelligent information processing due to the enormous growth of digital content available on-line. Owing to the high feature space dimensions in most of data types, reduction of feature space and improving classification accuracy is important and difficult problem. A rough set theory is a powerful tool to deal with uncertainty, so it is a good tool for feature reduction. Bayesian networks are also one of the most powerful tools in design of expert systems located in an uncertainty framework. In this paper, we proposed an algorithm for data classification that first, it uses rough set theory and conditional entropy for feature selection and then through rough membership degree concept, it classifies objects with high membership degree, certainly. For classification of other objects, we use Bayesian network classifiers for instance Tree Augmented Naïve Bayes and general Bayesian classifier with two different search approaches. Finally, proposed algorithm is evaluated on Reuters-21578 collection and 4 UCI data sets.
  • Keywords
    belief networks; entropy; expert systems; feature extraction; pattern classification; rough set theory; text analysis; uncertainty handling; Bayesian network classifiers; Bayesian networks; classification accuracy; conditional entropy; data classification; data types; digital content; expert systems; feature reduction; feature selection; feature space dimensions; feature space reduction; general Bayesian classifier; intelligent information processing; rough membership degree concept; rough set theory; text classification; tree augmented naïve Bayes classifier; two layer algorithm; uncertainty framework; Bayesian Network; General Bayesian Classifier; Rough Set Theory; Text Classification; Tree Augmented Naïve Bayes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (IFSC), 2013 13th Iranian Conference on
  • Conference_Location
    Qazvin
  • Print_ISBN
    978-1-4799-1227-8
  • Type

    conf

  • DOI
    10.1109/IFSC.2013.6675587
  • Filename
    6675587