• DocumentCode
    653931
  • Title

    Relationship between Naïve Bayes error and max-dependency criterion in feature selection problems

  • Author

    Sedaghat, Nafiseh ; Fathy, Mahmood ; Modarressi, Mohammad-Hossein

  • Author_Institution
    Comput. Eng. Dept., Iran Univ. of Sci. & Technol., Tehran, Iran
  • fYear
    2013
  • fDate
    Oct. 31 2013-Nov. 1 2013
  • Firstpage
    262
  • Lastpage
    266
  • Abstract
    Feature selection of the raw data is a fundamental step in the most pattern recognition and machine learning applications. The primary problem of feature selection is the criterion which evaluates a feature set. In the context of classification problems, optimal criterion would be the Bayesian error rate for selected subset of features. The Bayesian error rate bounds to some values that are related to mutual information. This interval shrinks as the mutual information increases. In this paper, we investigated the relationship between dependency and the Naïve Bayes error; dependency of the selected features is calculated as mutual information between the selected features and class. We designed some experiments to examine it about a two classes and two binary features problem. We found that in binary feature selection problem, the Naïve Bayes error increases as dependency increases; however, we showed that there are some states that the Naïve Bayes classifier is optimal while its default assumption is strongly violated (dependency is more than 0.8).
  • Keywords
    Bayes methods; belief networks; pattern classification; Bayesian error rate; Naive Bayes Error; binary feature selection problem; classification problems; max-dependency criterion; Bioinformatics; Blogs; Computers; Entropy; Genomics; Bayes classifier; Error rate; Feature selection; Information theory; Naïve Bayes classifier; dependency; mutual information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Knowledge Engineering (ICCKE), 2013 3th International eConference on
  • Conference_Location
    Mashhad
  • Print_ISBN
    978-1-4799-2092-1
  • Type

    conf

  • DOI
    10.1109/ICCKE.2013.6682868
  • Filename
    6682868