• DocumentCode
    508308
  • Title

    On Improving Discretization Quality by a Bagging Technique

  • Author

    Qureshi, Taimur ; Zighed, Djamel A.

  • Author_Institution
    Lab. ERIC, Univ. of Lyon 2, Bron, France
  • Volume
    1
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    226
  • Lastpage
    231
  • Abstract
    Most of the real data often comes in a mixed format (discrete or continuous), however many supervised induction algorithms require discrete data. Quality discretization of continuous attributes is an important problem that has effects on accuracy, complexity, variance and understandability of the induction models. Most of the existing discretization methods, partition the attribute range into two or several intervals using a single or a set of cut points. In this paper, we introduce a resampling based bagging technique (using bootstrap) to generate a set of candidate discretization points and thus, improving the discretization quality by providing a better estimation towards the entire population. Thus, the goal of this paper is to observe whether this type of bagging variant can lead to better discretization points.
  • Keywords
    data mining; learning (artificial intelligence); statistical analysis; bagging technique; bootstrap technique; continuous attribute discretization; discretization quality; resampling method; supervised induction algorithms; Bagging; Bayesian methods; Classification algorithms; Classification tree analysis; Data mining; Decision trees; Entropy; Induction generators; Laboratories; Partitioning algorithms; Baggin; Discretization; Resampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.706
  • Filename
    5366557