• DocumentCode
    1241454
  • Title

    Filter-Based Data Partitioning for Training Multiple Classifier Systems

  • Author

    Dara, Rozita A. ; Makrehchi, Masoud ; Kamel, Mohamed S.

  • Author_Institution
    R&D, Res. In Motion, Ltd., Guelph, ON, Canada
  • Volume
    22
  • Issue
    4
  • fYear
    2010
  • fDate
    4/1/2010 12:00:00 AM
  • Firstpage
    508
  • Lastpage
    522
  • Abstract
    Data partitioning methods such as bagging and boosting have been extensively used in multiple classifier systems. These methods have shown a great potential for improving classification accuracy. This study is concerned with the analysis of training data distribution and its impact on the performance of multiple classifier systems. In this study, several feature-based and class-based measures are proposed. These measures can be used to estimate statistical characteristics of the training partitions. To assess the effectiveness of different types of training partitions, we generated a large number of disjoint training partitions with distinctive distributions. Then, we empirically assessed these training partitions and their impact on the performance of the system by utilizing the proposed feature-based and class-based measures. We applied the findings of this analysis and developed a new partitioning method called "Clustering, Declustering, and Selection" (CDS). This study presents a comparative analysis of several existing data partitioning methods including our proposed CDS approach.
  • Keywords
    pattern classification; pattern clustering; class based measures; clustering; declustering; disjoint training partitions; feature based measures; filter based data partitioning; multiple classifier systems training; selection; training data distribution; Multiple classifier system; class-based.; combining method; distance; feature-based; filter-based data partitioning; wrapper-based data partitioning;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2009.80
  • Filename
    4815241