• DocumentCode
    1957806
  • Title

    Soft partitions lead to better learned ensembles

  • Author

    Eschrich, Steven ; Hall, Lawrence O.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    406
  • Lastpage
    411
  • Abstract
    Ensembles of classifiers often provide better classification accuracy than a single classifier. One approach to creating ensembles is to create different subsets of the training data. We present a method of creating ensembles of classifiers by partitioning the dataset into regions using clustering. Learners are assigned to each region and the ensemble classification occurs by querying the learned classifier. The first strategy considered for partitioning the training set is to generate a hard, non-overlapping partition. This approach is shown to perform worse than a single classifier using the entire training set. However, the use of soft partitions significantly improves the overall ensemble performance. Three different methods of creating soft partitions are considered: a simple distance ratio, and both the fuzzy c-means and possibilistic c-means membership functions. All three methods are found to improve overall classifier performance beyond hard partitioning and often perform better than the base classifier using the entire training set. Experiments on six datasets illustrate the improved accuracy from creating ensembles on soft partitions of data.
  • Keywords
    divide and conquer methods; fuzzy logic; learning (artificial intelligence); pattern classification; classification accuracy; clustering; divide and conquer strategies; ensembles of classifiers; membership functions; soft partitions; Bagging; Computer science; Fuzzy logic; Machine learning; Neural networks; Neurons; Training data; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Information Processing Society, 2002. Proceedings. NAFIPS. 2002 Annual Meeting of the North American
  • Print_ISBN
    0-7803-7461-4
  • Type

    conf

  • DOI
    10.1109/NAFIPS.2002.1018094
  • Filename
    1018094