• DocumentCode
    1000468
  • Title

    Ensembling local learners ThroughMultimodal perturbation

  • Author

    Zhou, Zhi-Hua ; Yu, Yang

  • Author_Institution
    Nat. Lab. for Novel Software Technol., Nanjing Univ., China
  • Volume
    35
  • Issue
    4
  • fYear
    2005
  • Firstpage
    725
  • Lastpage
    735
  • Abstract
    Ensemble learning algorithms train multiple component learners and then combine their predictions. In order to generate a strong ensemble, the component learners should be with high accuracy as well as high diversity. A popularly used scheme in generating accurate but diverse component learners is to perturb the training data with resampling methods, such as the bootstrap sampling used in bagging. However, such a scheme is not very effective on local learners such as nearest-neighbor classifiers because a slight change in training data can hardly result in local learners with big differences. In this paper, a new ensemble algorithm named Filtered Attribute Subspace based Bagging with Injected Randomness (FASBIR) is proposed for building ensembles of local learners, which utilizes multimodal perturbation to help generate accurate but diverse component learners. In detail, FASBIR employs the perturbation on the training data with bootstrap sampling, the perturbation on the input attributes with attribute filtering and attribute subspace selection, and the perturbation on the learning parameters with randomly configured distance metrics. A large empirical study shows that FASBIR is effective in building ensembles of nearest-neighbor classifiers, whose performance is better than that of many other ensemble algorithms.
  • Keywords
    data mining; learning (artificial intelligence); sampling methods; bootstrap sampling; data mining; ensemble learning algorithm; filtered attribute subspace; machine learning; multimodal perturbation; multiple component learner; nearest-neighbor classifier; resampling method; stable base learner; Bagging; Decision trees; Diversity reception; Educational programs; Filtering; Machine learning; Machine learning algorithms; Neural networks; Sampling methods; Training data; Data mining; ensemble learning; local learner; machine learning; multimodal perturbation; nearest-neighbor classifier; stable base learner; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Models, Statistical; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2005.845396
  • Filename
    1468246