• DocumentCode
    3549176
  • Title

    Combining variable selection with dimensionality reduction

  • Author

    Wolf, Lior ; Bileschi, Stan

  • Author_Institution
    The Center for Biol. & Computational Learning, Massachusetts Inst. of Technol., MA, USA
  • Volume
    2
  • fYear
    2005
  • fDate
    20-25 June 2005
  • Firstpage
    801
  • Abstract
    This paper bridges the gap between variable selection methods (e.g., Pearson coefficients, KS test) and dimensionality reduction algorithms (e.g., PCA, LDA). Variable selection algorithms encounter difficulties dealing with highly correlated data, since many features are similar in quality. Dimensionality reduction algorithms tend to combine all variables and cannot select a subset of significant variables. Our approach combines both methodologies by applying variable selection followed by dimensionality reduction. This combination makes sense only when using the same utility function in both stages, which we do. The resulting algorithm benefits from complex features as variable selection algorithms do, and at the same time enjoys the benefits of dimensionality reduction.
  • Keywords
    feature extraction; learning (artificial intelligence); statistical analysis; support vector machines; KS test; LDA; PCA; Pearson coefficients; SVM; correlated data; dimensionality reduction algorithm; feature selection; utility function; variable selection algorithms; Biology computing; Bridges; Data mining; Diversity reception; Input variables; Linear discriminant analysis; Principal component analysis; Support vector machines; Testing; Time measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on
  • ISSN
    1063-6919
  • Print_ISBN
    0-7695-2372-2
  • Type

    conf

  • DOI
    10.1109/CVPR.2005.103
  • Filename
    1467525