• DocumentCode
    2865904
  • Title

    Making logistic regression a core data mining tool with TR-IRLS

  • Author

    Komarek, Paul ; Moore, Andrew W.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    Binary classification is a core data mining task. For large datasets or real-time applications, desirable classifiers are accurate, fast, and need no parameter tuning. We present a simple implementation of logistic regression that meets these requirements. A combination of regularization, truncated Newton methods, and iteratively re-weighted least squares make it faster and more accurate than modern SVM implementations, and relatively insensitive to parameters. It is robust to linear dependencies and some scaling problems, making most data preprocessing unnecessary.
  • Keywords
    Newton method; data mining; least squares approximations; pattern classification; regression analysis; binary classification; data mining tool; iteratively reweighted least squares; logistic regression; regularization method; truncated Newton method; Character generation; Computer science; Data mining; Data preprocessing; Least squares methods; Logistics; Newton method; Sparse matrices; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.90
  • Filename
    1565757