DocumentCode
2710419
Title
Cost-Sensitive Parsimonious Linear Regression
Author
Goetschalckx, Robby ; Driessens, Kurt ; Sanner, Scott
Author_Institution
Katholieke Univ. Leuven, Leuven
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
809
Lastpage
814
Abstract
We examine linear regression problems where some features may only be observable at a cost (e.g., in medical domains where features may correspond to diagnostic tests that take time and costs money). This can be important in the context of data mining, in order to obtain the best predictions from the data on a limited cost budget. We define a parsimonious linear regression objective criterion that jointly minimizes prediction error and feature cost. We modify least angle regression algorithms commonly used for sparse linear regression to produce the ParLiR algorithm, which not only provides an efficient and parsimonious solution as we demonstrate empirically, but it also provides formal guarantees that we prove theoretically.
Keywords
data mining; regression analysis; ParLiR algorithm; cost-sensitive parsimonious linear regression; data mining; parsimonious linear regression objective criterion; prediction error; sparse linear regression; Australia; Costs; Data mining; Linear regression; Machine learning; Machine learning algorithms; Measurement units; Medical diagnostic imaging; Medical tests; Sampling methods; Cost sensitivity; linear regression; regression; sparsity;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location
Pisa
ISSN
1550-4786
Print_ISBN
978-0-7695-3502-9
Type
conf
DOI
10.1109/ICDM.2008.76
Filename
4781183
Link To Document