DocumentCode :
2710419
Title :
Cost-Sensitive Parsimonious Linear Regression
Author :
Goetschalckx, Robby ; Driessens, Kurt ; Sanner, Scott
Author_Institution :
Katholieke Univ. Leuven, Leuven
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
809
Lastpage :
814
Abstract :
We examine linear regression problems where some features may only be observable at a cost (e.g., in medical domains where features may correspond to diagnostic tests that take time and costs money). This can be important in the context of data mining, in order to obtain the best predictions from the data on a limited cost budget. We define a parsimonious linear regression objective criterion that jointly minimizes prediction error and feature cost. We modify least angle regression algorithms commonly used for sparse linear regression to produce the ParLiR algorithm, which not only provides an efficient and parsimonious solution as we demonstrate empirically, but it also provides formal guarantees that we prove theoretically.
Keywords :
data mining; regression analysis; ParLiR algorithm; cost-sensitive parsimonious linear regression; data mining; parsimonious linear regression objective criterion; prediction error; sparse linear regression; Australia; Costs; Data mining; Linear regression; Machine learning; Machine learning algorithms; Measurement units; Medical diagnostic imaging; Medical tests; Sampling methods; Cost sensitivity; linear regression; regression; sparsity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.76
Filename :
4781183
Link To Document :
بازگشت