DocumentCode :
3250717
Title :
On active learning for data acquisition
Author :
Zheng, Zhiqiang ; Padmanabhan, Balaji
Author_Institution :
Wharton Sch., Univ. of Pennsylvania, PA, USA
fYear :
2002
fDate :
2002
Firstpage :
562
Lastpage :
569
Abstract :
Many applications are characterized by having naturally incomplete data on customers - where data on only some fixed set of local variables is gathered However, having a more complete picture can help build better models. The naive solution to this problem - acquiring complete data for all customers s often impractical due to the costs of doing so. A possible alternative is to acquire complete data for "some" customers and to use this to improve the models built. The data acquisition problem is determining how many, and which, customers to acquire additional data from. In this paper we suggest using active learning based approaches for the data acquisition problem. In particular, we present initial methods for data acquisition and evaluate these methods experimentally on web usage data and UCI datasets. Results show that the methods perform well and indicate that active learning based methods for data acquisition can be a promising area for data mining research.
Keywords :
data acquisition; data mining; learning (artificial intelligence); UCI datasets; active learn; data acquisition; data mining; naturally incomplete data; web usage data; Companies; Costs; Credit cards; Data acquisition; Data mining; Information management; Learning systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
Type :
conf
DOI :
10.1109/ICDM.2002.1184002
Filename :
1184002
Link To Document :
بازگشت