DocumentCode :
2866295
Title :
On the tractability of rule discovery from distributed data
Author :
Scholz, Martin
Author_Institution :
Dept. of Comput. Sci., Dortmund Univ., Germany
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
This paper analyses the tractability of rule selection for supervised learning in distributed scenarios. The selection of rules is usually guided by a utility measure such as predictive accuracy or weighted relative accuracy. A common strategy to tackle rule selection from distributed data is to evaluate rules locally on each dataset. While this works well for homogeneously distributed data, this work proves limitations of this strategy if distributions are allowed to deviate. The identification of those subsets for which local and global distributions deviate, poses a learning task of its own, which is shown to be at least as complex as discovering the globally best rules from local data.
Keywords :
data mining; distributed processing; learning (artificial intelligence); distributed data; predictive accuracy; rule discovery; rule selection; supervised learning; utility measure; weighted relative accuracy; Accuracy; Artificial intelligence; Computer science; Costs; Databases; Logic; Machine learning; Privacy; Supervised learning; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.110
Filename :
1565776
Link To Document :
بازگشت