DocumentCode
3724072
Title
Ensemble Kernel Mean Matching
Author
Yun-Qian Miao;Ahmed K. Farahat;Mohamed S. Kamel
Author_Institution
Univ. of Waterloo Waterloo, Waterloo, ON, Canada
fYear
2015
Firstpage
330
Lastpage
338
Abstract
The Kernel Mean Matching (KMM) is an elegant algorithm that produces density ratios between training and test data by minimizing their maximum mean discrepancy in a kernel space. The applicability of KMM to large-scale problems is however hindered by the quadratic complexity of calculating and storing the kernel matrices over training and test data. To address this problem, this paper proposes a novel ensemble algorithm for KMM, which divides test samples into smaller partitions, estimates a density ratio for each partition and then fuses these local estimates with a weighted sum. Our theoretical analysis shows that the ensemble KMM has a lower error bound than the centralized KMM, which uses all the test data at once to estimate the density ratio. Considering its suitability for distributed implementation, the proposed algorithm is also favorable in terms of time and space complexities. Experiments on benchmark datasets confirm the superiority of the proposed algorithm in terms of estimation accuracy and running time.
Keywords
"Kernel","Training","Partitioning algorithms","Estimation","Complexity theory","Density functional theory","Algorithm design and analysis"
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2015 IEEE International Conference on
ISSN
1550-4786
Type
conf
DOI
10.1109/ICDM.2015.127
Filename
7373337
Link To Document