DocumentCode :
148978
Title :
Balance learning to rank in big data
Author :
Guanqun Cao ; Ahmad, Ishtiaq ; Honglei Zhang ; Weiyi Xie ; Gabbouj, Moncef
Author_Institution :
Tampere Univ. of Technol., Tampere, Finland
fYear :
2014
fDate :
1-5 Sept. 2014
Firstpage :
1422
Lastpage :
1426
Abstract :
We propose a distributed learning to rank method, and demonstrate its effectiveness in web-scale image retrieval. With the increasing amount of data, it is not applicable to train a centralized ranking model for any large scale learning problems. In distributed learning, the discrepancy between the training subsets and the whole when building the models are non-trivial but overlooked in the previous work. In this paper, we firstly include a cost factor to boosting algorithms to balance the individual models toward the whole data. Then, we propose to decompose the original algorithm to multiple layers, and their aggregation forms a superior ranker which can be easily scaled up to billions of images. The extensive experiments show the proposed method outperforms the straightforward aggregation of boosting algorithms.
Keywords :
Big Data; Internet; image retrieval; learning (artificial intelligence); Web-scale image retrieval; balance learning; big data; boosting algorithm; centralized ranking model; distributed learning; large scale learning problem; Bagging; Big data; Boosting; Data models; Distributed databases; Training; Training data; Big Data; distributed learning; learning to rank;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon
Type :
conf
Filename :
6952504
Link To Document :
بازگشت