DocumentCode :
3739375
Title :
Connecting Devices to Cookies via Filtering, Feature Engineering, and Boosting
Author :
Michael Sungjun Kim;Jiwei Liu;Xiaozhou Wang;Wei Yang
fYear :
2015
Firstpage :
1690
Lastpage :
1694
Abstract :
We present a supervised machine learning system capable of matching internet devices to web cookies through filtering, feature engineering, binary classification, and post processing. The system builds a reasonably sized training and testing data set through filtering and feature engineering. We build 415 features in total. Some of these features were engineered to be O(n) time, stand alone classifiers for this problem. Other features use various natural language processing (NLP) techniques. Meta features are created by ridge regression and Adaboost. Then binary classification through two different gradient boosting (XGBoost with logarithmic loss) models is performed. A post processing pipeline connects devices and cookies in a way that maximizes F_0.5 score. Our machine learning system obtained a private F_0.5 score of 0.849562 for a final rank of 12th/340 on the ICDM 2015: Drawbridge Cross-Device Connections challenge.
Keywords :
"IP networks","Training","Feature extraction","Pipelines","Boosting","Natural language processing","Conferences"
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
Electronic_ISBN :
2375-9259
Type :
conf
DOI :
10.1109/ICDMW.2015.236
Filename :
7395889
Link To Document :
بازگشت