Title
Connecting Devices to Cookies via Filtering, Feature Engineering, and Boosting.
Abstract
We present a supervised machine learning system capable of matching internet devices to web cookies through filtering, feature engineering, binary classification, and post processing. The system builds a reasonably sized training and testing data set through filtering and feature engineering. We build 415 features in total. Some of these features were engineered to be O(n) time, stand alone classifiers for this problem. Other features use various natural language processing (NLP) techniques. Meta features are created by ridge regression and Adaboost. Then binary classification through two different gradient boosting (XGBoost with logarithmic loss) models is performed. A post processing pipeline connects devices and cookies in a way that maximizes F_0.5 score. Our machine learning system obtained a private F_0.5 score of 0.849562 for a final rank of 12th/340 on the ICDM 2015: Drawbridge Cross-Device Connections challenge.
Year
DOI
Venue
2015
10.1109/ICDMW.2015.236
ICDM Workshops
Field
DocType
Citations 
Data mining,Binary classification,Computer science,Feature engineering,Artificial intelligence,AdaBoost,Pattern recognition,Filter (signal processing),Feature extraction,Boosting (machine learning),Test data,Machine learning,Gradient boosting
Conference
2
PageRank 
References 
Authors
0.37
2
4
Name
Order
Citations
PageRank
Michael Sungjun Kim120.37
Jiwei Liu263.48
Xiaozhou Wang320.70
Wei Yang42015.87