Abstract | ||
---|---|---|
We present a supervised machine learning system capable of matching internet devices to web cookies through filtering, feature engineering, binary classification, and post processing. The system builds a reasonably sized training and testing data set through filtering and feature engineering. We build 415 features in total. Some of these features were engineered to be O(n) time, stand alone classifiers for this problem. Other features use various natural language processing (NLP) techniques. Meta features are created by ridge regression and Adaboost. Then binary classification through two different gradient boosting (XGBoost with logarithmic loss) models is performed. A post processing pipeline connects devices and cookies in a way that maximizes F_0.5 score. Our machine learning system obtained a private F_0.5 score of 0.849562 for a final rank of 12th/340 on the ICDM 2015: Drawbridge Cross-Device Connections challenge. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/ICDMW.2015.236 | ICDM Workshops |
Field | DocType | Citations |
Data mining,Binary classification,Computer science,Feature engineering,Artificial intelligence,AdaBoost,Pattern recognition,Filter (signal processing),Feature extraction,Boosting (machine learning),Test data,Machine learning,Gradient boosting | Conference | 2 |
PageRank | References | Authors |
0.37 | 2 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Michael Sungjun Kim | 1 | 2 | 0.37 |
Jiwei Liu | 2 | 6 | 3.48 |
Xiaozhou Wang | 3 | 2 | 0.70 |
Wei Yang | 4 | 20 | 15.87 |