Identifying Implicit Enterprise Users from the Imbalanced Social Data. - Citegraph

Paper Info

Title
Identifying Implicit Enterprise Users from the Imbalanced Social Data.

Abstract
Identifying the implicit enterprise users in social media enables the improvement of data quality for many applications like user profiling and targeted advertisement, as they register as ordinary users but act like enterprise ones and hence become the noises in the data. The recognition of implicit enterprise users confronts two challenges: 1 it needs to be handled quickly with little cost due to the very nature of preprocessing, and 2 it is necessary to deal with the highly skewed distribution of implicit enterprise users and ordinary users, which is about 1:10 in a social media site Sina Weibo in China. To the best of our knowledge, this problem is so far unexplored. In this paper, we present an efficient class-imbalance learning framework which involves several types of new features from the users' profile. Specifically, a cost sensitive learning strategy is designed to overcome the problem arising from the skewed data, and a set of novel features are extracted from the profile rather than the main contents to greatly reduce the overhead of crawling and processing the microblogs. We conduct extensive experiments on a real data set consisting of 2200 users 2000 ordinary users and 200 implicit enterprise users, respectively in Sina Weibo. The results demonstrate that our method significantly outperforms the baselines by a large margin.

Year	DOI	Venue
2016	10.1007/978-3-319-48743-4_8	WISE
Keywords	Field	DocType
User classification, Implicit enterprise users, Feature extraction, Imbalanced data	Data mining,Social media,Data quality,Crawling,Profiling (computer programming),Computer science,Microblogging,Baseline (configuration management),Feature extraction,Preprocessor,Database	Conference
Volume	ISSN	Citations
10042	0302-9743	0
PageRank	References	Authors
0.34	10	4

Authors (4 rows)

Cited by (0 rows)

References (10 rows)

Name	Order	Citations	PageRank
Zhenni You	1	2	2.05
Tieyun Qian	2	177	28.81
Baochao Zhang	3	0	0.68
Shi Ying	4	334	31.11

1