Title
Inferring cellular user demographic information using homophily on call graphs
Abstract
Homophily refers to the phenomenon where people who are socially-connected share many characteristics including demographic and behavioral properties. The goal of this paper is to see whether homophily exists in call networks and if so, to what degree we can infer a cellphone user's demographic properties by knowing the demographic information of the people that s/he talks to. We focus on three types of demographic information: a) home location, b) age group, and c) income level. The novelty is two-folds. First, we use both communication metrics and structural properties of call graphs to identify those “important” friends for each user with whom (s)he is most likely to be in homophily. Second, we assess the importance of different time slices such as weekdays, or nights and weekends for capturing different user relationships. We conduct our study on a real data trace with 20M subscribers during one month from a nationwide cellular carrier. Our first contribution is that we quantify the extent of homophily on the call graph and identify the correlations between homophily and communication and structural features. As a second contribution, we develop effective methods to infer demographic information for a cellular user using linear regression to select the most homophily-like friend of her/him. We find that we can predict home location within 20km radius with 80% accuracy, and age group and income level with 78% and 72% accuracy, respectively.
Year
DOI
Venue
2013
10.1109/INFCOMW.2013.6562897
INFOCOM Workshops
Keywords
Field
DocType
cellular radio,demography,graph theory,regression analysis,age group,behavioral properties,call graph,call network,cellphone user demographic properties,cellular user demographic information,communication metrics,home location,homophily-like friend,income level,linear regression,nationwide cellular carrier,socially-connected people,structural feature,structural property,user relationship
Graph theory,Mobile computing,Graph,Internet privacy,Cellular radio,Information retrieval,Homophily,Computer science,Regression analysis,Computer network,Call graph,Novelty
Conference
ISSN
ISBN
Citations 
2159-4228
978-1-4799-0055-8
5
PageRank 
References 
Authors
0.43
9
3
Name
Order
Citations
PageRank
Yi Wang11149.03
Hui Zang2105277.25
Michalis Faloutsos35288586.88