Title
A Tale of Two (Similar) Cities - Inferring City Similarity through Geo-spatial Query Log Analysis.
Abstract
Understanding the backgrounds and interest of the people who are consuming a piece of content, such as a news story, video, or music, is vital for the content producer as well the advertisers who rely on the content to provide a channel on which to advertise. We extend traditional search-engine query log analysis, which has primarily concentrated on analyzing either single or small groups of queries or users, to examining the complete query stream of very large groups of users the inhabitants of 13,377 cities across the United States. Query logs can be a good representation of the interests of the city's inhabitants and a useful characterization of the city itself. Further, we demonstrate how query logs can be effectively used to gather city-level statistics sufficient for providing insights into the similarities and differences between cities. Cities that are found to be similar through the use of query analysis correspond well to the similar cities as determined through other large-scale and time-consuming direct measurement studies, such as those undertaken by the Census Bureau.
Year
Venue
Keywords
2011
KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL
Data mining,Spatial data mining,Log analysis,Large scale similarity measurement,Search engine queries,Query logs,Census data
Field
DocType
Citations 
Data science,Population,Data mining,Computer science,Communication channel,Advertising campaign,Spatial query,Ethnic group,American Community Survey,Census,The Internet
Conference
2
PageRank 
References 
Authors
0.48
0
5
Name
Order
Citations
PageRank
Rohan Seth12087.69
Michele Covell270678.42
Deepak Ravichandran3116784.57
D. Sivakumar43515389.02
Shumeet Baluja54053728.83