Title
Characterizing JSON Traffic Patterns on a CDN
Abstract
Content delivery networks serve a major fraction of the Internet traffic, and their geographically deployed infrastructure makes them a good vantage point to observe traffic access patterns. We perform a large-scale investigation to characterize Web traffic patterns observed from a major CDN infrastructure. Specifically, we discover that responses with application/json content-type form a growing majority of all HTTP requests. As a result, we seek to understand what types of devices and applications are requesting JSON objects and explore opportunities to optimize CDN delivery of JSON traffic. Our study shows that mobile applications account for at least 52% of JSON traffic on the CDN and embedded devices account for another 12% of all JSON traffic. We also find that more than 55% of JSON traffic on the CDN is uncacheable, showing that a large portion of JSON traffic on the CDN is dynamic. By further looking at patterns of periodicity in requests, we find that 6.3% of JSON traffic is periodically requested and reflects the use of (partially) autonomous software systems, IoT devices, and other kinds of machine-to-machine communication. Finally, we explore dependencies in JSON traffic through the lens of ngram models and find that these models can capture patterns between subsequent requests. We can potentially leverage this to prefetch requests, improving the cache hit ratio.
Year
DOI
Venue
2019
10.1145/3355369.3355594
Proceedings of the Internet Measurement Conference
Keywords
Field
DocType
Content Delivery Networks (CDNs), JSON, Web
Computer science,Computer network,JSON
Conference
ISBN
Citations 
PageRank 
978-1-4503-6948-0
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Santiago Vargas141.40
Utkarsh Goel200.34
Moritz Steiner371544.39
Aruna Balasubramanian43034168.83