Title
Social Trove: A Self-Summarizing Storage Service for Social Sensing
Abstract
The increasing availability of smartphones, cameras, and wearables with instant data sharing capabilities, and the exploitation of social networks for information broadcast, heralds a future of real-time information overload. With the growing excess of worldwide streaming data, such as images, geotags, text annotations, and sensory measurements, an increasingly common service will become one of data summarization. The objective of such a service will be to obtain a representative sampling of large data streams at a configurable granularity, in real-time, for subsequent consumption by a range of data-centric applications. This paper describes a general-purpose self-summarizing storage service, called Social Trove, for social sensing applications. The service summarizes data streams from human sources, or sensors in their possession, by hierarchically clustering received information in accordance with an application-specific distance metric. It then serves a sampling of produced clusters at a configurable granularity in response to application queries. While Social Trove is a general service, we illustrate its functionality and evaluate it in the specific context of workloads collected from Twitter. Results show that Social Trove supports a high query throughput, while maintaining a low access latency to the produced real-time application-specific data summaries. As a specific application case-study, we implement a fact-finding service on top of Social Trove.
Year
DOI
Venue
2015
10.1109/ICAC.2015.47
International Conference on Autonomic Computing
Keywords
Field
DocType
Summarization,Social Sensing,Clustering,Storage
Query throughput,Data modeling,Automatic summarization,World Wide Web,Data stream mining,Information overload,Social network,Computer science,Data sharing,Geotagging,Distributed computing
Conference
Citations 
PageRank 
References 
8
0.47
27
Authors
11
Name
Order
Citations
PageRank
Md Tanvir Amin11237.97
Shen Li216612.23
Muntasir Raihan Rahman345521.21
Panindra Tumkur Seetharamu480.47
ShiGuang Wang539820.97
Tarek Abdelzaher610179729.36
Indranil Gupta71837143.92
Mudhakar Srivatsa8108477.97
Raghu K. Ganti9120273.54
Reaz Ahmed1048129.69
Hieu Khac Le1129116.03