Title
Tweets as data: demonstration of TweeQL and Twitinfo
Abstract
Microblogs such as Twitter are a tremendous repository of user-generated content. Increasingly, we see tweets used as data sources for novel applications such as disaster mapping, brand sentiment analysis, and real-time visualizations. In each scenario, the workflow for processing tweets is ad-hoc, and a lot of unnecessary work goes into repeating common data processing patterns. We introduce TweeQL, a stream query processing language that presents a SQL-like query interface for unstructured tweets to generate structured data for downstream applications. We have built several tools on top of TweeQL, most notably TwitInfo, an event timeline generation and exploration interface that summarizes events as they are discussed on Twitter. Our demonstration will allow the audience to interact with both TweeQL and TwitInfo to convey the value of data embedded in tweets.
Year
DOI
Venue
2011
10.1145/1989323.1989470
SIGMOD Conference
Keywords
Field
DocType
sql-like query interface,downstream application,brand sentiment analysis,exploration interface,stream query processing language,disaster mapping,novel application,event timeline generation,common data,data source,stream processing,data processing,sentiment analysis,visualization,structured data,user generated content
Data mining,Data processing,Computer science,Processing,Timeline,Workflow,World Wide Web,Social media,Sentiment analysis,Microblogging,Stream processing,Data model,Database
Conference
Citations 
PageRank 
References 
18
0.92
8
Authors
6
Name
Order
Citations
PageRank
Adam Marcus1120362.74
Michael S. Bernstein28604393.80
Osama Badar333112.28
David R. Karger4193672233.64
Samuel Madden5161011176.38
Robert C. Miller64412326.00