Title
Twarql: tapping into the wisdom of the crowd
Abstract
Twarql is an infrastructure translating microblog posts from Twitter as Linked Open Data in real-time. The approach employed in Twarql can be summarized as follows: (1) extract content (e.g. entity mentions, hashtags and URLs) from microposts streamed from Twitter; (2) encode content in RDF using shared and well-known vocabularies (FOAF, SIOC, MOAT, etc.); (3) enable structured querying of microposts with SPARQL; (4) enable subscription to a stream of microposts that match a given query; and (5) enable scalable real-time delivery of streaming annotated data using sparqlPuSH. In this paper we use a brand tracking scenario to demonstrate how Twarql enables flexibility in handling the information overload of those interested in collectively analyzing microblog data for sensemaking. The dataset produced is shared as Linked Data. Twarql is available as open source and can be easily deployed or extended for monitoring Twitter data in various contexts such as brand tracking, disaster relief management, stock exchange monitoring, etc.
Year
DOI
Venue
2010
10.1145/1839707.1839762
I-SEMANTICS
Keywords
Field
DocType
microblog data,scalable real-time delivery,twitter data,encode content,annotated data,microblog post,linked data,brand tracking scenario,linked open data,brand tracking,management,design,social media,stock exchange,information overload,rdf,real time,sparql
Data mining,Information overload,World Wide Web,FOAF,Social media,Information retrieval,Computer science,Wisdom of the crowd,Microblogging,Linked data,SPARQL,RDF
Conference
Citations 
PageRank 
References 
19
1.10
7
Authors
3
Name
Order
Citations
PageRank
Pablo N. Mendes1107051.09
Alexandre Passant2100669.16
Pavan Kapanipathi312510.66