Title
High performance mining of social media data
Abstract
News and disaster-related applications may benefit from real-time processing of large-volume, up-to-the-minute social media data. Our geo-mining algorithm finds local place references (of street, building, toponym and place abbreviation) in Twitter messages so that those messages can be put on a map. The ability to map is significant because it can present a timely overview of a situation. Our current research demonstrates that our prototype desktop algorithm that geo-locates Twitter messages with an F statistic of .90 accuracy for location identification will be viable on a large scale and in real time, for actual applications. We present methods of managing external resources, threading the algorithm and balancing the data load, that allow us to scale up the application without significantly re-writing the code.
Year
DOI
Venue
2012
10.1145/2335755.2335818
Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond
Keywords
DocType
Citations 
high performance mining,data load,prototype desktop algorithm,geo-mining algorithm,present method,local place reference,twitter message,geo-locates twitter message,up-to-the-minute social media data,large scale,place abbreviation,parallel computer,real time processing,multicore,real time,social media,geo location
Conference
1
PageRank 
References 
Authors
0.36
5
2
Name
Order
Citations
PageRank
Judith Gelernter11019.77
Gang Wu24213.30