Abstract | ||
---|---|---|
Microblogging sites such as Twitter and Weibo are increasingly being used to enhance situational awareness during various natural and man-made disaster events such as floods, earthquakes, and bomb blasts. During any such event, thousands of microblogs (tweets) are posted in short intervals of time. Typically, only a small fraction of these tweets contribute to situational awareness, while the majority merely reflect the sentiment or opinion of people. Real-time extraction of tweets that contribute to situational awareness is especially important for relief operations when time is critical. However, automatically differentiating such tweets from those that reflect opinion / sentiment is a non-trivial challenge, mainly because of the very small size of tweets and the informal way in which tweets are written (frequent use of emoticons, abbreviations, and so on). This study applies Natural Language Processing (NLP) techniques to address this challenge. We extract low-level syntactic features from the text of tweets, such as the presence of specific types of words and parts-of-speech, to develop a classifier to distinguish between tweets which contribute to situational awareness and tweets which do not. Experiments over tweets related to four diverse disaster events show that the proposed features identify situational awareness tweets with significantly higher accuracy than classifiers based on standard bag-of-words models. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/COMSNETS.2015.7098720 | Communication Systems and Networks |
Keywords | Field | DocType |
emergency management,natural language processing,pattern classification,social networking (online),text analysis,NLP techniques,Twitter,Weibo,bomb blasts,classifier accuracy,earthquakes,floods,low-level syntactic feature extraction,man-made disaster events,microblogging sites,natural disaster,natural language processing,parts-of-speech,real-time tweet extraction,relief operations,situational awareness enhancement,situational awareness extraction,tweet text | Speech enhancement,Pragmatics,Computer science,Situation awareness,Computer network,Natural language processing,Artificial intelligence,Classifier (linguistics),Syntax,Social media,Microblogging,Feature extraction,Speech recognition | Conference |
ISSN | Citations | PageRank |
2155-2487 | 9 | 0.50 |
References | Authors | |
12 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Anirban Sen | 1 | 18 | 2.66 |
Koustav Rudra | 2 | 78 | 9.08 |
Saptarshi Ghosh | 3 | 594 | 53.82 |