Title
Sentiment-Based Identification of Radical Authors (SIRA).
Abstract
As violent extremists continue to surface in online discussion forums, counter-extremism agencies search for new and innovative ways of uncovering their digital indicators. Using a sample of approximately 1 million posts and 26,000 unique users across four Islamic-based discussion forums, this study proposed a method of identifying the most radical users on the Dark Web. Several characteristics of each user's postings were analyzed using Parts of Speech (POS) tagging, a custom openNLP based tagger, sentiment analysis, and a novel algorithm called \"Sentiment-based Identification of Radical Authors\" (SIRA). POS tagging was used to develop a list of the 400 most frequently cited nouns across the discussion forums. With this list, sentiment analysis provided the context surrounding users' posts, and each post was assigned a polarity value. Radical scores were calculated using SIRA, which is an algorithm that accounts for a user's percentile score for average sentiment score, volume of negative posts, severity of negative posts, and duration of negative posts. Results did not suggest that a simple typology or typologies best described the most radical users in the Dark Web, however, the findings indicated that SIRA was flexible enough to evaluate several combinations of online activity that could identify the most radical users in the discussion forums. In addition, SIRA identified the same user across two separate discussion forums as the most radical, thus providing validation for the algorithm. This particular user was linked to an extremist website that supported terrorists. Lastly, the results revealed that the Gawaher and Islamic Awakening web forums hosted the highest volume of most radical users in the sample.
Year
DOI
Venue
2015
10.1109/ICDMW.2015.64
ICDM Workshops
Keywords
Field
DocType
Sentiment Analysis, Extremism, Discussion Forums
Data mining,World Wide Web,Computer science,Sentiment analysis,Noun,Typology,Part of speech,Percentile rank,Deep Web,Online discussion
Conference
Citations 
PageRank 
References 
2
0.41
9
Authors
4
Name
Order
Citations
PageRank
Ryan Scrivens120.74
Garth Davies261.57
Richard Frank3205.61
Joseph Mei470.90