Title
Extremist Propaganda Tweet Classification with Deep Learning in Realistic Scenarios.
Abstract
In this work, we tackled the problem of the automatic classification of the extremist propaganda on Twitter, focusing on the Islamic State of Iraq and al-Sham (ISIS). We built and published several datasets, obtained by mixing 15,684 ISIS propaganda tweets with a variable number of neutral tweets, related to ISIS, and random ones, accounting for imbalances up to 1%. We considered three state-of-the-art, deep learning techniques, representative of the main current approaches to text classification, and two strong linear machine learning baselines. We compared their performance when varying the composition of the training and test sets, in order to explore different training strategies, and to evaluate the results when approaching realistic conditions. We demonstrated that a Recurrent-Convolutional Neural Network, based on pre-trained word embeddings, can reach an excellent F1 score of 0.9 on the most challenging test condition (1%-imbalance).
Year
DOI
Venue
2019
10.1145/3292522.3326050
WebSci '19: 11th ACM Conference on Web Science Boston Massachusetts USA June, 2019
Field
DocType
ISBN
Data science,Astronomy,Computer science,Artificial intelligence,Deep learning
Conference
978-1-4503-6202-3
Citations 
PageRank 
References 
1
0.36
0
Authors
4
Name
Order
Citations
PageRank
Leonardo Nizzoli121.38
Marco Avvenuti226724.14
Cresci, S.323521.79
Maurizio Tesconi428132.06