Detection of Possible Illicit Messages Using Natural Language Processing and Computer Vision on Twitter and Linked Websites. - Citegraph

Paper Info

Title
Detection of Possible Illicit Messages Using Natural Language Processing and Computer Vision on Twitter and Linked Websites.

Abstract
Human trafficking is a global problem that strips away the dignity of millions of victims. Currently, social networks are used to spread this crime through the online environment by using covert messages that serve to promote these illegal services. In this context, since law enforcement resources are limited, it is vital to automatically detect messages that may be related to this crime and could also serve as clues. In this paper, we identify Twitter messages that could promote these illegal services and exploit minors by using natural language processing. The images and the URLs found in suspicious messages were processed and classified by gender and age group, so it is possible to detect photographs of people under 14 years of age. The method that we used is as follows. First, tweets with hashtags related to minors are mined in real-time. These tweets are preprocessed to eliminate noise and misspelled words, and then the tweets are classified as suspicious or not. Moreover, geometric features of the face and torso are selected using Haar models. By applying Support Vector Machine (SVM) and Convolutional Neural Network (CNN), we are able to recognize gender and age group, taking into account torso information and its proportional relationship with the head, or even when the face details are blurred. As a result, using the SVM model with only torso features has a higher performance than CNN.

Year	DOI	Venue
2020	10.1109/ACCESS.2020.2976530	IEEE ACCESS
Keywords	DocType	Volume
Twitter,Natural language processing,Tagging,Support vector machines,Torso,Computer vision,CNN,features detection,image classification,natural language processing,SVM	Journal	8
ISSN	Citations	PageRank
2169-3536	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Sergio L. Granizo	1	0	0.34
Ángel Leonardo Valdivieso Caraguay	2	76	6.09
Lorena Isabel Barona López	3	0	0.34
Myriam Hernández Alvarez	4	0	0.34

1