Abstract | ||
---|---|---|
Detrimental online behavior such as harassment and cyberbullying is becoming a serious, large-scale problem damaging people's lives. This phenomenon is creating a need for automated, data-driven techniques for analyzing and detecting such behaviors. We propose a machine learning method for simultaneously inferring user roles in harassment-based bullying and new vocabulary indicators of bullying. The learning algorithm considers social structure and infers which users tend to bully and which tend to be victimized. To address the elusive nature of cyberbullying, the learning algorithm only requires weak supervision. Experts provide a small seed vocabulary of bullying indicators, and the algorithm uses a large, unlabeled corpus of social media interactions to extract bullying roles of users and additional vocabulary indicators of bullying. The model estimates whether each social interaction is bullying based on who participates and based on what language is used, and it tries to maximize the agreement between these estimates, i.e., participant-vocabulary consistency (PVC). We evaluate PVC on three social media data sets, demonstrating quantitatively and qualitatively its effectiveness in cyberbullying detection.
|
Year | DOI | Venue |
---|---|---|
2017 | 10.1145/3110025.3110049 | ASONAM '17: Advances in Social Networks Analysis and Mining 2017
Sydney
Australia
July, 2017 |
Keywords | Field | DocType |
harassment-based bullying,social structure,seed vocabulary,bullying indicators,social media interactions,bullying roles,social interaction,participant-vocabulary consistency,social media data sets,cyberbullying detection,weakly supervised machine learning,detrimental online behavior,automated data-driven techniques | Social relation,Data mining,Social media,Computer science,Artificial intelligence,Phenomenon,Sequential model,Vocabulary,Machine learning,Harassment | Conference |
ISSN | ISBN | Citations |
2473-9928 | 978-1-4503-4993-2 | 10 |
PageRank | References | Authors |
0.57 | 22 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Elaheh Raisi | 1 | 15 | 3.73 |
Bert Huang | 2 | 563 | 39.09 |