Title
Detecting Trending Terms in Cybersecurity Forum Discussions.
Abstract
We present a lightweight method for identifying currently trending terms in relation to a known prior of terms, using a weighted log-odds ratio with an informative prior. We apply this method to a dataset of posts from an English-language underground hacking forum, spanning over ten years of activity, with posts containing misspellings, orthographic variation, acronyms, and slang. Our statistical approach supports analysis of linguistic change and discussion topics over time, without a requirement to train a topic model for each time interval for analysis. We evaluate the approach by comparing the results to TF-IDF using the discounted cumulative gain metric with human annotations, finding our method outperforms TF-IDF on information retrieval.
Year
DOI
Venue
2020
10.18653/v1/2020.wnut-1.15
W-NUT@EMNLP
DocType
Citations 
PageRank 
Conference
1
0.35
References 
Authors
0
5
Name
Order
Citations
PageRank
Jack Hughes161.78
Seth Aycock210.35
Andrew Caines346.13
Paula Buttery4379.83
alice hutchings5145.81