Abstract | ||
---|---|---|
Fake news is news-like content that has been produced without following journalism principles. Fake news try to mimic the look and feel of real news to intentionally disinform the reader. This phenomenon can have a strong influence on society, thus being potentially a severe problem. To address this phenomenon, systems to detect fake news have been developed, but most of them build upon fact-checking approaches, which are unfit to detect misinformation when a news piece, rather than completely false, is distorted, exaggerated, or even decontextualized. We aim to detect Portuguese fake news by following a forensic linguistics approach. Contrary to previous approaches, we build upon methods of linguistic and stylistic analysis that have been tried and tested in forensic linguists. After collecting corpora from multiple fake news outlets and from a genuine news source, we formulate the task as a text classification problem and demonstrate the effectiveness of the proposed features when training different classifiers for telling fake from genuine news. Furthermore, we perform an ablation study with subsets of features and find that the proposed feature sets are complementary. The highest results reported are very promising, achieving 97% of accuracy and a macro F1-score of 91%. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1007/978-3-030-86230-5_62 | PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021) |
Keywords | DocType | Volume |
Fake news detection, Forensic linguistics, Natural language processing, Text classification, Disinformation, Misinformation | Conference | 12981 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ricardo Moura | 1 | 0 | 0.34 |
Rui Sousa-Silva | 2 | 0 | 0.68 |
Henrique Lopes Cardoso | 3 | 223 | 34.02 |