Title
Automatically assessing the quality of Wikipedia contents.
Abstract
With the development of Web 2.0 technologies, people have gone from being mere content users to content generators. In this context, the evaluation of the quality of (potential) information available online has become a crucial issue. Nowadays, one of the biggest online resources that users rely on as a knowledge base is Wikipedia. The collaborative aspect at the basis of Wikipedia can let to the possible creation of low-quality articles or even misinformation if the process of monitoring the generation and the revision of articles is not performed in a precise and timely way. For this reason, in this paper, the problem of automatically evaluating the quality of Wikipedia contents is considered, by proposing a supervised approach based on Machine Learning to perform the classification of articles on qualitative bases. With respect to prior literature, a wider set of features connected to Wikipedia articles has been taken into account, as well as previously unconsidered aspects connected to the generation of a labeled dataset to train the model, and the use of Gradient Boosting, which produced encouraging results.
Year
DOI
Venue
2019
10.1145/3297280.3297357
SAC
Keywords
Field
DocType
Wikipedia, information quality, machine learning, social media
Social media,Information retrieval,Computer science,Misinformation,Software,Knowledge base,Information quality,Gradient boosting
Conference
ISBN
Citations 
PageRank 
978-1-4503-5933-7
1
0.39
References 
Authors
0
2
Name
Order
Citations
PageRank
Elias Bassani112.08
Marco Viviani214318.95