Automatically assessing the quality of Wikipedia contents. - Citegraph

Paper Info

Title
Automatically assessing the quality of Wikipedia contents.

Abstract
With the development of Web 2.0 technologies, people have gone from being mere content users to content generators. In this context, the evaluation of the quality of (potential) information available online has become a crucial issue. Nowadays, one of the biggest online resources that users rely on as a knowledge base is Wikipedia. The collaborative aspect at the basis of Wikipedia can let to the possible creation of low-quality articles or even misinformation if the process of monitoring the generation and the revision of articles is not performed in a precise and timely way. For this reason, in this paper, the problem of automatically evaluating the quality of Wikipedia contents is considered, by proposing a supervised approach based on Machine Learning to perform the classification of articles on qualitative bases. With respect to prior literature, a wider set of features connected to Wikipedia articles has been taken into account, as well as previously unconsidered aspects connected to the generation of a labeled dataset to train the model, and the use of Gradient Boosting, which produced encouraging results.

Year	DOI	Venue
2019	10.1145/3297280.3297357	SAC
Keywords	Field	DocType
Wikipedia, information quality, machine learning, social media	Social media,Information retrieval,Computer science,Misinformation,Software,Knowledge base,Information quality,Gradient boosting	Conference
ISBN	Citations	PageRank
978-1-4503-5933-7	1	0.39
References	Authors
0	2

Authors (2 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Elias Bassani	1	1	2.08
Marco Viviani	2	143	18.95

1