Assessing ranking metrics in top-N recommendation. - Citegraph

Paper Info

Title
Assessing ranking metrics in top-N recommendation.

Abstract
The evaluation of recommender systems is an area with unsolved questions at several levels. Choosing the appropriate evaluation metric is one of such important issues. Ranking accuracy is generally identified as a prerequisite for recommendation to be useful. Ranking metrics have been adapted for this purpose from the Information Retrieval field into the recommendation task. In this article, we undertake a principled analysis of the robustness and the discriminative power of different ranking metrics for the offline evaluation of recommender systems, drawing from previous studies in the information retrieval field. We measure the robustness to different sources of incompleteness that arise from the sparsity and popularity biases in recommendation. Among other results, we find that precision provides high robustness while normalized discounted cumulative gain offers the best discriminative power. In dealing with cold users, we also find that the geometric mean is more robust than the arithmetic mean as aggregation function over users.

Year	DOI	Venue
2020	10.1007/s10791-020-09377-x	Information Retrieval Journal
Keywords	DocType	Volume
Recommender systems, Top-N recommendation, Evaluation, Ranking metrics, Robustness, Discriminative power	Journal	23
Issue	ISSN	Citations
4	1386-4564	1
PageRank	References	Authors
0.35	0	4

Authors (4 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Daniel Valcarce	1	54	8.51
Alejandro BellogíN	2	638	46.83
Javier Parapar	3	2	1.37
Pablo Castells	4	1845	108.79

1