Controversy detection in Wikipedia using semantic dissimilarity. - Citegraph

Paper Info

Title
Controversy detection in Wikipedia using semantic dissimilarity.

Abstract
The advent of search engines and wikis has made access to information easy and almost free. Wikipedia is the efficacious outcome of an enormous collaboration, and its peer review-like methods of creation, maintenance, and evolution of contents, ensure high quality and reliability. However, the “anyone-can-edit” policy of Wikipedia has created many problems such as trolling, vandalism, controversies, and doubts about the content and reliability of the information provided due to non-expert involvement. People have tried to identify and rank controversies in Wikipedia articles through various techniques that use quantitative data, ignoring the semantic significance of conflicts among authors. In this paper, we have addressed the problem of identifying controversy using natural language processing techniques for the first time. The proposed method spots the impact on existing meanings of the text due to new editing processes along with their relationship to the topic of the article. The experimental results for precision (0.901), recall (0.901), accuracy (0.908), and F-measure (0.901) demonstrate the effectiveness of the proposed method. The technique is deemed useful for automatic identification of conflicts newly introduced into existing article contents, and could prove helpful in making decisions for inclusion or exclusion of controversies under the same topic.

Year	DOI	Venue
2017	10.1016/j.ins.2017.08.037	Information Sciences
Keywords	Field	DocType
Wikipedia,Controversy,Semantic dissimilarity,Sentence similarity,Natural language processing,Edit similarity	Search engine,Information retrieval,Computer science,Artificial intelligence,Access to information,Recall,Machine learning	Journal
Volume	ISSN	Citations
418	0020-0255	0
PageRank	References	Authors
0.34	32	5

Authors (5 rows)

Cited by (0 rows)

References (32 rows)

Name	Order	Citations	PageRank
M. Zeeshan Jhandir	1	0	0.34
Tenvir Ali	2	1	1.02
Byung-Won On	3	329	28.76
Ingyu Lee	4	52	8.90
Gyu Sang Choi	5	121	20.20

1