Title
Controversy detection in Wikipedia using semantic dissimilarity.
Abstract
The advent of search engines and wikis has made access to information easy and almost free. Wikipedia is the efficacious outcome of an enormous collaboration, and its peer review-like methods of creation, maintenance, and evolution of contents, ensure high quality and reliability. However, the “anyone-can-edit” policy of Wikipedia has created many problems such as trolling, vandalism, controversies, and doubts about the content and reliability of the information provided due to non-expert involvement. People have tried to identify and rank controversies in Wikipedia articles through various techniques that use quantitative data, ignoring the semantic significance of conflicts among authors. In this paper, we have addressed the problem of identifying controversy using natural language processing techniques for the first time. The proposed method spots the impact on existing meanings of the text due to new editing processes along with their relationship to the topic of the article. The experimental results for precision (0.901), recall (0.901), accuracy (0.908), and F-measure (0.901) demonstrate the effectiveness of the proposed method. The technique is deemed useful for automatic identification of conflicts newly introduced into existing article contents, and could prove helpful in making decisions for inclusion or exclusion of controversies under the same topic.
Year
DOI
Venue
2017
10.1016/j.ins.2017.08.037
Information Sciences
Keywords
Field
DocType
Wikipedia,Controversy,Semantic dissimilarity,Sentence similarity,Natural language processing,Edit similarity
Search engine,Information retrieval,Computer science,Artificial intelligence,Access to information,Recall,Machine learning
Journal
Volume
ISSN
Citations 
418
0020-0255
0
PageRank 
References 
Authors
0.34
32
5
Name
Order
Citations
PageRank
M. Zeeshan Jhandir100.34
Tenvir Ali211.02
Byung-Won On332928.76
Ingyu Lee4528.90
Gyu Sang Choi512120.20