Title
IdentityRank: Named entity disambiguation in the news domain
Abstract
News companies produce news items that describe events that happen in the world. These news items usually contain mentions to persons, organizations, locations and other types of named entities that are involved in the events being described. These named entities may have an ambiguous meaning, which impacts the performance of free-text information retrieval systems. In this paper the IdentityRank algorithm, designed to address the problem of named entity disambiguation in news items, is described. It has been developed as part of the EU-funded project News Engine Web Services (NEWS) and is specifically designed to operate within the editorial environment of a news company. The algorithm was implemented and evaluated using several corpora of actual news items, achieving an average accuracy of around 96%.
Year
DOI
Venue
2012
10.1016/j.eswa.2012.02.084
Expert Syst. Appl.
Keywords
Field
DocType
news company,identityrank algorithm,news item,eu-funded project,average accuracy,news domain,news engine web services,entity disambiguation,ambiguous meaning,actual news item,editorial environment,named entity disambiguation,natural language processing
Entity linking,Data mining,World Wide Web,Information retrieval,Semantic annotation,Computer science,Web service
Journal
Volume
Issue
ISSN
39
10
0957-4174
Citations 
PageRank 
References 
8
0.46
37
Authors
4
Name
Order
Citations
PageRank
Norberto Fernández1141.61
Jesús Arias Fisteus2413.11
Luis Sánchez Fernández317319.82
Gonzalo Lopez41027.61