Scalable Disambiguation System Capturing Individualities of Mentions. - Citegraph

Paper Info

Title
Scalable Disambiguation System Capturing Individualities of Mentions.

Abstract
Entity disambiguation, or mapping a phrase to its canonical representation in a knowledge base, is a fundamental step in many natural language processing applications. Existing techniques based on global ranking models fail to capture the individual peculiarities of the words and hence, struggle to meet the accuracy-time requirements of many real-world applications. In this paper, we propose a new system that learns specialized features and models for disambiguating each ambiguous phrase in the English language. We train and validate the hundreds of thousands of learning models for this purpose using a Wikipedia hyperlink dataset with more than 170 million labelled annotations. The computationally intensive training required for this approach can be distributed over a cluster. In addition, our approach supports fast queries, efficient updates and its accuracy compares favorably with respect to other state-of-the-art disambiguation systems.

Year	DOI	Venue
2017	10.1007/978-3-319-59888-8_31	Lecture Notes in Artificial Intelligence
Keywords	Field	DocType
Entity linking,Entity disambiguation,Wikification,Word-sense disambiguation	Entity linking,English language,Ranking,Computer science,Phrase,Canonical form,Artificial intelligence,Hyperlink,Natural language processing,Knowledge base,Scalability	Conference
Volume	ISSN	Citations
10318	0302-9743	0
PageRank	References	Authors
0.34	19	5

Authors (5 rows)

Cited by (0 rows)

References (19 rows)

Name	Order	Citations	PageRank
Tiep Mai	1	1	1.12
Bichen Shi	2	40	5.13
Patrick K. Nicholson	3	88	14.10
Deepak Ajwani	4	188	22.30
Alessandra Sala	5	21	4.29

1