Title | ||
---|---|---|
Supporting Natural Language Processing with Background Knowledge: Coreference Resolution Case |
Abstract | ||
---|---|---|
Systems based on statistical and machine learning methods have been shown to be extremely effective and scalable for the analysis
of large amount of textual data. However, in the recent years, it becomes evident that one of the most important directions
of improvement in natural language processing (NLP) tasks, like word sense disambiguation, coreference resolution, relation
extraction, and other tasks related to knowledge extraction, is by exploiting semantics. While in the past, the unavailability
of rich and complete semantic descriptions constituted a serious limitation of their applicability, nowadays, the Semantic
Web made available a large amount of logically encoded information (e.g. ontologies, RDF(S)-data, linked data, etc.), which
constitutes a valuable source of semantics. However, web semantics cannot be easily plugged into machine learning systems.
Therefore the objective of this paper is to define a reference methodology for combining semantic information available in
the web under the form of logical theories, with statistical methods for NLP. The major problems that we have to solve to
implement our methodology concern (i) the selection of the correct and minimal knowledge among the large amount available
in the web, (ii) the representation of uncertain knowledge, and (iii) the resolution and the encoding of the rules that combine
knowledge retrieved from Semantic Web sources with semantics in the text. In order to evaluate the appropriateness of our
approach, we present an application of the methodology to the problem of intra-document coreference resolution, and we show
by means of some experiments on the standard dataset, how the injection of knowledge leads to the improvement of this task
performance.
|
Year | DOI | Venue |
---|---|---|
2010 | 10.1007/978-3-642-17746-0_6 | International Semantic Web Conference |
Keywords | Field | DocType |
coreference resolution case,natural language processing,uncertain knowledge,minimal knowledge,large amount,knowledge extraction,intra-document coreference resolution,methodology concern,textual data,reference methodology,web semantics,coreference resolution,machine learning,linked data,semantic web,relation extraction | Data mining,Semantic Web Stack,Computer science,Semantic Web,Artificial intelligence,Natural language processing,Social Semantic Web,Semantic Web Rule Language,Semantic computing,Semantic compression,Semantic search,Information retrieval,Semantic analytics,Database | Conference |
Volume | ISSN | ISBN |
6496 | 0302-9743 | 3-642-17745-X |
Citations | PageRank | References |
12 | 0.82 | 26 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Volha Bryl | 1 | 180 | 14.46 |
Claudio Giuliano | 2 | 488 | 33.00 |
Luciano Serafini | 3 | 2230 | 204.36 |
Kateryna Tymoshenko | 4 | 180 | 11.39 |