Title
A Bag-of-entities Approach to Document Focus Time Estimation.
Abstract
Detecting the document focus time, defined as the time the content of a document refers to, is an important task to support temporal information retrieval systems. In this paper we propose a novel approach to focus time estimation based on a bag-of-entity representation. In particular, we are interested in understanding if and to what extent existing open data sources can be leveraged to achieve focus time estimation. We leverage state of the art Named Entity Extraction tools and exploit links to Wikipedia and DBpedia to derive temporal information relevant to entities, namely years and intervals of years. We then estimate focus time as the point in time that is more relevant to the entity set associated to a document. Our method does not rely on explicit temporal expressions in the documents, so it is therefore applicable to a general context. We tested our methodology on two datasets of historical events and evaluated it against a state of the art approach, measuring improvement in average estimation error.
Year
Venue
Keywords
2017
CEUR Workshop Proceedings-Series
focus time,temporal mining,information retrieval,bag-of-entities,linked data,wikipedia,dbpedia
DocType
Volume
ISSN
Conference
1959
1613-0073
Citations 
PageRank 
References 
0
0.34
0
Authors
2
Name
Order
Citations
PageRank
Christian Morbidoni128937.76
Alessandro Cucchiarelli222636.38