Title
Multi-step classification approaches to cumulative citation recommendation
Abstract
Knowledge bases have become indispensable sources of information. It is therefore critical that they rely on the latest information available and get updated every time new facts surface. Knowledge base acceleration (KBA) systems seek to help humans expand knowledge bases like Wikipedia by automatically recommending edits based on incoming content streams. A core step in this process is that of identifying relevant content, i.e., filtering documents that would imply modifications to the attributes or relations of a given target entity. We propose two multi-step classification approaches for this task that consist of two and three binary classification steps, respectively. Both methods share the same initial component, which is concerned with the identification of entity mentions in documents, while subsequent steps involve identification of documents being relevant and/or central to a given entity. Using the evaluation platform of the TREC 2012 KBA track and a rich feature set developed for this particular task, we show that both approaches deliver state-of-the-art performance.
Year
Venue
Keywords
2013
OAIR
knowledge base acceleration,multi-step classification approach,binary classification step,knowledge base,particular task,relevant content,cumulative citation recommendation,kba track,latest information,incoming content stream,target entity
Field
DocType
ISBN
Data mining,Knowledge base acceleration,Binary classification,Information retrieval,Computer science,Citation,Filter (signal processing),Feature set
Conference
978-2-905450-09-8
Citations 
PageRank 
References 
28
1.51
40
Authors
4
Name
Order
Citations
PageRank
Krisztian Balog11797113.68
Heri Ramampiaro215420.46
Naimdjon Takhirov3487.79
Kjetil Nørvåg4131179.26