Abstract | ||
---|---|---|
Knowledge bases have become indispensable sources of information. It is therefore critical that they rely on the latest information available and get updated every time new facts surface. Knowledge base acceleration (KBA) systems seek to help humans expand knowledge bases like Wikipedia by automatically recommending edits based on incoming content streams. A core step in this process is that of identifying relevant content, i.e., filtering documents that would imply modifications to the attributes or relations of a given target entity. We propose two multi-step classification approaches for this task that consist of two and three binary classification steps, respectively. Both methods share the same initial component, which is concerned with the identification of entity mentions in documents, while subsequent steps involve identification of documents being relevant and/or central to a given entity. Using the evaluation platform of the TREC 2012 KBA track and a rich feature set developed for this particular task, we show that both approaches deliver state-of-the-art performance.
|
Year | Venue | Keywords |
---|---|---|
2013 | OAIR | knowledge base acceleration,multi-step classification approach,binary classification step,knowledge base,particular task,relevant content,cumulative citation recommendation,kba track,latest information,incoming content stream,target entity |
Field | DocType | ISBN |
Data mining,Knowledge base acceleration,Binary classification,Information retrieval,Computer science,Citation,Filter (signal processing),Feature set | Conference | 978-2-905450-09-8 |
Citations | PageRank | References |
28 | 1.51 | 40 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Krisztian Balog | 1 | 1797 | 113.68 |
Heri Ramampiaro | 2 | 154 | 20.46 |
Naimdjon Takhirov | 3 | 48 | 7.79 |
Kjetil Nørvåg | 4 | 1311 | 79.26 |