Abstract | ||
---|---|---|
Acquiring high-quality (temporal) facts for knowledge bases is a labor-intensive process. Although there has been recent progress in the area of semi-supervised fact extraction, these approaches still have limitations, including a restricted corpus, a fixed set of relations to be extracted or a lack of assessment capabilities. In this paper we introduce PRAVDA-live, a framework that overcomes these limitations and supports the entire pipeline of interactive knowledge harvesting. To this end, our demo exhibits fact extraction from ad-hoc corpus creation, via relation specification, labeling and assessment all the way to ready-to-use RDF exports. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1145/2396761.2398722 | CIKM |
Keywords | Field | DocType |
restricted corpus,ad-hoc corpus creation,entire pipeline,interactive knowledge harvesting,acquiring high-quality,rdf export,semi-supervised fact extraction,knowledge base,fact extraction,assessment capability,design | Data mining,World Wide Web,Information retrieval,Label propagation,Computer science,Fact extraction,RDF | Conference |
Citations | PageRank | References |
4 | 0.40 | 10 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yafang Wang | 1 | 134 | 13.56 |
Maximilian Dylla | 2 | 109 | 5.93 |
Zhaochun Ren | 3 | 511 | 31.69 |
Marc Spaniol | 4 | 897 | 61.13 |
Gerhard Weikum | 5 | 12710 | 2146.01 |