Title
ClausIE: clause-based open information extraction
Abstract
We propose ClausIE, a novel, clause-based approach to open information extraction, which extracts relations and their arguments from natural language text. ClausIE fundamentally differs from previous approaches in that it separates the detection of ``useful'' pieces of information expressed in a sentence from their representation in terms of extractions. In more detail, ClausIE exploits linguistic knowledge about the grammar of the English language to first detect clauses in an input sentence and to subsequently identify the type of each clause according to the grammatical function of its constituents. Based on this information, ClausIE is able to generate high-precision extractions; the representation of these extractions can be flexibly customized to the underlying application. ClausIE is based on dependency parsing and a small set of domain-independent lexica, operates sentence by sentence without any post-processing, and requires no training data (whether labeled or unlabeled). Our experimental study on various real-world datasets suggests that ClausIE obtains higher recall and higher precision than existing approaches, both on high-quality text as well as on noisy text as found in the web.
Year
DOI
Venue
2013
10.1145/2488388.2488420
WWW
Keywords
Field
DocType
clause-based open information extraction,input sentence,noisy text,high-quality text,dependency parsing,higher precision,higher recall,clause-based approach,english language,natural language text,information extraction,relation extraction
Data mining,Computer science,Noisy text,Dependency grammar,Grammar,Information extraction,Natural language,Artificial intelligence,Natural language processing,Small set,Sentence,Relationship extraction
Conference
ISBN
Citations 
PageRank 
978-1-4503-2035-1
123
3.10
References 
Authors
12
2
Search Limit
100123
Name
Order
Citations
PageRank
Luciano Del Corro11476.91
Rainer Gemulla2108153.54