Unsupervised Discovery of Compound Entities for Relationship Extraction - Citegraph

Paper Info

Title
Unsupervised Discovery of Compound Entities for Relationship Extraction

Abstract
In this paper we investigate unsupervised population of a biomedical ontology via information extraction from biomedical literature. Relationships in text seldom connect simple entities. We therefore focus on identifying compound entities rather than mentions of simple entities. We present a method based on rules over grammatical dependency structures for unsupervised segmentation of sentences into compound entities and relationships. We complement the rule-based approach with a statistical component that prunes structures with low information content, thereby reducing false positives in the prediction of compound entities, their constituents and relationships. The extraction is manually evaluated with respect to the UMLS Semantic Network by analyzing the conformance of the extracted triples with the corresponding UMLS relationship type definitions.

Year	DOI	Venue
2008	10.1007/978-3-540-87696-0_15	EKAW
Keywords	Field	DocType
compound entities,umls semantic network,biomedical ontology,simple entity,compound entity,unsupervised segmentation,biomedical literature,unsupervised population,unsupervised discovery,relationship extraction,low information content,information extraction,corresponding umls relationship type,false positive,information content,semantic network,rule based	Ontology,Data mining,Population,Information retrieval,Segmentation,Computer science,Semantic network,Information extraction,Unified Medical Language System,False positive paradox,Relationship extraction	Conference
Volume	ISSN	Citations
5268	0302-9743	9
PageRank	References	Authors
0.78	18	4

Authors (4 rows)

Cited by (9 rows)

References (18 rows)

Name	Order	Citations	PageRank
Cartic Ramakrishnan	1	655	43.01
Pablo N. Mendes	2	1070	51.09
Shaojun Wang	3	468	38.96
Amit P. Sheth	4	10950	1885.56

1