Abstract | ||
---|---|---|
We describe a methodology for automatically extracting ‘evidence fragments’ from a set of biomedical experimental research articles. These fragments provide the primary description of evidence that is presented in the papers9 figures. They elucidate the goals, methods, results and interpretations of experiments that support the original scientific contributions the study being reported. Within this paper, we describe our methodology and showcase an example data set based on the European Bioinformatics Institute9s INTACT database (http:www.ebi.ac.uk/intact/). Using figure codes as anchors, we linked evidence fragments to INTACT data records as an example of distant supervision so that we could use INTACT9s preexisting, manually-curated structured interaction data to act as a gold standard for machine reading experiments. We report preliminary baseline event extraction measures from this collection based on a publicly available, machine reading system (REACH). We use semantic web standards for our data and provide open access to all source code. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1101/192856 | SemSci@ISWC |
Keywords | Field | DocType |
Machine Reading,Molecular Interactions,Biomedical In-formatics,Discourse Analysis | Molecular interactions,Information retrieval,Computer science,Source code,Semantic Web,Bioinformatics,Data records,Machine reading | Conference |
Citations | PageRank | References |
0 | 0.34 | 9 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Gully A. P. C. Burns | 1 | 172 | 12.17 |
Pradeep Dasigi | 2 | 131 | 12.09 |
Eduard H. Hovy | 3 | 7450 | 663.27 |