Title | ||
---|---|---|
Managing expectations: assessment of chemistry databases generated by automated extraction of chemical structures from patents |
Abstract | ||
---|---|---|
In our comparison of automatically generated vs. manually curated patent chemistry databases, the former successfully provided approximately 60 % of links between chemical structure and patents. It needs to be stressed that only a very limited number of patents and compound-patent pairs were used for our comparison. Nevertheless, our results will hopefully help to manage expectations of users of patent chemistry databases of this type and provide a useful framework for more studies like ours as well as guide future developments of the workflows used for the automated extraction of chemical structures from patents. The challenges we have encountered whilst performing this study highlight that more needs to be done to make such assessments easier. Above all, more adequate, preferably open access to relevant 'gold standards' is required. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1186/s13321-015-0097-z | Journal of Cheminformatics |
Keywords | Field | DocType |
IBM SIIP,Patent chemistry databases,Patents,SureChEMBL | Data science,Data mining,IBM,Computer science,Intellectual property,Public disclosure,Bioinformatics,Database | Journal |
Volume | Issue | ISSN |
7 | 1 | 1758-2946 |
Citations | PageRank | References |
3 | 0.44 | 4 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stefan Senger | 1 | 13 | 2.71 |
Luca Bartek | 2 | 3 | 0.44 |
George Papadatos | 3 | 325 | 16.97 |
Anna Gaulton | 4 | 1028 | 68.35 |