Title
Managing expectations: assessment of chemistry databases generated by automated extraction of chemical structures from patents
Abstract
In our comparison of automatically generated vs. manually curated patent chemistry databases, the former successfully provided approximately 60 % of links between chemical structure and patents. It needs to be stressed that only a very limited number of patents and compound-patent pairs were used for our comparison. Nevertheless, our results will hopefully help to manage expectations of users of patent chemistry databases of this type and provide a useful framework for more studies like ours as well as guide future developments of the workflows used for the automated extraction of chemical structures from patents. The challenges we have encountered whilst performing this study highlight that more needs to be done to make such assessments easier. Above all, more adequate, preferably open access to relevant 'gold standards' is required.
Year
DOI
Venue
2015
10.1186/s13321-015-0097-z
Journal of Cheminformatics
Keywords
Field
DocType
IBM SIIP,Patent chemistry databases,Patents,SureChEMBL
Data science,Data mining,IBM,Computer science,Intellectual property,Public disclosure,Bioinformatics,Database
Journal
Volume
Issue
ISSN
7
1
1758-2946
Citations 
PageRank 
References 
3
0.44
4
Authors
4
Name
Order
Citations
PageRank
Stefan Senger1132.71
Luca Bartek230.44
George Papadatos332516.97
Anna Gaulton4102868.35