Title
Automated Extraction of Information from the Literature on Chemical-CYP3A4 Interactions.
Abstract
A text mining system is presented for automatically extracting information from the literature on chemical-CYP3A4 interactions (i.e., substrate, induction, inhibition). The system identifies chemicals and CYP3A4 forms according to a combination of name dictionaries and context features. In addition, it transforms sentences into multiple simple clauses each containing a single event and extracts information on chemical-CYP3A4 interactions using a simple but effective pattern matching method based on the order of three keywords (chemicals, CYP3A4, key verbs). Using this system, 2990 relations including 2700 identified interactions with CYP3A4 for 600 chemicals were extracted from a corpus of 2900 PubMed abstracts. In an evaluation test using 100 randomly selected abstracts, it achieved 87.4% recall and 92.3% precision for identification of the. chemical name and 85.2% recall and 92.0% precision for the extraction of chemical-CYP3A4 interactions, respectively. This system will be applicable to interactions of chemicals with any functional proteins, such as enzymes and transporters, simply by changing the list of key verbs.
Year
DOI
Venue
2007
10.1021/ci700091m
JOURNAL OF CHEMICAL INFORMATION AND MODELING
Field
DocType
Volume
Text mining,Chemical nomenclature,Computer science,Natural language processing,Artificial intelligence,Pattern matching,Recall
Journal
47
Issue
ISSN
Citations 
6
1549-9596
2
PageRank 
References 
Authors
0.37
0
3
Name
Order
Citations
PageRank
Chunlai Feng151.17
Fumiyoshi Yamashita2153.67
Mitsuru Hashida3102.71