Title | ||
---|---|---|
Extracting drug-drug interactions from literature using a rich feature-based linear kernel approach. |
Abstract | ||
---|---|---|
Identifying unknown drug interactions is of great benefit in the early detection of adverse drug reactions. Despite existence of several resources for drug-drug interaction (DDI) information, the wealth of such information is buried in a body of unstructured medical text which is growing exponentially. This calls for developing text mining techniques for identifying DDIs. The state-of-the-art DDI extraction methods use Support Vector Machines (SVMs) with non-linear composite kernels to explore diverse contexts in literature. While computationally less expensive, linear kernel-based systems have not achieved a comparable performance in DDI extraction tasks. In this work, we propose an efficient and scalable system using a linear kernel to identify DDI information. The proposed approach consists of two steps: identifying DDIs and assigning one of four different DDI types to the predicted drug pairs. We demonstrate that when equipped with a rich set of lexical and syntactic features, a linear SVM classifier is able to achieve a competitive performance in detecting DDIs. In addition, the one-against-one strategy proves vital for addressing an imbalance issue in DDI type classification. Applied to the DDIExtraction 2013 corpus, our system achieves an F1 score of 0.670, as compared to 0.651 and 0.609 reported by the top two participating teams in the DDIExtraction 2013 challenge, both based on non-linear kernel methods. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1016/j.jbi.2015.03.002 | Journal of Biomedical Informatics |
Keywords | Field | DocType |
Drug–drug interaction,Biomedical literature,Linear kernel approach | Kernel (linear algebra),Data mining,F1 score,Text mining,Computer science,Support vector machine,Artificial intelligence,Kernel method,Classifier (linguistics),Syntax,Semantics,Machine learning | Journal |
Volume | Issue | ISSN |
55 | C | 1532-0480 |
Citations | PageRank | References |
34 | 1.15 | 672 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sun I Kim | 1 | 473 | 53.77 |
Haibin Liu | 2 | 34 | 1.15 |
Lana Yeganova | 3 | 34 | 1.15 |
John Wilbur | 4 | 627 | 35.35 |