The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight. - Citegraph

Paper Info

Title
The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight.

Abstract
In biomedical informatics, assigning drug codes to categories is a common step in the analysis pipeline. Unfortunately, incomplete mappings are the norm rather than the exception with coverage values less than 85% not uncommon. Here, we perform this linking task on a nationwide insurance claims database with over 13 million members who were dispensed, according to National Drug Codes (NDCs), over 50,000 unique product forms of medication. The chosen approach employs Cerner Multum's VantageRx and the U.S. National Library of Medicine's RxMix. As a result, 94.0% of the NDCs were successfully mapped to categories used by common drug terminologies, e.g., Anatomical Therapeutic Chemical (ATC). Implemented as an SQL database and scripts, the approach is generic and can be setup for a new data set in a few hours. Thus, the method is a viable option for large-scale drug classification.

Year	Venue	Field
2016	CRI	Data mining,Information retrieval,Sql database,Data classification,Health informatics,Drug,Medicine,Drug classification,Scripting language
DocType	Volume	Citations
Conference	2016	0
PageRank	References	Authors
0.34	0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Mark L. Homer	1	0	0.34
Nathan Palmer	2	0	1.01
Olivier Bodenreider	3	2715	226.05
Aurel Cami	4	0	0.68
Laura Chadwick	5	0	0.34
Kenneth D. Mandl	6	275	67.17

1