Abstract | ||
---|---|---|
One of the primary challenges of healthcare delivery is aggregating disparate, asynchronous data sources into meaningful indicators of individual health. We combine natural language word embedding and network modeling techniques to learn meaningful representations of medical concepts by using the weighted network adjacency matrix in the GloVe algorithm, which we call Code2Vec. We demonstrate that using our learned embeddings improve neural network performance for disease prediction. However, we also demonstrate that popular deep learning models for disease prediction are not meaningfully better than simpler, more interpretable classifiers such as XGBoost. Additionally, our work adds to the current literature by providing a comprehensive survey of various machine learning algorithms on disease prediction tasks. |
Year | Venue | Field |
---|---|---|
2018 | ICHI | Adjacency matrix,Asynchronous communication,Computer science,Weighted network,Natural language,Artificial intelligence,Word embedding,Deep learning,Artificial neural network,Machine learning,Network model |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tanner Christensen | 1 | 0 | 1.01 |
Abraham Frandsen | 2 | 0 | 0.68 |
Seth Glazier | 3 | 0 | 0.34 |
Jeffrey Humpherys | 4 | 30 | 9.59 |
David Kartchner | 5 | 3 | 0.76 |