Title
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison.
Abstract
Large, labeled datasets have driven deep learning methods to achieve expert-level performance on a variety of medical imaging tasks. We present CheXpert, a large dataset that contains 224,316 chest radiographs of 65,240 patients. We design a labeler to automatically detect the presence of 14 observations in radiology reports, capturing uncertainties inherent in radiograph interpretation. We investigate different approaches to using the uncertainty labels for training convolutional neural networks that output the probability of these observations given the available frontal and lateral radiographs. On a validation set of 200 chest radiographic studies which were manually annotated by 3 board-certified radiologists, we find that different uncertainty approaches are useful for different pathologies. We then evaluate our best model on a test set composed of 500 chest radiographic studies annotated by a consensus of 5 board-certified radiologists, and compare the performance of our model to that of 3 additional radiologists in the detection of 5 selected pathologies. On Cardiomegaly, Edema, and Pleural Effusion, the model ROC and PR curves lie above all 3 radiologist operating points. We release the dataset to the public as a standard benchmark to evaluate performance of chest radiograph interpretation models.
Year
Venue
Field
2019
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE
Chest radiograph,Pleural effusion,Pattern recognition,Convolutional neural network,Computer science,Medical imaging,Radiography,Artificial intelligence,Deep learning,Machine learning,Test set
DocType
Volume
Citations 
Journal
abs/1901.07031
10
PageRank 
References 
Authors
0.56
0
20
Name
Order
Citations
PageRank
Jeremy Irvin1723.60
Pranav Rajpurkar255524.99
Michael Ko3162.87
Yifan Yu4100.56
Silviana Ciurea-Ilcus5100.56
Chris Chute6100.56
Henrik Marklund7100.56
Behzad Haghgoo8100.89
Robyn L. Ball9140.97
katie s shpanskaya10723.94
Jayne Seekins11100.56
David Mong12181.40
Safwan Halabi13101.23
Jesse K. Sandberg14281.64
Ricky Jones15100.56
David B. Larson16100.56
Curtis P Langlotz1721326.80
Bhavik N. Patel18100.89
Lungren Matthew P19887.34
Andrew Y. Ng20260651987.54