Title
Deep Mining External Imperfect Data for Chest X-Ray Disease Screening
Abstract
Deep learning approaches have demonstrated remarkable progress in automatic Chest X-ray analysis. The data-driven feature of deep models requires training data to cover a large distribution. Therefore, it is substantial to integrate knowledge from multiple datasets, especially for medical images. However, learning a disease classification model with extra Chest X-ray (CXR) data is yet challenging. Recent researches have demonstrated that performance bottleneck exists in joint training on different CXR datasets, and few made efforts to address the obstacle. In this paper, we argue that incorporating an external CXR dataset leads to imperfect training data, which raises the challenges. Specifically, the imperfect data is in two folds: domain discrepancy, as the image appearances vary across datasets; and label discrepancy, as different datasets are partially labeled. To this end, we formulate the multi-label thoracic disease classification problem as weighted independent binary tasks according to the categories. For common categories shared across domains, we adopt task-specific adversarial training to alleviate the feature differences. For categories existing in a single dataset, we present uncertainty-aware temporal ensembling of model predictions to mine the information from the missing labels further. In this way, our framework simultaneously models and tackles the domain and label discrepancies, enabling superior knowledge mining ability. We conduct extensive experiments on three datasets with more than 360,000 Chest X-ray images. Our method outperforms other competing models and sets state-of-the-art performance on the official NIH test set with 0.8349 AUC, demonstrating its effectiveness of utilizing the external dataset to improve the internal classification.
Year
DOI
Venue
2020
10.1109/TMI.2020.3000949
IEEE Transactions on Medical Imaging
Keywords
DocType
Volume
Deep Learning,Radiography,Radiography, Thoracic,Thorax,X-Rays
Journal
39
Issue
ISSN
Citations 
11
0278-0062
6
PageRank 
References 
Authors
0.46
0
7
Name
Order
Citations
PageRank
Luyang Luo1203.73
Lequan Yu270639.80
Hao Chen3106058.15
Liu Quande4182.71
Xi Wang54712.80
Jiaqi Xu61069.50
Pheng-Ann Heng73565280.98