Title
Lesion-Harvester: Iteratively Mining Unlabeled Lesions and Hard-Negative Examples at Scale
Abstract
The acquisition of large-scale medical image data, necessary for training machine learning algorithms, is hampered by associated expert-driven annotation costs. Mining hospital archives can address this problem, but labels often incomplete or noisy, e.g., 50% of the lesions in DeepLesion are left unlabeled. Thus, effective label harvesting methods are critical. This is the goal of our work, where we introduce Lesion-Harvester-a powerful system to harvest missing annotations from lesion datasets at high precision. Accepting the need for some degree of expert labor, we use a small fully-labeled image subset to intelligently mine annotations from the remainder. To do this, we chain together a highly sensitive lesion proposal generator (LPG) and a very selective lesion proposal classifier (LPC). Using a new hard negative suppression loss, the resulting harvested and hard-negative proposals are then employed to iteratively finetune our LPG. While our framework is generic, we optimize our performance by proposing a new 3D contextual LPG and by using a global-local multi-view LPC. Experiments on DeepLesion demonstrate that Lesion-Harvester can discover an additional 9,805 lesions at a precision of 90%. We publicly release the harvested lesions, along with a new test set of completely annotated DeepLesion volumes. We also present a pseudo 3D IoU evaluation metric that corresponds much better to the real 3D IoU than current DeepLesion evaluation metrics. To quantify the downstream benefits of Lesion-Harvester we show that augmenting the DeepLesion annotations with our harvested lesions allows state-of-the-art detectors to boost their average precision by 7 to 10%.
Year
DOI
Venue
2021
10.1109/TMI.2020.3022034
IEEE Transactions on Medical Imaging
Keywords
DocType
Volume
Algorithms,Machine Learning
Journal
40
Issue
ISSN
Citations 
1
0278-0062
0
PageRank 
References 
Authors
0.34
0
8
Name
Order
Citations
PageRank
Jinzheng Cai17110.95
Adam P. Harrison210117.06
Youjing Zheng300.68
Ke Yan4429.14
Yuankai Huo59626.45
Jing Xiao675.78
Lin Yang71291116.88
Le Lu8129786.78