Title
Training Deep Nets with Progressive Batch Normalization on Multi-GPUs
Abstract
Batch normalization (BN) enables us to train various deep neural networks faster. However, the training accuracy will be significantly influenced with the decrease of input mini-batch size. To increase the model accuracy, a global mean and variance among all the input batch can be used, nevertheless communication across all devices is required in each BN layer, which reduces the training speed greatly. To address this problem, we propose progressive batch normalization, which can achieve a good balance between model accuracy and efficiency in multiple-GPU training. Experimental results show that our algorithm can obtain significant performance improvement over traditional BN without data synchronization across GPUs, achieving up to 18.4% improvement on training DeepLab for semantic segmentation task across 8 GPUs.
Year
DOI
Venue
2019
10.1007/s10766-018-0615-5
International Journal of Parallel Programming
Keywords
Field
DocType
Batch normalization, Data parallelism, Deep learning
Normalization (statistics),Segmentation,Computer science,Parallel computing,Data synchronization,Data parallelism,Artificial intelligence,Deep learning,Computer engineering,Deep neural networks,Performance improvement
Journal
Volume
Issue
ISSN
47
3
1573-7640
Citations 
PageRank 
References 
0
0.34
8
Authors
5
Name
Order
Citations
PageRank
Lianke Qin100.34
Yifan Gong21332135.58
Tianqi Tang334219.66
Yutian Wang404.06
Jiangming Jin531.76