Title
Identity Mappings In Deep Residual Networks
Abstract
Deep residual networks have emerged as a family of extremely deep architectures showing compelling accuracy and nice convergence behaviors. In this paper, we analyze the propagation formulations behind the residual building blocks, which suggest that the forward and backward signals can be directly propagated from one block to any other block, when using identity mappings as the skip connections and after-addition activation. A series of ablation experiments support the importance of these identity mappings. This motivates us to propose a new residual unit, which makes training easier and improves generalization. We report improved results using a 1001-layer ResNet on CIFAR-10 (4.62% error) and CIFAR-100, and a 200-layer ResNet on ImageNet. Code is available at: https://github.com/KaimingHe/resnet-1k-layers.
Year
DOI
Venue
2016
10.1007/978-3-319-46493-0_38
COMPUTER VISION - ECCV 2016, PT IV
Keywords
DocType
Volume
Identity Mapping,Training Error,Residual Function,Grey Arrow,Residual Unit
Conference
9908
ISSN
Citations 
PageRank 
0302-9743
759
23.48
References 
Authors
18
4
Search Limit
100759
Name
Order
Citations
PageRank
Kaiming He121469696.72
Xiangyu Zhang213044437.66
Shaoqing Ren317051548.00
Jian Sun425842956.90