The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures - Citegraph

Paper Info

Title
The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures

Abstract
In this paper, we tackle the problem of convolutional neural network design. Instead of focusing on the design of the overall architecture, we investigate a design space that is usually overlooked, i.e. adjusting the channel configurations of predefined networks. We find that this adjustment can be achieved by shrinking widened baseline networks and leads to superior performance. Based on that, we articulate the "heterogeneity hypothesis": with the same training protocol, there exists a layer-wise differentiated network architecture (LW-DNA) that can outperform the original network with regular channel configurations but with a lower level of model complexity. The LW-DNA models are identified without extra computational cost or training time compared with the original network. This constraint leads to controlled experiments which direct the focus to the importance of layer-wise specific channel configurations. LW-DNA models come with advantages related to overfitting, i.e. the relative relationship between model complexity and dataset size. Experiments are conducted on various networks and datasets for image classification, visual tracking and image restoration. The resultant LW-DNA models consistently outperform the baseline models.

Year	DOI	Venue
2021	10.1109/CVPR46437.2021.00218	2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021
DocType	ISSN	Citations
Conference	1063-6919	0
PageRank	References	Authors
0.34	0	7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yawei Li	1	31	5.58
Wen Li	2	373	21.87
Danelljan Martin	3	1344	49.35
Kai Zhang	4	686	26.59
Shuhang Gu	5	701	28.25
Luc Van Gool	6	27566	1819.51
Radu Timofte	7	1880	118.45

1