Group Fisher Pruning for Practical Network Compression - Citegraph

Paper Info

Title
Group Fisher Pruning for Practical Network Compression

Abstract
Network compression has been widely studied since it is able to reduce the memory and computation cost during inference. However, previous methods seldom deal with complicated structures like residual connections, group/depth-wise convolution and feature pyramid network, where channels of multiple layers are coupled and need to be pruned simultaneously. In this paper, we present a general channel pruning approach that can be applied to various complicated structures. Particularly, we propose a layer grouping algorithm to find coupled channels automatically. Then we derive a unified metric based on Fisher information to evaluate the importance of a single channel and coupled channels. Moreover, we find that inference speedup on GPUs is more correlated with the reduction of memory(2) rather than FLOPs, and thus we employ the memory reduction of each channel to normalize the importance. Our method can be used to prune any structures including those with coupled channels. We conduct extensive experiments on various backbones, including the classic ResNet and ResNeXt, mobile-friendly MobileNetV2, and the NAS-based RegNet, both on image classification and object detection which is under-explored. Experimental results validate that our method can effectively prune sophisticated networks, boosting inference speed without sacrificing accuracy.

Year	Venue	DocType
2021	INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139	Conference
Volume	ISSN	Citations
139	2640-3498	0
PageRank	References	Authors
0.34	12	10

Authors (10 rows)

Cited by (0 rows)

References (12 rows)

Name	Order	Citations	PageRank
Liyang Liu	1	2	1.74
Shilong Zhang	2	0	0.34
Zhanghui Kuang	3	56	9.91
Jing-Hao Xue	4	15	10.05
Aojun Zhou	5	104	5.20
Xinjiang Wang	6	0	0.68
Yimin Chen	7	3	1.73
WM	8	221	34.28
QM	9	464	72.05
Wei Zhang	10	382	24.27

1