One-Shot Layer-Wise Accuracy Approximation For Layer Pruning - Citegraph

Paper Info

Title
One-Shot Layer-Wise Accuracy Approximation For Layer Pruning

Abstract
Recent advances in neural networks pruning have made it possible to remove a large number of filters without any perceptible drop in accuracy. However, the gain in speed depends on the number of filters per layer. In this paper, we propose a one-shot layer-wise proxy classifier to estimate layer importance that in turn allows us to prune a whole layer. In contrast to existing filter pruning methods which attempt to reduce the layer width of a dense model, our method reduces its depth and can thus guarantee inference speed up. In our proposed method, we first go through the training data once to construct proxy classifiers for each layer using imprinting. Next, we prune layers with smallest accuracy difference from their preceding layer till a latency budget is achieved. Finally, we fine-tune the newly pruned model to improve accuracy. Experimental results showed 43.70% latency reduction with 1.27% accuracy increase on CIFAR100 for the pruned VGG19. Further, we achieved 16% and 25% latency reduction with 0.58% increase and 0.01% decrease in accuracy respectively on ImageNet for ResNet-50. The major advantage of our proposed method is that these latency reductions cannot be achieved with existing filter pruning methods as they are bounded by the original model's depth. Code is available at https://github.com/selkerdawy/one-shot-layer-pruning

Year	DOI	Venue
2020	10.1109/ICIP40778.2020.9191238	2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)
Keywords	DocType	ISSN
one-shot layer pruning, layer-wise proxy classifiers, inference speed up	Conference	1522-4880
Citations	PageRank	References
0	0.34	0
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Sara Elkerdawy	1	0	1.35
Mostafa Elhoushi	2	0	1.35
Abhineet Singh	3	13	3.60
Hong Zhang	4	582	74.33
Ray Nilanjan	5	541	55.39

1