Imitation networks: Few-shot learning of neural networks from scratch. - Citegraph

Paper Info

Title
Imitation networks: Few-shot learning of neural networks from scratch.

Abstract
In this paper, we propose imitation networks, a simple but effective method for training neural networks with a limited amount of training data. Our approach inherits the idea of knowledge distillation that transfers knowledge from a deep or wide reference model to a shallow or narrow target model. The proposed method employs this idea to mimic predictions of reference estimators that are much more robust against overfitting than the network we want to train. Different from almost all the previous work for knowledge distillation that requires a large amount of labeled training data, the proposed method requires only a small amount of training data. Instead, we introduce pseudo training examples that are optimized as a part of model parameters. Experimental results for several benchmark datasets demonstrate that the proposed method outperformed all the other baselines, such as naive training of the target model and standard knowledge distillation.

Year	Venue	Field
2018	arXiv: Machine Learning	Scratch,Reference model,Effective method,Computer science,Distillation,Imitation,Artificial intelligence,Overfitting,Artificial neural network,Machine learning,Estimator
DocType	Volume	Citations
Journal	abs/1802.03039	0
PageRank	References	Authors
0.34	19	5

Authors (5 rows)

Cited by (0 rows)

References (19 rows)

Name	Order	Citations	PageRank
Akisato Kimura	1	244	28.03
Zoubin Ghahramani	2	10455	1264.39
Koh Takeuchi	3	59	11.29
Tomoharu Iwata	4	824	65.87
Naonori Ueda	5	1902	214.32

1