Efficient Knowledge Distillation From An Ensemble Of Teachers - Citegraph

Paper Info

Title
Efficient Knowledge Distillation From An Ensemble Of Teachers

Abstract
This paper describes the effectiveness of knowledge distillation using teacher student training for building accurate and compact neural networks. We show that with knowledge distillation. information from multiple acoustic models like very deep VGG networks and Long Short-Term Memory (LSTM) models can be used to train standard convolutional neural network (CNN) acoustic models for a variety of systems requiring a quick turnaround. We examine two strategies to leverage multiple teacher labels for training student models. In the first technique, the weights of the student model are updated by switching teacher labels at the minibatch level. In the second method. student models are trained on multiple streams of information from various teacher distributions via data augmentation. We show that standard CNN acoustic models can achieve comparable recognition accuracy with much smaller number of model parameters compared to teacher VGG and LSTM acoustic models. Additionally we also investigate the effectiveness of using broadband teacher labels as privileged knowledge for training better narrowband acoustic models within this framework. We show the benefit of this simple technique by training narrow band student models with broadband teacher soft labels on the Aurora 4 task.

Year	DOI	Venue
2017	10.21437/Interspeech.2017-614	18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords	Field	DocType
Speech recognition, knowledge distillation, teacher-student, CNN, VGG, LSTM, bandwidth	Computer science,Distillation,Natural language processing,Artificial intelligence,Machine learning	Conference
ISSN	Citations	PageRank
2308-457X	7	0.63
References	Authors
0	6

Authors (6 rows)

Cited by (7 rows)

References (0 rows)

Name	Order	Citations	PageRank
Takashi Fukuda	1	10	4.86
Masayuki Suzuki	2	23	5.88
Gakuto Kurata	3	107	19.06
Samuel Thomas	4	536	46.88
Jia Cui	5	94	6.26
Bhuvana Ramabhadran	6	1779	153.83

1