Title
End-To-End Streaming Keyword Spotting
Abstract
We present a system for keyword spotting that, except for a front-end component for feature generation, it is entirely contained in a deep neural network (DNN) model trained "end-to-end" to predict the presence of the keyword in a stream of audio. The main contributions of this work are, first, an efficient memoized neural network topology that aims at making better use of the parameters and associated computations in the DNN by holding a memory of previous activations distributed over the depth of the DNN. The second contribution is a method to train the DNN, end-to-end, to produce the keyword spotting score. This system significantly outperforms previous approaches both in terms of quality of detection as well as size and computation.
Year
DOI
Venue
2018
10.1109/icassp.2019.8683557
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Keywords
Field
DocType
deep neural networks, keyword spotting, audio processing, embedded speech recognition
End-to-end principle,Computer science,Keyword spotting,Artificial intelligence,Feature generation,Artificial neural network,Neural network topology,Machine learning,Computation
Journal
Volume
ISSN
Citations 
abs/1812.02802
1520-6149
0
PageRank 
References 
Authors
0.34
12
2
Name
Order
Citations
PageRank
Raziel Álvarez1303.84
Hyun-Jin Park200.34