End-To-End Contextual Speech Recognition Using Class Language Models And A Token Passing Decoder - Citegraph

Paper Info

Title
End-To-End Contextual Speech Recognition Using Class Language Models And A Token Passing Decoder

Abstract
End-to-end modeling ( E2E) of automatic speech recognition ( ASR) blends all the components of a traditional speech recognition system into a single, unified model. Although it simplifies the ASR systems, the unified model is hard to adapt when training and testing data mismatches. In this work, we focus on contextual speech recognition, which is particularly challenging for E2E models because contextual information is only available in inference time. To improve the performance in the presence of contextual information during training, we propose to use class-based language models ( CLM) that can populate context-dependent information during inference. To enable this approach to scale to a large number of class members and minimize search errors, we propose a token passing algorithm with an efficient token recombination for E2E systems. We evaluate the proposed system on general and contextual ASR tasks, and achieve relative 62% Word Error Rate ( WER) reduction for the contextual ASR task without hurting recognition performance for the general ASR task. We also show that the proposed method performs well without modification of the decoding hyper-parameters across tasks, making it a desirable solution for E2E ASR.

Year	Venue	Keywords
2018	2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)	End-to-end Speech Recognition, Weighted Finite State Transducer, Token Passing, Class-based Language Model
Field	DocType	Volume
Token passing,Inference,End-to-end principle,Computer science,Word error rate,Speech recognition,Test data,Decoding methods,Security token,Language model	Journal	abs/1812.02142
ISSN	Citations	PageRank
1520-6149	2	0.38
References	Authors
0	5

Authors (5 rows)

Cited by (2 rows)

References (0 rows)

Name	Order	Citations	PageRank
Zhehuai Chen	1	2	0.72
Mahaveer Jain	2	24	2.93
Yongqiang Wang	3	175	13.32
Michael L. Seltzer	4	1027	69.42
Christian Fuegen	5	9	6.58

1