Neural-FST Class Language Model for End-to-End Speech Recognition - Citegraph

Paper Info

Title
Neural-FST Class Language Model for End-to-End Speech Recognition

Abstract
We propose Neural-FST Class Language Model (NFCLM) for end-to-end speech recognition, a novel method that combines neural network language models (NNLMs) and finite state transducers (FSTs) in a mathematically consistent framework. Our method utilizes a background NNLM which models generic background text together with a collection of domain-specific entities modeled as individual FSTs. Each output token is generated by a mixture of these components; the mixture weights are estimated with a separately trained neural decider. We show that NFCLM significantly outperforms NNLM by 15.8% relative in terms of Word Error Rate. NFCLM achieves similar performance as traditional NNLM and FST shallow fusion while being less prone to overbiasing and 12 times more compact, making it more suitable for on-device usage.

Year	DOI	Venue
2022	10.1109/ICASSP43922.2022.9747573	IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	10

Authors (10 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Antoine Bruguier	1	0	0.34
Duc-Vinh Le	2	45	15.88
Rohit Prabhavalkar	3	163	22.56
Dangna Li	4	0	0.34
Zhe Liu	5	0	0.34
Bo Wang	6	0	0.34
Eun Chang	7	0	0.34
Fuchun Peng	8	1378	85.75
Ozlem Kalinli	9	1	3.39
Michael L. Seltzer	10	0	0.34

1