Title
Multi-Stream End-to-End Speech Recognition.
Abstract
Attention-based methods and Connectionist Temporal Classification (CTC) network have been promising research directions for end-to-end (E2E) Automatic Speech Recognition (ASR). The joint CTC/Attention model has achieved great success by utilizing both architectures during multi-task training and joint decoding. In this article, we present a multi-stream framework based on joint CTC/Attention E2E A...
Year
DOI
Venue
2020
10.1109/TASLP.2019.2959721
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Keywords
Field
DocType
Decoding,Speech recognition,Acoustics,Computational modeling,Microphone arrays,Task analysis
Computer science,End-to-end principle,Word error rate,Speech recognition,Robustness (computer science),Encoder,Decoding methods,Microphone,Connectionism,Test set
Journal
Volume
Issue
ISSN
28
1
2329-9290
Citations 
PageRank 
References 
1
0.35
12
Authors
6
Name
Order
Citations
PageRank
ruizhi li15112.01
Xiaofei Wang2134.99
Sri Harish Reddy Mallidi3487.94
Shinji Watanabe41158139.38
Takaaki Hori540845.58
Hynek Hermansky63298510.27