End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training | 0 | 0.34 | 2022 |
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations | 0 | 0.34 | 2022 |
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning. | 0 | 0.34 | 2021 |
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens. | 0 | 0.34 | 2021 |
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation. | 0 | 0.34 | 2021 |
Enrollment-Less Training for Personalized Voice Activity Detection. | 0 | 0.34 | 2021 |
AUDIO-VISUAL SPEECH SEPARATION USING CROSS-MODAL CORRESPONDENCE LOSS | 0 | 0.34 | 2021 |
MAPGN: MASKED POINTER-GENERATOR NETWORK FOR SEQUENCE-TO-SEQUENCE PRE-TRAINING | 0 | 0.34 | 2021 |
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition. | 0 | 0.34 | 2021 |
HIERARCHICAL TRANSFORMER-BASED LARGE-CONTEXT END-TO-END ASR WITH LARGE-CONTEXT KNOWLEDGE DISTILLATION | 0 | 0.34 | 2021 |
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model | 0 | 0.34 | 2020 |
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition. | 1 | 0.34 | 2020 |