reducing multilingual context confusion for end-to-end code-switching automatic speech recognition | 0 | 0.34 | 2022 |
Fully Automated End-to-End Fake Audio Detection. | 0 | 0.34 | 2022 |
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition | 0 | 0.34 | 2022 |
ADD 2022: the first Audio Deep Synthesis Detection Challenge. | 0 | 0.34 | 2022 |
Continual Learning for Fake Audio Detection. | 1 | 0.36 | 2021 |
Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition | 0 | 0.34 | 2021 |
Hierarchically Attending Time-Frequency and Channel Features for Improving Speaker Verification | 0 | 0.34 | 2021 |
Half-Truth - A Partially Fake Audio Detection Dataset. | 0 | 0.34 | 2021 |
DECOUPLING PRONUNCIATION AND LANGUAGE FOR END-TO-END CODE-SWITCHING AUTOMATIC SPEECH RECOGNITION | 0 | 0.34 | 2021 |
FSR - Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization. | 0 | 0.34 | 2021 |
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition | 0 | 0.34 | 2021 |
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition | 0 | 0.34 | 2020 |
Focal Loss for Punctuation Prediction. | 0 | 0.34 | 2020 |
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition | 0 | 0.34 | 2020 |
Self-Attention Transducers for End-to-End Speech Recognition | 1 | 0.36 | 2019 |
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting | 0 | 0.34 | 2019 |
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition | 1 | 0.34 | 2019 |