Self-Supervised Pre-Training for Attention-Based Encoder-Decoder ASR Model | 0 | 0.34 | 2022 |
DPT-FSNet: Dual-Path Transformer Based Full-Band and Sub-Band Fusion Network for Speech Enhancement | 0 | 0.34 | 2022 |
HISTORY UTTERANCE EMBEDDING TRANSFORMER LM FOR SPEECH RECOGNITION | 0 | 0.34 | 2021 |
THE THINKIT SYSTEM FOR ICASSP2021 M2VOC CHALLENGE | 1 | 0.39 | 2021 |
RNN-T BASED OPEN-VOCABULARY KEYWORD SPOTTING IN MANDARIN WITH MULTI-LEVEL DETECTION | 0 | 0.34 | 2021 |
Context-dependent Label Smoothing Regularization for Attention-based End-to-End Code-Switching Speech Recognition | 0 | 0.34 | 2021 |
Power Pooling: An Adaptive Pooling Function for Weakly Labelled Sound Event Detection | 0 | 0.34 | 2021 |
PRE-TRAINING TRANSFORMER DECODER FOR END-TO-END ASR MODEL WITH UNPAIRED TEXT DATA | 0 | 0.34 | 2021 |
Keyword Search Using Attention-Based End-to-End ASR and Frame-Synchronous Phoneme Alignments | 0 | 0.34 | 2021 |
A dual-stream deep attractor network with multi-domain learning for speech dereverberation and separation | 1 | 0.35 | 2021 |
A Unified System For Multilingual Speech Recognition And Language Identification | 0 | 0.34 | 2021 |
D-MONA: A dilated mixed-order non-local attention network for speaker and language recognition | 0 | 0.34 | 2021 |
Non-autoregressive Deliberation-Attention based End-to-End ASR | 0 | 0.34 | 2021 |
Improved Guided Source Separation Integrated with a Strong Back-End for the CHiME-6 Dinner Party Scenario. | 0 | 0.34 | 2020 |
Lingual-Agnostic Meta-Learning for Low-Resource Part-of-Speech Tagging. | 0 | 0.34 | 2020 |
End-To-End Multilingual Speech Recognition System With Language Supervision Training | 0 | 0.34 | 2020 |
Speaker Diarization System Based on DPCA Algorithm for Fearless Steps Challenge Phase-2. | 0 | 0.34 | 2020 |
Domain Adaption for Fine-Grained Urban Village Extraction From Satellite Images | 1 | 0.35 | 2020 |
Domain Adaptation Using Class Similarity for Robust Speech Recognition | 0 | 0.34 | 2020 |
Weighted Feature Fusion Based Emotional Recognition For Variable-Length Speech Using Dnn | 0 | 0.34 | 2019 |
Investigation of knowledge transfer approaches to improve the acoustic modeling of Vietnamese ASR system | 2 | 0.39 | 2019 |
Aluminum alloy microstructural segmentation method based on simple noniterative clustering and adaptive density-based spatial clustering of applications with noise. | 0 | 0.34 | 2019 |
Long/short-term utility aware optimal selection of manufacturing service composition towards Industrial Internet platform | 1 | 0.35 | 2019 |
Consensus aware manufacturing service collaboration optimization under blockchain based Industrial Internet platform | 2 | 0.37 | 2019 |
Speaker-Phonetic I-Vector Modeling For Text-Dependent Speaker Verification With Random Digit Strings | 0 | 0.34 | 2019 |
Automatic Speech Recognition System With Output-Gate Projected Gated Recurrent Unit | 0 | 0.34 | 2019 |
A Novel Method for Automatic Heart Murmur Diagnosis Using Phonocardiogram | 0 | 0.34 | 2019 |
Tailoring an Interpretable Neural Language Model | 0 | 0.34 | 2019 |
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation | 0 | 0.34 | 2019 |
Self-Attention Based Prosodic Boundary Prediction For Chinese Speech Synthesis | 0 | 0.34 | 2019 |
Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning | 1 | 0.35 | 2019 |
Aluminum alloy microstructural segmentation in micrograph with hierarchical parameter transfer learning method. | 0 | 0.34 | 2019 |
An Audio Scene Classification Framework With Embedded Filters And A Dct-Based Temporal Module | 0 | 0.34 | 2019 |
Character-Aware Sub-Word Level Language Modeling for Uyghur and Turkish ASR | 0 | 0.34 | 2019 |
Target Speaker Recovery and Recognition Network with Average x-Vector and Global Training | 0 | 0.34 | 2019 |
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition | 0 | 0.34 | 2019 |
Multi-Accent Adaptation Based on Gate Mechanism | 0 | 0.34 | 2019 |
Improve Multichannel Speech Recognition With Temporal And Spatial Information | 0 | 0.34 | 2018 |
Multichannel ASR with Knowledge Distillation and Generalized Cross Correlation Feature. | 0 | 0.34 | 2018 |
Evaluating Modeling Units and Sub-word Features in Language Models for Turkish ASR | 0 | 0.34 | 2018 |
Space-Time Residual LSTM Architechture for Distant Speech Recognition | 0 | 0.34 | 2018 |
Multilingual Speech Recognition Training and Adaptation with Language-Specific Gate Units | 0 | 0.34 | 2018 |
An Improved Lexicon Generation Method For Mandarin Speech Recognition | 0 | 0.34 | 2017 |
Attention-Based Lstm With Multi-Task Learning For Distant Speech Recognition | 0 | 0.34 | 2017 |
Fast Variable-Frame-Rate Decoding Of Speech Recognition Based On Deep Neural Networks | 0 | 0.34 | 2017 |
Improved End-To-End Speech Recognition Using Adaptive Per-Dimensional Learning Rate Methods | 0 | 0.34 | 2016 |
Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge. | 2 | 0.39 | 2015 |
Using neural network front-ends on far field multiple microphones based speech recognition | 28 | 0.90 | 2014 |
Semi-supervised DNN training in meeting recognition | 10 | 0.70 | 2014 |
Optimization of Spoken Term Detection System. | 0 | 0.34 | 2012 |