EMA2S: An End-to-End Multimodal Articulatory-to-Speech System | 0 | 0.34 | 2021 |
MetricGAN+ - An Improved Version of MetricGAN for Speech Enhancement. | 2 | 0.38 | 2021 |
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification | 0 | 0.34 | 2021 |
A Study of Incorporating Articulatory Movement Information in Speech Enhancement | 0 | 0.34 | 2021 |
Coupling a Generative Model With a Discriminative Learning Framework for Speaker Verification | 0 | 0.34 | 2021 |
UNSUPERVISED NEURAL ADAPTATION MODEL BASED ON OPTIMAL TRANSPORT FOR SPOKEN LANGUAGE IDENTIFICATION | 0 | 0.34 | 2021 |
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport. | 0 | 0.34 | 2021 |
Investigation of NICT Submission for Short-Duration Speaker Verification Challenge 2020. | 0 | 0.34 | 2020 |
Incorporating Broad Phonetic Information for Speech Enhancement | 0 | 0.34 | 2020 |
Joint Training End-to-End Speech Recognition Systems with Speaker Attributes. | 0 | 0.34 | 2020 |
Wavecrn: An Efficient Convolutional Recurrent Neural Network For End-To-End Speech Enhancement | 1 | 0.35 | 2020 |
Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training. | 0 | 0.34 | 2020 |
Compensation on x-vector for Short Utterance Spoken Language Identification. | 0 | 0.34 | 2020 |
Optimal Classifier Parameter Status Selection Based on Bayes Boundary-ness for Multi-ProtoType and Multi-Layer Perceptron Classifiers. | 0 | 0.34 | 2019 |
Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection | 0 | 0.34 | 2019 |
Incorporating Symbolic Sequential Modeling for Speech Enhancement. | 0 | 0.34 | 2019 |
Interactive Learning Of Teacher-Student Model For Short Utterance Spoken Language Identification | 1 | 0.34 | 2019 |
Incorporating Symbolic Sequential Modeling for Speech Enhancement. | 3 | 0.42 | 2019 |
Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation | 2 | 0.45 | 2019 |
End-to-End Articulatory Attribute Modeling for Low-Resource Multilingual Speech Recognition | 1 | 0.36 | 2019 |
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric | 1 | 0.36 | 2019 |
Investigating Radical-Based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese | 2 | 0.39 | 2019 |
Speech Dereverberation Based on Integrated Deep and Ensemble Learning. | 0 | 0.34 | 2018 |
Improving Very Deep Time-Delay Neural Network With Vertical-Attention For Effectively Training CTC-Based ASR Systems. | 0 | 0.34 | 2018 |
Temporal Attentive Pooling for Acoustic Event Detection. | 0 | 0.34 | 2018 |
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. | 9 | 0.58 | 2018 |
Study of articulators' contribution and compensation during speech by articulatory speech recognition. | 0 | 0.34 | 2018 |
Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement | 0 | 0.34 | 2018 |
Multi-Metrics Learning for Speech Enhancement. | 0 | 0.34 | 2017 |
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation. | 10 | 0.53 | 2017 |
Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models. | 1 | 0.35 | 2017 |
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks. | 10 | 0.59 | 2017 |
Incremental training and constructing the very deep convolutional residual network acoustic models | 0 | 0.34 | 2017 |
Regularization of neural network model with distance metric learning for i-vector based spoken language identification. | 3 | 0.38 | 2017 |
Conditional Generative Adversarial Nets Classifier For Spoken Language Identification | 0 | 0.34 | 2017 |
Speaker Adaptive Training Localizing Speaker Modules In Dnn For Hybrid Dnn-Hmm Speech Recognizers | 0 | 0.34 | 2016 |
Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition | 0 | 0.34 | 2016 |
Wavelet speech enhancement based on nonnegative matrix factorization | 3 | 0.37 | 2016 |
Comparison of regularization constraints in deep neural network based speaker adaptation | 0 | 0.34 | 2016 |
A pseudo-task design in multi-task learning deep neural network for speaker recognition | 0 | 0.34 | 2016 |
Incorporating Local Environment Information With Ensemble Neural Networks To Robust Automatic Speech Recognition | 0 | 0.34 | 2016 |
Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription. | 1 | 0.39 | 2016 |
Confidence estimation for speech recognition systems using conditional random fields trained with partially annotated data | 0 | 0.34 | 2016 |
Training data pseudo-shuffling and direct decoding framework for recurrent neural network based acoustic modeling | 1 | 0.35 | 2015 |
Sparse Representation With Temporal Max-Smoothing For Acoustic Event Detection | 1 | 0.37 | 2015 |
Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm. | 3 | 0.39 | 2015 |
Speaker Adaptive Training For Deep Neural Networks Embedding Linear Transformation Networks | 3 | 0.39 | 2015 |
Ensemble Speaker Modeling Using Speaker Adaptive Training Deep Neural Network For Speaker Adaptation | 1 | 0.35 | 2015 |
Ensemble environment modeling using affine transform group. | 0 | 0.34 | 2015 |
Signal to noise ratio estimation based on an optimal design of subband voice activity detection | 0 | 0.34 | 2014 |