Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech | 0 | 0.34 | 2022 |
Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning with Spoofing Detection and Spoofing Type Classification | 0 | 0.34 | 2021 |
Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling | 0 | 0.34 | 2020 |
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances. | 1 | 0.34 | 2020 |
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification. | 0 | 0.34 | 2019 |
Self-Adaptive Soft Voice Activity Detection Using Deep Neural Networks for Robust Speaker Verification | 0 | 0.34 | 2019 |
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification. | 1 | 0.34 | 2019 |