AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios | 0 | 0.34 | 2022 |
A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation | 0 | 0.34 | 2022 |
Adaptive Logit Adjustment Loss for Long-Tailed Visual Recognition. | 0 | 0.34 | 2022 |
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior | 0 | 0.34 | 2022 |
Infergrad: Improving Diffusion Models for Vocoder by Considering Inference in Training | 0 | 0.34 | 2022 |
ProphetChat: Enhancing Dialogue Generation with Simulation of Future Conversation | 0 | 0.34 | 2022 |
Analyzing and Mitigating Interference in Neural Architecture Search. | 0 | 0.34 | 2022 |
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. | 0 | 0.34 | 2022 |
ReLyMe: Improving Lyric-to-Melody Generation by Incorporating Lyric-Melody Relationships | 0 | 0.34 | 2022 |
Revisiting Over-Smoothness in Text to Speech | 0 | 0.34 | 2022 |
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System. | 0 | 0.34 | 2022 |
DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders | 0 | 0.34 | 2022 |
Transformer-S2A: Robust and Efficient Speech-to-Animation | 0 | 0.34 | 2022 |
ADASPEECH 2: ADAPTIVE TEXT TO SPEECH WITH UNTRANSCRIBED DATA | 0 | 0.34 | 2021 |
MusicBERT - Symbolic Music Understanding with Large-Scale Pre-Training. | 0 | 0.34 | 2021 |
MIXSPEECH: DATA AUGMENTATION FOR LOW-RESOURCE AUTOMATIC SPEECH RECOGNITION | 0 | 0.34 | 2021 |
LIGHTSPEECH: LIGHTWEIGHT AND FAST TEXT TO SPEECH WITH NEURAL ARCHITECTURE SEARCH | 0 | 0.34 | 2021 |
A Tutorial on AI Music Composition | 0 | 0.34 | 2021 |
Uwspeech: Speech To Speech Translation For Unwritten Languages | 0 | 0.34 | 2021 |
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition. | 0 | 0.34 | 2021 |
AdaSpeech: Adaptive Text to Speech for Custom Voice | 0 | 0.34 | 2021 |
A Survey on Low-Resource Neural Machine Translation. | 0 | 0.34 | 2021 |
Speech-T: Transducer for Text to Speech and Beyond. | 0 | 0.34 | 2021 |
Mixing or Extracting? Further Exploring Necessity of Music Separation for Singer Identification | 0 | 0.34 | 2021 |
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search | 2 | 0.42 | 2021 |
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network | 0 | 0.34 | 2021 |
Songmass: Automatic Song Writing With Pre-Training And Alignment Constraint | 0 | 0.34 | 2021 |
FastCorrect 2 - Fast Error Correction on Multiple Candidates for Automatic Speech Recognition. | 0 | 0.34 | 2021 |
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech | 0 | 0.34 | 2021 |
DENOISPEECH: DENOISING TEXT TO SPEECH WITH FRAME-LEVEL NOISE MODELING | 0 | 0.34 | 2021 |
PopMAG: Pop Music Accompaniment Generation | 1 | 0.48 | 2020 |
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation | 1 | 0.35 | 2020 |
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition | 4 | 0.43 | 2020 |
DeepSinger: Singing Voice Synthesis with Data Mined From the Web | 2 | 0.40 | 2020 |
MPNet: Masked and Permuted Pre-training for Language Understanding | 0 | 0.34 | 2020 |
A Study of Non-autoregressive Model for Sequence Generation | 1 | 0.35 | 2020 |
DualLip: A System for Joint Lip Reading and Generation | 0 | 0.34 | 2020 |
XiaoiceSing: A High-Quality and Integrated Singing Voice Synthesis System | 0 | 0.34 | 2020 |
Neural Machine Translation with Error Correction | 1 | 0.36 | 2020 |
MultiSpeech: Multi-Speaker Text to Speech with Transformer | 0 | 0.34 | 2020 |
Semi-Supervised Neural Architecture Search | 0 | 0.34 | 2020 |
FastSpeech: Fast, Robust and Controllable Text to Speech. | 5 | 0.40 | 2019 |
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion. | 1 | 0.36 | 2019 |
Multilingual Neural Machine Translation with Knowledge Distillation. | 1 | 0.36 | 2019 |
Multilingual Neural Machine Translation with Language Clustering | 1 | 0.34 | 2019 |
Almost Unsupervised Text to Speech and Automatic Speech Recognition. | 0 | 0.34 | 2019 |
Tied Transformers: Neural Machine Translation with Shared Encoder and Decoder | 2 | 0.36 | 2019 |
Beyond Error Propagation: Language Branching Also Affects the Accuracy of Sequence Generation | 0 | 0.34 | 2019 |
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion. | 0 | 0.34 | 2019 |
Deliberation Learning for Image-to-Image Translation. | 0 | 0.34 | 2019 |