Sparse MLP for Image Recognition: Is Self-Attention Really Necessary? | 0 | 0.34 | 2022 |
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion. | 0 | 0.34 | 2022 |
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism. | 0 | 0.34 | 2022 |
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration. | 0 | 0.34 | 2021 |
Self-Supervised Visual Representations Learning by Contrastive Mask Prediction. | 0 | 0.34 | 2021 |
Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation | 0 | 0.34 | 2020 |