Can Audio Captions Be Evaluated With Image Caption Metrics? | 0 | 0.34 | 2022 |
INVESTIGATING LOCAL AND GLOBAL INFORMATION FOR AUTOMATED AUDIO CAPTIONING WITH TRANSFER LEARNING | 0 | 0.34 | 2021 |
TEXT-TO-AUDIO GROUNDING: BUILDING CORRESPONDENCE BETWEEN CAPTIONS AND SOUND EVENTS | 0 | 0.34 | 2021 |
Building Interpretable Interaction Trees For Deep Nlp Models | 0 | 0.34 | 2021 |
DEPA: Self-Supervised Audio Embedding for Depression Detection | 0 | 0.34 | 2021 |
Enriching Ontology with Temporal Commonsense for Low-Resource Audio Tagging | 0 | 0.34 | 2021 |
Audio Caption in a Car Setting with a Sentence-Level Loss | 0 | 0.34 | 2021 |
Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL. | 0 | 0.34 | 2021 |
Voice Activity Detection in the Wild via Weakly Supervised Sound Event Detection. | 0 | 0.34 | 2020 |
Audio Caption: Listen And Tell | 0 | 0.34 | 2019 |
Text-based Depression Detection: What Triggers An Alert. | 0 | 0.34 | 2019 |
What does a Car-ssette tape tell? | 0 | 0.34 | 2019 |
Detecting and Analysing Spatial-Temporal Aggregation of Flight Turbulence with the QAR Big Data | 0 | 0.34 | 2018 |
Perception of Cantonese tones by Mandarin speakers. | 0 | 0.34 | 2015 |