Abstract | ||
---|---|---|
Audio-visual matching aims to learn the intrinsic correspondence between image and audio clip. Existing works mainly concentrate on learning discriminative features, while ignore the cross-modal heterogeneous issue between audio and visual modalities. To deal with this issue, we propose a novel Adversarial-Metric Learning (AML) model for audio-visual matching. AML aims to generate a modality-indep... |
Year | DOI | Venue |
---|---|---|
2022 | 10.1109/TMM.2021.3050089 | IEEE Transactions on Multimedia |
Keywords | DocType | Volume |
Visualization,Task analysis,Measurement,Speech recognition,Videos,Location awareness,Image recognition | Journal | 24 |
ISSN | Citations | PageRank |
1520-9210 | 0 | 0.34 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Aihua Zheng | 1 | 0 | 4.39 |
Menglan Hu | 2 | 0 | 0.34 |
Bo Jiang | 3 | 119 | 17.21 |
Yan Huang | 4 | 226 | 27.65 |
Yan Yan | 5 | 1 | 0.69 |
Bin Luo | 6 | 802 | 107.57 |