Title
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
Abstract
Audio-visual matching aims to learn the intrinsic correspondence between image and audio clip. Existing works mainly concentrate on learning discriminative features, while ignore the cross-modal heterogeneous issue between audio and visual modalities. To deal with this issue, we propose a novel Adversarial-Metric Learning (AML) model for audio-visual matching. AML aims to generate a modality-indep...
Year
DOI
Venue
2022
10.1109/TMM.2021.3050089
IEEE Transactions on Multimedia
Keywords
DocType
Volume
Visualization,Task analysis,Measurement,Speech recognition,Videos,Location awareness,Image recognition
Journal
24
ISSN
Citations 
PageRank 
1520-9210
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Aihua Zheng104.39
Menglan Hu200.34
Bo Jiang311917.21
Yan Huang422627.65
Yan Yan510.69
Bin Luo6802107.57