Visual Question Answering Dataset for Bilingual Image Understanding: A Study of Cross-Lingual Transfer Using Attention Maps. | 0 | 0.34 | 2018 |
TokyoTech-AIST at TRECVID 2017 - Multimedia Event Detection Using Deep CNNs and Zero-Shot Classiers. | 0 | 0.34 | 2017 |
TokyoTech at TRECVID 2016. | 0 | 0.34 | 2016 |