Abstract | ||
---|---|---|
Most existing work that grounds natural language phrases in images starts with the assumption that the phrase in question is relevant to the image. In this paper we address a more realistic version of the natural language grounding task where we must both identify whether the phrase is relevant to an image and localize the phrase. This can also be viewed as a generalization of object ... |
Year | DOI | Venue |
---|---|---|
2022 | 10.1109/TPAMI.2020.3029008 | IEEE Transactions on Pattern Analysis and Machine Intelligence |
Keywords | DocType | Volume |
Task analysis,Grounding,Visualization,Feature extraction,Benchmark testing,Detectors,Vocabulary | Journal | 44 |
Issue | ISSN | Citations |
4 | 0162-8828 | 0 |
PageRank | References | Authors |
0.34 | 0 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bryan A. Plummer | 1 | 76 | 8.15 |
Shih Kevin J. | 2 | 0 | 0.34 |
Yichen Li | 3 | 0 | 1.69 |
Xu Ke | 4 | 0 | 0.34 |
Svetlana Lazebnik | 5 | 7379 | 449.66 |
Stan Sclaroff | 6 | 5631 | 705.89 |
kate saenko | 7 | 4478 | 202.48 |