Name
Affiliation
Papers
QI WU
Univ Bath, Dept Comp Sci, Media Technol Res Ctr, Bath BA2 7AY, Avon, England
94
Collaborators
Citations 
PageRank 
201
396
41.54
Referers 
Referees 
References 
1130
1675
993
Search Limit
1001000
Title
Citations
PageRank
Year
TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization With Few Labeled Samples10.402022
Collaborative Feature Learning for Gait Recognition Under Cloth Changes00.342022
ForeSI: Success-Aware Visual Navigation Agent00.342022
Maintaining Reasoning Consistency in Compositional Visual Question Answering.00.342022
Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition00.342022
HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation00.342022
UniMiSS: Universal Medical Self-supervised Learning via Breaking Dimensionality Barrier00.342022
Show, Price and Negotiate: A Negotiator With Online Value Look-Ahead00.342022
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering00.342022
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation00.342022
Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise00.342022
Diagnosing Vision-and-Language Navigation: What Really Matters00.342022
A Simple and Robust Correlation Filtering Method for Text-Based Person Search.00.342022
V2C: Visual Voice Cloning00.342022
Recognizing Gaits Across Walking and Running Speeds00.342022
Visual Grounding Via Accumulated Attention00.342022
Neighbor-view Enhanced Model for Vision and Language Navigation00.342021
Towards Accurate Text-based Image Captioning with Content Diversity Exploration00.342021
R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks00.342021
VLN BERT - A Recurrent Vision-and-Language BERT for Navigation.00.342021
The Road to Know-Where - An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation.00.342021
Sketch, Ground, and Refine: Top-Down Dense Video Captioning00.342021
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention.00.342021
Referring Expression Comprehension: A Survey of Methods and Datasets00.342021
Language-Guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning30.402021
Image editing with varying intensities of processing00.342021
How To Train Your Agent To Read And Write00.342021
Chop Chop BERT - Visual Question Answering by Chopping VisualBERT's Heads.00.342021
Optimistic Agent: Accurate Graph-Based Value Estimation For More Successful Visual Navigation00.342021
Jo-SRC: A Contrastive Approach for Combating Noisy Labels00.342021
Confidence-Aware Non-Repetitive Multimodal Transformers For Textcaps00.342021
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation00.342021
Simple Is Not Easy: A Simple Strong Baseline For Textvqa And Textcaps00.342021
Learning Dual Encoding Model for Adaptive Visual Understanding in Visual Dialogue10.362021
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression00.342021
CogTree - Cognition Tree Loss for Unbiased Scene Graph Generation.00.342021
MeisterMorxrc at SemEval-2020 Task 9 - Fine-Tune Bert and Multitask Learning for Sentiment Analysis of Code-Mixed Tweets.00.342020
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments20.432020
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning70.442020
Length-Controllable Image Captioning30.462020
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue00.342020
Analysis of Plantar Pressure Image Based on Flexible Force-Sensitive Sensor Array00.342020
Sub-Instruction Aware Vision-and-Language Navigation.00.342020
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering10.372020
AIML at VQA-Med 2020 - Knowledge Inference via a Skeleton-based Sentence Mapping Approach for Medical Domain Visual Question Answering.00.342020
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge00.342020
Image and Sentence Matching via Semantic Concepts and Order Learning.00.342020
Intelligent Home 3D: Automatic 3D-House Design from Linguistic Descriptions Only10.352020
Dualvd: An Adaptive Dual Encoding Model For Deep Visual Understanding In Visual Dialogue00.342020
Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval50.382020
  • 1
  • 2