Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training | 0 | 0.34 | 2022 |
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training | 0 | 0.34 | 2022 |
Unpaired Image Captioning With semantic-Constrained Self-Learning | 0 | 0.34 | 2022 |
Responsive Listening Head Generation: A Benchmark Dataset and Baseline. | 0 | 0.34 | 2022 |
Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection | 0 | 0.34 | 2022 |
3D Cascade RCNN: High Quality Object Detection in Point Clouds | 0 | 0.34 | 2022 |
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning. | 0 | 0.34 | 2022 |
Dynamic Temporal Filtering in Video Models. | 0 | 0.34 | 2022 |
Building GC-free Key-value Store on HM-SMR Drives with ZoneFS | 0 | 0.34 | 2022 |
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing | 0 | 0.34 | 2022 |
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement. | 0 | 0.34 | 2022 |
Stand-Alone Inter-Frame Attention in Video Models | 0 | 0.34 | 2022 |
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics | 0 | 0.34 | 2021 |
Boosting Video Representation Learning with Multi-Faceted Integration | 0 | 0.34 | 2021 |
ComQA: Compositional Question Answering via Hierarchical Graph Neural Networks | 0 | 0.34 | 2021 |
A Style and Semantic Memory Mechanism for Domain Generalization*. | 0 | 0.34 | 2021 |
CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising | 1 | 0.35 | 2021 |
Motion-Focused Contrastive Learning of Video Representations*. | 0 | 0.34 | 2021 |
Optimization Planning for 3D ConvNets | 0 | 0.34 | 2021 |
Condensing a Sequence to One Informative Frame for Video Recognition. | 0 | 0.34 | 2021 |
Transferrable Contrastive Learning for Visual Domain Adaptation | 0 | 0.34 | 2021 |
Scheduled Sampling In Vision-Language Pretraining With Decoupled Encoder-Decoder Network | 0 | 0.34 | 2021 |
Smart Director: An Event-Driven Directing System for Live Broadcasting | 1 | 0.39 | 2021 |
Multi-Lingual Question Generation with Language Agnostic Language Model. | 0 | 0.34 | 2021 |
Single Shot Video Object Detector | 3 | 0.37 | 2021 |
Noise Augmented Double-Stream Graph Convolutional Networks for Image Captioning | 7 | 0.46 | 2021 |
Representing Videos as Discriminative Sub-graphs for Action Recognition | 0 | 0.34 | 2021 |
Seco: Exploring Sequence Supervision For Unsupervised Representation Learning | 0 | 0.34 | 2021 |
Joint Contrastive Learning with Infinite Possibilities | 0 | 0.34 | 2020 |
Learning a Unified Sample Weighting Network for Object Detection | 0 | 0.34 | 2020 |
Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation | 0 | 0.34 | 2020 |
iDirector: An Intelligent Directing System for Live Broadcast | 1 | 0.35 | 2020 |
Exploring Depth Information for Spatial Relation Recognition | 0 | 0.34 | 2020 |
Deep Metric Learning With Density Adaptivity. | 1 | 0.35 | 2020 |
Coarse-to-Fine Localization of Temporal Action Proposals | 1 | 0.35 | 2020 |
MatrixKV - Reducing Write Stalls and Write Amplification in LSM-tree Based KV Stores with Matrix Container in NVM. | 0 | 0.34 | 2020 |
Neural Question Generation with Answer Pivot. | 0 | 0.34 | 2020 |
Transferring and Regularizing Prediction for Semantic Segmentation | 0 | 0.34 | 2020 |
X-Linear Attention Networks for Image Captioning | 12 | 0.54 | 2020 |
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation. | 2 | 0.40 | 2019 |
Hierarchy Parsing For Image Captioning | 15 | 0.57 | 2019 |
SEALDB: An Efficient LSM-tree based KV Store on SMR Drives with Sets and Dynamic Bands | 2 | 0.37 | 2019 |
Learning Spatio-Temporal Representation With Local And Global Diffusion | 8 | 0.44 | 2019 |
Pointing Novel Objects In Image Captioning | 4 | 0.41 | 2019 |
Editorial to Special Issue on Deep Learning for Intelligent Multimedia Analytics. | 0 | 0.34 | 2019 |
Deep Learning–Based Multimedia Analytics: A Review | 1 | 0.39 | 2019 |
vireoJD-MM at Activity Detection in Extended Videos. | 0 | 0.34 | 2019 |
daBNN: A Super Fast Inference Framework for Binary Neural Networks on ARM devices | 7 | 0.44 | 2019 |
Transferrable Prototypical Networks For Unsupervised Domain Adaptation | 20 | 0.53 | 2019 |
Exploring Object Relation In Mean Teacher For Cross-Domain Detection | 13 | 0.49 | 2019 |