Towards Zero-Shot Sign Language Recognition | 0 | 0.34 | 2023 |
Comparison of 2D and 3D attention mechanisms for human (collective) activity recognition | 0 | 0.34 | 2022 |
Top-down and bottom-up attentional multiple instance learning for still image action recognition | 0 | 0.34 | 2022 |
Merging Super Resolution and Attribute Learning for Low-Resolution Person Attribute Recognition | 0 | 0.34 | 2022 |
Using Independently Recurrent Networks For Reinforcement Learning Based Unsupervised Video Summarization | 0 | 0.34 | 2021 |
Multi-Stream Pose Convolutional Neural Networks For Human Interaction Recognition In Images | 0 | 0.34 | 2021 |
Red Carpet To Fight Club: Partially-Supervised Domain Transfer For Face Recognition In Violent Videos | 0 | 0.34 | 2021 |
Leveraging Auxiliary Image Descriptions For Dense Video Captioning | 0 | 0.34 | 2021 |
Collective Sports: A multi-task dataset for collective activity recognition | 2 | 0.38 | 2020 |
Mask Guided Fusion For Group Activity Recognition In Images | 0 | 0.34 | 2019 |
Image Captioning with Unseen Objects | 0 | 0.34 | 2019 |
Region based multi-stream convolutional neural networks for collective activity recognition. | 0 | 0.34 | 2019 |
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign Languages? | 0 | 0.34 | 2019 |
Zero-Shot Object Detection by Hybrid Region Embedding. | 3 | 0.37 | 2018 |
Classification of human poses and orientations with deep learning. | 0 | 0.34 | 2018 |
Space-Time Tree Ensemble for Action Recognition and Localization. | 0 | 0.34 | 2018 |
Wildest Faces: Face Detection and Recognition in Violent Settings. | 0 | 0.34 | 2018 |
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes. | 2 | 0.36 | 2018 |
Data-driven image captioning via salient region discovery. | 0 | 0.34 | 2017 |
Anomaly detection using improved background subtraction. | 0 | 0.34 | 2017 |
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract). | 1 | 0.38 | 2017 |
Re-evaluating Automatic Metrics for Image Captioning. | 2 | 0.35 | 2017 |
Using deep multiple instance learning for action recognition in still images. | 1 | 0.35 | 2017 |
Low-level features for visual attribute recognition: An evaluation. | 0 | 0.34 | 2016 |
Facial Descriptors for Human Interaction Recognition In Still Images | 8 | 0.53 | 2016 |
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures. | 59 | 1.70 | 2016 |
Leveraging Captions in the Wild to Improve Object Detection. | 0 | 0.34 | 2016 |
Two-person interaction recognition via spatial multiple instance embedding | 7 | 0.45 | 2015 |
Ensemble of multiple instance classifiers for image re-ranking. | 1 | 0.34 | 2014 |
Action Recognition and Localization by Hierarchical Space-Time Segments | 46 | 0.97 | 2013 |
Comparison of clustering methods for pose based video summarization | 0 | 0.34 | 2013 |
Unsupervised learning of discriminative relative visual attributes | 9 | 0.55 | 2012 |
Multiple instance learning for re-ranking of web image search results | 0 | 0.34 | 2012 |
Web-Based Classifiers for Human Action Recognition | 12 | 0.60 | 2012 |
On recognizing actions in still images via multiple features | 11 | 1.39 | 2012 |
Object Recognition and Localization Via Spatial Instance Embedding | 1 | 0.35 | 2010 |
Object, scene and actions: combining multiple features for human action recognition | 144 | 4.39 | 2010 |
Recognizing actions from still images | 37 | 1.75 | 2008 |
Human Action Recognition With Line And Flow Histograms | 26 | 1.26 | 2008 |
Human action recognition using distribution of oriented rectangular patches | 56 | 3.20 | 2007 |
Searching Video for Complex Activities with Finite State Models | 48 | 2.30 | 2007 |