Self-Supervised Learning for Sentiment Analysis via Image-Text Matching. | 0 | 0.34 | 2022 |
SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification | 0 | 0.34 | 2021 |
Video Question Answering with Phrases via Semantic Roles | 0 | 0.34 | 2021 |
Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding | 5 | 0.52 | 2020 |
CPARR: Category-based Proposal Analysis for Referring Relationships | 0 | 0.34 | 2020 |
Video Object Grounding using Semantic Roles in Language Description | 0 | 0.34 | 2020 |
Deep, Landmark-Free FAME: Face Alignment, Modeling, and Expression Estimation | 8 | 0.50 | 2019 |
Zero-Shot Grounding Of Objects From Natural Language Queries | 6 | 0.42 | 2019 |
ExpNet: Landmark-Free, Deep, 3D Facial Expressions | 6 | 0.47 | 2018 |
Revisiting Temporal Modeling for Video-based Person ReID. | 3 | 0.37 | 2018 |
MSRC: Multimodal Spatial Regression with Semantic Context for Phrase Grounding. | 2 | 0.36 | 2018 |
RED: Reinforced Encoder-Decoder Networks for Action Anticipation. | 2 | 0.36 | 2017 |
Cascaded Boundary Regression for Temporal Action Detection. | 14 | 0.58 | 2017 |
Image Set Classification Via Template Triplets And Context-Aware Similarity Embedding | 0 | 0.34 | 2016 |
Pronet: Learning To Propose Object-Specific Boxes For Cascaded Neural Networks | 10 | 0.54 | 2016 |
Abstraction hierarchy and self annotation update for fine grained activity recognition. | 2 | 0.42 | 2016 |
Tag-Based Video Retrieval By Embedding Semantic Content In A Continuous Word Space | 3 | 0.67 | 2016 |
Learning Action Concept Trees And Semantic Alignment Networks From Image-Description Data | 0 | 0.34 | 2016 |
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images. | 36 | 1.10 | 2015 |
Beyond Pedestrians: A Hybrid Approach of Tracking Multiple Articulating Humans | 3 | 0.36 | 2015 |
Automatic Concept Discovery From Parallel Text and Visual Corpora | 23 | 0.78 | 2015 |
A Robust Adaptive Classifier for Detector Adaptation in a Video | 0 | 0.34 | 2015 |
DISCOVER: Discovering Important Segments for Classification of Video Events and Recounting | 19 | 0.63 | 2014 |
Multi-State Discriminative Video Segment Selection For Complex Event Classification | 0 | 0.34 | 2014 |
Multi-Target Tracking by Online Learning a CRF Model of Appearance and Motion Patterns | 35 | 0.84 | 2014 |
Semantic Aware Video Transcription Using Random Forest Classifiers | 8 | 0.48 | 2014 |
The 2014 SESAME Multimedia Event Detection and Recounting System. | 0 | 0.34 | 2014 |
Multi class boosted random ferns for adapting a generic object detector to a specific video | 3 | 0.36 | 2014 |
ISOMER: Informative Segment Observations for Multimedia Event Recounting | 6 | 0.46 | 2014 |
Video segmentation and feature co-occurrences for activity classification | 2 | 0.36 | 2014 |
Hierarchical abnormal event detection by real time and semi-real time multi-tasking video surveillance system | 14 | 0.58 | 2014 |
Large-scale web video event classification by use of Fisher Vectors | 40 | 1.15 | 2013 |
Efficient Detector Adaptation for Object Detection in a Video | 17 | 0.61 | 2013 |
Conditional Bayesian networks for action detection | 0 | 0.34 | 2013 |
Robust multi-pose face tracking by multi-stage tracklet association. | 2 | 0.38 | 2012 |
Efficient incremental learning of boosted classifiers for object detection | 1 | 0.35 | 2012 |
Online learned discriminative part-based appearance models for multi-human tracking | 19 | 0.82 | 2012 |
Simultaneous inference of activity, pose and object | 2 | 0.40 | 2012 |
AVSS 2011 demo session: A systems level approach to perimeter protection | 0 | 0.34 | 2011 |
Learning neighborhood cooccurrence statistics of sparse features for human activity recognition | 10 | 0.63 | 2011 |
Segmentation of objects in a detection window by Nonparametric Inhomogeneous CRFs | 5 | 0.43 | 2011 |
The SESAME MED System. | 0 | 0.34 | 2011 |
Learning affinities and dependencies for multi-target tracking using a CRF model | 58 | 1.73 | 2011 |
Simultaneous tracking and action recognition for single actor human actions | 8 | 0.46 | 2011 |
Action recognition in cluttered dynamic scenes using Pose-Specific Part Models | 23 | 0.86 | 2011 |
High performance object detection by collaborative learning of Joint Ranking of Granules features | 49 | 1.85 | 2010 |
Dynamics Based Trajectory Segmentation for UAV videos | 4 | 0.49 | 2010 |
Multiple pose context trees for estimating human pose in object context | 4 | 0.43 | 2010 |
Multi-target tracking by on-line learned discriminative appearance models | 138 | 4.35 | 2010 |
Learning 3d Action Models From A Few 2d Videos For View Invariant Action Recognition | 23 | 0.71 | 2010 |