Ego4D: Around the World in 3,000 Hours of Egocentric Video | 0 | 0.34 | 2022 |
Environment Predictive Coding for Visual Navigation | 0 | 0.34 | 2022 |
PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning. | 0 | 0.34 | 2022 |
Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation. | 0 | 0.34 | 2022 |
Learning to Set Waypoints for Audio-Visual Navigation | 0 | 0.34 | 2021 |
Learning Dexterous Grasping with Object-Centric Visual Affordances | 0 | 0.34 | 2021 |
Semantic Audio-Visual Navigation | 0 | 0.34 | 2021 |
Modeling Fashion Influence From Photos | 0 | 0.34 | 2021 |
Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos | 0 | 0.34 | 2021 |
VISUALVOICE: Audio-Visual Speech Separation with Cross-Modal Consistency | 0 | 0.34 | 2021 |
Listen To Look: Action Recognition By Previewing Audio | 4 | 0.47 | 2020 |
From Paris to Berlin: Discovering Fashion Style Influences Around the World | 0 | 0.34 | 2020 |
You2Me - Inferring Body Pose in Egocentric Video via First and Second Person Interactions. | 0 | 0.34 | 2020 |
Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias | 0 | 0.34 | 2020 |
Learning Affordance Landscapes for Interaction Exploration in 3D Environments | 0 | 0.34 | 2020 |
Densifying Supervision for Fine-Grained Visual Comparisons | 0 | 0.34 | 2020 |
Ego-Topo: Environment Affordances From Egocentric Video | 0 | 0.34 | 2020 |
Co-Separating Sounds Of Visual Objects | 4 | 0.47 | 2019 |
Click Carving: Interactive Object Segmentation in Images and Videos with Point Clicks | 0 | 0.34 | 2019 |
Thinking Outside the Pool: Active Training Image Creation for Relative Attributes | 4 | 0.47 | 2019 |
Less Is More: Learning Highlight Detection From Video Duration | 5 | 0.39 | 2019 |
Extreme Relative Pose Estimation For Rgb-D Scans Via Scene Completion | 2 | 0.36 | 2019 |
End-to-end policy learning for active visual categorization. | 3 | 0.37 | 2019 |
Kernel Transformer Networks For Compact Spherical Convolution | 3 | 0.40 | 2018 |
BlockDrop: Dynamic Inference Paths in Residual Networks. | 18 | 0.60 | 2018 |
Learning Image Representations Tied to Egomotion from Unlabeled Video. | 6 | 0.68 | 2017 |
Pixel Objectness. | 0 | 0.34 | 2017 |
Unsupervised learning through one-shot image-based shape reconstruction. | 3 | 0.36 | 2017 |
Efficient Activity Detection in Untrimmed Video with Max-Subgraph Search. | 4 | 0.43 | 2017 |
Learning Spherical Convolution for Fast Features from 360° Imagery. | 11 | 0.49 | 2017 |
Flat2Sphere: Learning Spherical Convolution for Fast Features from 360° Imagery. | 4 | 0.47 | 2017 |
Learning to look around. | 0 | 0.34 | 2017 |
Guest Editorial: Best of CVPR 2015. | 0 | 0.34 | 2017 |
Learning Compressible 360° Video Isomers. | 0 | 0.34 | 2017 |
Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing. | 7 | 0.47 | 2017 |
Crowdsourcing in Computer Vision. | 5 | 0.53 | 2016 |
Video Analysis for Body-worn Cameras in Law Enforcement. | 3 | 0.39 | 2016 |
Active Image Segmentation Propagation | 11 | 0.50 | 2016 |
From One-Trick Ponies to All-Rounders: On-Demand Learning for Image Restoration. | 0 | 0.34 | 2016 |
Object-Centric Representation Learning From Unlabeled Videos | 4 | 0.37 | 2016 |
Dense Supervision for Visual Comparisons via Synthetic Images. | 0 | 0.34 | 2016 |
Pull the Plug? Predicting If Computers or Humans Should Segment Images | 1 | 0.35 | 2016 |
Detecting Engagement In Egocentric Video | 10 | 0.49 | 2016 |
Text detection in stores using a repetition prior | 4 | 0.45 | 2016 |
Learning Image Representations Tied to Ego-Motion | 58 | 2.01 | 2015 |
Action and Attention in First-person Vision. | 0 | 0.34 | 2015 |
Predicting Important Objects for Egocentric Video Summarization | 45 | 1.17 | 2015 |
Just Noticeable Differences in Visual Attributes | 12 | 0.51 | 2015 |
Slow And Steady Feature Analysis: Higher Order Temporal Coherence In Video | 39 | 0.90 | 2015 |
Detecting Snap Points In Egocentric Video With A Web Photo Prior | 31 | 0.88 | 2014 |