Title
Augmented Autoencoders: Implicit 3D Orientation Learning for 6D Object Detection
Abstract
We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D domain. We also evaluate on the LineMOD dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3D orientation estimates to account for perspective errors when the object deviates from the image center and show extended results. Our code is available here https://github.com/DLR-RM/AugmentedAutoencoder.
Year
DOI
Venue
2020
10.1007/s11263-019-01243-8
International Journal of Computer Vision
Keywords
Field
DocType
6D object detection, Pose estimation, Domain randomization, Autoencoder, Synthetic data, Symmetries
Training set,Computer vision,Object detection,Autoencoder,Computer science,Pose,Synthetic data,Artificial intelligence,RGB color model,Denoising autoencoder
Journal
Volume
Issue
ISSN
128
3
0920-5691
Citations 
PageRank 
References 
9
0.70
0
Authors
4
Name
Order
Citations
PageRank
Martin Sundermeyer1103.08
Zoltan-Csaba Marton214813.78
Maximilian Durner3113.43
Rudolph Triebel471158.20