Title
A new benchmark for pose estimation with ground truth from virtual reality.
Abstract
The development of programming paradigms for industrial assembly currently gets fresh impetus from approaches in human demonstration and programmingby-demonstration. Major low-and mid-level prerequisites for machine vision and learning in these intelligent robotic applications are pose estimation, stereo reconstruction and action recognition. As a basis for the machine vision and learning involved, pose estimation is used for deriving object positions and orientations and thus target frames for robot execution. Our contribution introduces and applies a novel benchmark for typical multi-sensor setups and algorithms in the field of demonstration- based automated assembly. The benchmark platform is equipped with a multi-sensor setup consisting of stereo cameras and depth scanning devices (see Fig. 1). The dimensions and abilities of the platform have been chosen in order to reflect typical manual assembly tasks. Following the eRobotics methodology, a simulatable 3D representation of this platform was modelled in virtual reality. Based on a detailed camera and sensor simulation, we generated a set of benchmark images and point clouds with controlled levels of noise as well as ground truth data such as object positions and time stamps. We demonstrate the application of the benchmark to evaluate our latest developments in pose estimation, stereo reconstruction and action recognition and publish the benchmark data for objective comparison of sensor setups and algorithms in industry.
Year
DOI
Venue
2014
10.1007/s11740-014-0552-0
Production Engineering
Keywords
Field
DocType
Industrial assembly, Machine vision, Machine learning, Virtual reality
Computer vision,Stereo cameras,Virtual reality,Programming paradigm,Machine vision,Pose,Ground truth,Artificial intelligence,Engineering,Robot,Point cloud
Journal
Volume
Issue
ISSN
8
6
0944-6524
Citations 
PageRank 
References 
2
0.37
27
Authors
9