Title
Model-based object pose in 25 lines of code
Abstract
In this paper, we describe a method for finding the pose of an object from a single image. We assume that we can detect and match in the image four or more noncoplanar feature points of the object, and that we know their relative geometry on the object. The method combines two algorithms; the first algorithm,POS (Pose from Orthography and Scaling) approximates the perspective projection with a scaled orthographic projection and finds the rotation matrix and the translation vector of the object by solving a linear system; the second algorithm,POSIT (POS with ITerations), uses in its iteration loop the approximate pose found by POS in order to compute better scaled orthographic projections of the feature points, then applies POS to these projections instead of the original image projections. POSIT converges to accurate pose measurements in a few iterations. POSIT can be used with many feature points at once for added insensitivity to measurement errors and image noise. Compared to classic approaches making use of Newton's method, POSIT does not require starting from an initial guess, and computes the pose using an order of magnitude fewer floating point operations; it may therefore be a useful alternative for real-time operation. When speed is not an issue, POSIT can be written in 25 lines or less in Mathematica; the code is provided in an Appendix.
Year
DOI
Venue
1995
10.1007/BF01450852
International Journal of Computer Vision
Keywords
Field
DocType
Posit,Original Image,Feature Point,Initial Guess,Image Noise
Linear algebra,Computer vision,Rotation matrix,Orthographic projection,Iterative method,Computer science,Orthography,Artificial intelligence,Translation (geometry),Scaling,Source lines of code
Journal
Volume
Issue
ISSN
15
1-2
0920-5691
Citations 
PageRank 
References 
366
50.48
15
Authors
2
Search Limit
100366
Name
Order
Citations
PageRank
Daniel Dementhon11327139.94
Larry S. Davis2142012690.83