Title
Urban Localization With Street Views Using A Convolutional Neural Network For End-To-End Camera Pose Regression
Abstract
This paper presents an end-to-end real-time monocular absolute localization approach that uses Google Street View panoramas as a prior source of information to train a Convolutional Neural Network (CNN). We propose an adaptation of the PoseNet architecture [8] to a sparse database of panoramas. We show that we can expand the latter by synthesizing new images and consequently improve the accuracy of the pose regressor. The main advantage of our method is that it does not require a first passage of an equipped vehicle to build a map. Moreover, the offline data generation and CNN training are automatic and does not require the input of an operator. In the online phase, the approach only uses one camera for localization and regresses poses in a global frame. The conducted experiments show that augmenting the training set as presented in this paper drastically improves the accuracy of the CNN. The results, when compared to a handcrafted feature-based approach, are less accurate (around 7.5 to 8 m against 2.5 to 3 m) but also less dependent on the position of the camera inside the vehicle. Furthermore, our CNN-based method computes the pose approximately 40 times faster (75 ms per image instead of 3 s) than the handcrafted approach.
Year
DOI
Venue
2019
10.1109/IVS.2019.8813892
2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19)
Field
DocType
ISSN
Training set,Computer vision,Regression,Convolutional neural network,End-to-end principle,Computer science,Operator (computer programming),Artificial intelligence,Monocular,Test data generation
Conference
1931-0587
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Guillaume Bresson1205.83
Li Yu200.34
Cyril Joly334.46
Fabien Moutarde45415.26