On-Device Scalable Image-Based Localization via Prioritized Cascade Search and Fast One-Many RANSAC

Abstract

We present the design of an entire on-device system for large-scale urban localization using images. The proposed design integrates compact image retrieval and 2D–3D correspondence search to estimate the location in extensive city regions. Our design is GPS agnostic and does not require network connection. In order to overcome the resource constraints of mobile devices, we propose a system design that leverages the scalability advantage of image retrieval and accuracy of 3D model-based localization. Furthermore, we propose a new hashing-based cascade search for fast computation of 2D–3D correspondences. In addition, we propose a new one-many RANSAC for accurate pose estimation. The new one-many RANSAC addresses the challenge of repetitive building structures (e.g. windows and balconies) in urban localization. Extensive experiments demonstrate that our 2D–3D correspondence search achieves the state-of-the-art localization accuracy on multiple benchmark datasets. Furthermore, our experiments on a large Google street view image dataset show the potential of large-scale localization entirely on a typical mobile device.

Publication
In IEEE Transactions on Image Processing 2018
Anh-Dzung Doan
Anh-Dzung Doan
Postdoctoral Researcher

My research interests lie in the area of robotic vision, at the intersection of robotics, computer vision, and machine learning.