- Code for CVIP 2022 paper: Visual Localization using Capsule Networks- https://link.springer.com/chapter/10.1007/978-3-031-11346-8_15
- Attaching my master's thesis at IIT Madras on the same subject
- Project code inspired from: https://github.com/hazirbas/poselstm-pytorch
Abstract
Visual localization is the task of camera pose estimation, and is crucial for many technologies which involve localization such as mobile robots and augmented reality. Several convolutional neural network models have been proposed for the task against the more accurate geometry-based computer vision techniques. However, they have several shortcomings and to our knowledge, this was the first effort that explored the use of an alternative architecture based on capsule networks for the task. We achieved better results with capsules than with baseline-CNN PoseNet on a small NORB dataset, modified for the task of camera pose estimation. Feature visualizations for both networks produced more insights into their performance and behavior. We found that there is a scope for improvement and hence propose a few directions for future efforts.