Relative Pose Regression using Non-Square CNN Kernels: Estimation of translation, rotation and scaling between image pairs with custom layers

Publicerad

Typ

Examensarbete för masterexamen

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Localisation is a research field where handcrafted and complex engineering methods have so far given the best results. However, a general trend in computer science is that data-driven approaches often outperform classical methods. These datadriven approaches have been enabled by better high-resolution range sensors and vast amounts of data. End-to-end deep neural network approaches have much better scalability, but the state of the art localisation networks still use designs originally developed for image classification. These are necessarily not the best-suited design, as they aim to achieve a different task than relative pose regression. We show that custom input layers designed to predict translation, rotation or scaling between images are a possible solution. The networks tested in this thesis give good results, but do not outperform a baseline neural network with the same design as a neural network initially made for image classification.

Beskrivning

Ämne/nyckelord

Convolutional Neural Network, Kernel, Layer, Localisation, Machine Learning, Relative Pose Regression, Visual Odometry

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced