Cross-modal image feature matching between infrared and visual images. Adapting intra-modal feature matching models for cross-modal matching

Räjert, Tommy

Cross-modal image feature matching between infrared and visual images. Adapting intra-modal feature matching models for cross-modal matching

dc.contributor.author	Räjert, Tommy
dc.contributor.department	Chalmers tekniska högskola / Institutionen för elektroteknik	sv
dc.contributor.examiner	Zach, Christopher
dc.contributor.supervisor	Lochman, Yaroslava
dc.contributor.supervisor	Ringdahl, Viktor
dc.date.accessioned	2024-09-06T06:49:02Z
dc.date.available	2024-09-06T06:49:02Z
dc.date.issued	2024
dc.date.submitted
dc.description.abstract	Abstract Image feature matching is an essential part to various computer vision applications. Many modern solutions apply machine learning techniques to achieve state-of-theart results. A lesser studied problem is matching image features between images of different modalities. This thesis investigates this problem for the visual–LWIR (long-wave infrared) case by utilizing the matching capabilities of the pre-trained intra-modal models SuperPoint and SuperGlue. This is done by adding interfacing models and additional layers to mitigate problems such as catastrophic forgetting and data biasing in the pre-trained models. These techniques prove only marginally successful compared to the pre-trained models themselves. For training these models, a method for sparse pseudo ground truth point correspondence is proposed, and evaluation is done via pose estimation. This thesis provides insight into some specific methods of transfer learning for the SuperPoint and SuperGlue models, methods for ground truth estimation, and discusses the difficulties faced in this problem. Further studying of this problem may be able to construct improved models for LWIR–visual matching, which would enable more reliable methods for cross-modal camera calibration & registration, localization, and image retrieval, with numerous applications in the automotive, defense, and healthcare industries.
dc.identifier.coursecode	EENX30
dc.identifier.uri	https://hdl.handle.net/20.500.12380/308527
dc.language.iso	eng
dc.setspec.uppsok	Technology
dc.subject	Keywords: feature matching, deep learning, computer vision, pose estimation, multimodal, infrared imaging, graph neural networks.
dc.title	Cross-modal image feature matching between infrared and visual images. Adapting intra-modal feature matching models for cross-modal matching
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Data science and AI (MPDSC), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: Master's Thesis gs.pdf
Size:: 9.6 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen