Using LoRA to Improve Retrieval in Structured Document Collections

Wassenius, Marcus; Junayd, Kader

Using LoRA to Improve Retrieval in Structured Document Collections

Ladda ner

Master_Thesis_Marcus_Wassenius__Junayd_Kader_2026.pdf (6.69 MB)

Publicerad

2026

Författare

Wassenius, Marcus

Junayd, Kader

Typ

Examensarbete för masterexamen
Master's Thesis

Program

Data science and AI (MPDSC), MSc

Sammanfattning

This thesis explores different roles for Low-Rank Adaptation (LoRA) in a question answering (QA) system over structured document collections, and aims to shed light on benefits and limitations as well as rank requirement interactions with dataset size and confusability. Three roles are investigated on synthetic document collections mimicking company data. First, LoRA as parametric memory to internalize new knowledge directly into the model weights; second, LoRA as a tool for embedding adaptation in a retrieval-augmented generation (RAG) system to improve retrieval performance by improving the vector representations of chunks; third, LoRA as a tool for context routing in a RAG system to improve retrieval by narrowing the search space. The findings suggest that LoRA is a flexible tool that can improve QA performance in all three roles, but that each role comes with its own characteristics and trade-offs between for example performance, flexibility, and robustness. In the specific settings tested, parametric memory is shown to be effective and highly rank sensitive; embedding adaptation is shown to give robust retrieval and generation improvements already at low ranks with gradual benefits of increased rank; context routing is shown to give large retrieval benefits at low rank requirements but introduces certain brittleness in both the retrieval and generation step.

Ämne/nyckelord

Low-Rank Adaptation (LoRA), parameter-efficient fine-tuning (PEFT), Retrieval-Augmented Generation (RAG), embedding adaptation, rank scaling, structured document retrieval, context routing

URI

https://hdl.handle.net/20.500.12380/311220

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Using LoRA to Improve Retrieval in Structured Document Collections

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

Endorsement

Review

Supplemented By

Referenced By