Using LoRA to Improve Retrieval in Structured Document Collections

dc.contributor.authorWassenius, Marcus
dc.contributor.authorJunayd, Kader
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerZuyev, Sergei
dc.contributor.supervisorZuyev, Sergei
dc.date.accessioned2026-06-11T14:17:02Z
dc.date.issued2026
dc.date.submitted
dc.description.abstractThis thesis explores different roles for Low-Rank Adaptation (LoRA) in a question answering (QA) system over structured document collections, and aims to shed light on benefits and limitations as well as rank requirement interactions with dataset size and confusability. Three roles are investigated on synthetic document collections mimicking company data. First, LoRA as parametric memory to internalize new knowledge directly into the model weights; second, LoRA as a tool for embedding adaptation in a retrieval-augmented generation (RAG) system to improve retrieval performance by improving the vector representations of chunks; third, LoRA as a tool for context routing in a RAG system to improve retrieval by narrowing the search space. The findings suggest that LoRA is a flexible tool that can improve QA performance in all three roles, but that each role comes with its own characteristics and trade-offs between for example performance, flexibility, and robustness. In the specific settings tested, parametric memory is shown to be effective and highly rank sensitive; embedding adaptation is shown to give robust retrieval and generation improvements already at low ranks with gradual benefits of increased rank; context routing is shown to give large retrieval benefits at low rank requirements but introduces certain brittleness in both the retrieval and generation step.
dc.identifier.coursecodeMVEX03
dc.identifier.urihttps://hdl.handle.net/20.500.12380/311220
dc.language.isoeng
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectLow-Rank Adaptation (LoRA), parameter-efficient fine-tuning (PEFT), Retrieval-Augmented Generation (RAG), embedding adaptation, rank scaling, structured document retrieval, context routing
dc.titleUsing LoRA to Improve Retrieval in Structured Document Collections
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeData science and AI (MPDSC), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Master_Thesis_Marcus_Wassenius__Junayd_Kader_2026.pdf
Size:
6.69 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Size:
2.35 KB
Format:
Item-specific license agreed upon to submission
Description: