Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses
dc.contributor.author | Landgren, Malte | |
dc.contributor.author | Giljegård, Oskar | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Johansson, Richard | |
dc.contributor.supervisor | Johansson, Richard | |
dc.date.accessioned | 2024-10-17T13:49:18Z | |
dc.date.available | 2024-10-17T13:49:18Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | Today’s Retrieval Augmented Generation (RAG) systems often struggle when trying to answer questions that require complex multi-hop reasoning. In this thesis we investigate an autoregressive Large Language Model (LLM) architecture which can generate a real-time relevant dense search vector for every token generation step. To facilitate this we also develop a synthetic data generation technique to acquire search query vector labels on a token-by-token level, requiring only a generating LLM and a document database. We investigate the quality of the synthetic data, and provide an attention based relabeling method which decreases hallucinations, improving the correctness of the labels by 67%. The architecture is able to produce query vectors 27 times faster than a separate embedder at the cost of retrieval accuracy. Finally, we train and employ the model in an active retrieval question-answering setting. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/308927 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | LLM | |
dc.subject | RAG | |
dc.subject | active retrieval | |
dc.subject | synthetic data generation | |
dc.subject | master thesis | |
dc.title | Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Computer science – algorithms, languages and logic (MPALG), MSc | |
local.programme | Data science and AI (MPDSC), MSc |