Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses
| dc.contributor.author | Landgren, Malte | |
| dc.contributor.author | Giljegård, Oskar | |
| dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
| dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
| dc.contributor.examiner | Johansson, Richard | |
| dc.contributor.supervisor | Johansson, Richard | |
| dc.date.accessioned | 2024-10-17T13:49:18Z | |
| dc.date.available | 2024-10-17T13:49:18Z | |
| dc.date.issued | 2024 | |
| dc.date.submitted | ||
| dc.description.abstract | Today’s Retrieval Augmented Generation (RAG) systems often struggle when trying to answer questions that require complex multi-hop reasoning. In this thesis we investigate an autoregressive Large Language Model (LLM) architecture which can generate a real-time relevant dense search vector for every token generation step. To facilitate this we also develop a synthetic data generation technique to acquire search query vector labels on a token-by-token level, requiring only a generating LLM and a document database. We investigate the quality of the synthetic data, and provide an attention based relabeling method which decreases hallucinations, improving the correctness of the labels by 67%. The architecture is able to produce query vectors 27 times faster than a separate embedder at the cost of retrieval accuracy. Finally, we train and employ the model in an active retrieval question-answering setting. | |
| dc.identifier.coursecode | DATX05 | |
| dc.identifier.uri | http://hdl.handle.net/20.500.12380/308927 | |
| dc.language.iso | eng | |
| dc.setspec.uppsok | Technology | |
| dc.subject | LLM | |
| dc.subject | RAG | |
| dc.subject | active retrieval | |
| dc.subject | synthetic data generation | |
| dc.subject | master thesis | |
| dc.title | Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses | |
| dc.type.degree | Examensarbete för masterexamen | sv |
| dc.type.degree | Master's Thesis | en |
| dc.type.uppsok | H | |
| local.programme | Computer science – algorithms, languages and logic (MPALG), MSc | |
| local.programme | Data science and AI (MPDSC), MSc |
