Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses

Landgren, Malte; Giljegård, Oskar

Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses

Ladda ner

CSE 24-42 ML OG.pdf (1.97 MB)

Publicerad

2024

Författare

Landgren, Malte

Giljegård, Oskar

Typ

Examensarbete för masterexamen
Master's Thesis

Program

Computer science – algorithms, languages and logic (MPALG), MSc
Data science and AI (MPDSC), MSc

Sammanfattning

Today’s Retrieval Augmented Generation (RAG) systems often struggle when trying to answer questions that require complex multi-hop reasoning. In this thesis we investigate an autoregressive Large Language Model (LLM) architecture which can generate a real-time relevant dense search vector for every token generation step. To facilitate this we also develop a synthetic data generation technique to acquire search query vector labels on a token-by-token level, requiring only a generating LLM and a document database. We investigate the quality of the synthetic data, and provide an attention based relabeling method which decreases hallucinations, improving the correctness of the labels by 67%. The architecture is able to produce query vectors 27 times faster than a separate embedder at the cost of retrieval accuracy. Finally, we train and employ the model in an active retrieval question-answering setting.

Ämne/nyckelord

LLM, RAG, active retrieval, synthetic data generation, master thesis

URI

http://hdl.handle.net/20.500.12380/308927

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Real-time Relevance: RAG with Dynamic Context for Improved Natural Language Responses

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced