Natural Language Processing and Large Language Models for Automation of Compliance Tracing

dc.contributor.authorForsell, Maximilian
dc.contributor.authorErlandsson Hollgren, Eric
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerBerger, Christian
dc.contributor.supervisorBosch, Jan
dc.date.accessioned2025-11-19T07:02:05Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractCompliance is a costly and time-consuming task that most, if not all, firms must perform. As such, automating parts of the compliance process could be highly valuable. This thesis aims to investigate challenges faced by European software-intensive firms in their compliance processes, identify automation opportunities, and develop a Natural Language Processing- and Large Language Model-based software artifact to automate compliance tracing between company guidelines and normative requirements. The thesis followed the Design Science Research approach, and as such, the research was conducted in close collaboration with industry practitioners. The challenges and automation opportunities were identified together with seven interviewees from four different companies, and the final software artifact, dubbed TraceAlign, was developed and evaluated in focus groups with a total of twelve unique participants from two companies. The identified challenges ranged from organizational- and management-related to specifics inherent to the specifications of normative requirements. Automation opportunities related mainly to the management of requirements, company guidelines, and compliance evidence, of which this thesis focuses specifically on the task of compliance tracing of company guidelines to normative requirements. The final software artifact, TraceAlign, was considered to be time- and cost-saving by the focus group participants, but could perhaps be made more accurate. We conclude that there are many challenges with compliance that could potentially be automated using Natural Language Processing and Large Language Models.
dc.identifier.coursecodeDATX05
dc.identifier.urihttp://hdl.handle.net/20.500.12380/310757
dc.language.isoeng
dc.relation.ispartofseriesCSE 25-63
dc.setspec.uppsokTechnology
dc.subjectCompliance, Design Science Research, Large language model, Natural language processing, Artificial Intelligence, Knowledge graph, Requirements tracing
dc.titleNatural Language Processing and Large Language Models for Automation of Compliance Tracing
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeData science and AI (MPDSC), MSc
local.programmeSoftware engineering and technology (MPSOF), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 25-63 MF EEH.pdf
Storlek:
4.35 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: