Safe Multi-Robot Planning Via Long-Run Averages

dc.contributor.authorEmbaye, Eyob
dc.contributor.authorDaun, Johan
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerPiterman, Nir
dc.contributor.supervisorGautier, Anna Louise
dc.date.accessioned2026-07-02T13:16:14Z
dc.date.issued2026
dc.date.submitted
dc.description.abstractConstrained reinforcement learning in Markov decision processes (MDPs) has received increasing attention for its use in sequential decision making problems with safety requirements. This study investigated safe planning via long run average reward using MDPs. This thesis uses grid-world environments and builds on the Triple-QA framework [1]. Three approaches are evaluated: A single-agent baseline and two multi agent extensions, a trivial joint-state extension, and separate Q-table approach. The results show that the single agent algorithm reproduces the result found in the original framework and serves as a reliable baseline. The joint state space extension suffers from poor scalability due to exponential growth in the state action space, and therefore does not achieve comparable reward per agent as the baseline. In contrast, the separate Q-table approach scales significantly better and achieves a level comparable to the single agent case both in an environment with and without agent interaction. Although the result of two agents with trivial extension and separate Q-table was achieved to satisfy the constraint, the test with three agents did not satisfy for both algorithms.
dc.identifier.urihttps://hdl.handle.net/20.500.12380/311812
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectComputer, science, computer science, engineering, multi-agent, reinforce ment learning, project, thesis.
dc.titleSafe Multi-Robot Planning Via Long-Run Averages
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComputer science -algorithms, languages and logic (MPALG), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 26-118 EE JD.pdf
Size:
1.61 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Size:
2.35 KB
Format:
Item-specific license agreed upon to submission
Description: