Safe Multi-Robot Planning Via Long-Run Averages

Embaye, Eyob; Daun, Johan

Safe Multi-Robot Planning Via Long-Run Averages

dc.contributor.author	Embaye, Eyob
dc.contributor.author	Daun, Johan
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.department	Chalmers University of Technology / Department of Computer Science and Engineering	en
dc.contributor.examiner	Piterman, Nir
dc.contributor.supervisor	Gautier, Anna Louise
dc.date.accessioned	2026-07-02T13:16:14Z
dc.date.issued	2026
dc.date.submitted
dc.description.abstract	Constrained reinforcement learning in Markov decision processes (MDPs) has received increasing attention for its use in sequential decision making problems with safety requirements. This study investigated safe planning via long run average reward using MDPs. This thesis uses grid-world environments and builds on the Triple-QA framework [1]. Three approaches are evaluated: A single-agent baseline and two multi agent extensions, a trivial joint-state extension, and separate Q-table approach. The results show that the single agent algorithm reproduces the result found in the original framework and serves as a reliable baseline. The joint state space extension suffers from poor scalability due to exponential growth in the state action space, and therefore does not achieve comparable reward per agent as the baseline. In contrast, the separate Q-table approach scales significantly better and achieves a level comparable to the single agent case both in an environment with and without agent interaction. Although the result of two agents with trivial extension and separate Q-table was achieved to satisfy the constraint, the test with three agents did not satisfy for both algorithms.
dc.identifier.uri	https://hdl.handle.net/20.500.12380/311812
dc.language.iso	eng
dc.setspec.uppsok	Technology
dc.subject	Computer, science, computer science, engineering, multi-agent, reinforce ment learning, project, thesis.
dc.title	Safe Multi-Robot Planning Via Long-Run Averages
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Computer science -algorithms, languages and logic (MPALG), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 26-118 EE JD.pdf
Size:: 1.61 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen