Multi-Agent Deep Reinforcement Learning in a Three-Species Predator-Prey Ecosystem

Karlsson, Tobias

Multi-Agent Deep Reinforcement Learning in a Three-Species Predator-Prey Ecosystem

dc.contributor.author	Karlsson, Tobias
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.examiner	Axelson-Fisk, Marina
dc.contributor.supervisor	Strannegård, Claes
dc.date.accessioned	2021-07-01T13:04:08Z
dc.date.available	2021-07-01T13:04:08Z
dc.date.issued	2021	sv
dc.date.submitted	2020
dc.description.abstract	In computational biology, population dynamics in simulated ecosystems is one im portant topic. Standard mathematical tools of population dynamics such as systems of differential equations are typically incapable of accounting for a variety of impor tant attributes, such as the intelligent and adaptive behavior of individual agents in complex environments. Thus, they are often insufficient to simulate dynamics in real-world ecosystems. In this thesis, a three-species predator-prey simulated ecosystem was implemented in the Unity game engine. Agents in the ecosystem were trained through multi-agent reinforcement learning. The population dynam ics were then analysed with respect to the Lotka-Volterra predator-prey equations which are described by several parameters and assumptions regarding the responses of the parameters to changing population densities. The responses of the parameters were estimated through simulation experiments. It was found that the population dynamics of an ecosystem with trained agents exhibited Lotka-Volterra cycles where a random policy agent ecosystem failed to do so. Further, it was shown that the observed responses of the parameters did not fulfill the Lotka-Volterra assumptions, but rather showed properties that could be argued to be more realistic. For the reinforcement learning, a reward system was introduced as the happiness network, which incorporated both the external and internal state of the animat, inspired by behavioral science of real-world animals. This reward system was shown to perform better than a simple reward system with a positive reward for eating food and a negative for dying, in some environments and was argued to have benefits in more complex ecosystems.	sv
dc.identifier.coursecode	MPDSC	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/302922
dc.language.iso	eng	sv
dc.setspec.uppsok	Technology
dc.subject	animats	sv
dc.subject	multi-agent reinforcement learning	sv
dc.subject	ecosystems	sv
dc.subject	lotka-volterra	sv
dc.subject	predator-prey	sv
dc.title	Multi-Agent Deep Reinforcement Learning in a Three-Species Predator-Prey Ecosystem	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 21-56 Karlsson.pdf
Storlek:: 8.3 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.51 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen