Multi-agent Reinforcement Learning for predator-prey ecosystem

Palak, Michal

Multi-agent Reinforcement Learning for predator-prey ecosystem

Ladda ner

Primär fil Master_Thesis Michal Palak_2026.pdf (2.06 MB)

Publicerad

2026

Författare

Palak, Michal

Typ

Examensarbete för masterexamen
Master's Thesis

Program

Engineering mathematics and computational science (MPENM), MSc

Sammanfattning

Agent-based ecosystem models provide a flexible way to study population dynamics by simulating the behaviour of individual organisms. However, manually specifying realistic animal behaviour can be difficult, especially when agents must balance multiple needs and interact with other species. This thesis investigates the use of multi-agent reinforcement learning for simulating a simplified predator-prey ecosystem in a two-dimensional grid-based environment. A custom ecosystem environment was developed in which prey and predator agents interact with spatially distributed resources, including multiple grass types and water reservoirs. The model includes survival constraints based on energy, thirst, age, movement, reproduction, predation, and resource consumption. In contrast to simpler predator-prey simulations, the environment requires prey agents to balance grazing and drinking, creating a simple form of migration between food and water resources. Age-dependent movement speed was also introduced as a way to model increased vulnerability among young and old individuals. Several learning configurations were evaluated. In particular, the thesis compares a standard survival reward with a homeostatic reward based on internal energy and water levels, as well as two training setups for handling agent death. The results indicate that the homeostatic reward might improve early learning however the results were not statistically significant. The respawn based training environment significantly improves training efficiency compared with the standard setup. Comparisons between PPO, TRPO, TQC and hand coded agents showed broadly similar performance after extended training. Trained agents were also evaluated on satellitederived terrain maps, where no significant reduction in performance measure was observed compared with Perlin noise-generated maps. The results suggest that multi-agent reinforcement learning can be used to generate stable predator-prey dynamics in a spatially structured ecosystem while producing useful simulation statistics such as population trends, survival distributions, reproduction patterns, and spatial movement heatmaps.

Ämne/nyckelord

multi-agent reinforcement learning, predator-prey, lotka-volterra

URI

https://hdl.handle.net/20.500.12380/311134

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Multi-agent Reinforcement Learning for predator-prey ecosystem

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

Endorsement

Review

Supplemented By

Referenced By