Multi-agent Reinforcement Learning for predator-prey ecosystem
Hämtar...
Publicerad
Författare
Typ
Examensarbete för masterexamen
Master's Thesis
Master's Thesis
Modellbyggare
Tidskriftstitel
ISSN
Volymtitel
Utgivare
Sammanfattning
Agent-based ecosystem models provide a flexible way to study population dynamics
by simulating the behaviour of individual organisms. However, manually specifying
realistic animal behaviour can be difficult, especially when agents must balance
multiple needs and interact with other species. This thesis investigates the use of
multi-agent reinforcement learning for simulating a simplified predator-prey ecosystem
in a two-dimensional grid-based environment.
A custom ecosystem environment was developed in which prey and predator agents
interact with spatially distributed resources, including multiple grass types and water
reservoirs. The model includes survival constraints based on energy, thirst, age,
movement, reproduction, predation, and resource consumption. In contrast to simpler
predator-prey simulations, the environment requires prey agents to balance
grazing and drinking, creating a simple form of migration between food and water
resources. Age-dependent movement speed was also introduced as a way to model
increased vulnerability among young and old individuals.
Several learning configurations were evaluated. In particular, the thesis compares a
standard survival reward with a homeostatic reward based on internal energy and
water levels, as well as two training setups for handling agent death. The results
indicate that the homeostatic reward might improve early learning however the results
were not statistically significant. The respawn based training environment
significantly improves training efficiency compared with the standard setup. Comparisons
between PPO, TRPO, TQC and hand coded agents showed broadly similar
performance after extended training. Trained agents were also evaluated on satellitederived
terrain maps, where no significant reduction in performance measure was
observed compared with Perlin noise-generated maps.
The results suggest that multi-agent reinforcement learning can be used to generate
stable predator-prey dynamics in a spatially structured ecosystem while producing
useful simulation statistics such as population trends, survival distributions, reproduction
patterns, and spatial movement heatmaps.
Beskrivning
Ämne/nyckelord
multi-agent reinforcement learning, predator-prey, lotka-volterra
