Playtesting Match 3 Games with PPO
Loading...
Date
Authors
Type
Examensarbete för masterexamen
Master's Thesis
Master's Thesis
Programme
Model builders
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The training of proximal policy optimization agents with action masking on stochastic match-3 environments is explored in this thesis. A performant, feature-rich
match-3 simulator is developed, and experiments demonstrate improved performance over a random policy on both seen and unseen levels. Furthermore, the
best generalization performance is achieved when training is done by sampling levels from a subset of levels.
Description
Keywords
Reinforcement learning, match-3
