Learning to Play Games from Multiple Imperfect Teachers

Karlsson, John

Learning to Play Games from Multiple Imperfect Teachers

Ladda ner

Primär fil 203067.pdf (595.92 KB)

Publicerad

2014

Författare

Karlsson, John

Typ

Examensarbete för masterexamen
Master Thesis

Program

Complex adaptive systems (MPCAS), MSc

Sammanfattning

This project evaluates the modularity of a recent Bayesian Inverse Reinforcement Learning approach [1] by inferring the sub-goals correlated with winning board games from observations of a set of agents. A feature based architecture is proposed together with a method for generating the reward function space, making inference tractable in large state spaces and allowing for the combination with models that approximate stateaction values. Further, a policy prior is suggested that allows for least squares policy evaluation using sample trajectories. The model is evaluated on randomly generated environments and on Tic-tac-toe, showing that a combination of the intentions inferred from all agents can generate strategies that outperform the corresponding strategies from each individual agent.

Ämne/nyckelord

Data- och informationsvetenskap, Computer and Information Science

URI

https://hdl.handle.net/20.500.12380/203067

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Learning to Play Games from Multiple Imperfect Teachers

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced