Combinatorial Optimization with Reinforcement Learning

Persson Hijazi, Aladdin; Persson, Sanna

Combinatorial Optimization with Reinforcement Learning

Ladda ner

CSE 23-137 AP SP.pdf (784.63 KB)

Publicerad

2023

Författare

Persson Hijazi, Aladdin

Persson, Sanna

Typ

Examensarbete för masterexamen
Master's Thesis

Program

Data science and AI (MPDSC), MSc
Computer science – algorithms, languages and logic (MPALG), MSc

Sammanfattning

This master’s thesis delves into the topic of solving combinatorial optimization problems with methods based on reinforcement learning, and specifically, we explore the potential of iterative route decoding and gradient updates in enhancing the performance of route decoding. In this context, route decoding refers to determining the most efficient route for a set of destinations, a combinatorial optimization problem often encountered in logistics and transportation planning. We introduce two methods for iteratively updating solutions for the heterogeneous capacitated vehicle routing problems. They are built upon a reinforcement learning algorithm with an attention graph encoder and use previously computed routes for an instance to improve solution quality. Our results show improved performance, in particular, on out-of-distribution data, which suggests the practical applicability of the methods. In particular, our results show that a pre-trained route planner can, with a few gradient updates with a policy gradient method, significantly improve on out-ofdistribution data.

Ämne/nyckelord

Combinatorial optimization, reinforcement learning

URI

http://hdl.handle.net/20.500.12380/307496

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Combinatorial Optimization with Reinforcement Learning

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced