Guiding Column Generation using Deep Reinforcement Learning Trainee and Training Device Optimization

Lindén, Anton

Guiding Column Generation using Deep Reinforcement Learning Trainee and Training Device Optimization

dc.contributor.author	Lindén, Anton
dc.contributor.department	Chalmers tekniska högskola / Institutionen för fysik	sv
dc.contributor.department	Chalmers University of Technology / Department of Physics	en
dc.contributor.examiner	Granath, Mats
dc.contributor.supervisor	Wojciechowski, Adam
dc.date.accessioned	2023-09-20T07:29:08Z
dc.date.available	2023-09-20T07:29:08Z
dc.date.issued	2023
dc.date.submitted	2023
dc.description.abstract	Many optimization problems can be formulated as a Integer Linear Program (ILP), which is an optimization problem that involves minimizing or maximizing a linear objective function subject to linear constraints and integrality requirements. Some examples include train scheduling, airline crew scheduling and production planning. ILP models with an exponentially growing set of variables are often solved using an algorithm known as Column Generation (CG). CG iteratively improves the objective function value by generating new variables, or columns, without considering every possible variable in the ILP model. This thesis was performed together with Jeppe sen, a Boeing company, and investigated the possibility of using Deep Reinforcement Learning (DRL) to generate new variables in CG for a scheduling problem for air line pilots. Results show that it is possible to teach an agent a policy that slightly improves the quality of the generated variables in this specific problem. However, it is still unclear whether or not the benefits of using DRL outweighs the extra effort of setting up and training an agent.
dc.identifier.coursecode	TIFX05
dc.identifier.uri	http://hdl.handle.net/20.500.12380/307063
dc.language.iso	eng
dc.setspec.uppsok	PhysicsChemistryMaths
dc.subject	Scheduling Problem, Integer Linear Programming, Column Generation, Graph Neural Networks, Deep Reinforcement Learning.
dc.title	Guiding Column Generation using Deep Reinforcement Learning Trainee and Training Device Optimization
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Complex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: masters_thesis_Anton_Linden.pdf
Storlek:: 1.72 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen