A Version Oriented Parallel Asynchronous Evolution Strategy for Deep Learning

JANG, MYEONG-JIN

A Version Oriented Parallel Asynchronous Evolution Strategy for Deep Learning

dc.contributor.author	JANG, MYEONG-JIN
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.examiner	Papatriantafilou, Marina
dc.contributor.supervisor	Tsigas, Philippas
dc.date.accessioned	2021-11-16T13:35:36Z
dc.date.available	2021-11-16T13:35:36Z
dc.date.issued	2021	sv
dc.date.submitted	2020
dc.description.abstract	In this work we propose a new parallel asynchronous Evolution Strategy (ES) that outperforms the existing ESs, including the canonical ES and steady-state ES. ES has been considered a competitive alternative solution for optimizing neural networks in deep reinforcement learning, instead of using an optimizer and a backpropagation function. In this thesis, three different ES systems were implemented to compare the performances of each ES implementation. Two ES systems were implemented based on existing ES systems, which are the canonical ES and steady-steady ES, respectively. Lastly, the last ES system is the proposed ES system called Version Oriented Parallel Asynchronous Evolution Strategy (VOPAES). The canonical ES replaces all population individuals at each generation, whereas the steady-state ES replaces only the weakest population with the newly created one. By replacing all population individuals, the canonical ES could optimize the network faster than the steady-state ES. However, it requires synchronization which might increase CPU idle time. On the contrary, a parallel steady-state ES does not require synchronization, but its learning speed could be slower than the parallel canonical ES one. Therefore, we suggest VOPAES as an advanced ES solution that takes the benefits of both the parallel canonical ES and the parallel steady-state ES system. The test results of this work demonstrated that the canonical ES system can be implemented asynchronously using versions. Moreover, by merging the benefits, VOPAES could decrease CPU idle time and maintain high optimization accuracy and speed as the parallel canonical ES system. In conclusion, VOPAES achieved the fastest training speed among the implemented ES systems.	sv
dc.identifier.coursecode	DATX05	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/304364
dc.language.iso	eng	sv
dc.setspec.uppsok	Technology
dc.subject	Reinforcement Learning	sv
dc.subject	Parallelism	sv
dc.subject	Evolution Strategy	sv
dc.subject	Back-propagation	sv
dc.subject	Asynchronous	sv
dc.subject	Optimization	sv
dc.title	A Version Oriented Parallel Asynchronous Evolution Strategy for Deep Learning	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H
local.programme	Computer systems and networks (MPCSN), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 21-153 Jang.pdf
Storlek:: 2.09 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.51 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen