Utilizing heterogeneity to allocate ML tasks for increased efficiency
dc.contributor.author | Carling, Lukas | |
dc.contributor.author | Villing, Max | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Pathan, Risat | |
dc.contributor.supervisor | Petersen Moura Trancoso, Pedro | |
dc.date.accessioned | 2023-12-20T09:41:10Z | |
dc.date.available | 2023-12-20T09:41:10Z | |
dc.date.issued | 2023 | |
dc.date.submitted | 2023 | |
dc.description.abstract | There is a growing interest in using heterogeneous hardware and resource allocation to boost the efficiency of software applications. Proper use of both imposes additional burdens on software development. We look at characterizing some common machine learning tasks with regards to CPU-GPU systems, specifically for the NVIDIA Orin, in order to try and predict what conditions will give the highest performance and energy-efficiency. We then take an iterative approach for allocating said tasks to hardware and selecting resources based on our characterization, with either performance or energy-efficiency as a goal. We find that while there is room for improvement on per-task predictions there are various possibilities to gain significant benefits to performance and energy by properly utilizing hardware heterogeneity and resource allocation. Additional exploration of domain specific accelerators such as tensor cores shows significant potential for accelerating convolutions. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/307447 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | heterogeneous hardware | |
dc.subject | resource allocation | |
dc.subject | convolutional neural network | |
dc.subject | NVIDIA | |
dc.subject | Jetson ORIN | |
dc.subject | tensor cores | |
dc.subject | machine learning | |
dc.title | Utilizing heterogeneity to allocate ML tasks for increased efficiency | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | High-performance computer systems (MPHPC), MSc |