Debloating Machine Learning Systems
dc.contributor.author | Sildnik, Mihkel | |
dc.contributor.author | Wang, Yan | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.examiner | Crnkovic, Ivica | |
dc.contributor.supervisor | Ali-Eldin Hassan, Ahmed | |
dc.contributor.supervisor | Leitner, Philipp | |
dc.date.accessioned | 2021-06-29T08:05:06Z | |
dc.date.available | 2021-06-29T08:05:06Z | |
dc.date.issued | 2021 | sv |
dc.date.submitted | 2020 | |
dc.description.abstract | The size and complexity of software systems tend to grow over time. As a side-effect, this increase can potentially lead to the accumulation of unused code, also known as bloat. In this study, we assess the prevalence of bloat in Machine Learning (ML) systems, give an overview of a selection of existing debloating tools and study their applicability to workloads in this field. In order to assess the tools, we run a number of experiments on five different ML models, that are written using the PyTorch li brary. The debloating target is a Docker image containing the ML library and other dependancies required besides the model itself and the dataset. Cimplifier is the only tool we test that was able to generate working images. While the literature in the field of debloating suggests a possible reduction in metrics such as memory usage or power consumption, our testing only shows a reduction in storage size. Most of the removed files are parts of the Nvidia CUDA toolkit and the Intel Math Kernel Library. To summarize, Cimplifier gives promising results when it comes to storage reductions (around 50%) but is unable to impact other metrics such as GPU usage, power consumption or workload runtime. | sv |
dc.identifier.coursecode | MPALG | sv |
dc.identifier.uri | https://hdl.handle.net/20.500.12380/302760 | |
dc.language.iso | eng | sv |
dc.setspec.uppsok | Technology | |
dc.subject | Computer | sv |
dc.subject | science | sv |
dc.subject | computer science | sv |
dc.subject | machine learning | sv |
dc.subject | bloat | sv |
dc.subject | debloating | sv |
dc.subject | project | sv |
dc.subject | thesis | sv |
dc.title | Debloating Machine Learning Systems | sv |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.uppsok | H |