Auto-tuning RISC-V Vectorized convolutions in oneDNN
dc.contributor.author | Jönsson, Samuel | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Pericàs, Miquel | |
dc.contributor.supervisor | Pericàs, Miquel | |
dc.date.accessioned | 2025-02-25T13:32:42Z | |
dc.date.available | 2025-02-25T13:32:42Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | Convolutional Neural Networks are an essential part in deep learning tasks such as image recognition and object detection. The open-source RISC-V ISA with its vector extension offers a great opportunity for optimizing convolution algorithms on high-performance systems. The objective of this thesis was to implement previously vectorized convolution algorithms in a deep learning library called oneDNN and develop a state-of-the-art auto-tuner to further optimize the performance. The algorithms im2col+GEMM were executed and tested on the QEMU emulator running RISC-V Vector Extension 1.0. Four different machine learning models were evaluated for their accuracy and predictive power to auto-tune the algorithms. The results show significant improvement in both execution time and instruction efficiency compared to naive implementations and implementations that didn’t use auto-tuning. The thesis concludes that using a random forest model to auto-tune the convolution algorithm configurations generates the most accurate predictions. Furthermore, the thesis demonstrates the effectiveness of utilizing RISC-V Vector instructions and auto-tuning to optimize im2col+GEMM in oneDNN. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/309160 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | Convolutions | |
dc.subject | Neural Networks | |
dc.subject | Auto-tuning | |
dc.subject | Vectorized instructions | |
dc.subject | QEMU | |
dc.subject | RISC-V | |
dc.subject | Thesis | |
dc.subject | Computer Engineering | |
dc.subject | Computer Science | |
dc.title | Auto-tuning RISC-V Vectorized convolutions in oneDNN | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | High-performance computer systems (MPHPC), MSc |