Processing-in-Memory based CNN Acceleration: Application Characterization and Simulation
dc.contributor.author | Tang, Lexuan | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Larsson-Edefors, Per | |
dc.contributor.supervisor | Petersen Moura Trancoso, Pedro | |
dc.contributor.supervisor | Wang, Xu | |
dc.date.accessioned | 2025-05-21T12:13:48Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | Convolutional neural networks (CNN) are widely used in different machine learning tasks, especially computer vision. Large amounts of computation in CNN cause intensive data movement, which makes traditional compute-centric architectures (CPU and GPU) less capable. There is a demand for new memory-centric architecture, such as processing-in-memory (PIM). Two subcategories of PIM, processing near memory (PNM) and processing using memory (PUM) are discussed in this report. Starting from the application level, we characterize the layers. Then we use the PNM simulator, ramulator-pim, to simulate CPU and PNM architecture and use the PUM simulator, DNN+NeuroSim, to simulate PUM architecture. The CNN models we use are VGG8, UNet, ResNet18 and MobileNetV3. Based on the characteristics of software and advantages of hardware, we aim to accelerate CNN with PIM technologies and estimate the performance improvement by using PIM technologies. By observing the behavior of different layers on CPU and PNM architecture, we find the correlation between application characteristics and the performance improvement we can get from PNM. The characteristics are memory footprints, number of floating point operations and arithmetic intensity. PUM is another promising technology that provides high speedup compared to CPU with a cost of energy. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/309319 | |
dc.language.iso | eng | |
dc.relation.ispartofseries | CSE 24-136 | |
dc.setspec.uppsok | Technology | |
dc.subject | Processing-in-memory, processing near memory, processing using memory, CNN accelerator | |
dc.title | Processing-in-Memory based CNN Acceleration: Application Characterization and Simulation | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Embedded electronic system design (MPEES), MSc |