Energy Efficiency of Convolutional Neural Network Inference on FPGAs and Accelerated GPUs

ÜNALACAK, SEDAT; JOTHI SINGAARAVADIVELU, JOSHYKA

Energy Efficiency of Convolutional Neural Network Inference on FPGAs and Accelerated GPUs

dc.contributor.author	ÜNALACAK, SEDAT
dc.contributor.author	JOTHI SINGAARAVADIVELU, JOSHYKA
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.examiner	Larsson-Edefors, Per
dc.contributor.supervisor	o Petersen Moura Trancoso, Pedro
dc.date.accessioned	2021-09-21T10:11:36Z
dc.date.available	2021-09-21T10:11:36Z
dc.date.issued	2021	sv
dc.date.submitted	2020
dc.description.abstract	Energy efficiency of convolutional neural networks (CNN) can be improved by using low-precision data types. FPGAs and GPUs are widely used to implement CNN inference due to their parallel processing capabilities. Some GPU-based SoCs in clude accelerator cores that perform low-precision operations efficiently for certain data types. FPGAs can be configured to carry out arbitrary bit-width operations. This thesis examines and compares the energy efficiency of FPGAs and accelerated GPUs for low-precision CNN inference applications. We implemented convolution, fully connected and pooling building blocks for CNN inference on both platforms, verified functionality, measured and compared performance with each other and the state of the art. Accelerator cores on our GPU-based SoC improved the energy efficiency for some design cases at the expense of increased latency and base power consumption. Depending on the design parameters and the type of the layers, FPGA provided up to 23.11 times better energy efficiency, 28.31 times less power consump tion and 6.59 times lower latency than accelerated GPU, and GPU provided up to 1.64 times better operational energy efficiency. FPGA worked with even higher energy efficiency for variety of low bit-width data types that cannot be processed by accelerated GPU. Accelerated GPU delivered reasonable energy efficiency levels and required comparably less design time. We also included detailed analysis of the effects of the design parameters on energy efficiency.	sv
dc.identifier.coursecode	DATX05	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/304178
dc.language.iso	eng	sv
dc.setspec.uppsok	Technology
dc.subject	Accelerator	sv
dc.subject	CNN	sv
dc.subject	Convolution	sv
dc.subject	Energy Efficiency	sv
dc.subject	FPGA	sv
dc.subject	Fully Connected	sv
dc.subject	GPU	sv
dc.subject	HLS	sv
dc.subject	Pooling	sv
dc.subject	TensorRT	sv
dc.title	Energy Efficiency of Convolutional Neural Network Inference on FPGAs and Accelerated GPUs	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H
local.programme	Embedded electronic system design (MPEES), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 21-129 Sedat Joshyka.pdf
Storlek:: 5.62 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.51 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen