FPGA-implementation av ett neuralt nätverk

dc.contributor.authorLarsson, Hanna
dc.contributor.authorJadeglans, Tim
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerLundin, Peter
dc.date.accessioned2019-07-10T13:37:17Z
dc.date.available2019-07-10T13:37:17Z
dc.date.issued2019sv
dc.date.submitted2019
dc.description.abstractImage recognition is a quickly growing field where convolutional neural networks, CNN, are in the bleeding edge. Today fast GPUs are used which consume a lot of power. Field programmable gate arrays, FPGAs, are more energy efficient per calculation. This report describes an architecture of a convolutional neural network implemented in a field programmable gate array. The main purpose is to design an architecture and demonstrate its functionality in regards to power, speed and resource usage. In order to achieve the architecture, the project has followed general guidelines for a convolutional neural network, with filters that extend over the entire depth of the image. The parameters of the design were adapted for the FPGA used in the project. The dimensions of the memory were adjusted to reduce the number of times each data has to be loaded for each calculation, due to max-pooling. The final architecture, however, resulted in a flexible enough design that is adaptable to other FPGAs. When implemented, the calculations used both data and filters from a limited read-only memory, ROM, the design could use data from the main processor. The computing capacity of the architecture is far below the theoretical capacity of the FPGA. However, there are multiple possibilities for improvements which would improve the computing potential dramatically. To utilize the increased potential, the summing tree used in the architecture can be modified which will potentially double the calculations per clock cycle and optimize the critical data path to further increase the clock speed. Despite these limitations, the current architecture has higher performance-to-power ratio than a GTX 1060.sv
dc.identifier.coursecodeLMTX38sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/300036
dc.language.isoswesv
dc.setspec.uppsokTechnology
dc.subjectConvolutional Neural Networksv
dc.subjectCNNsv
dc.subjectField Programmable Gate Arraysv
dc.subjectFPGAsv
dc.subjectImage Recognitionsv
dc.titleFPGA-implementation av ett neuralt nätverksv
dc.type.degreeExamensarbete på grundnivåsv
dc.type.uppsokM
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 19-19 CPL Larsson Jadeglans.pdf
Storlek:
2.07 MB
Format:
Adobe Portable Document Format
Beskrivning:
CSE Larsson Jadeglans
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: