Impact of Training Data Volume on Neural Network Training and Accuracy

REY ALONSO, ALICIA

Impact of Training Data Volume on Neural Network Training and Accuracy

dc.contributor.author	REY ALONSO, ALICIA
dc.contributor.department	Chalmers tekniska högskola / Institutionen för fysik	sv
dc.contributor.department	Chalmers University of Technology / Department of Physics	en
dc.contributor.examiner	Volpe, Giovanni
dc.contributor.supervisor	Björklund, Tomas
dc.date.accessioned	2023-06-21T08:30:41Z
dc.date.available	2023-06-21T08:30:41Z
dc.date.issued	2023
dc.date.submitted	2023
dc.description.abstract	This master thesis explores the impact of data volume on model training and accuracy in the context of neural networks. The study focuses on conducting experiments on two image-based networks performing classification tasks, namely ResNet50 and MobileNetV2. The objective is to investigate the behaviour and accuracy of these networks as they are trained on progressively smaller subsets of the original dataset. With this study, we aim to gain some insight into how neural networks perform under different data availability scenarios. This type of information can become key in decision making processes regarding data collection, model development, and output handling, particularly in situations where data volume is limited. The research begins by establishing a baseline performance of the networks when trained on the entire dataset. Subsequently, various subsets of the original dataset are created by progressively reducing the volume of training data. The performance of the networks is then evaluated using these reduced datasets. This process allows for a comprehensive analysis of the effect of data volume on model training and accuracy. Throughout all of this process, statistical studies will be carried out to verify the robustness of our results, as well as the possible influence the different subsets have on the results. More specifically, the experiments involve training ResNet50 and MobileNetV2 models on subsets of the ImageNet-1K dataset, containing over 1.2 million training images across 1000 categories. The study examines how the reduction in training data volume affects the convergence of the models, as well as their accuracy in classifying images. Furthermore, the evolution of the network’s confidence in its predictions evolves through training.
dc.identifier.coursecode	TIFX05
dc.identifier.uri	http://hdl.handle.net/20.500.12380/306340
dc.language.iso	eng
dc.setspec.uppsok	PhysicsChemistryMaths
dc.subject	data volume, model training, accuracy, neural networks, image-based networks, ResNet50, MobileNetV2.
dc.title	Impact of Training Data Volume on Neural Network Training and Accuracy
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Physics (MPPHS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: ReyAlonsoAlicia_MasterThesisReport.pdf
Storlek:: 5.18 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen