Evaluation of Conditional Recurrent Generative Adversarial Networks forMultivariate Time-Series Augmentation

Carlsson, Anna

Evaluation of Conditional Recurrent Generative Adversarial Networks forMultivariate Time-Series Augmentation

dc.contributor.author	Carlsson, Anna
dc.contributor.department	Chalmers tekniska högskola / Institutionen för matematiska vetenskaper	sv
dc.contributor.examiner	Lundh, Torbjörn
dc.contributor.supervisor	Lundh, Torbjörn
dc.contributor.supervisor	Dammert, Patrik
dc.contributor.supervisor	Warston, Håkan
dc.date.accessioned	2020-06-30T13:42:11Z
dc.date.available	2020-06-30T13:42:11Z
dc.date.issued	2020	sv
dc.date.submitted	2020
dc.description.abstract	A successful application of any machine learning algorithmis dependent on a sufficiently large training dataset, preferably class-balanced and correctly labeled. However, in many applications, the collection and labeling of data is time-consuming, expensive, and might require special security precautions if the data is of a sensitive nature. Therefore, different types of augmentation methods are commonly used. For time-series data, traditional augmentation methods such as rotation, translation, and flipping are not applicable. In applications where the dataset consists of time-series data, other augmentation methods are therefore of interest. In this thesis, the usage of generative adversarial networks (GANs) as an augmentation method for univariate and multivariate time-series data is investigated. Both recurrent and conditional recurrent GANs are examined. Apart from constructing architectures for time-series generation, the thesis focuses on finding suitable methods for evaluating the quality of the generated data. To monitor the training progress and select a suitable generator model to simulate synthetic data from, two distance-based kernel metrics are used: maximum mean discrepancy (MMD) and energy distance (ED). To evaluate the sample quality and diversity of the generated data, several experiments are performed where a classifier is trained on real, tested on synthetic data (TRTS), trained on synthetic, tested on real data (TSTR), and lastly trained and tested on a mixture of real and synthetic data (TMTM). Furthermore, experiments aiming to examine the usage of synthetic samples from conditional recurrent GANs to augment a real dataset are performed. The results indicate that the GANs successfully generates highly realistic samples, both of simpler time-series and more complexmultivariate time-series. However, the time-series seem to not aid a classifier to any large extent when added to real data, even when larger proportions of synthetic data are added. A possible explanation for this is that the synthetic data, although consisting of realistic samples, suffers from loss of in-class diversity and boundary distortion.	sv
dc.identifier.coursecode	MVEX03	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/301113
dc.language.iso	eng	sv
dc.setspec.uppsok	PhysicsChemistryMaths
dc.subject	deep learning, generative adversarial networks, generative models,multivariate timeseries classification, maximummean discrepancy, energy distance, covariate shift, boundary distortion	sv
dc.title	Evaluation of Conditional Recurrent Generative Adversarial Networks forMultivariate Time-Series Augmentation	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H
local.programme	Engineering mathematics and computational science (MPENM), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: Master's Thesis Anna Carlsson.pdf
Storlek:: 4.46 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.14 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen