Unsupervised Outlier Detection in Software Engineering

Larsson, Henrik; Lindqvist, Erik

Unsupervised Outlier Detection in Software Engineering

Ladda ner

Primär fil 216772.pdf (1.05 MB)

Publicerad

2014

Författare

Larsson, Henrik

Lindqvist, Erik

Typ

Examensarbete för masterexamen
Master Thesis

Program

Software engineering and technology (MPSOF), MSc

Sammanfattning

The increasing complexity of software systems has lead to increased demands on the tools and methods used when developing software systems. To determine if a tool or method is more efficient or accurate than others empirical studies are used. The data used in empirical studies might be affected by outliers i.e. data points that deviates significantly from the rest of the data set. Hence, the statistical analysis might be distorted by these outliers as well. This study investigates if outliers are present within Empirical Software Engineering (ESE) studies using unsupervised methods for detection. It also tries to assess if the statistical analyses performed in ESE studies are affected by outliers by removing them and performing a re-analysis. The subjects used in this study comes from a narrow literature review of recently published papers within Software Engineering (SE). While collecting the samples needed for this study the current state of practise regarding data availability and analysis reproducibility is investigated. This study's results shows that outliers can be found in ESE studies and it also identifies issues regarding data availability within the same field. Finally, this study presents guidelines for how to improve the way outlier detection is presented within ESE studies as well as guidelines for publishing data.

Ämne/nyckelord

Data- och informationsvetenskap, Computer and Information Science

URI

https://hdl.handle.net/20.500.12380/216772

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Unsupervised Outlier Detection in Software Engineering

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

Endorsement

Review

Supplemented By

Referenced By