The Effects of Identifier Characteristics on Software Product Quality

dc.contributor.authorBoström, Henrik
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerHebig, Regina
dc.contributor.supervisorStaron, Miroslaw
dc.date.accessioned2020-07-06T08:05:01Z
dc.date.available2020-07-06T08:05:01Z
dc.date.issued2020sv
dc.date.submitted2020
dc.description.abstractBackground: Software projects consist of many identifiers, and how those identifiers should be named to improve software product quality is an opinionated subject. Some studies have tried to find the relation between identifiers characteristics and software product quality, but these studies have been conducted on small sets of software projects (N 12). Consequently, a need exists of large scale studies that investigate how identifier characteristics effect and can be used as predictors of software product quality. Aim: This study evaluates if source file averages of the identifier characteristics length, number of containing words, number of containing digits, and casing consistency can predict the software product quality of source files. Method: With identifier and software product quality data found in 60,315 source files from 1,000 open-source Java software repositories, linear regression models are fitted on the data. The models can then be evaluated in terms of accuracy and coefficient importance to find if identifier characteristics could be used as predictors of software product quality. Results: Bayesian linear regression models with identifier characteristics and the size of source files as independent variables had R-squared accuracies ranging from 0.008-0.545 of predicting the software product quality of source files. But, none of the models’ identifier characteristics were accepted as important for the change of the prediction based on the ROPE + HDI decision rule with a [-0.1,0.1] ROPE interval. Conclusions: The identifier characteristics investigated in the study could not be used to predict the study’s measurement of software product quality. Further work is needed in investigating the same identifier characteristics relation to other measures of software product quality and with source files written in other programming languages than Java.sv
dc.identifier.coursecodeDATX05sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/301336
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectIdentifiersv
dc.subjectcharacteristicssv
dc.subjectsoftwaresv
dc.subjectproductsv
dc.subjectqualitysv
dc.subjectempiricalsv
dc.subjectsoftwaresv
dc.subjectengineeringsv
dc.titleThe Effects of Identifier Characteristics on Software Product Qualitysv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 20-37 Boström.pdf
Storlek:
909.85 KB
Format:
Adobe Portable Document Format
Beskrivning:

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: