A Software Engineering Perspective on Data Quality Processes in Environmental Research - Recommendations Based on Software Engineering Practices Applied for Improving of Open Data Practices and Communication in Environmental Research
dc.contributor.author | MOEN, MARKUS | |
dc.contributor.author | NORÉN, MAX | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Penzenstadler, Birgit | |
dc.contributor.supervisor | Heyn, Hans-Martin | |
dc.date.accessioned | 2024-09-18T16:16:20Z | |
dc.date.available | 2024-09-18T16:16:20Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | The many fields within environmental research have been on the path towards open science and, most importantly, open data. With the increase in available data, there are opportunities to apply data-driven and data-intensive methods, including recent developments such as machine learning. However, the success of applying machine learning depends significantly on the quality of the available training data. The purpose of this thesis was to investigate the field of environmental research in regards to current views, practices and communication of data quality and to identify software engineering principles and practices that can form possible recommendations to progress data quality in environmental research. This process identified six challenges and proposed eight recommendations. The result shows a great deal of effort towards open data, with the FAIR principles as the main arbiter to achieve it. Most identified challenges are based on data quality handling, communication, and difficulties in achieving open science. We found suitable software engineering practices for four of the six challenges, with two key perspectives being derived from open source software and requirements engineering practices. Our results demonstrate that there is a willingness among environmental researchers to investigate and adopt software engineering practices in environmental research. Importantly, there is a broad agreement that open science is an improvement over to previous methods, and the stated challenges and recommendations need to preserve those advancements. The recommendations should be regarded as a first design iteration of these recommendations, and they should be explored further in terms of their applicability to different fields within environmental research. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/308697 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | Software engineering | |
dc.subject | requirements engineering | |
dc.subject | data quality | |
dc.subject | environmental research | |
dc.subject | open science | |
dc.subject | open data | |
dc.subject | data-intensive | |
dc.subject | big data | |
dc.subject | FAIR | |
dc.subject | thesis | |
dc.title | A Software Engineering Perspective on Data Quality Processes in Environmental Research - Recommendations Based on Software Engineering Practices Applied for Improving of Open Data Practices and Communication in Environmental Research | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Software engineering and technology (MPSOF), MSc |