Developing a Cooperative Data Cleaning Tool

dc.contributor.authorChatterjee, Devosmita
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerSagitov, Serik
dc.contributor.supervisorJohansson, Anton
dc.date.accessioned2021-04-27T12:45:01Z
dc.date.available2021-04-27T12:45:01Z
dc.date.issued2021sv
dc.date.submitted2020
dc.description.abstractAbstract Presently, large amount of data generated by organizations drives their business decisions. The data is usually inconsistent, inaccurate and incomplete. Poor data quality may lead to incorrect decisions for the organizations and hence, negatively affect them. Thus, high quality data is of utmost priority to draw good and valid business decisions and strategies. Data cleaning is the ultimate way to solve the data quality issues. But, data cleaning is really a time consuming task. Thus, tools which can help with the task are needed. This demands data cleaning tools for systematically examining data for errors and automatically cleaning them using algorithms. These data cleaning tools helps organizations save time and increase their efficiency. In this thesis, we develop a cooperative, free and open source data cleaning standalone application ‘DataCleaningTool’ in order to achieve the task of data cleaning. This tool is able to identify the potential data problems and report results such that the users can take informed decisions to clean data effectively.sv
dc.identifier.coursecodeMVEX03sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/302324
dc.language.isoengsv
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectData Cleaning, Noisy Data, Missing Data, MissForest Method, Outliers, Data Transformation, Interactive Data Visualizationsv
dc.titleDeveloping a Cooperative Data Cleaning Toolsv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
local.programmeEngineering mathematics and computational science (MPENM), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Masters Thesis_Devosmita Chatterjee_210427.pdf
Storlek:
18.51 MB
Format:
Adobe Portable Document Format
Beskrivning:
Developing a Cooperative Data Cleaning Tool
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: