Machine learning to predict enzymes’ optimal catalytic temperature

dc.contributor.authorUlfenborg, Josefin
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerDamaschke, Peter
dc.contributor.supervisorEngqvist, Martin
dc.contributor.supervisorKemp, Graham
dc.date.accessioned2020-07-08T11:01:31Z
dc.date.available2020-07-08T11:01:31Z
dc.date.issued2020sv
dc.date.submitted2020
dc.description.abstractEnzymes are proteins which operate as biological catalysts in chemical processes, for instance in biofuel production. The efficiency and sustainability of these processes may be greatly improved by knowing the optimal catalytic temperature (Topt) of the enzymes. However, determining these temperatures experimentally is timeconsuming and instead a machine learning approach for predicting Topt is suggested. In a previous approach, sequential features were used to predict Topt. In this thesis, new structural features which account for various structural properties in the enzymes were used alongside the sequential features. Test scores from the models show that structural features combined with sequential features improve previous R2 scores from 0.4 to 0.48. Furthermore, in the case where there is a pair of similar enzymes, but one has a colder and one a hotter temperature, the models correctly predicts the temperature order of the enzymes 83% of the time. By gathering more data and fine-tuning the structural features, it is anticipated that scores will improve even further.sv
dc.identifier.coursecodeDATX05sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/301399
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectStructural bioinformaticssv
dc.subjectenzymessv
dc.subjectmachine learningsv
dc.subjectfeature engineeringsv
dc.titleMachine learning to predict enzymes’ optimal catalytic temperaturesv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 20-20 Ulfenborg.pdf
Storlek:
3.65 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: