A method for detecting horizontal gene transfer events of antibiotic resistance genes using phylogenetic trees

dc.contributor.authorBenson, Leo
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerKristiansson, Erik
dc.contributor.supervisorKristiansson, Erik
dc.contributor.supervisorBerglund, Fanny
dc.contributor.supervisorLund, David
dc.date.accessioned2025-07-01T10:58:27Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractAn algorithm, named the HGT-score algorithm, to computationally assess horizontal gene transfer of antibiotic resistance genes spread was developed using a methodology based on phylogenetic trees and host taxonomy labels. Applying the HGT-score algorithm on validation data showed that the algorithm detects horizontal gene transfer events. This work serves as a good starting point for further investigation and improvements. The results are two-fold, consisting of the developed methods themselves, and the results that the methods generated when applied on real data. Antibiotic resistance gene (ARG)-based phylogenetic trees were cross-sectioned at different lengths from the root, named cutoffs. A cutoff generates flat clusters of leaves, encoding ARG similarity between hosts within each cluster. Distantly related host bacteria may end up in the same cluster if their genomes contain identical or similar antibiotic resistance genes. HGT-scores were given to clusters depending on the taxonomic similarity of member hosts, with higher dissimilarity resulting in a higher HGT-score. The method was validated on a labelled dataset of ARGs, where the labels sorted ARGs into HGT-prone genes and non-HGT-prone genes. The HGT-prone data scored higher than non-HGT labelled data, particularly in specific cutoff regions. It is therefore plausible that the method scores HGT events higher, and that the cutoff is an important factor to account for in drawing conclusions as to when the events occurred. Lastly, there are several directions to explore in order to improve the method. The HGT-score method is dependent on three key parameters: cutoff, cluster-level statistic, and p. These parameters should all be investigated and calibrated for different use case scenarios. A variant of the method should be investigated, using well-conserved genes rather than taxonomy as the base of gene tree-host species dissimilarity. This opens up the possibility of detecting HGT events across more closely related bacteria. Another interesting direction is fine-tuning cutoffs. Finer cutoffs could provide better resolution in critical parts of the phylogenetic gene trees where many bifurcations occur.
dc.identifier.coursecodeMVEX03
dc.identifier.urihttp://hdl.handle.net/20.500.12380/309814
dc.language.isoeng
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectAntibacterial resistance, horizontal gene transfer, phylogenetic trees, bioinformatics.
dc.titleA method for detecting horizontal gene transfer events of antibiotic resistance genes using phylogenetic trees
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeEngineering mathematics and computational science (MPENM), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Master_Thesis_Leo Benson_2025.pdf
Storlek:
5.79 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: