Short Tandem String Duplication Sequences
dc.contributor.author | DERVISEVIC, BELMIN | |
dc.contributor.author | RASPUDIC, MATEO | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.examiner | Coquand, Thierry | |
dc.contributor.supervisor | Damaschke, Peter | |
dc.date.accessioned | 2022-10-04T12:41:31Z | |
dc.date.available | 2022-10-04T12:41:31Z | |
dc.date.issued | 2022 | sv |
dc.date.submitted | 2020 | |
dc.description.abstract | It has been shown that tandem repeats are involved in several neurological and genetic disorders. Additionally, tandem repeats and the Tandem Duplication (TD) operation have been studied in other areas, such as information theory and formal languages. Given a string AXB, where A, X, and B denote substrings, a tandem duplication operation on the substring X copies it and places its copy adjacently, which produces the string AXXB. Contemporary work shows that the Tandem Duplication Distance Problem (TDDP) is NP-complete for alphabets of size five. Therefore, we study the idea of overlapping squares and introduce a new algorithm, which solves the k-TD problem in time O((n log n)k), where n denotes the length of the target string T, and k denotes the number of contraction. Primarily, we prove that the Tandem Duplication Distance Problem is fixed-parameter tractable in the natural parameter d. We introduce a dynamic programming algorithm that locally exhausts independent disjoint substrings and calculates the optimal edit distance through minimization. The algorithm has a running time of O(nd3(d log d)d), where n denotes the length of the target string T and d denotes the length difference of the two strings S and T. | sv |
dc.identifier.coursecode | MPALG | sv |
dc.identifier.uri | https://hdl.handle.net/20.500.12380/305682 | |
dc.language.iso | eng | sv |
dc.setspec.uppsok | Technology | |
dc.subject | algorithms | sv |
dc.subject | tandem duplication | sv |
dc.subject | squares | sv |
dc.subject | tandem repeats | sv |
dc.subject | edit distance | sv |
dc.subject | dynamic programming | sv |
dc.subject | FPT algorithms | sv |
dc.title | Short Tandem String Duplication Sequences | sv |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.uppsok | H | |
local.programme | Computer science – algorithms, languages and logic (MPALG), MSc |