Learning Meaningful Representations of Cells
dc.contributor.author | Andrekson, Leo | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för life sciences | sv |
dc.contributor.department | Chalmers University of Technology / Department of Life Sciences | en |
dc.contributor.examiner | Bengtsson-Palme, Johan | |
dc.contributor.supervisor | Oropeza Mercado, Rocío | |
dc.date.accessioned | 2024-06-04T07:45:09Z | |
dc.date.available | 2024-06-04T07:45:09Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | Batch effects are a significant concern in single-cell RNA sequencing (scRNA-Seq) data analysis, where variations in the data can be attributed to factors unrelated to cell types. This can make downstream analysis a challenging task. In this study, a neural network model is designed utilizing contrastive learning and a novel loss func tion for learning an generalizable embedding space from scRNA-Seq data. When benchmarked against multiple established methods for scRNA-Seq integration, the model outperforms existing methods in learning a generalizable embedding space on multiple datasets. A downstream application that was investigated for the embedding space was cell type annotation. When compared against multiple well established cell type classifiers, the model in this study displayed a performance competitive with top performing methods across multiple metrics, such as accuracy, balanced accuracy, and F1 score. These findings aim to quantify the “meaningfulness” of the embedding space learned by the model, and highlight the potential applications of these learned cellular representations. The model is currently being structured into an open-source Python package, simplifying and streamlining its usage. | |
dc.identifier.coursecode | BBTX60 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/307710 | |
dc.language.iso | eng | |
dc.setspec.uppsok | LifeEarthScience | |
dc.subject | scRNA-Seq | |
dc.subject | Deep learning | |
dc.subject | Contrastive learning | |
dc.subject | Bioinformatics | |
dc.subject | Cell type annotation | |
dc.subject | Novel cell type detection | |
dc.subject | Cell type representations | |
dc.subject | Machine learning | |
dc.subject | AI | |
dc.subject | Transformer | |
dc.title | Learning Meaningful Representations of Cells | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Biotechnology (MPBIO), MSc |
Ladda ner
Original bundle
1 - 1 av 1
Hämtar...
- Namn:
- Learning Meaningful Representations of Cells.pdf
- Storlek:
- 13.99 MB
- Format:
- Adobe Portable Document Format
- Beskrivning:
License bundle
1 - 1 av 1
Hämtar...
- Namn:
- license.txt
- Storlek:
- 2.35 KB
- Format:
- Item-specific license agreed upon to submission
- Beskrivning: