Sample-efficient machine learning with auxiliary information

Tan, Xinxin

Sample-efficient machine learning with auxiliary information

dc.contributor.author	Tan, Xinxin
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.department	Chalmers University of Technology / Department of Computer Science and Engineering	en
dc.contributor.examiner	Dubhashi, Devdatt
dc.contributor.supervisor	Johansson, Fredrik
dc.date.accessioned	2025-09-10T11:53:01Z
dc.date.issued	2024
dc.date.submitted
dc.description.abstract	Our thesis proposes a Learning using Privileged Mediating Information (LuPI) algorithm based on a directed Gaussian graphical model, and analyzes that LuPI outperforms the Ordinary Least Squares (OLS) model in terms of statistical properties under known causality by constructing a causal directed acyclic graph (DAG) containing mediating variables. Using the Rao-Blackwell theorem, it is shown theoretically that LuPI can efficiently decrease the mean square error (MSE) and the expected risk. In the experimental part, the improvement of LuPI over OLS is verified on a synthetic dataset under different noise levels and sample sizes, especially under high noise and small sample conditions. In addition, the experiments also investigate the impact of graph estimation bias on the performance of the algorithm, and the results show that appropriate removal of redundant edges in the causal graph can help reduce the variance, which in turn improves the overall performance of the model. Finally, the experiments based on real datasets further demonstrate the superiority of the LuPI algorithm under small sample sizes and validate its application value in complex causal data.
dc.identifier.coursecode	DATX05
dc.identifier.uri	http://hdl.handle.net/20.500.12380/310450
dc.language.iso	eng
dc.relation.ispartofseries	CSE-24-177
dc.setspec.uppsok	Technology
dc.subject	Learning using Privileged Information, Directed Gaussian Graphical Model, Linear Regression, Causal Analysis
dc.title	Sample-efficient machine learning with auxiliary information
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Engineering mathematics and computational science (MPENM), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 24-177 XT.pdf
Size:: 1.87 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen