Bayesian Network Fisher Kernel for Categorical Feature Spaces
dc.contributor.author | Ebberstein, Victor | |
dc.contributor.author | Holmberg, Martin | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.examiner | Dubhashi, Devdatt | |
dc.contributor.supervisor | Haghir Chehreghani, Morteza | |
dc.date.accessioned | 2019-10-03T14:21:10Z | |
dc.date.available | 2019-10-03T14:21:10Z | |
dc.date.issued | 2019 | sv |
dc.date.submitted | 2019 | |
dc.description.abstract | Similarity measures between categorical feature vectors are non-intuitive and difficult to compute, since no definitive way of representing distances between two such vectors exists. The Fisher kernel provides a method for computing similarities by considering an underlying statistical model, which circumvents the problem of computing distances between categorical vectors. A promising probabilistic model is the Bayesian network, which is able to capture local dependencies between variables. In this thesis, the Fisher kernel based on discrete Bayesian networks is explored in a categorical setting. This new similarity measure between categorical vectors is primarily evaluated using the task of clustering. In addition, Bayesian networks are evaluated on the task of imputation in order toa ddress the possibility of incomplete datasets. By breaking down the structure of the Bayesian network into basic segments, the connection between the network structure and the produced Fisher similarities was investigated. The Fisher kernel was found to have great potential given that a suitable network structure was considered. However, this structure did not necessarily coincide with structures learnt using conventional learning methods for Bayesian networks. | sv |
dc.identifier.coursecode | DATX05 | sv |
dc.identifier.uri | https://hdl.handle.net/20.500.12380/300393 | |
dc.language.iso | eng | sv |
dc.setspec.uppsok | Technology | |
dc.subject | Bayesian network | sv |
dc.subject | Fisher kernel | sv |
dc.subject | kernel | sv |
dc.subject | clustering | sv |
dc.subject | imputation | sv |
dc.subject | categorical | sv |
dc.subject | similarity | sv |
dc.subject | machine learning | sv |
dc.title | Bayesian Network Fisher Kernel for Categorical Feature Spaces | sv |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.uppsok | H |