Quantum information theory for machine learning

dc.contributor.authorWadman, Rikard
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerPersson, Daniel
dc.date.accessioned2019-12-11T09:01:03Z
dc.date.available2019-12-11T09:01:03Z
dc.date.issued2019sv
dc.date.submitted2019
dc.description.abstractThe remarkable successes of machine learning and of deep learning in particular during the last decade have caused an explosive growth of interest in the field. Meanwhile, there are still significant gaps in our understanding of the processes involved, making the area a very promising topic for theoretical investigation. A particularly interesting idea that has received a lot of attention recently is the claim that the successive transformations performed by deep neural networks behave similarly to the renormalization group flows of statistical mechanics. In the light of this it is natural also to consider numerical renormalization algorithms as interesting candidates for performing general machine learning. It turns out that both the DMRG and the more recent Entanglement Renormalization algorithm from numerical quantum mechanics are quite well suited for this purpose. Both of these algorithms are most naturally described using the language of tensor networks, which are graph based representations of multilinear tensors, typically used for the description of quantum states. This thesis discusses machine learning with tensor networks from a holistic perspective and makes a review of some of the recent work on the subject. Also of significant interest is the study of expressive power of neural networks. A recent proposal suggests employing quantum entanglement entropy as a measure of a models ability to represent complex correlations between input regions. We study the interpretability and implications of such a measure as well as its relations to the quantum version of the max- ow/min-cut theorem, which relates the entanglement entropy of a tensor network state to the minimal cut in its graph. A generalization of said theorem is found, leading us to alternate, and very simple, proofs of some already known scaling laws of quantum entanglement in Boltzmann machines and convolutional arithmetic circuits, which are derivative of standard convolutional neural networks.sv
dc.identifier.coursecodeMVEX03sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/300585
dc.language.isoengsv
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjecttensor networks, machine learning, quantum information, multilinear algebra, network theory, Convolutional arithmetic circuits, Boltzmann machines, entanglement, DMRG, MERAsv
dc.titleQuantum information theory for machine learningsv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Masters thesis Rikard Wadman.pdf
Storlek:
1023.8 KB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: