Deep Reinforcement Learning for Quantum Error Correction

Becker, Marvin; Chiu Falck, Karl-Rehan

Deep Reinforcement Learning for Quantum Error Correction

dc.contributor.author	Becker, Marvin
dc.contributor.author	Chiu Falck, Karl-Rehan
dc.contributor.department	Chalmers tekniska högskola / Institutionen för fysik	sv
dc.contributor.examiner	Granath, Mats
dc.contributor.supervisor	Granath, Mats
dc.date.accessioned	2021-06-17T11:21:01Z
dc.date.available	2021-06-17T11:21:01Z
dc.date.issued	2021	sv
dc.date.submitted	2020
dc.description.abstract	Recent advancements in quantum computing have supported and reinforced their promising computational abilities which for certain problems far exceeds that of classical computers. However, one of the main difficulties in realising larger quantum computers lies in the instability of quantum bits often resulting in erroneous states. While there are schemes in place to encode logical qubit states, such as repetition codes, the very quantum nature of these qubits makes it impossible to directly observe their quantum states without causing it to decohere. To circumvent this, parity measurements of localized groups of qubits, so-called syndrome measurements, can be performed as is the case in the surface code. The encoding leads to improved stability and error resistance of the encoded logical qubit. The challenge of decoding the surface code in the presence of potential errors still remains: Multiple possible error configurations can lead to the same observed syndrome state, thus making the decoding task non-trivial. This has been addressed with different approaches such as the utilization of decoder agents trained via deep learning. In this study, we deploy techniques from the realm of reinforcement learning such as Deep Q-learning, Proximal Policy Optimization, and Hindsight learning to train agents capable of decoding errors on the surface code. To better reflect a realistic setting, we also acknowledge the possibility of erroneous syndrome measurements as part of the training. The entirety of these syndrome measurements constitutes the observable state and introducing measurement errors breaks the Markov assumption which is central to many reinforcement learning algorithms and leaves us with a partially observable Markov decision process (POMDP). To deal with this, we introduce the dimension of time to our problem formulation by keeping track of the evolution of both qubits and syndromes. Our work includes a brand new state formulation for the syndrome evolution and utilizes different strategies to train agents on this newly-formulated problem.	sv
dc.identifier.coursecode	TIFX05	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/302593
dc.language.iso	eng	sv
dc.setspec.uppsok	PhysicsChemistryMaths
dc.subject	Deep Learning	sv
dc.subject	Reinforcement Learning	sv
dc.subject	Quantum Error Correction	sv
dc.subject	Surface Code	sv
dc.subject	Quantum Bits	sv
dc.title	Deep Reinforcement Learning for Quantum Error Correction	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H
local.programme	Complex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: Master_Thesis_Marvin_Rehan.pdf
Storlek:: 2.09 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.14 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen