Quantum error correction for beginners

Simon J Devitt; William J Munro; Kae Nemoto

doi:10.1088/0034-4885/76/7/076001

1. Introduction

The micro-computer revolution of the late 20th Century has arguably had a greater impact on the world than any other technological revolution in history. The advent of transistors, integrated circuits, and the modern microprocessor has spawned literally hundreds of devices, from pocket calculators to the iPod, all now integrated through an extensive worldwide communications system. However, as we enter the 21st Century, ever increasing computational power is driving us quickly to the realm where quantum physics dominates. The component size of individual transistors on modern microprocessors is becoming so small that quantum effects will soon begin to dominate. Unfortunately, quantum mechanical behaviour will tend to result in unpredictable and unwanted operation in classical microprocessor designs. We therefore have two choices: keep trying to suppress quantum effects in classically fabricated electronics, or move to the field of quantum information processing (QIP) where we exploit them. This leads to a paradigm shift in the way we view and process information and has commanded considerable interest from physicists, engineers, computer scientists and mathematicians. The counter-intuitive and strange rules of quantum physics offer enormous possibilities for information processing and the development of a large-scale quantum computer is the holy grail for many researchers worldwide.

While the advent of Shor's algorithm [Sho97] certainly spawned great interest in QIP, demonstrating that quantum algorithms could be far more efficient than those used in classical computing, there was a great deal of debate surrounding the practicality of building a large scale, controllable, quantum system. It was well known even before the introduction of quantum information that coherent quantum states were extremely fragile and many believed that to maintain large, multi-qubit, coherent quantum states for a long enough time to complete any quantum algorithm was unrealistic [Unr95]. Additionally, classical error correction techniques are intrinsically based on a digital framework, unlike quantum computing where the readout of qubits is digital but actual manipulations are analogue. Therefore, how can we adapt the vast knowledge gained from classical coding theory to the quantum regime?

Starting in 1995, several papers appeared, in rapid succession, proposing codes which were appropriate to perform error correction on quantum data [Sho95, Ste96a, CS96, LMPZ96, BSW96]. This was a key theoretical development needed to convince the general community that quantum computation was indeed a possibility. Since this initial introduction, the progress in this field has been extensive.

Initial research on error correction focused heavily on developing quantum codes [Ste96c, Ste96b, Got96, PVK97, KL97, Kni96], introducing a more rigorous theoretical framework for the structure, properties and operation of QEC [BSW96, KL00, CRSS98, Got97, CG97, KLV00]. Addi tionally, the introduction of concepts such as fault-tolerant quantum computation [Sho96, DS96, Got98, KLZ98, Got00] led directly to the threshold theorem for concatenated QEC [KLZ96, ABO97]. In more recent years, QEC protocols have been developed for various systems, such as continuous variables [LS98, Bra98, vL08, ATK⁺09], ion-traps and other systems containing motional degrees of freedom [ST00], adiabatic computation [JFS06] and globally controlled quantum computers [BBK03]. Work also continues on not only developing better (and in some ways, more technologically useful) protocols such as subsystem codes [KLP05, KLPL06, Bac06] and topological codes [Kit97, DKLP02, BMD06, BMD07b, BMD07a, RHG07, FSG09]. Advanced techniques to analyse fault-tolerant thresholds [AGP08, PR12, BMD08, BAO⁺12] have also been developed to implement error correction in a fault-tolerant manner with concatenated codes [Ste97a, Ste02, DA07, Kni05] and topological codes [BMD07b, KBAMD10, Bom11].

Along with QEC, other methods of protecting quantum information were also developed. These other techniques would technically be placed in a separate category of error suppression rather than error correction. The distinction between error suppression and error correction is also known as passive and active QEC, the latter terminology being much more prevalent in the theoretical community. The most well known techniques of error suppression are protocols such as decoherence free subspaces (DFS) [PSE96, DG97, LCW98, DG98b, ZR97b, ZR97a, DG98a, LW03]. DFS protocols are passive, encoding information in a subspace which is invariant under the dynamics that generate errors. Active methods for error suppression include applying repeated rotations to qubits such that these rotations, combined with the natural dynamics that induce errors, partially cancel. These methods were initially referred to as Bang-Bang control [VL98, VT99, Zan99] and developed primarily for NMR systems. In recent years these techniques have been further generalized and optimized for use in essentially all QIP systems [VKL99, FLP04, VK03, VK05, KL05, Uhr07]. These more advanced techniques are now routinely categorized as dynamical decoupling protocols. As with QEC, this field is vast, incorporating well established ideas from quantum control to create specially designed control sequences to suppress errors before more resource intensive encoding is applied.

This review deals exclusively with the concepts of QEC and fault-tolerant quantum computation. Many papers have reviewed error correction and fault-tolerance [Got97, NC00, Got02, KLA⁺02, Ste01, Got09], however to cater for a large audience, we attempt to describe QEC and fault-tolerance in a more basic manner, largely through examples. Instead of providing a more rigorous review of error correction, we instead focus on more practical issues involved when working with these ideas. For those who have recently begun investigating QIP or those who are focused on other important theoretical and/or experimental aspects related to quantum computing, searching through this enormous collection of work is daunting especially if a basic working knowledge of QEC is all that is required. This review of the basic aspects of QEC and fault-tolerance is designed to allow those with little knowledge of the field to quickly become accustomed to the various techniques and tricks that are commonly used. The examples in this review are selected due to their relative simplicity. For individuals seriously interested in examining new ideas for QEC and fault-tolerant computation we strongly advise that the references cited throughout are consulted.

We begin the discussion in section 2 where we describe some preliminary concepts on the required properties of any quantum error correcting protocol. In section 3 we review some basic noise models in the context of how they influence quantum algorithms. Section 4 introduces quantum error correction through the traditional example of the 3-qubit code, illustrating the circuits used for encoding and correction and why the principle of redundant encoding suppresses the failure of encoded qubits. Sections 5 and 6 introduce the first examples of complete quantum codes, Shor's 9-qubit code and a 4-qubit error detection code. Sections 7 and 8 introduce the stabilizer formalism, demonstrating how QEC circuits are synthesized once the structure of the code is known. In section 9 we briefly return to noise models and relate the abstract analysis of QEC, where errors are assumed to be discrete and probabilistic, to some of the physical mechanisms which can cause errors. Sections 10 and 11 introduce the concept of fault-tolerant error correction, the threshold theorem and how logical gate operations can be applied directly to encoded data. We then move onto circuit synthesis in section 12, presenting a fault-tolerant circuit design for state preparation using the 7-qubit Steane code as a representative example. Finally, in section 14, we review specific codes for qubit loss and examine two more modern techniques for error correction. We briefly examine quantum subsystem codes [KLP05, KLPL06, Bac06] and topological surface codes [DKLP02, FSG09] due to both their theoretical elegance and their increasing relevance in quantum architecture designs [IFI⁺02, DFS⁺09, SJ09, MLFY10].

2. Preliminaries

Before discussing quantum errors and how they can be corrected, we first introduce the basics of qubits and quantum gates. We assume a basic working knowledge with quantum information [EJ96, NC00] and this brief discussion is used simply to define our notation for the remainder of this review.

2.1. Basic structure of the qubit

The fundamental unit of quantum information is the qubit. Unlike the classical bit, the qubit can exist in coherent superpositions of its two states, denoted as |0〉 and |1〉. These basis states can be, for example, photonic polarization, atomic spin states, electronic states of an ion or charge states of superconducting systems. An arbitrary state of an individual qubit, |φ〉, can be expressed as

$\begin{equation} {\left\vert{\phi}\right\rangle} = \alpha{\left\vert{0}\right\rangle} + \beta{\left\vert{1}\right\rangle}, \end{equation} \tag{ 1 }$

where |0〉 and |1〉 are the two orthonormal basis states of the qubit and |α|² + |β|² = 1. Quantum gate operations are represented by unitary operations acting on the Hilbert space of a collection of qubits. Unlike classical information processing, conservation of probability for quantum states requires that all operations be reversible and hence unitary.

When describing a quantum gate on an individual qubit, any dynamical operation, G, is a member of the unitary group U(2), which consists of all 2 × 2 matrices where G^† = G⁻¹. Up to a global (and unphysical) phase factor, any single qubit operation can be expressed as a linear combination of the generators of SU(2) as,

$\begin{equation} G = c_I \sigma_I + c_x \sigma_x + c_y\sigma_y + c_z\sigma_z, \end{equation} \tag{ 2 }$

where

$\begin{eqnarray} \fl \sigma_x = \left(\begin{array}{@{}ll@{}} 0 & 1 \\ 1 & 0 \end{array}\right), \quad \sigma_y = \left(\begin{array}{@{}ll@{}} 0 & -i \\ i & 0 \end{array}\right), \quad \sigma_z = \left(\begin{array}{@{}ll@{}} 1 & 0 \\ 0 & -1 \end{array}\right),\nonumber\\ \end{eqnarray} \tag{ 3 }$

are the Pauli matrices, σ_I is the 2 × 2 identity matrix, c_I is a real number, and (c_x, c_y, c_z) are complex numbers satisfying,

$\begin{eqnarray} &&|c_I|^2+|c_x|^2+|c_y|^2+|c_z|^2 = 1, \\ &&\Re(c_Ic_x^*)+\Im(c_yc_z^*) = 0, \\ &&\Re(c_Ic_y^*)+\Im(c_xc_z^*) = 0, \\ &&\Re(c_Ic_z^*)+\Im(c_xc_y^*) = 0. \end{eqnarray} \tag{ 4 }$

Here ℜ and ℑ are the real and imaginary components, respectively.

2.2. Some general requirements of quantum error correction

Although the field of QEC is largely based on classical coding theory, there are several issues that need to be considered when transferring classical error correction techniques to the quantum regime.

First, coding based on data-copying, which is extensively used in classical error correction, cannot be used due to the no-cloning theorem of quantum mechanics [WZ82]. This result implies that there exists no transformation resulting in the following mapping,

$\begin{equation} U({\left\vert{\phi}\right\rangle} \otimes {\left\vert{\psi}\right\rangle}) = {\left\vert{\phi}\right\rangle} \otimes {\left\vert{\phi}\right\rangle} \tqs \forall {\left\vert{\phi}\right\rangle}, \end{equation} \tag{ 5 }$

i.e. it is impossible to perfectly copy an unknown quantum state. This means that quantum data cannot be protected from errors by simply making multiple copies. Secondly, direct measurement cannot be used to effectively protect against errors, since this will act to destroy any quantum superposition that is being used for computation. Error correction protocols must therefore be employed, which can detect and correct errors without determining any information regarding the qubit state. Unlike classical information, qubits are susceptible to both traditional bit errors |0〉 ↔ |1〉, and also phase errors |0〉 ↔ |0〉, |1〉 ↔ −|1〉. Hence any error correction procedure needs to be able to simultaneously correct for both. Finally, errors in quantum information are intrinsically continuous (i.e. qubits do not experience full bit or phase flips but rather an angular shift of the qubit state by any angle). This issue will be discussed in more depth in section 9.

At its most basic level, QEC utilizes the idea of redundant encoding. This is where the total size of the Hilbert space is expanded beyond what is needed to store a single qubit of information. This way, errors on individual qubits are mapped to a large set of mutually orthogonal subspaces, the size of which is determined by the number of qubits utilized in the code. Finally, the error correction protocol cannot allow us to gain information regarding the coefficients, α and β of the encoded state, as doing so would collapse the system.

3. Quantum errors: cause and effect

Before we begin discussing the details of QEC, we first examine some of the common sources of errors in QIP and contextualize what they imply for computation. We will consider several important sources of errors and how they influence two trivial, single qubit quantum algorithms.

The first algorithm is a computation consisting of a single qubit, initialized in the |0〉 state undergoing N identity operations. If this algorithm is performed perfectly the final state is,

$\begin{equation} {\left\vert{\psi}\right\rangle}_{{\rm final}} = \prod_i^N I_i{\left\vert{0}\right\rangle} = {\left\vert{0}\right\rangle}, \end{equation} \tag{ 6 }$

where I ≡ σ_I is the identity gate. Measurement of the qubit in the |0〉, |1〉 basis will consequently yield the result 0 with a probability of unity.

The second illustration is an algorithm of three gates,

$\begin{eqnarray} {\left\vert{\psi}\right\rangle}_{{\rm final}} = HIH{\left\vert{0}\right\rangle} &=HI \frac{1}{\sqrt{2}}({\left\vert{0}\right\rangle} + {\left\vert{1}\right\rangle}) \nonumber\\ &= H \frac{1}{\sqrt{2}}({\left\vert{0}\right\rangle}+{\left\vert{1}\right\rangle}) \\ &= {\left\vert{0}\right\rangle}.\nonumber \end{eqnarray} \tag{ 7 }$

This algorithm will ideally yield a result |0〉 with probability of 1 when measured in the |0〉, |1〉 basis. This algorithm implements two H ≡ Hadamard operations separated by a wait stage, represented by the Identity gate.

We examine, independently, several common sources of error from the effect they have on these two quantum algorithms. Hopefully, this introductory section will show that while quantum errors are complicated physical effects, in QIP the relevant measure is the theoretical success probability of a given quantum algorithm.

Errors that exist on any quantum system are highly dependent on the specific physical mechanisms that govern the system. The situations that follow are only basic illustrative examples which can often arise in discussions related to errors and QEC. It is important to stress that the types of errors and the way in which they are analysed need to be considered in the context of the physical system under consideration.

3.1. Coherent quantum errors: gates which are incorrectly applied

The first possible source of error is coherent, systematic control errors. This type of error is typically associated with incorrect knowledge of the system dynamics. Namely, otherwise perfect control assuming a system governed by a Hamiltonian H' when the actual system dynamics is governed by H. As qubit systems must be characterized prior to being used [PCZ97, CSG⁺05], finite experimental resolution with these processes will lead to coherent control errors. As these errors are coherent inaccuracies applied to the qubit, they can be mitigated using coherent techniques such as composite pulse sequences [LF79, Lev86, BHC04, Jon09]. Other control errors (e.g. arising from fluctuations in control fields) are not coherent processes and are dealt with in a similar way to environmental coupling described further on.

As this source of error is coherent and systematic it causes one to apply the same, undesired gate operation and can be analysed without moving to the density matrix formalism. With respect to the first of our algorithms, we are able to model this in several different ways. To keep things simple, we assume that incorrect characterization of the control dynamics leads to a gate which is not σ_I, but instead introduces a small rotation around the X-axis of the Bloch sphere. This results in the state,

$\begin{equation} {\left\vert{\psi}\right\rangle}_{{\rm final}} = \prod^N \rme^{\rmi\epsilon \sigma_x}{\left\vert{0}\right\rangle} = \cos(N\epsilon){\left\vert{0}\right\rangle} + \rmi\sin(N\epsilon){\left\vert{1}\right\rangle}. \end{equation} \tag{ 8 }$

We now measure the system in the |0〉, |1〉 basis. Due to these errors, the probability of measuring the system in the |0〉 or |1〉 state is,

$\begin{eqnarray} &&P({\left\vert{0}\right\rangle}) = \cos^2(N\epsilon) \approx 1- (N\epsilon)^2, \\ &&P({\left\vert{1}\right\rangle}) = \sin^2(N\epsilon) \approx (N\epsilon)^2. \end{eqnarray} \tag{ 9 }$

Hence, the probability of error in this trivial quantum algorithm is given by p_error ≈ (N)², which will be small given that N ≪ 1. The systematic error in this system is quadratic in both the small systematic over rotation and the total number of applied identity operations. This is expected as this rotational error always acts in the same direction every time it is applied.

3.2. Environmental decoherence

Environmental decoherence is another important source of errors in quantum systems. We will take a simple decoherence model and examine how our second algorithm behaves. Later in section 9 we will illustrate a more complicated decoherence model that arises from standard mechanisms.

Consider a very simple environment, which is another two level quantum system. This environment has two basis states, |e₀〉 and |e₁〉, that satisfy the completeness relations,

$\begin{equation} {\left\langle{e_i}\right\vert}e_j\rangle = \delta_{ij}, \quad {\left\vert{e_0}\right\rangle}{\left\langle{e_0}\right\vert} + {\left\vert{e_1}\right\rangle}{\left\langle{e_1}\right\vert} = I. \end{equation} \tag{ 10 }$

We will also assume that the environment couples to the qubit in a specific way. When the qubit is in the |1〉 state, the coupling flips the environmental state, while if the qubit is in the |0〉 state nothing happens to the environment. Finally, this model assumes the system/environment interaction only occurs during the wait stage of the algorithm (while the Identity gate is being applied). As with the first algorithm we should measure the state |0〉 with probability one. The reason for utilizing the second of our algorithms in this example is that this specific decoherence model acts to reduce coherence between the |0〉 and |1〉 states. Therefore, we require a coherent superposition to observe any effect from the environmental coupling. We assume the environment is initialized in the state, |E〉 = |e₀〉, and then coupled to the system,

$\begin{equation} HI H{\left\vert{0}\right\rangle}{\left\vert{E}\right\rangle} = \case{1}{2}({\left\vert{0}\right\rangle}+{\left\vert{1}\right\rangle}){\left\vert{e_0}\right\rangle} + \case{1}{2}({\left\vert{0}\right\rangle}-{\left\vert{1}\right\rangle}){\left\vert{e_1}\right\rangle}. \end{equation} \tag{ 11 }$

As we are considering environmental decoherence, pure states will be transformed into classical mixtures, hence we now move into the density matrix representation for the state HIH|0〉|E〉,

$\begin{eqnarray} \fl\rho_{f}& = \case{1}{4}( {\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} + {\left\vert{0}\right\rangle}{\left\langle{1}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{0}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}){\left\vert{e_0}\right\rangle}{\left\langle{e_0}\right\vert}\nonumber\\ \fl&\quad +\,\case{1}{4}( {\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} - {\left\vert{0}\right\rangle}{\left\langle{1}\right\vert}-{\left\vert{1}\right\rangle}{\left\langle{0}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}){\left\vert{e_1}\right\rangle}{\left\langle{e_1}\right\vert} \nonumber\\ \fl&\quad +\, \case{1}{4}( {\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} - {\left\vert{0}\right\rangle}{\left\langle{1}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{0}\right\vert}-{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}){\left\vert{e_0}\right\rangle}{\left\langle{e_1}\right\vert}\nonumber\\ \fl&\quad +\,\case{1}{4}( {\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} + {\left\vert{0}\right\rangle}{\left\langle{1}\right\vert}-{\left\vert{1}\right\rangle}{\left\langle{0}\right\vert}-{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}){\left\vert{e_1}\right\rangle}{\left\langle{e_0}\right\vert}. \end{eqnarray} \tag{ 12 }$

As we have no access to the environmental degrees of freedom, we trace over this part of the system, giving,

$\begin{eqnarray} \fl{\rm Tr}_{E}(\rho_f) &= \case{1}{4}( {\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} + {\left\vert{0}\right\rangle}{\left\langle{1}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{0}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert})\nonumber\\ \fl&\quad +\,\case{1}{4}( {\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} - {\left\vert{0}\right\rangle}{\left\langle{1}\right\vert}-{\left\vert{1}\right\rangle}{\left\langle{0}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}) \nonumber\\ \fl&= \case{1}{2}({\left\vert{0}\right\rangle}{\left\langle{0}\right\vert}+{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}). \end{eqnarray} \tag{ 13 }$

Measurement of the system will consequently return |0〉 50% of the time and |1〉 50% of the time. This final state is a complete mixture of the two qubit states and is consequently a classical system. The coupling to the environment removed all the coherences between the |0〉 and |1〉 states and the second Hadamard transform, intended to rotate $({\left\vert{0}\right\rangle}+{\left\vert{1}\right\rangle})/\sqrt{2} \rightarrow {\left\vert{0}\right\rangle}$ , has no effect on the qubit state.

As we assumed that the system/environment coupling during the wait stage causes the environmental degree of freedom to 'flip' (when the qubit is in the |1〉 state) this decoherence model implicitly incorporates a temporal effect. The temporal interval of the identity gate in the above algorithm is long enough to enact the controlled-flip operation. If we assumed a controlled rotation that was not a full flip on the environment, the final mixture would not be 50/50. Instead there would be a residual coherence between the qubit states and an increased probability of our algorithm returning a |0〉. Section 9 revisits the decoherence model and illustrates how time-dependence is explicitly incorporated.

3.3. Simple models of loss, leakage, measurement and initialization

Other sources of error such as qubit initialization, measurement errors, qubit loss and qubit leakage can be modelled in a very similar manner. As with coherent control errors and environmental decoherence, the specifics of other error channels can be modelled in a coherent or incoherent manner, depending on the physical mechanisms. The examples shown below are commonplace within QEC analysis and adequately apply to a large number of systems.

Measurement errors can be modelled in the same way as environmental decoherence, acting incoherently on the system. Measurement errors can be described in two slightly different ways. The first is to use the following positive operator value measures (POVMs),

$\begin{eqnarray} &&F_0 = (1-p_M){\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} + p_M{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}, \\ &&F_1 = (1-p_M){\left\vert{1}\right\rangle}{\left\langle{1}\right\vert} + p_M{\left\vert{0}\right\rangle}{\left\langle{0}\right\vert}, \end{eqnarray} \tag{ 14 }$

where p_M is the probability of measurement error. These are not measurement projectors as $F_0^2 \neq F_0$ and $F_1^2 \neq F_1$ . The second method is to apply the following mapping to the qubit,

$\begin{equation} \rho \rightarrow \rho' = (1-p_M)\rho + p_M X\rho X \end{equation} \tag{ 15 }$

followed by a perfect measurement in the (|0〉, |1〉) basis. These two models give exactly the same probabilities for each outcome. Defining A₀ = |0〉〈0| and A₁ = |1〉〈1| as the measurement projectors onto the (|0〉, |1〉) basis we find

$\begin{eqnarray} &&{\rm Tr}(F_0\rho) = (1-p_M){\rm Tr}(A_0\rho) + p_M{\rm Tr}(A_1\rho), \\ &&{\rm Tr}(F_1\rho) = (1-p_M){\rm Tr}(A_1\rho) + p_M{\rm Tr}(A_0\rho), \end{eqnarray} \tag{ 16 }$

$\begin{eqnarray} \fl{\rm Tr}(A_0\rho') &= (1-p_M){\rm Tr}(A_0\rho) + p_M{\rm Tr}(XA_0X\rho) \nonumber\\ \fl&= (1-p_M){\rm Tr}(A_0\rho) + p_M{\rm Tr}(A_1\rho), \nonumber\\ \fl{\rm Tr}(A_1\rho') &= (1-p_M){\rm Tr}(A_1\rho) + p_M{\rm Tr}(XA_1X\rho) \nonumber\\ \fl&= (1-p_M){\rm Tr}(A_1\rho) + p_M{\rm Tr}(A_0\rho). \end{eqnarray} \tag{ 17 }$

Hence, either method will result in the same probabilities for a given value of p_M. The difference between these two models is the state the measured qubit is projected to.

When using the POVMs in equation (14), the collapsed state of the qubit is given by

$\begin{equation} \rho \rightarrow \frac{M_i\rho M_i^{\dagger}}{{\rm Tr}(F_i\rho)} \tqs i = 0,1, \end{equation} \tag{ 18 }$

where

$\begin{eqnarray} &&M_0 = \sqrt{1-p_M}{\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} + \sqrt{p_M}{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}, \\ &&M_1 = \sqrt{1-p_M}{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert} + \sqrt{p_M}{\left\vert{0}\right\rangle}{\left\langle{0}\right\vert}. \end{eqnarray} \tag{ 19 }$

Therefore the resulting state, after measurement, will not be initialized in a known state. If the second model is used, then the system is initialized to either |0〉 or |1〉 depending on which projector is used. To see this explicitly, consider the state,

$\begin{equation} \case{1}{\sqrt{2}}\left({\left\vert{0}\right\rangle} + {\left\vert{1}\right\rangle}\right). \end{equation} \tag{ 20 }$

Using both models, the probability of measuring |0〉 is 1/2 (as the above state is invariant under a bit-flip error). If the POVM model is used (using the F₀ and M₀ operators), the state after measurement is,

$\begin{equation} \sqrt{(1-p_m)}{\left\vert{0}\right\rangle} + \sqrt{p_m}{\left\vert{1}\right\rangle}. \end{equation} \tag{ 21 }$

The collapsed state, after measurement, is a superposition of |0〉 and |1〉, with amplitudes related to p_m. If we use the second model (using the projector A₀), the state after measurement is simply |0〉. As measurement is generally followed by either discarding the qubit or reinitializing it in a known state, both models can generally be used interchangeably with no adverse consequences.

Qubit loss is modelled in a slightly different manner. When a qubit is lost, it is effectively removed from the system. This channel can be modelled by simply tracing out the lost qubit, Tr_i(ρ), where i is the index of the lost qubit. This reduces the dimensionality of the qubit space by a factor of two.

The loss of the physical object means that it cannot be directly measured or coupled to any other ancillary system. Even though this model of qubit loss (with regards to the information stored on the qubit) is equivalent to incoherent processes such as environmental coupling, correcting this type of error requires additional machinery on top of standard QEC protocols. Standard correction protocols can protect against the loss of information on a qubit, provided that the physical object still exists in the computer. Therefore, an initial non-demolition detection must be employed (which determines if the qubit is actually present without performing a projective measurement on the computational state) before standard correction can correct the error. Provided that these non-demolition protocols are employed, loss actually becomes a preferred channel in error correction. If a loss event is detected and the quit is replaced, this heralds the qubit experiencing the error. This additional information can be used within error correction protocols to increase performance.

Initialization of the qubit can be modelled either using a coherent systematic or incoherent process. If an incoherent model is employed, initialization can be modelled in essentially the same way as imperfect measurement. If we have a probability p_I of initialization error (where p_I encapsulates the internal mechanisms that introduce errors) and we initialize the qubit in the |0〉 state, the initial state of the system is given by the mixture,

$\begin{equation} \rho_i = (1-p_I){\left\vert{0}\right\rangle}{\left\langle{0}\right\vert} + p_I{\left\vert{1}\right\rangle}{\left\langle{1}\right\vert}. \end{equation} \tag{ 22 }$

In contrast, we could consider an initialization model which is achieved via a coherent unitary operation where the target is the desired initial state. In this case, the initial state is pure, but contains a non-zero amplitude of the undesired target, for example,

$\begin{equation} {\left\vert{\psi}\right\rangle} = \alpha{\left\vert{0}\right\rangle} + \beta{\left\vert{1}\right\rangle}, \end{equation} \tag{ 23 }$

where |α|² + |β|² = 1 and |β|² ≪ 1. The interpretation of these two types of initialization models is identical to the coherent and incoherent models presented. Again, the effect of these types of errors relates to the probabilities of measuring the system in an erred state.

One final type of error that we can briefly mention is qubit leakage. Qubit leakage manifests due to the fact that most systems utilized for qubits do not consist of just two levels. For example, figure 1 (from [Ste97b]) illustrates the energy level structure for a ⁴³Ca⁺ ion utilized for ion trap quantum computing. The qubits in this system are defined with two electronic states, however the system itself contains many more levels (including some which are used for qubit readout and initialization). As with systematic errors, leakage can occur when improper control is applied to such a system. In the case of ion-traps, qubit transitions are performed by focusing finely tuned lasers resonant on the relevant transitions. If the laser frequency fluctuates or additional levels are not sufficiently detuned from the qubit resonance, the following transformation could occur,

$\begin{equation} U{\left\vert{0}\right\rangle} = \alpha{\left\vert{0}\right\rangle} + \beta{\left\vert{1}\right\rangle} + \gamma{\left\vert{2}\right\rangle}, \end{equation} \tag{ 24 }$

where the state |2〉 is a third level, which is now populated due to improper control. The actual effect of this type of error can manifest in several different ways. As quantum circuits and algorithms are fundamentally designed assuming the computational array is of a 2-level system, the transformation shown in equation (24) (which in this case is operating over a 3-level space) will naturally induce unwanted dynamics. Another important implication of applying non-qubit operations is how these levels interact with the environment and hence how decoherence affects the system. For example, in the above case, the unwanted level, |2〉, may be extremely short lived leading to an emission of a photon and the system relaxing back to the ground state. For these reasons, leakage is one of the most problematic error channels to correct using QEC. In general, leakage induced errors need to be corrected in a similar manner to loss, which can also be thought of as a leakage process into a vacuum state. Non-demolition techniques are first required to determine if the system is still confined to the qubit subspace [Pre98, GBP97, VWW05] after which standard correction protocols can be utilized. As with qubit loss, if these additional protocols are employed, leakage errors are heralded and the performance of the resulting error correction will increase.

**Figure 1.** (From [Ste97b]) Energy level structure for the ⁴³Ca⁺ investigated by the Oxford ion-trapping group. The structure of this ion is clearly not a 2-level quantum system. Hence leakage into non-qubit states is an important factor to consider.
Download figure:
Standard image High-resolution image

Another well known method is to use a complicated pulse sequence which acts to re-focus an improperly confined operation back to the qubit subspace [WBL02, BLWZ05]. This can be advantageous as it does not require additional qubit resources.

Leakage effects arise from multiple sources, and the best method for correction is often heavily dependent on the error mechanisms. One example is when internal fluctuations in the system change otherwise perfect qubit dynamics. Unfortunately this leakage source generally cannot be predicted and/or characterized and so dedicated leakage protection will need to be employed. A second source is imprecise fabrication of an otherwise perfect qubit system. This source of leakage can, in principal, be engineered away, thereby eliminating the need for specialized leakage correction. In the context of mass manufacturing of qubit systems, leakage would ideally be quantified immediately after the fabrication of a device [DSO⁺07] to eliminate these systematic leakage errors. If a particular system was found to be improperly confined to the qubit subspace, it would simply be discarded. Employing characterization at this stage could potentially eliminate the need to implement a large amount of leakage protection in the computer, shortening gate times and ultimately reducing error rates in the computer.

In this section we have presented a very basic set of examples to explain some of the ideas of quantum errors and how they affect the success of a quantum algorithm. Section 9 will return to a more realistic set of error models. For those interested in a more complete treatment of quantum errors we encourage readers to refer to [Got97, NC00, Got02, KLA⁺02, Ste01, Got09].

4. The 3-qubit code: a good starting point for quantum error correction

The 3-qubit bit-flip code is traditionally used as a basic introduction to the concept of quantum error correction. It should be emphasized that the 3-qubit code does not represent a full quantum code. This is due to the fact that the code cannot simultaneously correct for both bit and phase flips (section 9). This code is a repetition code extended by Shor [Sho95] to construct the 9-qubit quantum code, demonstrating that QEC was possible.

The 3-qubit code encodes a single logical qubit into three physical qubits with the property that it can correct for a single, σ_x, bit-flip error. The two logical basis states |0〉_L and |1〉_L are defined as

$\begin{equation} {\left\vert{0}\right\rangle}_L = {\left\vert{000}\right\rangle}, \tqs {\left\vert{1}\right\rangle}_L = {\left\vert{111}\right\rangle}, \end{equation} \tag{ 25 }$

such that an arbitrary single qubit state |ψ〉 = α|0〉 + β|1〉 is mapped to,

$\begin{eqnarray} \alpha{\left\vert{0}\right\rangle} + \beta{\left\vert{1}\right\rangle} &\rightarrow \alpha{\left\vert{0}\right\rangle}_L + \beta{\left\vert{1}\right\rangle}_L \nonumber\\ &= \alpha{\left\vert{000}\right\rangle} + \beta{\left\vert{111}\right\rangle}\\ &= {\left\vert{\psi}\right\rangle}_L.\nonumber \end{eqnarray} \tag{ 26 }$

Figure 2 illustrates the quantum circuit required to encode a single logical qubit via the initialization of two ancilla qubits and two CNOT gates. The reason that this code is able to correct for a single bit-flip error is the binary distance between the two codeword states. Notice that three individual bit flips are required to take |0〉_L ↔ |1〉_L, hence if we assume |ψ〉 = |0〉_L, a single bit flip on any qubit leaves the final state closer to |0〉_L than |1〉_L. The distance between two codeword states, d, defines the number of errors that can be corrected, t, as t = ⌊(d − 1)/2⌋. In this case, d = 3, hence t = 1.

How are we able to correct errors using this code without directly measuring or obtaining information about the logical state? Two additional ancilla qubits are introduced, which are used to extract syndrome information (information regarding possible errors) from the data block without discriminating the exact state of any qubit. The encoding and correction circuit is illustrated in figure 3. Correction proceeds by introducing two ancilla qubits and performing a sequence of CNOT gates, which checks the parity of the three qubit data block. For the sake of simplicity we assume that all gate operations are perfect and the only place where the qubits are susceptible to error is the region between encoding and correction (as illustrated in figure 3). We will return to this issue in section 10 when we discuss fault-tolerance. We also assume that at most a single, complete bit-flip error occurs on one of the three data qubits. Table 1 summarizes the state of the whole system, for each possible error, just prior to measurement.

**Figure 3.** Circuit required to encode and correct for a single σ_x-error. We assume that after encoding a single bit-flip occurs on one of the three qubits (or no error occurs). Two initialized ancilla are then coupled to the data block which only checks the parity between qubits. These ancilla are then measured, with the measurement result indicating where (or if) an error has occurred, without directly measuring any of the data qubits. Using this *syndrome* information, the error can be corrected with a classically controlled σ_x gate.
Download figure:
Standard image High-resolution image

Table 1. Final state of the five qubit system prior to the syndrome measurement for no error or a single X error on one of the qubits. The last two qubits represent the state of the ancilla. Note that each possible error will result in a unique measurement result (syndrome) of the ancilla qubits. This allows for a σ_x correction gate to be applied to the data block which is classically controlled from the syndrome result.

Error location	Final state, \|data〉\|ancilla〉
No error	α\|000〉\|00〉 + β\|111〉\|00〉
Qubit 1	α\|100〉\|11〉 + β\|011〉\|11〉
Qubit 2	α\|010〉\|10〉 + β\|101〉\|10〉
Qubit 3	α\|001〉\|01〉 + β\|110〉\|01〉

For each possible situation, either no error or a single bit-flip error, the ancilla qubits are flipped to a unique state based on the parity of the data block. These qubits are then measured to obtain the classical syndrome result. The result of the measurement will then dictate if a correction gate needs to be applied. This is illustrated in table 2.

Table 2. Ancilla measurements for a single σ_x error with the 3-qubit code. Each of the four possible results correspond to either no error or a bit flip on one of the three qubits.

Ancilla measurement	Collapsed state	Consequence
00	α\|000〉 + β\|111〉	No error
01	α\|001〉 + β\|110〉	σ_x on qubit 3
10	α\|010〉 + β\|101〉	σ_x on qubit 2
11	α\|100〉 + β\|011〉	σ_x on qubit 1

Provided that only a single error has occurred, the data block is restored. At no point during correction do we gain any information regarding the coefficients α and β, hence superpositions of the computational state will remain intact during correction.

This code will only work if a maximum of one error occurs. The 3-qubit code is a d = 3 code, hence if t > 1, then the resulting state becomes 'closer' to the wrong logical state. This can be seen in table 3 when tracking multiple errors through the correction circuit. In each case, we assume that the total number of errors is >d/2, therefore our correction takes us to the wrong logical state. In every case where t > 1 our mis-correction induces a logical bit flip, causing the code to fail.

Table 3. Ambiguity of syndrome results when multiple errors occur.

Error location	Final state, \|data〉\|ancilla〉	Assumed error
Qubit 1 and 2	α\|110〉\|01〉 + β\|001〉\|01〉	σ_x on qubit 3
Qubit 2 and 3	α\|011〉\|11〉 + β\|100〉\|11〉	σ_x on qubit 1
Qubit 1 and 3	α\|101〉\|10〉 + β\|010〉\|10〉	σ_x on qubit 2
Qubit 1, 2 and 3	α\|111〉\|00〉 + β\|000〉\|00〉	No error

To be absolutely clear on how QEC acts to restore the system and protect against errors let us now consider a different and slightly more physically realistic example. We will assume that the errors acting on the qubits are coherent rotations of the form U = exp(iσ_x) on each qubit, with ≪ 1. We choose coherent rotations so that we can remain in the state vector representation. This is not a necessary requirement, however more general incoherent mappings would require us to move to density matrices.

We assume that each qubit experiences the same error, hence the error operator acting on the state is

$\begin{eqnarray} {\left\vert{\psi}\right\rangle}_E = E&{\left\vert{\psi}\right\rangle}_L,\nonumber\\\ms E = U^{\otimes 3} &= (\cos(\epsilon)\sigma_I + \rmi\sin(\epsilon)\sigma_x)^{\otimes 3} \nonumber\\\ms &= c_0\sigma_I\sigma_I\sigma_I \\\ms &\quad+ c_1 (\sigma_x\sigma_I\sigma_I+\sigma_I\sigma_x\sigma_I+\sigma_I\sigma_I\sigma_x) \nonumber\\\ms &\quad+ c_2 (\sigma_x\sigma_x\sigma_I+\sigma_I\sigma_x\sigma_x+\sigma_x\sigma_I\sigma_x) \nonumber\\\ms &\quad+ c_3 \sigma_x\sigma_x\sigma_x,\nonumber \end{eqnarray} \tag{ 27 }$

where

$\begin{eqnarray} &&c_0 = \cos^3(\epsilon), \\ &&c_1 = i\cos^2(\epsilon)\sin(\epsilon), \\ &&c_2 = -\cos(\epsilon)\sin^2(\epsilon),\\ &&c_3 = -i\sin^3(\epsilon). \end{eqnarray} \tag{ 28 }$

Now let us examine the transformation that occurs when we run the error correction circuit in figure 3. This is represented via the unitary transformation, U_QEC, over both the data and ancilla qubits,

$\begin{eqnarray} U_{{\rm QEC}} (E{\left\vert{\psi}\right\rangle}_L{\left\vert{00}\right\rangle}) &= c_0{\left\vert{\psi}\right\rangle}_L{\left\vert{00}\right\rangle} \nonumber\\\ms &\quad +c_1 \sigma_x\sigma_I\sigma_I{\left\vert{\psi}\right\rangle}_L{\left\vert{11}\right\rangle} \nonumber\\\ms &\quad + c_1\sigma_I\sigma_x\sigma_I{\left\vert{\psi}\right\rangle}_L{\left\vert{10}\right\rangle} \nonumber\\\ms &\quad + c_1\sigma_I\sigma_I\sigma_x{\left\vert{\psi}\right\rangle}_L{\left\vert{01}\right\rangle} \nonumber\\\ms &\quad +c_2 \sigma_x\sigma_x\sigma_I {\left\vert{\psi}\right\rangle}_L{\left\vert{10}\right\rangle} \nonumber\\\ms &\quad + c_2 \sigma_I\sigma_x\sigma_x{\left\vert{\psi}\right\rangle}_L{\left\vert{11}\right\rangle}\nonumber\\\ms &\quad +c_2\sigma_x\sigma_I\sigma_x{\left\vert{\psi}\right\rangle}_L{\left\vert{01}\right\rangle} \nonumber\\\ms &\quad +c_3\sigma_x\sigma_x\sigma_x{\left\vert{\psi}\right\rangle}_L{\left\vert{00}\right\rangle}. \end{eqnarray} \tag{ 29 }$

Once again, the ancilla block is measured and the appropriate correction operator is applied, yielding the results (up to renormalization) shown in table 4. In each case, after correction (based on the syndrome result), we are left with approximately the same state, a superposition of a 'clean state' with the logically flipped state, σ_xσ_xσ_x|ψ〉. Let us now examine in detail the amplitudes now associated with each term in these states. If we consider the unitary U acting on a single, unencoded qubit, the rotation takes the state |ψ〉 to,

$\begin{equation} U{\left\vert{\psi}\right\rangle} = \cos(\epsilon){\left\vert{\psi}\right\rangle} + \rmi\sin(\epsilon)\sigma_x{\left\vert{\psi}\right\rangle}. \end{equation} \tag{ 30 }$

Table 4. Resulting logical state after a round of error correction with the 3-qubit code.

Ancilla measurement	Collapsed state (with correction)
00	c₀\|ψ〉_L + c₃σ_xσ_xσ_x\|ψ〉_L
01	c₁\|ψ〉_L + c₂σ_xσ_xσ_x\|ψ〉_L
10	c₁\|ψ〉_L + c₂σ_xσ_xσ_x\|ψ〉_L
11	c₁\|ψ〉_L + c₂σ_xσ_xσ_x\|ψ〉_L

Consequently, the fidelity of the single qubit state is,

$\begin{equation} F_{{\rm unencoded}} = |{\left\langle{\psi}\right\vert}U{\left\vert{\psi}\right\rangle}|^2 = \cos^2{\epsilon} \approx 1-\epsilon^2. \end{equation} \tag{ 31 }$

In contrast, the worst case fidelity³ of the encoded qubit state after a cycle of error correction is,

$\begin{eqnarray} F_{{\rm no\ error\ detected}} &= \frac{|c_0|^2}{|c_0|^2+|c_3|^2} \nonumber\\ &= \frac{\cos^6(\epsilon)}{\cos^6(\epsilon)+\sin^6(\epsilon)} \nonumber\\ &\approx 1-\epsilon^6, \end{eqnarray} \tag{ 32 }$

with probability 1 − 3² + O(⁴) and

$\begin{eqnarray} F_{{\rm error\ detected}} &= \frac{|c_1|^2}{|c_1|^2+|c_2|^2} \nonumber\\ &= \frac{\cos^4(\epsilon)\sin^2(\epsilon)}{\cos^4(\epsilon)\sin^2(\epsilon)+\sin^4(\epsilon)\cos^2(\epsilon)} \nonumber\\ &\approx 1-\epsilon^2, \end{eqnarray} \tag{ 33 }$

with probability 3² + O(⁴). This is the general idea of how QEC suppresses errors at the logical level. During a round of error correction, if no error is detected, the error on the resulting state is suppressed from O(²) to O(⁶). If a single error is detected, the fidelity of the resulting state remains the same. As the 3-qubit code is a single error correcting code, if one error has already been corrected then the failure rate of the logical qubit is conditional on experiencing one further error (which will be proportional to ²). As ≪ 1 the majority of correction cycles will detect no error and the fidelity of the resulting encoded state is higher than when unencoded.

It should be stressed that no error correction scheme will, in general, fully restore a corrupted state to the original logical state. The resulting state will generally contain a superposition or a mixture of a clean state and a logically erred state depending on whether the error process is coherent or incoherent. The point is that the fidelity of the corrupted states, at the logical level, is greater than the corresponding fidelity for unencoded qubits. Consequently the probability of measuring the correct result at the end of a specific algorithm increases when the system is encoded. The example shown here is somewhat unphysical, i.e. it assumes perfect gate operations, errors that only consist of σ_x-rotations at a specific point in the circuit. In the coming sections we will introduce the concepts necessary when relaxing these assumptions.

5. The 9-qubit code: the first full quantum code

The 9-qubit error correcting code was first developed by Shor [Sho95] in 1995 and is based largely on the 3-qubit repetition code. The Shor code is a degenerate⁴ single error correcting code, able to correct a logical qubit from one bit-flip, one phase-flip or one of each, on any of the nine physical qubits. This code is therefore sufficient to correct for an arbitrary single qubit error (section 9). The two basis states for the code are

$\begin{eqnarray} &&\fl {\left\vert{0}\right\rangle}_L = \case{1}{\sqrt{8}}({\left\vert{000}\right\rangle}+{\left\vert{111}\right\rangle})({\left\vert{000}\right\rangle}+{\left\vert{111}\right\rangle})({\left\vert{000}\right\rangle}+{\left\vert{111}\right\rangle}),\nonumber\\ &&\fl {\left\vert{1}\right\rangle}_L = \case{1}{\sqrt{8}}({\left\vert{000}\right\rangle}-{\left\vert{111}\right\rangle})({\left\vert{000}\right\rangle}-{\left\vert{111}\right\rangle})({\left\vert{000}\right\rangle}-{\left\vert{111}\right\rangle})\nonumber\\ \end{eqnarray} \tag{ 34 }$

and the circuit to perform the encoding is shown in figure 4. Correction for X errors, for each block of three qubits encoded to $({\left\vert{000}\right\rangle}\pm {\left\vert{111}\right\rangle})/\sqrt{2}$ is identical to the 3-qubit code shown earlier. By performing the correction circuit shown in figure 3 for each block of three qubits, single σ_x ≡ X errors can be detected and corrected. Phase errors (σ_z ≡ Z) are corrected by examining the sign differences between the three blocks. The circuit shown in figure 5 achieves this. The first set of six CNOT gates compares the sign of blocks one and two and the second set of CNOT gates compares the sign for blocks two and three. Note that a phase flip on any one qubit in a block of three has the same effect, this is why the 9-qubit code is referred to as a degenerate code. In other error correcting codes, such as the 5- or 7-qubit codes [Ste96a, LMPZ96], there is a one-to-one mapping between correctable errors and unique states. In degenerate codes such as the 9-qubit code, the mapping is not unique. Hence provided we know in which block the error occurs it does not matter which qubit we apply the correction operator to.

**Figure 5.** Circuit required to perform Z-error correction for the 9-qubit code.
Download figure:
Standard image High-resolution image

As the 9-qubit code can correct for a single X error in any one block of three and a single phase error on any of the nine qubits, this code is a full quantum error correcting code. Even if a bit and phase error occurs on the same qubit, the X correction circuit will detect and correct for bit flips, while the Z correction circuit will detect and correct for phase flips. As mentioned, the X error correction does have the ability to correct for up to three individual bit flips (provided each bit flip occurs in a different block of three). However, in general, the 9-qubit code is only a single error correcting code as it cannot handle multiple errors if they occur in certain locations.

The 9-qubit code is in fact related to a useful class of error correcting codes known as Bacon–Shor codes [Bac06]. These codes have the property that certain subgroups of error operators do not corrupt the logical space. For example, in the 9-qubit code, specific pairs of phase errors do not corrupt the logical states. Bacon–Shor codes are very nice codes from a computer architectural point of view. Error correction circuits and gates are generally simpler, allowing for circuit structures more amenable to the physical restrictions of a computer architecture [AC07]. Additionally, Bacon–Shor codes correcting a larger number of errors have a similar structure. Therefore, they are able to perform dynamical switching between codes, in a fault-tolerant manner. This allows us to adapt the amount of error correction to better reflect the noise present at a physical level [SEDH08]. We will return and revisit these codes later in section 14.1.

6. Quantum error detection

So far we have focused on the ability not only to detect errors, but also to correct them. Another approach is to not enforce the correction requirement. Post-selected quantum computation, developed by Knill [Kni05], demonstrated that large-scale quantum computing could be achieved with much higher noise rates when error detection is employed instead of more costly correction protocols. The basic idea in post-selected schemes is to encode a large number of ancilla qubits with error detecting circuits. Sets of encoded qubits which pass error detection are selected and further utilized as encoded ancillas for error correction. In general, error detection is faster and requires fewer qubits than performing active error correction. By producing and verifying large numbers of encoded ancillas which are post-selected after verification, error correction can be performed without data qubits waiting as long for appropriate ancilla to be prepared, decreasing the number of errors that need to be corrected. One of the downsides to these types of schemes is that although they lead to large tolerable error rates, the resource requirements are much higher.

The simplest error detecting circuit is the 4-qubit code [GBP97]. This encodes two logical qubits on to four physical qubits with the ability to detect a single error on either of the two logical qubits. The four basis states for the code are

$\begin{eqnarray} &&{\left\vert{00}\right\rangle} = \case{1}{\sqrt{2}}({\left\vert{0000}\right\rangle}+{\left\vert{1111}\right\rangle}), \\ &&{\left\vert{01}\right\rangle} = \case{1}{\sqrt{2}}({\left\vert{1100}\right\rangle}+{\left\vert{0011}\right\rangle}), \\ &&{\left\vert{10}\right\rangle} = \case{1}{\sqrt{2}}({\left\vert{1010}\right\rangle}+{\left\vert{0101}\right\rangle}), \\ &&{\left\vert{11}\right\rangle} = \case{1}{\sqrt{2}}({\left\vert{0110}\right\rangle}+{\left\vert{1001}\right\rangle}). \end{eqnarray} \tag{ 35 }$

Figure 6 illustrates the error detection circuit that can be utilized to detect a single bit and/or phase flip on one of these encoded qubits. If a single bit and/or phase flip occurs on one of the four qubits then the ancilla qubits will be measured in the state |1〉. For example, let us consider the cases when a single bit flip occurs on one of the four qubits. The state of the system, just prior to the measurement of the ancilla, is given in table 5. Regardless of the location of the bit flip, the ancilla system is measured in the state |10〉. Similarly if one considers a single phase error on any of the four qubits the ancilla measurement will return |01〉. In both cases no information is obtained regarding where the error has occurred, hence it is not possible to correct the state. Instead the circuit must reset and re-run.

Table 5. Qubit and ancilla state, just prior to measurement for the 4-qubit error detection code when a single bit-flip has occurred on at most one of the four qubits.

Error location	Final state, \|data〉\|ancilla〉
No error	\|ψ〉_L\|00〉
Qubit 1	X₁\|ψ〉_L\|10〉
Qubit 2	X₂\|ψ〉_L\|10〉
Qubit 3	X₃\|ψ〉_L\|10〉
Qubit 4	X₄\|ψ〉_L\|10〉

7. Stabiliser formalism

So far we have described error correcting codes from the state vector representation of their encoded states. This is a rather inefficient method for describing the codes as the state representations and circuits will differ from code to code. Consequently we would like a representation that has a generalized method for error correction and circuit construction, regardless of the code used. The majority of error correcting codes that are used within the literature are members of a class known as stabilizer codes. Stabilizer codes are very useful to work with. The general formalism applies broadly and there exist general rules to construct preparation circuits, correction circuits and fault-tolerant logical gate operations once the stabilizer structure of the code is known.

The stabilizer formalism was first introduced by Daniel Gottesman [Got97] and uses the Heisenberg representation for quantum mechanics describing quantum states in terms of operators rather than the states. A state |ψ〉 is defined to be stabilized by some operator, K, if it is a +1 eigenstate of K, i.e.

$\begin{equation} K{\left\vert{\psi}\right\rangle} = {\left\vert{\psi}\right\rangle}. \end{equation} \tag{ 36 }$

For example, the single qubit state |0〉 is stabilized by the operator K = σ_z, i.e.

$\begin{equation} \sigma_z{\left\vert{0}\right\rangle} = {\left\vert{0}\right\rangle}. \end{equation} \tag{ 37 }$

We can definine multi-qubit states in the same way, by examining some of the group properties of multi-qubit operators.

Within the group of all possible, single qubit operators, there exists a subgroup, denoted the Pauli group, $\mathcal{P}$ , which contains the following elements,

$\begin{eqnarray} &&\fl \mathcal{P} = \{\pm \sigma_I, \pm i \sigma_I, \pm \sigma_x, \pm i \sigma_x,\pm \sigma_y, \pm i \sigma_y,\pm \sigma_z, \pm i \sigma_z\}.\nonumber\\ \end{eqnarray} \tag{ 38 }$

By utilizing the commutation and anti-commutation rules for the Pauli set, {σ_i}_i=x,y,z,

$\begin{equation} [\sigma_i,\sigma_j] = 2i\epsilon_{ijk}\sigma_k, \tqs \{\sigma_i,\sigma_j\} = 2\delta_{ij}, \end{equation} \tag{ 39 }$

where

$\begin{equation} \epsilon_{ijk} = \Bigg \{\begin{array}{@{}l@{}} +1 \ {\rm for}\ (i,j,k) \in \{(1,2,3), (2,3,1), (3,1,2)\},\\ -1\ {\rm for}\ (i,j,k) \in \{(1,3,2), (3,2,1), (2,1,3)\},\\ 0\ {\rm for}\ i=j, j=k,\ {\rm or}\ k=i \end{array} \end{equation} \tag{ 40 }$

and

$\begin{equation} \delta_{ij} = \Bigg \{\begin{array}{@{}ll@{}} 1 \ {\rm for}\ i = j,\\ 0 \ {\rm for}\ i \neq j, \end{array} \end{equation} \tag{ 41 }$

it is easy to show that $\mathcal{P}$ forms a group under matrix multiplication.

The Pauli group extends over N-qubits by simply taking the N fold tensor product of all elements of $\mathcal{P}$ , denoted as,

$\begin{equation} \mathcal{P}_N = \mathcal{P}^{\otimes N}. \end{equation} \tag{ 42 }$

An N-qubit stabilizer state, |ψ〉_N is then defined by the N generators of an Abelian (all elements commute) subgroup, $\mathcal{G}$ , of the N-qubit Pauli group,

$\begin{eqnarray} &&\fl \mathcal{G} = \{\; K^i \;|\; K^i{\left\vert{\psi}\right\rangle} = {\left\vert{\psi}\right\rangle}, \; [K^i,K^j] = 0, \; \forall \;(i,j)\} \subset \mathcal{P}_N. \nonumber\\ \end{eqnarray} \tag{ 43 }$

The state |ψ〉_N can be equivalently defined either through the state vector representation or by specifying the generators of the stabilizer group, $\mathcal{G}$ . As each stabilized state, |ψ〉, satisfies, K|ψ〉 = |ψ〉, and each stabilizer is Hermitian, each stabilizer squares to the identity, K · K = I.

Many extremely useful multi-qubit states are stabilizer states, including two-qubit Bell states, Greenberger–Horne–Zeilinger (GHZ) states [GHZ89, GHSZ90], Cluster states [BR01, RB01] and codeword states for QEC. As an example, consider a 3-qubit GHZ state,

$\begin{equation} {\left\vert{\rm GHZ}\right\rangle}_3 = \frac{{\left\vert{000}\right\rangle} + {\left\vert{111}\right\rangle}}{\sqrt{2}}. \end{equation} \tag{ 44 }$

This state can be expressed via any three linearly independent generators of the |GHZ〉₃ stabilizer group,

$\begin{eqnarray} &&K^1 = \sigma_x\otimes \sigma_x \otimes \sigma_x \equiv XXX, \\ &&K^2 = \sigma_z\otimes \sigma_z \otimes \sigma_I \equiv ZZI, \\ &&K^3 = \sigma_I \otimes \sigma_z \otimes \sigma_z \equiv IZZ, \end{eqnarray} \tag{ 45 }$

where the right-hand side of each equation is the shorthand representation of stabilizers. Similarly, the four orthogonal Bell states,

$\begin{eqnarray} {\left\vert{\Phi^{\pm}}\right\rangle} &= \frac{{\left\vert{00}\right\rangle} \pm {\left\vert{11}\right\rangle}}{\sqrt{2}}, \\ {\left\vert{\Psi^{\pm}}\right\rangle} &= \frac{{\left\vert{01}\right\rangle} \pm {\left\vert{10}\right\rangle}}{\sqrt{2}}, \end{eqnarray} \tag{ 46 }$

are stabilized by the operators, K¹ = (−1)^aXX, and K² = (−1)^bZZ, where [a, b] ∈ {0, 1}. Each of the four Bell states correspond to the four ±1 eigenstate combinations of these two operators,

$\begin{eqnarray} \fl \Phi^+ &\equiv \left(\begin{array}{@{}ll@{}} K^1 = XX \\ K^2 = ZZ \end{array}\right) \tqs \Phi^- \equiv \left(\begin{array}{@{}ll@{}} K^1 = -XX \\ K^2 = ZZ \end{array}\right),\nonumber\\ \fl \Psi^+ &\equiv \left(\begin{array}{@{}ll@{}} K^1 = XX \\ K^2 = -ZZ \end{array}\right) \tqs \Psi^- \equiv \left(\begin{array}{@{}ll@{}} K^1 = -XX \\ K^2 = -ZZ \end{array}\right). \end{eqnarray} \tag{ 47 }$

8. Quantum error correction with stabilizer codes

The use of the stabilizer formalism to describe quantum error correction codes is extremely useful since it allows easy synthesis of correction circuits and also allows for quick determination of what logical operations can be applied directly on encoded data. The link between stabilizer codes and stabilizer states comes about by defining a relevant coding subspace within the larger Hilbert space of a multi-qubit system.

To illustrate this reduction let us examine a simple two qubit example. A 2-qubit system has a Hilbert space dimension of four, however if we require that this two qubit state is stabilized by the XX operator, then there are only two orthogonal basis states which satisfy this condition,

$\begin{eqnarray} &&\fl {\left\vert{0}\right\rangle}_L \equiv \case{1}{\sqrt{2}} \left( {\left\vert{01}\right\rangle}+{\left\vert{10}\right\rangle}\right), \tqs {\left\vert{1}\right\rangle}_L \equiv \case{1}{\sqrt{2}}\left( {\left\vert{00}\right\rangle}+{\left\vert{11}\right\rangle}\right),\nonumber\\ \end{eqnarray} \tag{ 48 }$

which can be used to define an effective logical qubit. Hence by using stabilizers we can reduce the size of the Hilbert space for a multi-qubit system to an effective single qubit system. In the context of QEC, the stabilizers that are used to define the logical subspace are utilized to detect and correct for errors. A stabilizer code is therefore a subspace defined via stabilizer operators for a multi-qubit system.

Returning to QEC, the example we focus on is the most well known quantum code; the 7-qubit Steane code first proposed in 1996 [Ste96a]. The 7-qubit code is defined as a [[n, k, d]] = [[7, 1, 3]] quantum code, where n = 7 physical qubits encode k = 1 logical qubit with a distance between basis states d = 3, correcting t = ⌊(d − 1)/2⌋ = 1 error. As the code contains a single logical qubit, the code must contain two valid code states: |0〉_L and |1〉_L basis states which are, in state vector notation,

$\begin{eqnarray} \fl |0\rangle_L &= \case{1}{\sqrt{8}}(|0000000\rangle + |1010101\rangle + |0110011\rangle + |1100110\rangle\nonumber\\ &\quad+\,|0001111\rangle + |1011010\rangle + |0111100\rangle + |1101001\rangle),\nonumber\\ \fl |1\rangle_L &= \case{1}{\sqrt{8}}(|1111111\rangle + |0101010\rangle + |1001100\rangle + |0011001\rangle\nonumber\\ &\quad+\,|1110000\rangle + |0100101\rangle + |1000011\rangle + |0010110\rangle).\nonumber\\ \end{eqnarray} \tag{ 49 }$

For a general 7-qubit state, the total dimension of the Hilbert space is 2⁷. For a single logically encoded qubit, we must restrict this to a two-dimensional subspace spanned by the states in equation (49). This reduction can be clearly seen via stabilizers.

The stabilizer set for the 7-qubit code is fully specified by the six operators,

$\begin{eqnarray} &&K^1 = IIIXXXX, \tqs K^2 = XIXIXIX,\\ &&K^3 = IXXIIXX, \tqs K^4 = IIIZZZZ,\\ &&K^5 = ZIZIZIZ, \tqs K^6 = IZZIIZZ. \end{eqnarray} \tag{ 50 }$

Each of the valid codestates for the 7-qubit code are stabilized by these operators. As the dimensionality of a 7-qubit system is 2⁷ and there are six stabilizers, the total dimension of this subspace defined by the stabilizer group is 2⁷⁻⁶ = 2. The stabilizer set now defines an effective two-dimensional subspace, and thus a single encoded qubit⁵.

The final operator fixes the encoded state to one of the two codewords. For the Steane code this operator is $\bar{Z} = ZZZZZZZ=Z^{\otimes 7}$ , where $\bar{Z}{\left\vert{0}\right\rangle}_L = {\left\vert{0}\right\rangle}_L$ and $\bar{Z}{\left\vert{1}\right\rangle}_L = -{\left\vert{1}\right\rangle}_L$ . Notice that these generators for the stabilizer set separate into elements consisting solely of X or Z operators. This defines the code as a Calderbank–Shor–Steane (CSS) code. CSS codes are useful since they allow for the straightforward application of several logical gate operations directly to the encoded data (section 11).

Although the 7-qubit code is the most well-known stabilizer code, there are other stabilizer codes which encode multiple logical qubits and correct for more errors [Got97]. Despite these advantages, larger codes generally require more complicated error correction circuits. As the complexity of the error correction circuits increases, adapting such codes to physical computer architectures becomes more difficult. Tables 6 and 7 show the stabilizer structure of two other well-known codes. The 9-qubit code [Sho95] examined earlier, and the 5-qubit code [LMPZ96] which represents the smallest possible quantum code that corrects for a single error.

Table 6. The eight stabilizers for the 9-qubit Shor code, encoding nine physical qubits into one logical qubit to correct for a single X and/or Z error.

K¹	Z	Z	I	I	I	I	I	I	I
K²	Z	I	Z	I	I	I	I	I	I
K³	I	I	I	Z	Z	I	I	I	I
K⁴	I	I	I	Z	I	Z	I	I	I
K⁵	I	I	I	I	I	I	Z	Z	I
K⁶	I	I	I	I	I	I	Z	I	Z
K⁷	X	X	X	X	X	X	I	I	I
K⁸	X	X	X	I	I	I	X	X	X

Table 7. The four stabilizers for the [[5,1,3]] quantum code, encoding five physical qubits into one logical qubit to correct for a single X, Y or Z error. Unlike the 7- and 9-qubit codes, the [[5,1,3]] code is a non-CSS code, since the stabilizer set does not separate into X and Z sectors. Additionally, this code can only correct a single error whereas the 7- and 9-qubit codes can correct for a maximum of 2 errors (a single X and Z error provided they occur on different qubits).

K¹	X	Z	Z	X	I
K²	I	X	Z	Z	X
K³	X	I	X	Z	Z
K⁴	Z	X	I	X	Z

8.1. State preparation

Using the stabilizer structure for QEC codes, the logical state preparation and error correcting procedure is straightforward. Recall that valid codeword states are defined as simultaneous +1 eigenstates of each generator of the stabilizer group. In order to prepare a logical state from an arbitrary input, we need to project qubits into eigenstates of each of these operators.

Consider the circuit shown in figure 7. For an arbitrary input state, |ψ〉_I, an ancilla which is initialized in the |0〉 state is used as a control qubit for a unitary and Hermitian operation (U^† = U, U² = I) on |ψ〉_I. After the second Hadamard gate is performed, the state of the system is,

$\begin{equation} {\left\vert{\psi}\right\rangle}_F = \case{1}{2} ( {\left\vert{\psi}\right\rangle}_I + U{\left\vert{\psi}\right\rangle}_I){\left\vert{0}\right\rangle} + \case{1}{2}({\left\vert{\psi}\right\rangle}_I - U{\left\vert{\psi}\right\rangle}_I){\left\vert{1}\right\rangle}. \end{equation} \tag{ 51 }$

The ancilla qubit is then measured in the computational basis. If the result is |0〉, the input state is projected to (neglecting normalization),

$\begin{equation} {\left\vert{\psi}\right\rangle}_F = {\left\vert{\psi}\right\rangle}_I+U{\left\vert{\psi}\right\rangle}_I. \end{equation} \tag{ 52 }$

Since U is unitary and Hermitian, U|ψ〉_F = |ψ〉_F, hence |ψ〉_F is a +1 eigenstate of U. If the ancilla is measured to be |1〉, then the input is projected to the state,

$\begin{equation} {\left\vert{\psi}\right\rangle}_F = {\left\vert{\psi}\right\rangle}_I-U{\left\vert{\psi}\right\rangle}_I, \end{equation} \tag{ 53 }$

which is the −1 eigenstate of U. Therefore, provided U is Hermitian, the general circuit of figure 7 will project an arbitrary input state to a ±1 eigenstate of U⁶. This procedure is well known and is referred to as either a 'parity' or 'operator' measurement [NC00].

From this construction it should be clear how QEC state preparation proceeds. Taking the [[7, 1, 3]] code as an example (the following technique is not unique or optimal for preparing and correcting the 7-qubit code [Ste97a, Ste02] but once this idea is understood, readers should have little difficultly understanding other correction techniques), 7-qubits are first initialized in the state |0〉^⊗7. The circuit shown in figure 7 is applied three times with U = K¹, K², K³, projecting the input state into a simultaneous ±1 eigenstate of each X generator of the stabilizer group for the [[7, 1, 3]] code. The result of each operator measurement is then used to classically control a single qubit Z gate which is applied to one of the seven qubits at the end of the preparation. This single Z gate converts any −1 projected eigenstates into +1 eigenstates. Notice that the final three stabilizers do not need to be measured due to the input state, |0〉^⊗7, already being a +1 eigenstate of K⁴, K⁵ and K⁶. Additionally as the state |0〉^⊗7 is also a +1 eigenstate of K₇, the initial state will be |0〉_L state. Figure 8 illustrates the final circuit, where instead of one ancilla, three are utilized to speed up the state preparation by performing each operator measurement in parallel.

**Figure 8.** Quantum circuit to prepare the [[7, 1, 3]] logical |0〉 state. The input state |0〉^⊗7 is projected into an eigenstate of each of the X stabilizers shown in equation (50). After each ancilla measurement the classical results are used to apply a single qubit Z gate to qubit $i = 1^{M_2}+2^{M_3}+4^{M_1}$ which converts the state from a −1 eigenstates of (K¹, K², K³) to +1 eigenstates.
Download figure:
Standard image High-resolution image

**Figure 8.** Quantum circuit to prepare the [[7, 1, 3]] logical |0〉 state. The input state |0〉^⊗7 is projected into an eigenstate of each of the X stabilizers shown in equation (50). After each ancilla measurement the classical results are used to apply a single qubit Z gate to qubit $i = 1^{M_2}+2^{M_3}+4^{M_1}$ which converts the state from a −1 eigenstates of (K¹, K², K³) to +1 eigenstates.
Download figure:
Standard image High-resolution image

As a quick aside, let us detail exactly how the relevant logical basis states can be derived from the stabilizer structure of the code by utilizing this preparation procedure. We will use the stabilizer set shown in table 7 to calculate the |0〉_L state for the 5-qubit code as we have not yet explicitly shown the state vectors for the two logical state. The four stabilizer generators are given by,

$\begin{eqnarray} &&K^1 = XZZXI, \tqs K^2 = IXZZX,\\ &&K^3 = XIXZZ, \tqs K^4 = ZXIXZ. \end{eqnarray} \tag{ 54 }$

In an identical way to the 7-qubit code, projecting an arbitrary state into a +1 eigenstate of these operators defines the two logical basis states, |0〉_L and |1〉_L. The logical operator, $\bar{Z} = ZZZZZ$ , then fixes the state to either |0〉_L or |1〉_L. Therefore, calculating |0〉_L from some initial unencoded state requires us to project the initial state into a +1 eigenstate of these operators. If we take the initial, unencoded state as |00000〉, then it is already a +1 eigenstate of $\bar{Z}$ . Therefore, to find |0〉_L we simply calculate

$\begin{equation} {\left\vert{0}\right\rangle}_L=\prod_{i=1}^4 (I^{\otimes 5} + K^i){\left\vert{00000}\right\rangle}, \end{equation} \tag{ 55 }$

up to normalization. Expanding out this product, we find,

$\begin{eqnarray} \fl {\left\vert{0}\right\rangle}_L &= \case{1}{4}({\left\vert{00000}\right\rangle}+{\left\vert{01010}\right\rangle}+{\left\vert{10100}\right\rangle}-{\left\vert{11110}\right\rangle}\nonumber\\ &\quad+\,{\left\vert{01001}\right\rangle}-{\left\vert{00011}\right\rangle}-{\left\vert{11101}\right\rangle}-{\left\vert{10111}\right\rangle}\nonumber\\ &\quad+\,{\left\vert{10010}\right\rangle}-{\left\vert{11000}\right\rangle}-{\left\vert{00110}\right\rangle}-{\left\vert{01100}\right\rangle}\nonumber\\ &\quad-\,{\left\vert{11011}\right\rangle}-{\left\vert{10001}\right\rangle}-{\left\vert{01111}\right\rangle}+{\left\vert{00101}\right\rangle}). \end{eqnarray} \tag{ 56 }$

Note that the above state vector does not match up with those given in [LMPZ96]⁷. However, these vectors are equivalent up to local rotations on each qubit.

8.2. Error correction

Error correction using stabilizer codes is an extension of the state preparation. Consider an arbitrary single qubit state that has been encoded as

$\begin{equation} \alpha{\left\vert{0}\right\rangle} + \beta{\left\vert{1}\right\rangle} \rightarrow \alpha{\left\vert{0}\right\rangle}_L + \beta{\left\vert{1}\right\rangle}_L = {\left\vert{\psi}\right\rangle}_L. \end{equation} \tag{ 57 }$

Now assume that an error occurs on one (or multiple) qubits which is described via the operator E, where E is a combination of X and/or Z errors over the N physical qubits of the logical state (and therefore an element of the N-qubit Pauli group, $\mathcal{P}_N$ ). By definition of stabilizer codes, Kⁱ|ψ〉_L = |ψ〉_L, i ∈ [1, .., N − k], for a code encoding k logical qubits. Hence the erred state, E|ψ〉_L, satisfies,

$\begin{equation} K^iE{\left\vert{\psi}\right\rangle}_L = (-1)^m EK^i{\left\vert{\psi}\right\rangle}_L = (-1)^m E{\left\vert{\psi}\right\rangle}_L, \end{equation} \tag{ 58 }$

where m is defined as m = 0, if [E, Kⁱ] = 0 and m = 1, if {E, Kⁱ} = 0 (E and K are Pauli group operators and all Pauli group operators either commute or anti-commute). Therefore, if the error operator commutes with the stabilizer, the state remains a +1 eigenstate of Kⁱ, if the error operator anti-commutes with the stabilizer then the logical state is flipped to a −1 eigenstate of Kⁱ.

The general procedure for error correction is identical to state preparation. Each of the code stabilizers is sequentially measured. Since an error free state is already a +1 eigenstate of all the stabilizers, errors which anti-commute with any of the stabilizers describing the code will flip the relevant eigenstate and consequently measuring the parity of these stabilizers will return a result of |1〉. Taking the [[7, 1, 3]] code as an example, if the error operator is E = X_i (where i = 1, ..., 7), representing a bit-flip on any one of the 7 physical qubits, then regardless of the location, E will anti-commute with a unique combination of K⁴, K⁵ and K⁶. Hence the classical results of measuring these three operators will indicate if and where a single X error has occurred. Similarly, if E = Z_i, then the error operator will anti-commute with a unique combination of, K¹, K² and K³. Consequently, the first three stabilizers for the [[7, 1, 3]] code correspond to Z error correction while the second three stabilizers correspond to X error correction. Note that correction for Pauli Y errors is achieved by correcting in the X and Z sector since a Y error on a single qubit is equivalent to both an X and Z error on the same qubit, i.e. Y = iXZ.

The circuit shown in figure 9 illustrates the circuit for full error correction with the [[7, 1, 3]] code. As you can see it is simply an extension of the preparation circuit shown in figure 8, where all six stabilizers are measured across the data block. Even though we have specifically used the [[7, 1, 3]] code as an example, this procedure for error correction and state preparation will work for all stabilizer codes (although other correction procedures are possible [Ste97a, Ste02, Kni05]).

9. Digitization of quantum errors

Up until now we have remained fairly abstract regarding the analysis of quantum errors. Specifically, we have examined QEC from the standpoint of a discrete set of Pauli errors occurring at certain locations within a larger quantum circuit. In this section we examine how this analysis of errors relates to some of the more realistic processes such as environmental decoherence and systematic gate errors.

Digitization of quantum noise is often assumed when people examine the stability of quantum circuit design or attempt to calculate thresholds for concatenated error correction (section 10.4). However, the link between discrete Pauli errors and more general, continuous noise only makes sense when we consider the stabilizer nature of the correction procedure. Recall from section 7 that correction is performed by re-projecting a potentially corrupt data block into +1 eigenstates of the code stabilizers. A general continuous mapping from a 'clean' codeword state to a corrupt one will not be eigenstates of the code stabilizers. Instead they will be in superpositions of eigenstates. Measuring the parity of each of the stabilizers acts to digitize the quantum noise. We will first introduce how a coherent systematic error, caused by imperfect implementation of quantum gates, is digitized during correction, after which we will briefly discuss environmental decoherence from the standpoint of a Markovian decoherence model.

9.1. Systematic gate errors

We have already shown a primitive example of how systematic gate errors are digitized into a discrete set of Pauli operators in section 3. However, in that case we only considered a very restrictive type of error, namely the coherent operator U = exp(i X). We can easily extend this analysis to cover all types of systematic gate errors. Consider an N qubit unitary operation, U_N, which is valid on encoded data. Assume that U_N is applied inaccurately such that the resultant operation is actually $\mathcal{U}_N$ . Given a general encoded state |ψ〉_N, the final state can be expressed as

$\begin{equation} \mathcal{U}_N {\left\vert{\psi}\right\rangle}_L = U_E U_N {\left\vert{\psi}\right\rangle}_L = \sum_j \alpha_j E_j {\left\vert{\psi_2}\right\rangle}_L, \end{equation} \tag{ 59 }$

where |ψ₂〉_L = U_N|ψ〉_L is the perfectly applied N qubit gate, (the stabilizer group for |ψ'〉_L remains invariant under the operation U_N (see section 11)) and U_E is a coherent error operator which is expanded in terms of the N qubit Pauli Group, E_j ∈ P_N. Now append two ancilla blocks, |A₀〉^X and |A₀〉^Z. These are generally initialized to the state that corresponds to no detected errors (but can be other initial states up to a redefinition of the measurement results) and are used for X and Z correction. We then run the syndrome extraction procedure, which we represent by the unitary operator, U_QEC. It will be assumed that |ψ〉_L is encoded with a QEC code which can correct for a single error (both X and/or Z), and the error operators E_j are a maximum of weight one (i.e. E_j contains at most one non-identity term e.g. E₁ = X₁ ⊗ I^⊗ (N−1)) ⁸, hence there is a one-to-one mapping between the error operators, E_j, and the orthogonal basis states of the ancilla blocks,

$\begin{eqnarray} &U_{{\rm QEC}}U_N'{\left\vert{\psi}\right\rangle}_L{\left\vert{A_0}\right\rangle}^X{\left\vert{A_0}\right\rangle}^Z\nonumber\\ \quad &= U_{{\rm QEC}} \sum_j \alpha_jE_j{\left\vert{\psi'}\right\rangle}_L {\left\vert{A_0}\right\rangle}^X{\left\vert{A_0}\right\rangle}^Z\nonumber\\ \quad &= \sum_j \alpha_j E_j {\left\vert{\psi'}\right\rangle}_L{\left\vert{A_j}\right\rangle}^X{\left\vert{A_j}\right\rangle}^Z. \end{eqnarray} \tag{ 60 }$

The above assumes a non-degenerate quantum code⁹. The ancilla blocks are then measured, projecting the data blocks into the state E_j|ψ'〉_L with probability |α_j|². After measurement the correction $E_j^{\dagger}$ is applied based on the syndrome result. As the error operation E_j is simply an element of $\mathcal{P}_N$ , correcting for X and Z independently is sufficient to correct for all error operators (as Y errors are corrected when a bit and phase error is detected and corrected on the same qubit).

For well designed gates, very small systematic inaccuracies lead to the expansion coefficient α₀ ≈ 1, with all other coefficients, α_j≠0 ≪ 1. Hence during correction there will be a very high probability that no error is detected. This is the digitization effect of QEC. Since codeword states are eigenstates of the stabilizers, re-projecting the state when each stabilizer is measured forces any continuous noise operator to collapse. The strength of the error is then related to the probability that the data block collapses to a specific Pauli error acting on the codestate. The physical mechanisms which are used to construct quantum gates are what determine the form of U_E and hence the form of the error operators, E_j and their respective coefficients, α_j.

9.2. Environmental decoherence

A complete analysis of environmental decoherence in relation to quantum information is a lengthy topic. Instead of a detailed review, we will instead present a simplified example to highlight how QEC relates to environmental effects.

The Lindblad formalism [Gar91, NC00, DWM03] provides an elegant method for analysing the effect of decoherence on open quantum systems. This model does have several assumptions, most notably that the environmental bath couples weakly to the system (Born approximation), the system and environment are initially in some separable state and that each qubit experiences temporally un-correlated noise (Markovian approximation). While these assumptions are utilized for a variety of systems [BHPC03, BM03, BKD04], it is known that they may not hold in some cases [HMCS00, MCM⁺05, APN⁺05, ALKH02]. This is particularly important in superconducting systems, where decoherence can be caused by small numbers of fluctuating charges which induces coloured noise therefore violating the assumption of non-Markovian dynamics. In this case more specific decoherence models need to be considered.

Using this formalism, the evolution of the density matrix can be written as

$\begin{equation} \partial_t \rho = -\frac{\rmi}{\hbar} [H,\rho] + \sum_k \Gamma_k \mathcal{L}_k[\rho], \end{equation} \tag{ 61 }$

where H is the Hamiltonian, representing coherent, dynamical evolution of the system and $\mathcal{L}_k[\rho]=([L_k,\rho L_k^{\dagger}]+[L_k\rho, L_k^{\dagger}])/2$ represents the incoherent evolution. The operators L_k are known as the Lindblad quantum jump operators and are used to model specific decoherence channels, with each operator associated with some rate Γ_k ⩾ 0. This differential equation is known as the quantum Liouville equation or more generally, the density matrix master equation.

To link Markovian decoherence to QEC, consider a special set of decoherence channels which represent a single qubit undergoing dephasing, spontaneous emission and spontaneous absorption. This helps to simplify the calculation. Dephasing of a single qubit is modelled by the Lindblad operator L₁ = Z while spontaneous emission/absorption are modelled by the operators L₂ = |0〉〈1| and L₃ = |1〉〈0|, respectively. For the sake of simplicity, we assume that absorption/emission occur at the same rate, Γ. Although this is unphysical, it does simplify the calculation significantly for this example. The density matrix evolution is given by,

$\begin{equation} \partial_t \rho = -\frac{\rmi}{\hbar}[H,\rho] + \Gamma_Z (Z\rho Z - \rho) + \frac{\Gamma}{2}(X\rho X + Y\rho Y -2\rho). \end{equation} \tag{ 62 }$

If we assumed that the qubit is not undergoing any coherent evolution (H = 0), i.e. a memory stage within a quantum algorithm, then equation (62) can be solved by re-expressing the density matrix in the Bloch formalism. Set ρ(t) = I/2 + x(t)X + y(t)Y + z(t)Z, then equation (62) with H = 0 reduces to ∂_tS(t) = AS(t) with S(t) = (x(t), y(t), z(t))^T and

$\begin{equation} A = \left(\begin{array}{@{}lll@{}} -(\Gamma + 2\Gamma_z) & 0 &0 \\ 0 &-(\Gamma + 2\Gamma_z) &0\\ 0 &0 &-2\Gamma \end{array}\right). \end{equation} \tag{ 63 }$

This differential equation is easy to solve, leading to

$\begin{eqnarray} \fl \rho(t) &= [1-p(t)]\rho(0) + p_x(t) X\rho(0) X\nonumber\\ &\quad +\,p_y(t) Y \rho(0) Y + p_z(t) Z \rho(0) Z, \end{eqnarray} \tag{ 64 }$

where,

$\begin{eqnarray} p_x(t) = &p_y(t) = \case{1}{4}(1-\rme^{-2\Gamma t}),\nonumber\\ &p_z(t) = \case{1}{4}(1+\rme^{-2\Gamma t}-2\rme^{-(\Gamma +2\Gamma_z)t}),\nonumber\\ &p(t) = p_x(t) + p_y(t) + p_z(t). \end{eqnarray} \tag{ 65 }$

If this single qubit is part of an encoded data block, then each term represents a single error on the qubit experiencing decoherence. Two blocks of ancilla qubits, initialized to the state corresponding to no detected error, are added to the system. The error correction protocol is then run. Once the ancilla qubits are measured, the state will collapse to no error with probability 1 − p(t), or a single X, Y or Z error, with probabilities p_x(t), p_y(t) and p_z(t) respectively. Although the above example is somewhat artificial, the above should allow the reader to perform their own calculations for expected QEC error rates given a more physical complete error model.

We can also see how temporal effects are incorporated into the error correction model. The temporal integration window t of the master equation will influence how probable an error is detected for a fixed rate Γ. The longer between correction cycles, the more probable the qubit experiences an error.

9.3. More general mappings

Both the systematic gate errors and the errors induced by environmental decoherence illustrate the digitization effect of QEC. However, we can generalize digitization to other mappings of the density matrix. In this case consider a more general Krauss map on a multi-qubit density matrix,

$\begin{equation} \rho \rightarrow \sum_k A_k^{\dagger}\rho A_k, \end{equation} \tag{ 66 }$

where $\sum A_k^{\dagger}A_k = I$ . For the sake of simplicity let us choose a simple mapping where $A_1 = (Z_1+\rmi Z_2)/\sqrt{2}$ and A_k = 0 for k ≠ 1. This mapping essentially represents dephasing on two qubits. However, this type of mapping (when considered in the context of error correction) represents independent Z errors on either qubit one or two.

The above discussion assumes that we are working with a non-degenerate quantum code. If the code is degenerate, i.e. if the errors Z₁ and Z₂ have the same effect on the codestates then the correction protocol does not simplify this error mapping as described above.

To illustrate, first expand out the density matrix (neglecting normalization),

$\begin{equation} \rho \rightarrow A_1^{\dagger}\rho A_1 = Z_1\rho Z_1 + Z_2\rho Z_2 - \rmi Z_1\rho Z_2 + \rmi Z_2 \rho Z_1. \end{equation} \tag{ 67 }$

Note that only the first two terms in this expansion, on their own, represent physical mixtures. However, the last two off-diagonal terms are removed by the process of syndrome extraction and are irrelevant in the context of QEC. To illustrate, we assume that ρ represents a protected qubit, where Z₁ and Z₂ are physical errors on qubits comprising the code block. As we are only considering phase errors in this example, we will ignore X correction (but the analysis automatically generalizes if the error mapping contains X terms). A fresh ancilla block, represented by the density matrix $\rho^z_0$ , is coupled to the system and the unitary U_QEC is run,

$\begin{eqnarray} U_{QEC}^{\dagger}\rho'\otimes \rho^z_0 U_{QEC} &= Z_1\rho Z_1 \otimes {\left\vert{Z_1}\right\rangle}{\left\langle{Z_1}\right\vert}\nonumber\\ &\quad +\,Z_2\rho Z_2 \otimes {\left\vert{Z_2}\right\rangle}{\left\langle{Z_2}\right\vert}\nonumber\\ &\quad -i\,Z_1\rho Z_2 \otimes {\left\vert{Z_1}\right\rangle}{\left\langle{Z_2}\right\vert}\nonumber\\ &\quad +\,iZ_2\rho Z_1 {\left\vert{Z_2}\right\rangle}{\left\langle{Z_1}\right\vert}, \end{eqnarray} \tag{ 68 }$

where |Z₁〉 and |Z₂〉 represent the two orthogonal syndrome states of the ancilla that are used to detect phase errors on qubits one and two respectively. The important part of the above expression is that when the syndrome qubits are measured, the state collapses to,

$\begin{eqnarray} &\rho \rightarrow \frac{{\left\langle{Z_1}\right\vert}\rho{\left\vert{Z_1}\right\rangle}{\left\vert{Z_1}\right\rangle}{\left\langle{Z_1}\right\vert}}{{\rm Tr}(\rho {\left\vert{Z_1}\right\rangle}{\left\langle{Z_1}\right\vert})}\nonumber\\ {\rm or} &\rho \rightarrow \frac{{\left\langle{Z_2}\right\vert}\rho{\left\vert{Z_2}\right\rangle}{\left\vert{Z_2}\right\rangle}{\left\langle{Z_2}\right\vert}}{{\rm Tr}(\rho {\left\vert{Z_2}\right\rangle}{\left\langle{Z_2}\right\vert})}. \end{eqnarray} \tag{ 69 }$

The two cross terms in the above expression are never observed. In this mapping the only two possible states that exist after the measurement of the ancilla system are,

$\begin{eqnarray} &&Z_1\rho Z_1 \otimes {\left\vert{Z_1}\right\rangle}{\left\langle{Z_1}\right\vert} \quad {\rm with\ Probability} =\case{1}{2}, \\ &&Z_2\rho Z_2 \otimes {\left\vert{Z_2}\right\rangle}{\left\langle{Z_2}\right\vert} \quad {\rm with\ Probability} =\case{1}{2}. \end{eqnarray} \tag{ 70 }$

Therefore, not only are the cross terms eliminated via error correction but the final density matrix again collapses to a single error perturbation of 'clean' codeword states with no correlated errors.

Consequently, it is common in standard QEC analysis to assume that discrete X and/or Z errors are applied stochastically to each qubit after each elementary gate operation, measurement, initialization and memory step with some probability p. The QEC protocol itself digitizes the continuous noise, either coherent or incoherent errors, into a discrete set of bit and/or phase flips. The set of discrete errors is determined by the set of errors that are distinguishable by the quantum code used. The magnitude of the continuous error is translated to the probability of detecting a discrete error. In this way error correction can be analysed by assuming perfect gate operations and discrete, probabilistic errors. The probability of these errors occurring can then be independently calculated via analysis of the physical mechanisms which produce errors. While a local, stochastic error model is generally the most common, depending on the physical system under consideration and the type of quantum codes being used a more complicated analysis may be needed [Pre98].

10. Fault-tolerant quantum error correction and the threshold theorem

Section 7 detailed the protocols required to correct for quantum errors, however in the discussion so far, we have implicitly assumed the following,

1.
Errors only occur during 'memory' regions, i.e. when quantum operations or error correction are not being performed and ancilla qubits are error free.
2.
The quantum gates themselves do not induce any systematic errors within the logical data block.

Clearly these are two very unrealistic assumptions. Fault-tolerant QEC is how we address these two issues. As the name suggests, fault-tolerance is a design methodology that allows us to tolerate faults, allowing QEC to remain effective. Combining error correction techniques with fault-tolerance allows us to design correction procedures and logical gate operations such that they can still function when the above assumptions are relaxed.

10.1. Error propagation

Before discussing the nature of fault-tolerant computation, we first examine how errors can propagate in quantum circuits. There are essentially two dominant channels that can cause errors to be copied: the first is quantum gates. For obvious reasons, gates operating on single qubits do not copy errors, however quantum gates which couple qubits can cause errors to propagate.

For example, take the two qubit state E_j|ψ〉, where E_j = {X₁I₂, Y₁I₂, Z₁I₂, I₁X₂, I₁Y₂, I₁Z₂} are each single qubit errors (either on qubit one or two respectively). We now perform a CNOT operation, U = CNOT, where the control is qubit one. This gives,

$\begin{equation} UE_j{\left\vert{\psi}\right\rangle} = (UE_jU^{\dagger})U{\left\vert{\psi}\right\rangle}, \end{equation} \tag{ 71 }$

using the identity (U^†U) = I. The effect of the gate is to transform the error E_j to E'_j = (UE_jU^†). For the six single qubit errors, this gives

$\begin{eqnarray} &&U(X_1I_2)U^{\dagger} = X_1X_2,\\ &&U(Y_1I_2)U^{\dagger} = Y_1X_2,\\ &&U(Z_1I_2)U^{\dagger} = Z_1I_2,\\ &&U(I_1X_2)U^{\dagger} = I_1X_2,\\ &&U(I_1Y_2)U^{\dagger} = Z_1Y_2,\\ &&U(I_1Z_2)U^{\dagger} = Z_1Z_2. \end{eqnarray} \tag{ 72 }$

Therefore, the CNOT gate copies X errors from the control qubit to the target and copies Z errors from the target to the control. Error propagation rules for any multi-qubit unitary can also be calculated for an arbitrary gate, U, and error, E¹⁰.

The above example assumes a perfect 2-qubit gate operation, however the gates themselves can also introduce multiple errors when they fail. In the case of coherent errors, an arbitrary 2-qubit unitary, U, can be written in the form, U_EU_p, where U_p is the perfectly applied gate and U_E is some coherent error operator. This error operator can be expressed as a linear combination of the 2-qubit Pauli group,

$\begin{equation} U_E = \sum_{(i,j)=1}^{4} d_{i,j} \; \sigma_i\otimes \sigma_j, \tqs [\sigma_1,\sigma_2,\sigma_3,\sigma_4] = [I,X,Y,Z]. \end{equation} \tag{ 73 }$

Therefore inaccurate application of the gate U_p (including complete failure) can introduce a single X, Y or Z error to each of the two qubits or it can introduce an error to both qubits (the details of what kind of errors can be introduced by faulty gates are dependent on the mechanisms causing the failure). Therefore, not only can gates coupling qubits copy pre-existing errors but a single failure event can introduce errors on both qubits involved in the interaction. Correlated errors also occur in the density matrix representation when incoherent noise causes gate inaccuracies. This generalizes for higher order operations, e.g. errors on three qubit gates can introduce errors on all three qubits, etc.

A second process which can cause errors to cascade is the classical correction of quantum data from a measurement result¹¹. Take the circuit shown in figure 10. The measurement result on the top qubit is used to determine if the gate, X, is applied to the second qubit. In this case the gate is applied if the upper qubit is measured in the state |1〉. If we now assume that this measurement is faulty, then with some probability the 'classical' information extracted from the measurement device is wrong. Therefore the correction that is applied to the second qubit should not have been applied. Not only has a bit-flip error now occurred on qubit one (due to the inaccurate measurement result), but we also have accidentally introduced a bit-flip error on the second qubit due to this inaccurate information.

**Figure 10.** Error propagation due to measurement error. If the measurement result for qubit one is faulty, we inadvertently introduce an error to the second qubit.
Download figure:
Standard image High-resolution image

Hence gates which couple qubits and measurement errors (which subsequently control further quantum gates) can copy errors from qubit to qubit.

10.2. Concatenation

Concatenation is where a group of encoded qubits are further encoded (not necessarily with the same error correction code). This forms a second level encoded qubit which figure 11 illustrates for the [[7, 1, 3]] code. Starting with a group of unencoded qubits, we first encode into level-1 logical qubits, each containing a block of seven. Each of these logical qubits are now protected with a distance three quantum code. If we now take seven of these level-1 logical qubits and perform the same encoding operation (using valid encoded gate operations, discussed in the following sections) we form a level-2 logical qubit. This is now a block containing 49 physical qubits and has a code distance of 3² = 9. A level-2 logical qubit can therefore be faithfully recovered if four or less errors occur on any of the 49 physical qubits. This process can be repeated indefinitely, for g levels of concatenation, the encoded state will consist of 7^g physical qubits and have a code distance of 3^g, allowing for the correction of (3^g − 1)/2 individual errors. Hence for a concatenation scheme, both the number of physical qubits required and the number of errors it can correct scale exponentially (with the number of qubits growing faster than the number of correctable errors).

**Figure 11.** Illustration explaining the concept of concatenation with the [[7, 1, 3]] code. 49 physical qubits are first encoded into seven level-1 logical qubits (seven blocks of seven qubits). These seven logical qubits are then encoded into a level-2 logical qubit. This level-2 qubit can correct unto a maximum of ((3² − 1)/2) = 4 errors on the 49 physical qubits forming the encoded block.
Download figure:
Standard image High-resolution image

10.3. Fault-tolerance

The concept of fault-tolerance in computation is not a new idea, it was first developed in relation to classical computing [Neu55, G83, Avi87]. However, in recent years the precise manufacturing of digital circuitry has made large-scale error correction and fault-tolerant circuits largely unnecessary.

The basic principle of fault-tolerance is that the circuits used for gate operations and error correction procedures should not cause errors to cascade. As shown in the previous section, gates which couple multiple qubits and measurements which are used to control a quantum operation can cause errors to be copied. How do we design operations to avoid cascading errors in quantum algorithms?

This can be seen more clearly when we look at a simple CNOT operation between two qubits (figure 12). In this circuit we are performing a sequence of three CNOT gates which act to take the state |111〉|000〉 → |111〉|111〉. In figure 12(a) we consider a single X error which occurs on the top most qubit prior to the first CNOT. This single error will cascade through each of the three gates such that the X error has now propagated to four qubits. Figure 12(b) shows a slightly modified design that implements the same operation, but the single X error now only propagates to two of the six qubits. If we consider each block of three as a single logical qubit, then the staggered circuit will only induce a total of one error in each logical block, given a single X error occurred somewhere during the circuit. Therefore, one of the standard definitions of fault-tolerance is

fault-tolerant circuit element: A single error will cause at most one error in the output for each logical qubit block.

It should be stressed that the idea of fault-tolerance is a discrete definition: either a certain quantum operation is fault-tolerant or it is not. What is defined to be fault-tolerant can change depending on the error correction code used. For example, for a single error correcting code, the above definition is the only one available, since any more than one error in a logical qubit will result in the error correction procedure failing. However, if the quantum code employed is able to correct multiple errors, then the definition of fault-tolerance can be relaxed, i.e. if the code can correct three errors then circuits may be designed such that a single failure results in at most three errors in the output (which is then correctable). In general, for a code correcting t = ⌊(d − 1)/2⌋ errors, fault-tolerance requires that ⩽t errors during an operation does not result in > t errors in the output for each logical qubit.

**Figure 12.** Two circuits to implement the transformation |111〉|000〉 → |111〉|111〉. (a) shows a version where a single X error can cascade into four errors while (b) shows an equivalent circuit where the error only propagates to a second qubit.
Download figure:
Standard image High-resolution image

10.4. Threshold theorem

The threshold theorem is truly a remarkable result in quantum information and is a consequence of fault-tolerant circuit design and the ability to perform dynamical error correction. Rather than present a detailed derivation of the theorem for a variety of noise models, we will instead take a very simple case where we utilize a quantum code that can only correct for a single error, using a model that assumes uncorrelated errors on individual qubits. For more rigorous derivations of the theorem see [ABO97, Got97, Ali07].

Consider a quantum computer where each physical qubit experiences either an X and/or Z error independently with probability p, per gate operation. Furthermore, it is assumed that logical gate operations and error correction circuits are designed according to the rules of fault-tolerance and that a cycle of error correction is performed after each elementary logical gate operation using a code that corrects for a single error ([[n, k, 3]] code). If an error occurs during a logical gate operation, then fault-tolerance ensures this error will only propagate to at most one error in each block, after which a cycle of error correction will remove the error. Hence if the failure probability of unencoded qubits per time step is p, then a single level of error correction will ensure that the logical step fails only when two (or more) errors occur. Hence the failure rate of each logical operation, to leading order, is now $p^1_L = cp^2$ , where $p^1_L$ is the failure rate (per logical gate operation) of a level-1 logical qubit and c is the upper bound for the number of possible 2-error combinations which can occur at a physical level within the sequence of operations consisting of a correction cycle, the logical gate operation and a second correction cycle [Ali07]. We now concatenate, encoding the computer further, such that a level-2 logical qubit is formed. We assume this is done using the same [[n, k, 3]] quantum code level-1 encoded qubit. It is assumed that all error correcting procedures and gate operations at level-2 are self-similar to the level-1 operations (i.e. the circuit structures for the level-2 encoding are identical to the level-1 encoding). Therefore, if the level-1 failure rate per logical time step is $p^1_L$ , then by the same argument, the failure rate of a level-2 operation is given by, $p^2_L = c(p^1_L)^2 = c^3p^4$ . This iterative procedure is then repeated up to level-g, such that the logical failure rate, per time step, of a level-g encoded qubit is given by

$\begin{equation} p^g_L = \frac{(cp)^{2^g}}{c}. \end{equation} \tag{ 74 }$

Equation (74) implies that for a finite physical error rate, p, per qubit, per time step, the failure rate of a level-g encoded qubit can be made arbitrarily small if cp < 1. This inequality defines the threshold. The physical error rate experienced by each qubit per time step must be p_th < 1/c to ensure that multiple levels of concatenated error correction reduce the failure rate of logical components.

Hence, provided sufficient resources are available, an arbitrarily large quantum circuit can be successfully implemented to arbitrary accuracy, once the physical error rate is below threshold. The calculation of thresholds is therefore an extremely important aspect to quantum architecture design. Initial estimates at the threshold, which gave p_th ≈ 10⁻⁴–10⁻⁶ [Kit97, ABO97, Got97], did not sufficiently model the physical architectures. Recent results [SFH08, SDT07, SBF⁺06, MCT⁺04, BKSO05] have been estimated for more realistic quantum processor architectures, showing significant differences in threshold when architectural considerations are taken into account. Many of these estimates are summarized in table 8 with a brief description of the constraints considered by the model. We order the entries by the level of detail presented in each of the papers with regards to physical implementation of fault-tolerant error correction protocols on realistic quantum architectures.

Table 8. Various threshold estimates, associated geometric constraints and architectural considerations when performing the calculation. We have ordered the table with respect to the level of architectural detail that has been developed compatible with the threshold result. Note that this is not an exhaustive list.

Relevant work	Code	Threshold	Geometric constraints	Architectural context
[ABO08]	[[7, 1, 3]]	O(10⁻⁶)	A.I.^a	None
[Got97]	[[7, 1, 3]]	O(10⁻⁴−10⁻⁶)	A.I.	None
[KLZ96]	[[7, 1, 3]]	O(10⁻⁶)	A.I.	None
[PR12]	[[23, 1, 7]]	O(10⁻³)	A.I.	None
[Kni05]	4- and 6-qubit detection	O(10⁻²)	A.I.	None
[BAO⁺12]	Toric code	18%	2D NN^b array, P.B.C.^c	None, theoretical upper bound^d
[MCT⁺04]	[[7, 1, 3]]^e	O(10⁻⁴)	A.I. with movement penalty	Specific to ion-traps
[SFH08]	[[7, 1, 3]]	O(10⁻⁶)	Bilinear NN array	Kane P : Si system [Kan98]
[SBF⁺06]	[[7, 1, 3]]	O(10⁻⁷)	Variable width NN array	General NN systems
[SDT07]	[[7, 1, 3]]	O(10⁻⁵)	2D NN arrays	General NN systems
[FTY⁺07]	[[7, 1, 3]]	O(10⁻⁶)	Bilinear NN array	Superconducting qubits
[BKSO05]	[[7, 1, 3]]	O(10⁻⁹)	A.I. with movement penalty	Ion-traps
[RHG07]	Topological cluster	O(10⁻²−10⁻³)	3D NN array	Photonic qubits [DFS⁺09]
[WFSH10]	Surface codes	O(10⁻²−10⁻³)	2D NN array	Quantum dots, diamond [JMF⁺12, YJG⁺12]

^aArbitrary interactions. ^bNearest neighbour. ^cPeriodic boundary conditions. ^dThis result is calculated assuming perfect quantum gates and is known as the code capacity. ^eOnly a preliminary estimate.

From the above table you can see the differences in threshold that occur once more specific architectural considerations are taken into account. This variation is heavily dependent on what error models are assumed in the analysis, if qubit transport can be done directly (such as in ion traps where the qubits are physically moved throughout the computational array) or via the application of SWAP gates (required in condensed matter systems such as P : Si, quantum dots or NV defects in diamond) and what is the structure of the QEC code that is ultimately used. In terms of concatenated QEC codes, the majority of analysis was done using the [[7, 1, 3]] Steane code and only the results of Knill [Kni05] show a threshold of the order of 1%. However, the results of Knill have not yet been adapted to a physical architecture and it remains unclear if the constraints of a 1D, 2D or 3D nearest neighbour architecture will reduce the threshold to that of other concatenated codes and/or substantially increase qubit resources to achieve a high threshold.

Currently, the most detailed architectural designs are based on topological cluster codes [RHG07] and surface codes [Kit97, DKLP02, FSG09]. The general framework for this type of error correction is discussed later in section 14.2. These codes are promising for large-scale computers as they exhibit significantly higher thresholds to concatenation and they have an intrinsic structure that is compatible with 2D or 3D nearest neighbour architectures.

11. Fault-tolerant operations on encoded data

Sections 7, 8 and 10 showed how fault-tolerant QEC allows for any quantum algorithm to be run to arbitrary accuracy. However, the results of the threshold theorem assume that logical operations can be performed directly on the encoded data without the need for continual decoding and re-encoding. Using stabilizer codes, a large class of operations can be performed on logical data in an inherently fault-tolerant way.

If a given logical state, |ψ〉_L, is stabilized by K, and the logical operation U is applied, the new state, U|ψ〉_L is stabilized by UKU^†, i.e.

$\begin{equation} UKU^{\dagger}U{\left\vert{\psi}\right\rangle}_L = UK{\left\vert{\psi}\right\rangle}_L = U{\left\vert{\psi}\right\rangle}_L. \end{equation} \tag{ 75 }$

In order for the codeword states to remain valid, the stabilizer set for the code, $\{G_i\} \in \mathcal{G}$ , must remain fixed through every operation. Hence for U to be a valid operation on the data, UG_iU^† = G_j, $G_j \in \mathcal{G}$ ∀i. As a shorthand notation, we express this relationship as $U\mathcal{G}U^{\dagger} = \mathcal{G}$ .

11.1. Single qubit operations

For an elementary operation A, the equivalent logical operator $\bar{A}$ will now be formed via a sequence of elementary operations on the physical qubits comprising the code block. The logical $\bar{X}$ and $\bar{Z}$ operations on a single encoded qubit are the first examples of valid codeword operations. Taking the [[7, 1, 3]] code as an example, $\bar{X}$ and $\bar{Z}$ are given by,

$\begin{eqnarray} &&\bar{X} = XXXXXXX \equiv X^{\otimes 7},\\ &&\bar{Z} = ZZZZZZZ \equiv Z^{\otimes 7}. \end{eqnarray} \tag{ 76 }$

Since the single qubit Pauli operators satisfy XZX = −Z and ZXZ = −X then, $\bar{X}K^{i}\bar{X} = K^{i}$ and $\bar{Z}K^{i}\bar{Z} = K^{i}$ for each of the [[7, 1, 3]] stabilizers given in equation (50). The fact that each stabilizer has a weight of four guarantees that UKU^† picks up an even number of −1 factors. Since the stabilizers remain fixed, the operations are valid. However, what transformations do equation (76) actually perform on encoded data?

For a single qubit, a bit-flip operation X takes |0〉 ↔ |1〉. Recall that for a single qubit Z|0〉 = |0〉 and Z|1〉 = −|1〉, hence for $\bar{X}$ to actually induce a logical bit-flip it must take, |0〉_L ↔ |1〉_L. For the [[7, 1, 3]] code, the final operator which fixes the logical state is K⁷ = Z^⊗7, where K⁷|0〉_L = |0〉_L and K⁷|1〉_L = −|1〉_L. As $\bar{X}K^7\bar{X} = -K^7$ , any state stabilized by K⁷ becomes stabilized by −K⁷ (and vice-versa) after the operation of $\bar{X}$ . Therefore, $\bar{X}$ represents a logical bit flip. The same argument can be used for $\bar{Z}$ by considering the stabilizer properties of the states ${\left\vert{\pm}\right\rangle} = ({\left\vert{0}\right\rangle} \pm {\left\vert{1}\right\rangle})/\sqrt{2}$ . Hence, the logical bit- and phase-flip gates can be applied directly to logical data by simply using seven single qubit X or Z gates (figure 13).

**Figure 13.** Bit-wise application of single qubit gates in the [[7, 1, 3]] code. Logical X, Z H and P gates can trivially be applied by using seven single qubit gates, fault-tolerantly. Note that the application of seven P gates results in the logical $\bar{P^{\dagger}}$ being applied and vice versa.
Download figure:
Standard image High-resolution image

**Figure 13.** Bit-wise application of single qubit gates in the [[7, 1, 3]] code. Logical X, Z H and P gates can trivially be applied by using seven single qubit gates, fault-tolerantly. Note that the application of seven P gates results in the logical $\bar{P^{\dagger}}$ being applied and vice versa.
Download figure:
Standard image High-resolution image

Two other useful gates which can be applied in this manner are the Hadamard rotation and phase gates,

$\begin{equation} H = \case{1}{\sqrt{2}} \left(\begin{array}{@{}ll@{}} 1 & 1 \\ 1 & -1 \end{array}\right), \tqs P = \left(\begin{array}{@{}ll@{}} 1 & 0 \\ 0 & \rmi \end{array}\right). \end{equation} \tag{ 77 }$

These gates are useful since when combined with the two-qubit CNOT gate, they can generate a subgroup of all multi-qubit gates known as the Clifford group (gates which map Pauli group operators back to the Pauli group). Again, using the stabilizers of the [[7, 1, 3]] code and the fact that for single qubits,

$\begin{eqnarray} &&HXH = Z, \tqs HZH = X, \\ &&PXP^{\dagger} = \rmi XZ, \tqs PZP^{\dagger} = Z, \end{eqnarray} \tag{ 78 }$

a seven qubit bit-wise Hadamard gate will switch X with Z and therefore will simply flip {K¹, K², K³} with {K⁴, K⁵, K⁶}, and is a valid operation. The bit-wise application of the P gate will leave any Z stabilizer invariant, but takes X → iXZ. This is still valid, since provided there are a multiple of four non-identity operators for each generator of the stabilizer group, the factors of i will cancel. Hence seven bit-wise P gates is valid for the [[7, 1, 3]] code.

What do these logical operators do to the logical state? For a single qubit, the Hadamard gate flips any Z stabilized state to an X stabilized state, i.e. |0, 1〉 ↔ |+, −〉. Looking at the transformation of K⁷, using $\bar{H} = H^{\otimes 7}$ we have, $\bar{H}K^7\bar{H} = X^{\otimes 7}$ . Therefore, the bit-wise Hadamard gate, $\bar{H}$ , is the logical Hadamard operation. The single qubit P gate leaves a Z stabilized state invariant, while an X eigenstate becomes stabilized by iXZ. Hence, denoting $\bar{P}^{\dagger} = P^{\otimes 7}$ , $\bar{P}(X^{\otimes 7})\bar{P}^{\dagger} = -i(XZ)^{\otimes 7}$ and the bit-wise gate, $\bar{P}^{\dagger}$ , is the logical P^† gate. Similarly, bit-wise gate, $\bar{P} = P^{\dagger \otimes 7}$ , enacts a logical P gate (figure 13). Each of these fault-tolerant operations on a logically encoded block are commonly referred to as transversal operations. A transversal operation can be defined as a logical operator which is formed by applying the individual physical operators to each qubit in the code block (or between two equivalent physical qubits in the case of entangling operations such as the CNOT gate shown below). The physical operations forming a transversal logical operator need not be the same, for example in figure 13, the $\bar{P}^{\dagger}$ gate is formed via individual P operations and vice versa. Hence, valid transversal gates may possibly be constructed from a set of physical operations which are unique to each qubit.

11.2. Two-qubit gate

A two-qubit logical CNOT operation can also be applied in the same way, as a transversal bit-wise operation of individual CNOT gates between corresponding physical qubits. For unencoded qubits, a CNOT operation performs the following mapping on the two qubit stabilizer set,

$\begin{eqnarray} &&X\otimes I \rightarrow X\otimes X, \\ &&I\otimes Z \rightarrow Z\otimes Z, \\ &&Z\otimes I \rightarrow Z\otimes I, \\ &&I\otimes X \rightarrow I\otimes X, \end{eqnarray} \tag{ 79 }$

where the first operator corresponds to the control qubit and the second operator corresponds to the target. Now consider the bit-wise application of seven CNOT gates between logically encoded blocks of data (figure 14). First the stabilizer set must remain invariant,

$\begin{equation} {\rm CNOT} \mathcal{G}{\rm CNOT} = \{{\rm CNOT} (K^{i}\otimes K^{j}) {\rm CNOT}\} = \mathcal{G}, \end{equation} \tag{ 80 }$

i.e. each element in $\mathcal{G}$ must be mapped to another element of $\mathcal{G}$ . Table 9 details the transformation for all the stabilizer generators under seven bit-wise CNOT gates, demonstrating that this operation is valid on the [[7, 1, 3]] code. The transformations in equation (79) are trivially extended to the logical space, showing that seven bit-wise CNOT gates invoke a logical CNOT operation.

$\begin{eqnarray} &&\bar{X}\otimes I \rightarrow \bar{X}\otimes \bar{X}, \\ &&I\otimes \bar{Z} \rightarrow \bar{Z}\otimes \bar{Z}, \\ &&\bar{Z}\otimes I \rightarrow \bar{Z}\otimes I, \\ &&I\otimes \bar{X} \rightarrow I\otimes \bar{X}. \end{eqnarray} \tag{ 81 }$

**Figure 14.** Bit-wise application of a CNOT gate between two logical qubits. Since each CNOT only couples corresponding qubits in each block, this operation is inherently fault-tolerant.
Download figure:
Standard image High-resolution image

Table 9. Transformations of the [[7, 1, 3]] stabilizer generators under the gate operation U = CNOT^⊗7, where $\mathcal{G} \rightarrow U^{\dagger}\mathcal{G}U$ . Note that the transformation does not take any stabilizer outside the group generated by Kⁱ ⊗ K^j (i, j) ∈ [1, .., 6], therefore U = CNOT^⊗7 represents a valid operation on the codespace.

Kⁱ ⊗ K^j	K¹	K²	K³	K⁴	K⁵	K⁶
K¹	K¹ ⊗ I	K¹ ⊗ K¹K²	K¹ ⊗ K¹K³	K¹K⁴ ⊗ K¹K⁴	K¹K⁵ ⊗ K¹K⁵	K¹K⁶ ⊗ K¹K⁶
K²	K² ⊗ K¹K²	K² ⊗ I	K² ⊗ K²K³	K²K⁴ ⊗ K²K⁴	K²K⁵ ⊗ K²K⁵	K²K⁶ ⊗ K²K⁶
K³	K³ ⊗ K³K¹	K³ ⊗ K³K²	K³ ⊗ I	K³K⁴ ⊗ K³K⁴	K³K⁵ ⊗ K³K⁵	K³K⁶ ⊗ K³K⁶
K⁴	K⁴ ⊗ K¹	K⁴ ⊗ K²	K⁴ ⊗ K³	I ⊗ K⁴	K⁴K⁵ ⊗ K⁵	K⁴K⁶ ⊗ K⁶
K⁵	K⁵ ⊗ K¹	K⁵ ⊗ K²	K⁵ ⊗ K³	K⁵K⁴ ⊗ K⁴	I ⊗ K⁵	K⁵K⁶ ⊗ K⁶
K⁶	K⁶ ⊗ K¹	K⁶ ⊗ K²	K⁶ ⊗ K³	K⁶K⁴ ⊗ K⁴	K⁶K⁵ ⊗ K⁵	I ⊗ K⁶

The issue of fault-tolerance with these logical operations should be clear. The $\bar{X}$ , $\bar{Z}$ , $\bar{H}$ , $\bar{P}^{\dagger}$ and $\bar{P}$ gates are trivially fault-tolerant since the logical operation is performed through seven bit-wise single qubit gates. The logical CNOT is also fault-tolerant since each two-qubit gate only operates between counterpart qubits in each logical block. Hence if any gate is inaccurate then, at most, a single error will be introduced in each block.

In contrast to the [[7,1,3]] code, let us also take a quick look at the [[5,1,3]] code. Unlike the 7-qubit code, the full set of Clifford gates cannot be implemented in the same transversal manner. To see this clearly we can examine how the generators of the stabilizer group for the code transforms under a transversal Hadamard operation,

$\begin{eqnarray} \fl \begin{array}{@{}cccccc@{}} K^1 = & X & Z & Z & X & I \\ K^2 = & I & X & Z & Z & X \\ K^3 = & X & I & X & Z & Z \\ K^4 = & Z & X & I & X & Z \end{array} \quad \mathop{\longrightarrow}\limits^{H^{\otimes 5}} \quad \begin{array}{@{}ccccc@{}} Z & X& X & Z & I \\ I & Z & X & X & Z \\ Z & I & Z & X & X \\ X & Z & I & Z & X.\end{array}\nonumber\\ \end{eqnarray} \tag{ 82 }$

The stabilizer group is not preserved under this transformation and therefore is not a valid logical operation for the [[5,1,3]] code. One thing to briefly note is that there are methods for performing logical Hadamard and phase gates on the [[5,1,3]] code [Got97]. However, they essentially involve performing a valid, transversal, three qubit gate and then measuring out two of the logical ancilla. Although it has been demonstrated that a valid set of transversal operations for universal computing is incompatible with a quantum code that corrects an arbitrary single error [EK09], it has not been shown that a non-CSS code cannot have a transversal set of operations generating the Clifford group. This example is used to illustrate the transformation for an invalid logical operation on an encoded state.

While these gates can be conveniently implemented on error protected data, they do not represent a universal set for quantum computation. In fact it has been shown that by using the stabilizer formalism, these operations can be efficiently simulated on a classical device [Got98, AG04]. In order to achieve universality one of the following gates is generally added to the available set,

$\begin{equation} T = \left(\begin{array}{@{}ll@{}} 1 & 0 \\ 0 & e^{i\pi/4} \end{array}\right), \end{equation} \tag{ 83 }$

or the Toffoli gate [Tof81]. Applying them in a similar transversal way, for the majority of stabilizer codes, transform the stabilizers out the group and is consequently not a valid operation. Circuits implementing these two gates in a fault-tolerant manner have been developed [NC00, GC99, SI05, SFH08], for various codes (most often the [[7, 1, 3]] code) but at this stage the circuits are complicated and resource intensive. This has practical implications for encoded operations. If universality is achieved by adding the T gate to the list, arbitrary single qubit rotations require long gate sequences (utilizing the Solovay-Kitaev theorem [Kit97, DN06]) to approximate arbitrary logical qubit rotations and these sequences often require many T gates [Fow05]. Finding more efficient methods to achieve universality on encoded data is therefore still an active area of research.

12. Fault-tolerant circuit design for logical state preparation

Section 10 introduced the basic rules for fault-tolerant circuit design and how these rules lead to the threshold theorem for concatenated error correction. However, what does a full fault-tolerant quantum circuit look like? Here, we introduce a full fault-tolerant circuit to prepare the [[7, 1, 3]] logical |0〉 state. As the [[7, 1, 3]] code is a single error correcting code, we use the one-to-one definition of fault-tolerance and therefore only need to consider the propagation of a single error during the preparation (any more than one error during correction represents a higher order effect and is ignored).

As described in section 7, logical state preparation can be done by initializing an appropriate number of physical qubits and measuring each of the X stabilizers that describe the code. Therefore, a circuit which allows the measurement of a Hermitian operator in a fault-tolerant manner needs to be constructed. The general structure of the circuit used was first developed by Shor [Sho96], however it should be noted that several more recent methods for fault-tolerant state preparation and correction now exist [Ste97a, Ste02, DA07, Kni05].

The circuits shown in figures 15(a) and (b), which measure the stabilizer K¹ = IIIXXXX are not fault-tolerant, since a single ancilla is used to control each of the four CNOT gates. As CNOT gates can copy X errors (figure 12), a single X error on this ancilla can be copied to multiple qubits in the data block. Instead, four ancilla qubits are used which are prepared in the state ${\left\vert{\mathcal{A}}\right\rangle} = ({\left\vert{0000}\right\rangle}+{\left\vert{1111}\right\rangle})/\sqrt{2}$ . This can be done by initializing four qubits in the |0〉 state and applying a Hadamard, then a sequence of CNOT gates (figure 15(c)). Each of these four ancilla are used to control a separate CNOT gate, after which the ancilla state is decoded and measured. By ensuring that each CNOT is controlled via a separate ancilla, any X error will only propagate to a single qubit in the data block. However, during the preparation of the ancilla state there is the possibility that a single X error can propagate to multiple ancilla, which are then fed forward into the data block. In order to combat this, the ancilla block needs to be verified against possible X errors. Tracking through all the possible locations where a single X error can occur during ancilla preparation leads to the following unique states.

$\begin{eqnarray} &&{\left\vert{\mathcal{A}}\right\rangle}_1 = \case{1}{\sqrt{2}}({\left\vert{0000}\right\rangle}+{\left\vert{1111}\right\rangle}),\\ &&{\left\vert{\mathcal{A}}\right\rangle}_2 = \case{1}{\sqrt{2}}({\left\vert{0100}\right\rangle} + {\left\vert{1011}\right\rangle}),\\ &&{\left\vert{\mathcal{A}}\right\rangle}_3 = \case{1}{\sqrt{2}}({\left\vert{0010}\right\rangle} + {\left\vert{1101}\right\rangle}),\\ &&{\left\vert{\mathcal{A}}\right\rangle}_4 = \case{1}{\sqrt{2}}({\left\vert{0001}\right\rangle} + {\left\vert{1110}\right\rangle}),\\ &&{\left\vert{\mathcal{A}}\right\rangle}_5 = \case{1}{\sqrt{2}}({\left\vert{0111}\right\rangle} + {\left\vert{1000}\right\rangle}),\\ &&{\left\vert{\mathcal{A}}\right\rangle}_6 = \case{1}{\sqrt{2}}({\left\vert{0011}\right\rangle} + {\left\vert{1100}\right\rangle}).\\ \end{eqnarray} \tag{ 84 }$

From these possibilities, the first four states correspond to no error or at most one bit flip and the last two states correspond to multiple errors cased by errors copied by the CNOT operations. In fact, only the last state, ${\left\vert{\mathcal{A}}\right\rangle}_6$ , needs to be identified since two bit-flip errors exist on the ancilla state. To verify this ancilla state, a fifth ancilla is added, initialized and used to perform a parity check on the ancilla block. This fifth ancilla is then measured. If the result is |0〉, the ancilla block can be coupled to the data. If the ancilla result is |1〉, then either a single error has occurred on the verification qubit or the ancilla state has been prepared in either the ${\left\vert{\mathcal{A}}\right\rangle}_5$ or ${\left\vert{\mathcal{A}}\right\rangle}_6$ state. In either case, the entire ancilla block is reinitialized and prepared again. This is continued until the verification qubit is measured to be |0〉 (figure 16). The re-preparation of the ancilla block protects against multiple X errors, which can propagate forward through the CNOT gates. Z errors propagate in the other direction. Any Z error which occurs in the ancilla block will propagate straight through to the final measurement. This results in the measurement not corresponding to the actual errors which have occurred and can result in mis-correction once all stabilizers have been measured. To protect against this, each stabilizer is measured 2–3 times and a majority vote of the measurement results taken. As any additional error represents a second order process, if the first or second measurement has been corrupted by a Z error, then the third measurement will only contain additional errors if a higher order error process has occurred. Therefore, we are free to ignore this possibility and assume that the third measurement is error free. The full circuit for [[7, 1, 3]] state preparation is shown in figure 17, where each stabilizer is measured 2–3 times. The total circuit requires a minimum of 12 qubits (7-data qubits and a 5-qubit ancilla block).

**Figure 15.** Three circuits which measure the stabilizer K¹. (a) represents a generic operator measurement where a multi-qubit controlled gate is available. (b) decomposes this into single- and two-qubit gates, but in a non-fault-tolerant manner. (c) introduces four ancilla such that each CNOT is controlled via a separate qubit. This ensures fault-tolerance.
Download figure:
Standard image High-resolution image

**Figure 16.** Circuit required to measure the stabilizer K¹, fault-tolerantly. A four qubit GHZ state is used as ancilla with the state requiring verification against multiple X errors. After the state has passed verification it is coupled to the data block and a syndrome is extracted.
Download figure:
Standard image High-resolution image

**Figure 17.** Circuit required to prepare the [[7, 1, 3]] logical |0〉 state fault-tolerantly. Each of the X stabilizers are sequentially measured using the circuit in figure 16. To maintain fault-tolerance, each stabilizer is measured 2–3 times with a majority vote taken.
Download figure:
Standard image High-resolution image

As you can see, the circuit constructions for full fault-tolerant state preparation (and error correction) are not simple circuits. However, they are easy to design in generic ways when employing stabilizer coding.

13. Loss protection

So far we have focused the discussion on correction techniques which assume that error processes maintain a qubit structure to the Hilbert space. As we noted in section 3.3, the loss of physical qubits within the computer violates this assumption and in general requires additional correction machinery beyond what we have already discussed. This section examines a basic correction techniques for qubit loss. Specifically, we detail one such scheme which was developed with single photon based architectures in mind.

Protecting against qubit loss requires a different approach than other general forms of quantum errors such as environmental decoherence or systematic control imperfections. The cumbersome aspect related to correcting qubit loss is detecting the presence of a qubit at the physical level. The specific machinery that is required for loss detection is dependent on the underlying physical architecture, but the basic principal is that the presence or absence of the physical qubit must be determined without discriminating the actual quantum state.

Certain systems allow for loss detection in a more convenient way than others. Electronic spin qubits, for example, can employ single electron transistors (SETs) to detect the presence or absence of the charge without performing measurement on the spin degree of freedom [DS00, CGJH05, AJW⁺01]. Optics in contrast requires more complicated non-demolition measurement [MW84, IHY85, POW⁺04, MNBS05]. This is due to the fact that typical photonic measurement is performed via photo-detectors which have the disadvantage of physically destroying the photon.

Once the detection of the presence of the physical qubit has been performed, a freshly initialized qubit can be injected to replace the lost qubit. Once this has been completed, the standard error correcting procedure can correct for the error. A freshly initialized qubit state, |0〉 can be represented as projective collapse of a general qubit state, |ψ〉 ≠ |1〉, as

$\begin{equation} {\left\vert{0}\right\rangle} \propto {\left\vert{\psi}\right\rangle}+Z{\left\vert{\psi}\right\rangle}. \end{equation} \tag{ 85 }$

If we consider this qubit as part of an encoded block, then the above corresponds to a 50% error probability of experiencing a phase error on this qubit. Therefore, a loss event that is corrected by non-demolition detection and standard QEC, 50% of the time, will result in a detection event in the QEC cycle and, in the absence of other errors, correction. The probability of loss needs to be at a comparable rate to standard errors as the correction cycle after a loss detection event will, with high probability, detect an error.

If a loss event is detected and the qubit replaced, the error detection code shown in section 6 becomes a single error correction code. This is due to the fact that via the identification of the loss event, these errors subsequently have known locations. Consequently error detection is sufficient to perform full correction, in contrast to the error channels we have already considered where in general the location is unknown.

A second method for loss correction is related to systems that have high loss rates compared to systematic and environmental errors, for example in optical systems. Due to the high mobility of single photons and their relative immunity to environmental interactions, loss is a major error channel that generally dominates over other error sources. The use of error detection and correction codes for photon loss is undesirable due to the need for non-demolition detection of the lost qubit. While techniques for measuring the presence or absence of a photon without direct detection have been developed and implemented [POW⁺04], they require multiple ancilla photons and controlled interactions. Ultimately it may be more desirable to redesign the loss correction code such that it can be employed directly with photo-detection rather than more complicated non-demolition techniques [YNM06].

One such scheme was developed by Ralph, Hayes and Gilchrist in 2005 [RHG05]. This scheme was a more efficient extension of an original parity encoding method developed by Knill, Laflamme and Milburn [KLM01]. The general parity encoding for a logical qubit is an N photon GHZ state in the conjugate basis, i.e.

$\begin{eqnarray} &&{\left\vert{0}\right\rangle}_L^N = \case{1}{\sqrt{2}}({\left\vert{+}\right\rangle}^{\otimes N} + {\left\vert{-}\right\rangle}^{\otimes N}), \\ &&{\left\vert{1}\right\rangle}_L^N = \case{1}{\sqrt{2}}({\left\vert{+}\right\rangle}^{\otimes N} - {\left\vert{-}\right\rangle}^{\otimes N}), \end{eqnarray} \tag{ 86 }$

where ${\left\vert{\pm}\right\rangle} = ({\left\vert{0}\right\rangle}\pm {\left\vert{1}\right\rangle})/\sqrt{2}$ . The motivation with this type of encoding is that measuring any qubit in the |0, 1〉 basis simply removes it from the state, reducing the resulting state by one,

$\begin{eqnarray} \fl P_{0,N} {\left\vert{0}\right\rangle}_L^N &= (I_N + Z_N){\left\vert{0}\right\rangle}_L^N\nonumber\\ & = \case{1}{\sqrt{2}}({\left\vert{+}\right\rangle}^{N-1} + {\left\vert{-}\right\rangle}^{N-1}){\left\vert{0}\right\rangle}_N = {\left\vert{0}\right\rangle}_L^{N-1}{\left\vert{0}\right\rangle}_N,\nonumber\\ \fl P_{1,N} {\left\vert{0}\right\rangle}_L^N &= (I_N - Z_N){\left\vert{0}\right\rangle}_L^N\nonumber\\ &= \case{1}{\sqrt{2}}({\left\vert{+}\right\rangle}^{N-1} - {\left\vert{-}\right\rangle}^{N-1}){\left\vert{1}\right\rangle}_N = {\left\vert{1}\right\rangle}_L^{N-1}{\left\vert{1}\right\rangle}_N,\nonumber\\ \end{eqnarray} \tag{ 87 }$

where P_(0,1),N are the projectors corresponding to measurement in the |0, 1〉 basis on the N-th qubit (up to normalization). The effect for the |1〉_L state is similar. Measuring the N-th qubit in the |0〉 state simply removes it from the encoded state, reducing the logical zero state by one, while measuring the N-th qubit as |1〉 enacts a logical bit flip at the same time as reducing the size of the logical state.

Instead of introducing the full scheme developed in [RHG05], we give the general idea of how such encoding allows for loss detection without non-demolition measurements. Photon loss in this model is assumed equivalent to measuring the photon in the |0〉, |1〉 basis, but not knowing the answer (section 3.3). Our ignorance of the measurement result could lead to a logical bit flip error on the encoded state, therefore we require the ability to protect against logical bit-flip errors on the above states. As already shown, the 3-qubit code allows us to achieve such correction. Therefore the final step in this scheme is encoding the above states into a redundancy code (a generalized version of the 3-qubit code), where an arbitrary logical state, |ψ〉_L is now given by,

$\begin{equation} {\left\vert{\psi}\right\rangle}_L = \alpha{\left\vert{0}\right\rangle}_1^N {\left\vert{0}\right\rangle}_2^N...{\left\vert{0}\right\rangle}_q^N + \beta {\left\vert{1}\right\rangle}_1^N {\left\vert{1}\right\rangle}_2^N ... {\left\vert{1}\right\rangle}_q^N \end{equation} \tag{ 88 }$

where |0〉^N, |1〉^N are the parity encoded states shown in equation (87) and the fully encoded state is q-blocks of these parity states.

This form of encoding protects against the loss of qubits by first encoding the system into a code structure that allows for the removal of qubits without destroying the computational state and then protecting against logical errors that are induced by loss events. In effect, it maps errors un-correctable by standard QEC to error channels that are correctable, in this case qubit loss → qubit bit flip. Even though this code can be used to convert loss events into bit-flips, the size of the code will steadily decrease as qubits are lost.

This general technique is common with pathological error channels. If a specific type of error violates the standard 'qubit' assumption of QEC, additional correction techniques are always required to map this type of error to a correctable form, consequently additional physical resources are usually needed.

14. Some modern developments in quantum error correction

Up until this stage we have restricted our discussions on error correction to the most basic principals and codes. The ideas and methodologies we have detailed represent some of the introductory techniques that were developed when error correction was first proposed. For readers who are only looking for a basic introduction to the field, you can quite easily skip the remainder of this paper as we will now examine some of the more modern protocols that are utilized when considering the construction of a large-scale quantum computer.

Providing a fair and encompassing review of the more modern and advanced error correction techniques that have been developed is far outside our goal for this review. However, we would be remiss if we did not briefly examine some of the more advanced error correction techniques that have been proposed for large-scale QIP. For the remainder of this discussion we choose two closely related error correction techniques, subsystem coding and topological coding which have been receiving significant attention in the fields of architecture design and large-scale QIP. While some readers may disagree, we review these two modern error correction protocols because they are currently two of the most useful correction techniques when discussing the physical construction of a quantum computer.

We again attempt to keep the discussion of these techniques simple and provide specific examples when possible. However, it should be stressed that these error correcting protocols are far more complicated than the basic codes shown earlier. Topological error correction alone has, since its introduction, essentially become its own research topic within the broader error correction field. Hence we encourage the reader who is interested to refer to the cited articles below for more rigorous and detailed treatment of these techniques.

14.1. Subsystem codes: Bacon–Shor codes

Quantum subsystem codes [KLP05, KLPL06, Bac06] are one of the newer and highly flexible techniques to implement quantum error correction. The traditional stabilizer codes that we have reviewed are more formally identified as subspace codes, where information is encoded in a relevant coding subspace of a larger multi-qubit system. In contrast, subsystem coding identifies multiple subspaces of the multi-qubit system as equivalent. Specifically, multiple states are identified with the logical |0〉_L and |1〉_L states. Rather than review the subsystem codes in general, we will focus on a subset of these codes known as Bacon–Shor codes [Bac06].

The primary benefit to utilizing Bacon–Shor [BS] codes is the general nature of their construction. The description of arbitrarily large error correcting codes is conceptually straightforward, error correction circuits are much simpler to construct [AC07], and the generality of their construction introduces the ability to perform dynamical code switching in a fault-tolerant manner [SEDH08]. This final property gives BS coding significant flexibility as the strength of error correction within a quantum computer can be changed fault-tolerantly during operation of the computer.

As with the other codes presented in this review, BS codes are stabilizer codes but now defined over a square lattice. The lattice dimensions represent the X and Z error correction properties and the size of the lattice in either of these two dimensions dictates the total number of errors the code can correct. In general, a $\mathcal{C}$ (n₁,n₂) BS code is defined over a n₁ × n₂ square lattice which encodes one logical qubit into n₁n₂ physical qubits with the ability to correct at least ⌊(n₁ − 1)/2⌋Z errors and at least ⌊(n₂ − 1)/2⌋X errors. Again, keeping with the spirit of this review, we focus on a specific example, the $\mathcal{C}$ (3,3) BS code. This code, encoding one logical qubit with nine physical qubits, can correct for one X and one Z error. In order to define the code structure we begin with a 3 × 3 lattice of qubits, where qubits are identified with the vertices of the lattice. Note that this 2D structure represents the structure of the code, it does not imply that a physical array of qubits must be arranged into a 2D lattice. Figure 18 illustrate three sets of stabilizer operators which are defined over the lattice. The first group, illustrated in figure 18(a) is the stabilizer group, $\mathcal{S}$ , which is generated by the operators,

$\begin{equation} \mathcal{S} = \langle X_{i,*} X_{i+1,*} ; Z_{*,j}Z_{*,j+1} \ | \ i \in {\mathbb Z}_{2} ; j \in {\mathbb Z}_{2} \rangle, \end{equation} \tag{ 89 }$

where we retain the notation utilized in [AC07, SEDH08]. U_i,* and U_*,j represent an operator, U, acting on all qubits in a given row, i, or column, j, respectively, and ${\mathbb Z}_2=\{1,2\}$ . The second relevant subsystem is known as the gauge group (figure 18(b)), $\mathcal{T}$ , and is described via the non-Abelian group generated by the pairwise operators

$\begin{equation} \mathcal{T} = \langle X_{i,j}X_{i+1,j} ; Z_{j,i}Z_{j,i+1} \ | \ i \in {\mathbb Z}_{2} ; j \in {\mathbb Z}_{3} \rangle. \end{equation} \tag{ 90 }$

The third relevant subsystem is the logical space (figure 18(c)), $\mathcal{L}$ , which can be defined through the logical Pauli operators

$\begin{equation} \mathcal{L} = \langle Z_{*,1} ; X_{1,*} \rangle. \end{equation} \tag{ 91 }$

**Figure 18.** Stabilizer structure for the $\mathcal{C}$ (3,3) code. (a) gives two of the four stabilizers from the group $\mathcal{S}$ . (b) illustrates one of the four encoded sets of Pauli operators from each subsystem defined with the Gauge group, $\mathcal{T}$ . (c) gives the two logical operators from the group $\mathcal{L}$ which enact valid operations on the encoded qubit.
Download figure:
Standard image High-resolution image

The stabilizer group $\mathcal{S}$ , defines all relevant code states, i.e. every valid logical space is a +1 eigenvalue of this set. For the $\mathcal{C}$ (3,3) code, there are a total of nine physical qubits and a total of four independent stabilizers in $\mathcal{S}$ , hence there are five degrees of freedom left in the system which can house 2⁵ logical states which are simultaneous eigenstates of $\mathcal{S}$ . This is where the gauge group, $\mathcal{T}$ , becomes relevant. As the gauge group is non-Abelian, there is no valid code state which is a simultaneous eigenstate of all operators in $\mathcal{T}$ . However, if you examine closely there are a total of four encoded sets of Pauli operations within $\mathcal{T}$ . Figure 18(b) illustrates two such sets. As all elements of $\mathcal{T}$ commute with all elements of $\mathcal{S}$ we can identify each of these four sets of valid 'logical' qubits to be equivalent, i.e. we define {|0〉_L, |1〉_L} pairs which are eigenstates of $\mathcal{S}$ and one Abelian subgroup of $\mathcal{T}$ and then ignore exactly what gauge group we are in¹². Therefore, each of these gauge states represent a subsystem of the code, with each subsystem logically equivalent.

The final group we consider is the logical group $\mathcal{L}$ . This is the set of two Pauli operators which enact a logical X or Z gate on the encoded qubit regardless of the gauge choice and consequently represent true logical operations to our encoded space.

In a more formal sense, the definition of these three group structures allows us to decompose the Hilbert space of the system. If we let $\mathcal{H}$ denote the Hilbert space of the physical system, $\mathcal{S}$ forms an Abelian group and hence can act as a stabilizer set defining subspaces of $\mathcal{H}$ . If we describe each of these subspaces by the binary vector, $\vec{e}$ , formed from the eigenvalues of the stabilizers, $\mathcal{S}$ , then each subspace splits into a tensor product structure

$\begin{equation} \mathcal{H} = \bigoplus_{\vec{e}} \mathcal{H}_{\mathcal{T}} \otimes \mathcal{H}_{\mathcal{L}}, \end{equation} \tag{ 92 }$

where elements of $\mathcal{T}$ act only on the subsystem $\mathcal{H}_{\mathcal{T}}$ and the operators $\mathcal{L}$ act only on the subsystem $\mathcal{H}_{\mathcal{L}}$ . In the context of storing qubit information, a logical qubit is encoded into the two dimensional subsystem $\mathcal{H}_{\mathcal{L}}$ . As the system is already stabilized by operators in $\mathcal{S}$ and the operators in $\mathcal{T}$ act only on the space $\mathcal{H}_{\mathcal{T}}$ , qubit information is only altered when operators in the group $\mathcal{L}$ act on the system.

This formal definition of how BS coding works may be more complicated than the standard stabilizer codes shown earlier, but this slightly more complicated coding structure has significant benefits when we consider how error correction is performed.

In general, to perform error correction, each of the stabilizers of the codespace must be checked to determine which eigenvalue changes have occurred due to errors. The stabilizer group, $\mathcal{S}$ , consists of qubit operators that scale with the size of the code. In general, for a n₁ × n₂ lattice, the X stabilizers are 2n₁ dimensional and the Z stabilizers are 2n₂ dimensional. If techniques such as Shor's method (section 12) were used, we would need to prepare a large ancilla state to perform fault-tolerant correction, and this is clearly undesirable. This problem can be mitigated due to the gauge structure of these codes [AC07].

Each of the stabilizers in $\mathcal{S}$ is simply the product of certain elements from $\mathcal{T}$ , for example,

$\begin{eqnarray} &&X_{1,1}X_{1,2}X_{1,3}X_{2,1}X_{2,2}X_{2,3} \in \mathcal{S} \\ &&\quad = ( X_{1,1}X_{2,1} ) . (X_{1,2}X_{2,2}).(X_{1,3}X_{2,3}) \in \mathcal{T}. \end{eqnarray} \tag{ 93 }$

Therefore if we check the eigenvalues of the three, 2-qubit operators from $\mathcal{T}$ we are able to calculate what the eigenvalue is for the six-dimensional stabilizer. This decomposition of the stabilizer set for the code can only occur since the decomposition is in terms of operators from $\mathcal{T}$ which, when measured, has no effect on the logical information encoded within the system. In fact, when error correction is performed, the gauge state of the system will almost always change based on the order in which the eigenvalues of the gauge operators are checked.

This exploitation of the gauge properties of subsystem coding is extremely beneficial for the design of fault-tolerant correction circuits. As the stabilizer operators can now be decomposed into multiple two-dimensional operators, fault-tolerant circuits for error correction do not require any encoded ancilla states. Furthermore, even if we decide to scale the code-space to correct more errors (increasing the lattice size representing the code), we do not require measuring operators with higher dimensionality. Figure 19 taken from [AC07] illustrates the fault-tolerant circuit constructions for Bacon–Shor subsystem codes. As each ancilla qubit is only coupled to two data qubits, no further circuit constructions are required to ensure fault-tolerance¹³. The measurement results from these two-dimensional parity checks are then combined to calculate the parity of the higher dimensional stabilizer of the subsystem code.

**Figure 19.** (From [AC07]) Circuits for measuring the gauge operators and hence performing error correction for subsystem codes. (a) measures, fault-tolerantly, the operator X_j,kX_j+1,k with only one ancilla. (b) measures Z_k,jZ_{k, j+1}. The results of these two qubit parity checks can be used to calculate the parity of the higher dimensional stabilizer operators of the code.
Download figure:
Standard image High-resolution image

A second benefit to utilizing subsystem codes is the ability to construct fault-tolerant circuits to perform dynamical code switching. Dynamical code switching is a technique that could be utilized when noise sources are highly biased. In many systems the physical mechanisms which lead to errors are not symmetric in X and Z. For example, simple Markovian dephasing introduces only Z errors into the system and in all circumstances is the more dominant error channel.

One technique is to use concatenation in a clever way. The QEC codes we have largely examined in this review are symmetric in the X and Z sector. The 5-, 7- and 9-qubit codes correct for one X error and one Z error. Aliferis and Preskill considered a concatenated code, where the lower level code is a simple n qubit repetition code (section 4) that only corrected for Z errors [AP08]. By utilizing this lower level code, they symmeterize the Z and X noise (the size of the n qubit code is determined by the level of asymmetry in the physical noise). The subsequent levels are standard QEC codes, but it now operates at the next encoded level which will operate with symmetrical rates for logical X and Z errors. By utilising the simpler n-qubit repetition code at the lower level, qubit resources are reduced, compared to using a symmetric code at every level. This work was extended and adapted to a realistic model in superconducting systems [ABD⁺09].

Asymmetric coding may also be required if the noise acting on a quantum computer changes over the course of its operation. In this case it may be required to switch from a symmetric code to an asymmetric code, dynamically. Naively, this can be done by simply decoding all qubits and then re-encoding them with the new code. However, this procedure would not be fault-tolerant, as any error that occurs to the system when it is decoded will propagate through to the new encoded state. A second technique would be to encode the system with an asymmetric code at the next level of concatenation and then decode the lower level. However, when utilizing the BS codes, we can perform this code switching in a different way. As noted before, the $\mathcal{C}(n_1,n_2)$ code corrects for ⌊(n₁−1)/2⌋ X errors and ⌊(n₁−1)/2⌋ Z errors. The ability to convert between two codes, $\mathcal{C}(n_1,n_2)$ and $\mathcal{C}(n_1',n_2')$ , in a fault-tolerant manner would allow for us to dynamically change the strength (or asymmetry) of the error correction whenever noise rates fluctuate in the computer. It was shown in [SEDH08] how such switching can be achieved, allowing for the fault-tolerant conversion between arbitrary sized QEC codes.

14.2. Topological codes

A conceptually similar (but technically distinct) coding technique to the Bacon–Shor subsystem codes is the idea of topological error correction, first introduced with the Toric code of Kitaev in 1997 [Kit97]. Topological coding is similar to subsystem codes in that the code structure is defined on a lattice (which, in general, can be of dimension ⩾ 2) and the scaling of the code to correct more errors is conceptually straightforward. However, in topological coding schemes the protection afforded to logical information relies on the unlikely application of error chains which define non-trivial topological paths over the code surface.

Topological error correction is a complicated area of QEC and fault-tolerance and any attempt to fairly summarize the field is not possible within this review. In brief, there are two ways of approaching these schemes. The first is simply to treat topological codes as a class of stabilizer codes over a qubit system. This approach is more amenable to current information technologies and is being adapted to methods in cluster state computing [RHG07, FG09], optics [DFS⁺09, DMN11], ion-traps [SJ09] and superconducting systems [IFI⁺02]. The second approach is to construct a physical Hamiltonian model based on the structure of the topological code or choose systems which appear to exhibit topological order. This leads to the more complicated field on anyonic quantum computation [Kit97]. For example, the coding structure of the Toric code [Kit97] can be treated as a physical Hamiltonian system, with the ground state corresponding to the logical states of the code (in the case of the Torus it is a 4-fold degenerate ground state corresponding to two encoded qubits). Excitations from the ground state of this Hamiltonian correspond to the creation of anyonic quasi-particles and are energetically unfavourable (since their physical Hamiltonian symmetries reflect the coding structure imposed).

This and other models of anyonic computation utilise quasi-particles that exhibit fractional quantum statistics (they acquire fractional phase shifts when their positions are exchanged twice with other anyons, in contrast to bosons or fermions which always acquire ±1 phase shifts). The unique properties of anyonic systems therefore allow for natural, robust, error protection. However, the major issue with this model is that it relies on quasi-particle excitations that do not, in general, arise naturally. Although certain physical systems have been shown to exhibit anyonic properties, most notably in the fractional quantum Hall effect [NSS⁺08]. However, it is a daunting task to both manufacture a reliable anyonic system and to reliably design and construct a large-scale computing system. Note: Recently it has been shown that anyonic models based on the 2D codes illustrated in this section do not exhibit self-correcting properties [KC08, BT09]. Higher dimensional topological coding models, currently a minimum of 4D, are actually required to implement self-correcting quantum memories [AHHH10].

As there are several extremely good discussions of both anyonic [NSS⁺08] and non-anyonic topological computing [DKLP02, FSG09, FG09], we will not review any of the anyonic methods for topological computing and simply provide a brief example of one topological coding scheme, namely the surface code [BK01, DKLP02, FSG09]. The surface code for QEC is a desirable error correction model for several reasons. As it is defined over a two-dimensional lattice of qubits it can be implemented on architectures that only allow for the coupling of nearest neighbour qubits (rather than the arbitrary long distance coupling of qubits in separate regions of the computer). The surface code also exhibits one of the highest fault-tolerant thresholds of any QEC scheme; recent simulations estimate a threshold approaching 1% [RHG07, WFSH10]. Finally, the surface code has been analysed with respect to loss errors, showing a high tolerance when such errors are heralded [SBD09]. This subfield of error correction has been heavily researched in recent years. Methods in statistical physics are now routinely utilized to help calculate fault-tolerant thresholds [BMD08, KBMD09, BAO⁺12, ABKMD12], more advanced coding models and methods for computing with these models are under investigation [BMD07b,BMD07a,KBAMD10, Bom11, Fow12]. This section will present a basic introduction, hopefully readers will feel more comfortable studying these advanced techniques afterwards.

The surface code, as with subsystem codes, is a stabilizer code defined over a two-dimensional qubit lattice, as figure 20 illustrates. We identify each edge of the 2D lattice with a physical qubit. The stabilizer set consists of two types of operators. The first is the set of Z^⊗4 operators which circle every lattice face (or plaquette). The second is the set of X^⊗4 operators which encircle every vertex of the lattice. The stabilizer set is consequently generated by the operators,

$\begin{equation} A_p = \bigotimes_{j \in b(p)} Z_j, \tqs B_v = \bigotimes_{j \in s(v)} X_j, \end{equation} \tag{ 94 }$

where b(p) is the four qubits surrounding a plaquette and s(v) is the four qubits surrounding each vertex in the lattice and identity operators on the other qubits are implied. First note that all of these operators commute as any plaquette and vertex stabilizer will share either zero or two qubits. If the lattice is not periodic in either dimension, this stabilizer set completely specifies one unique state, i.e. for a N × N lattice there are 2N² qubits and 2N² stabilizer generators. Hence this stabilizer set defines a unique multi-qubit entangled state which is generally referred to as a 'clean' surface. Detailing exactly how this surface can be utilized to perform robust quantum computation is far outside the scope of this review and there are several papers to which such a discussion can be referred [RH07, RHG07, FSG09, FG09]. Instead, we can quite adequately show how robust error correction is possible by simply examining how a 'clean' surface can be maintained in the presence of errors. The X and Z stabilizer sets, A_p and B_v, define two equivalent 2D lattices which are interlaced, as figure 21 illustrates. If the total 2D lattice is shifted along the diagonal by half a cell then the operators B_v are now arranged around a plaquette and the operators A_p are arranged around a lattice vertex. Since protection against X errors is achieved by detecting eigenvalue flips of Z stabilizers and vice versa, these two interlaced lattices correspond to error correction against X and Z errors respectively. Therefore we can quite happily restrict our discussion to one possible error channel, for example correcting X errors (since the correction for Z errors proceeds identically when considering the stabilizers B_v instead of A_p).

**Figure 20.** General structure of the surface code. The edges of the lattice correspond to physical qubits. The four qubits surrounding each face (or plaquette) are +1 eigenstates of the operators A_p while the four qubits surrounding each vertex are +1 eigenstates of the operators B_v. If all eigenstate conditions are met, a unique multi-qubit state is defined as a 'clean' surface.
Download figure:
Standard image High-resolution image

**Figure 21.** The surface code imbeds two self similar lattices that are interlaced, generally referred to as the primal and dual lattice. (a) illustrates one lattice where plaquettes are defined with the stabilizers A_p. (b) illustrates the dual structure where plaquettes are now defined by the stabilizer set B_v. The two lattice structures are interlaced and are related by shifting along the diagonal by half a lattice cell. Each of these equivalent lattices are independently responsible for X and Z error correction.
Download figure:
Standard image High-resolution image

Figure 22(a) illustrates the effect a single X error has on a pair of adjacent plaquettes. Since X and Z anti-commute, a single bit-flip error on one qubit in the surface will flip the eigenvalue of the Z^⊗4 stabilizers on the two plaquettes adjacent to the respective qubit. As single qubit errors act to flip the eigenvalue of adjacent plaquette stabilizers we examine how chains of errors affect the surface. Figures 22(b) and (c) examine two longer chains of errors. As you can see, if multiple errors occur, only the eigenvalues of the stabilizers associated with the ends of the error chains flip. Each plaquette along the chain will always have two X errors occurring on different boundaries and consequently the eigenvalue of the Z^⊗4 stabilizer around these plaquettes will flip twice.

If we now consider an additional ancilla qubit which sits in the centre of each plaquette and can couple to the four surrounding qubits, we can check the parity by running the simple circuit shown in figure 23. If we assume that we initially prepare a perfect 'clean' surface we then, at some later time, check the parity of every plaquette over the surface. If X errors have occurred on a certain subset of qubits, the parity associated with the endpoints of error chains will have flipped. We now take this two-dimensional classical data tree of eigenvalue flips and pair them up into the most likely set of error chains. Since it is assumed that the probability of error on any individual qubit is low, the most likely set of errors which reflects the eigenvalue changes observed is the minimum weight set (i.e. connect up all plaquettes where eigenvalues have changed into pairs such that the total length of all connections is minimized). This classical data processing is quite common in computer science and minimum weight matching algorithms such as the Blossom package [CR99, Kol09] have a running time polynomial in the total number of data points in the classical set. Once this minimal matching is achieved, we can identify the likely error chains corresponding to the end points and correction can be applied accordingly.

**Figure 23.** (a) Lattice structure to check the parity of a surface plaquette. An additional ancilla qubit is coupled to the four neighbouring qubits that comprise each plaquette. (b) Quantum circuit to check the parity of the Z^⊗4 stabilizer for each surface plaquette.
Download figure:
Standard image High-resolution image

The failure of this code is therefore dictated by error chains that cannot be detected through changes in plaquette eigenvalues. If you examine figure 24, we consider an error chain that connects one edge of the surface lattice to another. In this case every plaquette has two associated qubits that have experienced a bit flip and no eigenvalues in the surface have changed. Since we have assumed that we are only wishing to maintain a 'clean' surface, these error chains have no effect, but when one considers the case of storing information in the lattice, these types of error chains correspond to logical errors on the qubit [BK98, FSG09]. Hence undetectable errors are chains which connect boundaries of the surface to other boundaries (in the case of information processing, qubits are artificial boundaries within the larger lattice surface). It should be stressed that this is a simplified description of the full protocol, but it does encapsulate the basic idea. The important thing to realize is that the failure rate of the error correction procedure is suppressed, exponentially with the size of the lattice. If we consider an error model where each qubit experiences a bit flip, independently, with probability p, then an error chain of one occurs with probability p, error chains of weight two occur with probability O(p²), chains of three O(p³), etc. If we have an N × N lattice and we extend the surface by one plaquette in each dimension, then the probability of having an error chain connecting two boundaries will drop by a factor of p (one extra qubit has to experience an error)¹⁴. Extending an N × N lattice by one plaquette in each dimension requires O(N) extra qubits, hence this type of error correcting code suppresses the probability of having undetectable errors exponentially with a qubit resource cost which grows linearly.

**Figure 24.** Example of a chain of errors which do not cause any eigenvalue changes in the surface. If errors connect boundaries to other boundaries, the error correction protocol will not detect them. In the case of a 'clean' surface, these error chains are invariants of the surface code. When computation is considered, qubit information are artificial boundaries within the surface. Hence if error chains connect these information qubits to other boundaries, logical errors occur.
Download figure:
Standard image High-resolution image

As we showed in section 10, standard concatenated coding techniques allow for an error rate suppression which scales with the concatenation level as a double exponential while the resource increase scales exponentially. For the surface code, the error rate suppression scales exponentially while the resource increase scales linearly. While these scaling relations might be mathematically equivalent, the surface code offers much more flexibility at the architectural level. Being able to increase the error protection in the computer with only a linear change in the number of physical qubits is far more beneficial than using an exponential increase in resources when utilizing concatenated correction. Specifically, consider the case where an error protected computer is operating at a logical error rate which is just above what is required for an algorithm. If concatenated error correction is employed, then adding another layer of correction will not only increase the number of qubits by an exponential amount, but it will also drop the effective logical error rate far below what is actually required. In contrast, if surface codes are employed, we increase the qubit resources by a linear factor and drop the logical error rate sufficiently for successful application of the algorithm.

We now leave the discussion regarding topological correction models. We emphasize again that this was a very broad overview of the general concept of topological codes. There are many details and subtleties that we have deliberately left out of this discussion and we urge the reader, if interested, to refer to the referenced articles for a more thorough treatment of this topic.

15. Conclusions and future outlook

This review has hopefully provided a basic introduction to some of the most important theoretical aspects of QEC and fault-tolerant quantum computation. The ultimate goal of this discussion was not to provide a rigorous theoretical framework for QEC and fault-tolerance, but instead attempted to illustrate most of the important rules, results and techniques that have evolved out of this field. Hopefully this introduction will serve as a starting point for individuals introducing themselves to this topic. For those wishing to continue research into the QEC field, we highly recommend consulting the papers referenced throughout this review and especially references [Got97, NC00, Got02, KLA⁺02, Ste01, Got09] which provide a more mathematically rigorous review of QEC and fault-tolerance.

We have not only covered the basic aspects of QEC through specific examples, but also briefly discussed how physical errors influence quantum computation and how these processes are interpreted within the context of QEC. One of the more important aspects of this review is the discussion related to the stabilizer formalism, circuit synthesis and fault-tolerant circuit construction. Stabilizers are arguably the most useful theoretical formalism in QEC as once it is sufficiently understood, most of the important properties of error correcting codes can be investigated and understood largely by inspection.

The study of QEC and fault-tolerance is still an active area of QIP research. Although the library of quantum codes and error correction techniques is vast there is still a significant disconnect between the abstract framework of quantum coding and the more physically realistic implementation of error correction for large-scale quantum information processing.

There are several future possibilities for the direction of quantum information processing. Even with the development of many of these advanced techniques, the physical construction and accuracy of current qubit fabrication is still insufficient to obtain any benefit from QEC. Many in the field now acknowledge that the future development of quantum computation will most likely split into two broad categories. The first is arguably the more physically realistic, namely small qubit applications in quantum simulation. Beyond these smaller qubit applications, we move to truly large-scale quantum computation, i.e. implementing large algorithms such as Shor on qubit arrays well beyond 1000 physical qubits. This would undoubtably require active techniques in error correction. Future work needs to focus on adapting the many codes and fault-tolerant techniques to the architectural level. As we noted in section 10.4, the implementation of QEC at the design level largely influences the fault-tolerant threshold exhibited by the code itself. Being able to efficiently incorporate both the actual quantum code and the error correction procedures at the physical level is extremely important when developing an experimentally viable, large-scale quantum computer.

There are many differing opinions within the quantum computing community as to the future prospects for quantum information processing. Many remain pessimistic regarding the development of a million qubit device and instead look towards quantum simulation in the absence of active error correction as the realistic goal of quantum information. However, in the past few years, the theoretical advances in error correction and the fantastic speed in the experimental development of few qubit devices continues to offer hope for the near-term construction of a large-scale computer, incorporating many of the ideas presented within this review. While we could never foresee the possible successes or failures in quantum information science, we remain hopeful that a large-scale quantum computer is still a goal worth pursuing.

Acknowledgments

The authors wish to thank A M Stephens, R Van Meter, A G Fowler, L C L Hollenberg and A D Greentree for helpful comments and acknowledge the support of MEXT, JST and FIRST projects.

Quantum error correction for beginners

Article metrics

Submit

Permissions

Share this article

Author e-mails

Author affiliations

Dates

Abstract

1. Introduction

2. Preliminaries

2.1. Basic structure of the qubit

2.2. Some general requirements of quantum error correction

3. Quantum errors: cause and effect

3.1. Coherent quantum errors: gates which are incorrectly applied

3.2. Environmental decoherence

3.3. Simple models of loss, leakage, measurement and initialization

4. The 3-qubit code: a good starting point for quantum error correction

5. The 9-qubit code: the first full quantum code

6. Quantum error detection

7. Stabiliser formalism

8. Quantum error correction with stabilizer codes

8.1. State preparation

8.2. Error correction

9. Digitization of quantum errors

9.1. Systematic gate errors

9.2. Environmental decoherence

9.3. More general mappings

10. Fault-tolerant quantum error correction and the threshold theorem

10.1. Error propagation

10.2. Concatenation

10.3. Fault-tolerance

10.4. Threshold theorem

11. Fault-tolerant operations on encoded data

11.1. Single qubit operations

11.2. Two-qubit gate

12. Fault-tolerant circuit design for logical state preparation

13. Loss protection

14. Some modern developments in quantum error correction

14.1. Subsystem codes: Bacon–Shor codes

14.2. Topological codes

15. Conclusions and future outlook

Acknowledgments

Footnotes