Towards scalable bosonic quantum error correction

B M Terhal; J Conrad; C Vuillot

doi:10.1088/2058-9565/ab98a5

1. Introduction

There has been a recent surge in interest in bosonic error correction, both from the experimental as well as from the theoretical side. By bosonic quantum error correction we mean the representation of a qubit as a two-dimensional subspace of an oscillator, a means of performing some error correction on this qubit, as well as a suite of techniques to perform universal computation on the qubit.

We review some of these recent developments and older proposals, with an eye towards integration of the ideas into a scalable (code) architecture. To be concrete, we concentrate on superconducting devices as physical realizations, due to the excellent control and engineerability of strong non-linearities, as described by the formalism of circuit quantum electrodynamics (circuit-QED). For more background, we refer the reader to a recent review of circuit-QED [1] and also the realization of quantum error correction in circuit-QED [2].

Due to the commonality of the quantum optics language, some of our discussion applies more generally to other physical systems realizing oscillators, such as optical modes or mechanical oscillators. Our paper does not aim to be comprehensive in reviewing all possible bosonic codes, but rather seeks to identify some promising approaches and future work to be undertaken, in particular emphasizing scalable bosonic GKP error correction.

A first condition to even consider encoding a qubit into an oscillator is that a high-Q oscillator is available⁶ . Examples of such high-Q oscillators are microwave cavity modes, of 3D or co-planar resonators in a frequency range f = 3–10 GHz where single-photon life-times can be up to τ = 1/κ = 1–10 ms [3, 4]. Thus, without additional couplings and drives, the native noise model of such microwave cavities is simply photon loss, governed by the cavity decay rate κ.

In order to prepare and manipulate an encoded qubit as prescribed by some code, one induces additional errors which the chosen code should, ideally, also be able to correct. It is thus important to pick a code in which computational manipulations and error corrections are relatively simple and the chosen code can also handle the errors which occur in these processes. As is well known, no finite code can correct all errors, and hence the goal of bosonic quantum error correction is simply to provide a logical qubit which can be used as a building block in a further coding scheme. The repetition or surface codes are the simplest examples of such further encoding steps, using then multiple oscillators.

Even though we can embrace the surface code as the simplest scalable 2D coding scheme [5], variants on who is playing the role of data and ancilla qubit and what additional error correction on these qubits takes place, are important in actually getting the very demanding engineering it, to pan out. If we have learned anything over the past 20 years of Hamiltonian engineering it is that partially-coherent dynamics can be implemented in many quantum systems, while very few to none may allow for the high-precision control and scalability needed for quantum error correction.

Overall, the challenges of efficiently using a bosonic qubit encoding are in (1) keeping the harmonicity of the oscillators as high as possible while temporally coupling to this mode, with high on/off ratio, to create and manipulate non-classical code states, (2) finding a photon number regime in which approximations of engineered Hamiltonians are accurate while the error correcting properties and benefits of the encoding are valid. When using bosonic qubits as basic qubits in a code architecture, it may be advantageous to choose data qubits differently than ancilla qubits and we will give some examples of such choices. The simplest encoding of a single qubit into a bosonic mode can be done using Fock states: the vacuum state represents the logical |0⟩, denoted as $\vert \overline{0}\rangle$ , and a single-photon state represents the logical |1⟩, denoted as $\vert \overline{1}\rangle$ .

For superconducting devices one can view the difference between bosonic encoding versus the regular transmon qubit encoding [6] as an interchange between the roles played by the anharmonic and the harmonic oscillator. Using transmon qubits to store information, resonators are used as couplers and for read-out. Using bosonic qubits to store information, anharmonic oscillators can be used for state preparation and couplers generating effective nonlinearities to realize gates. In this review we will refer to systems in which the lowest two energy eigenstates (in the absence of couplers) are used as regular qubits: this definition covers a Fock encoding as well as a transmon or a fluxonium qubit.

1.1. Preliminaries & notation

Here we collect a few definitions and mathematical identities that are used throughout the paper. Additionally, useful textbooks for quantum optics and its mathematical description are [7–9]. We use $\hat{q}=\frac{1}{\sqrt{2}}\left(a+{a}^{{\dagger}}\right)$ and $\hat{p}=\frac{\mathrm{i}}{\sqrt{2}}\left({a}^{{\dagger}}-a\right)$ , where a (a^†) are annihilation (creation) operators, so that $\left[\hat{q},\hat{p}\right]=\mathrm{i}I$ and we sometimes refer to $\hat{p}$ and $\hat{q}$ as quadratures. A displacement in phase space is denoted as D(α) = exp(αa^† − α*a) and acts as D^†(α)aD(α) = a + α, while a coherent state is defined as $D\left(\alpha \right)\vert 0\rangle =\vert \alpha \rangle =\mathrm{exp}\left(-\vert \alpha {\vert }^{2}/2\right){\sum }_{n=0}^{\infty }\frac{{\alpha }^{n}}{n!}\vert n\rangle$ . We have ${\mathrm{e}}^{\mathrm{i}\theta {a}^{{\dagger}}a}a {\mathrm{e}}^{-\mathrm{i}\theta {a}^{{\dagger}}a}=a {\mathrm{e}}^{-\mathrm{i}\theta }$ so that ${\mathrm{e}}^{\mathrm{i}\theta {a}^{{\dagger}}a}D\left(\alpha \right){\mathrm{e}}^{-\mathrm{i}\theta {a}^{{\dagger}}a}=D\left(\alpha {\mathrm{e}}^{\mathrm{i}\theta }\right)$ . The following identities hold

$\begin{equation}\mathrm{exp}\left(-\mathrm{i}v\hat{q}\right)\vert p\rangle =\vert p-v\rangle ,\quad \mathrm{exp}\left(-\mathrm{i}u\hat{p}\right)\vert q\rangle =\vert q+u\rangle ,\quad \vert p\rangle =\frac{1}{\sqrt{2\pi }}{\int }_{\mathbb{R}}\mathrm{d}q {\mathrm{e}}^{\mathrm{i}pq}\vert q\rangle ,\end{equation} \tag{ 1 }$

so that

$\begin{equation}\mathrm{exp}\left(\mathrm{i}v\hat{q}\right)\hat{p} \mathrm{exp}\left(-i\mathrm{v}\hat{q}\right)=\hat{p}-v,\quad \mathrm{exp}\left(\mathrm{i}u\hat{p}\right)\hat{q} \mathrm{exp}\left(-\mathrm{i}u\hat{p}\right)=\hat{q}+u.\end{equation} \tag{ 2 }$

A single-mode squeezing transformation is given by $\mathrm{exp}\left(-\mathrm{i}{H}_{\text{sq}}t\right)=\mathrm{S}\mathrm{q}\left(\xi \right)=\mathrm{exp}\left(\frac{1}{2}\left({\xi }^{{\ast}}{a}^{2}-\xi {{a}^{{\dagger}}}^{2}\right)\right)$ with Hamiltonian ${H}_{\text{sq}}=\mathcal{E}{{a}^{{\dagger}}}^{2}+{\mathcal{E}}^{{\ast}}{a}^{2}$ with $\xi =2\mathrm{i}\mathcal{E}t$ . The squeezer enacts the mode transformation a_out = Sq^†(ξ)aSq(ξ) = a cosh(r) − a^† e^iθ sinh(r) with r = |ξ| and θ = arg(ξ).

In Lindblad equations we use the notation $\mathcal{D}\left(A\right)\left(\rho \right)=A\rho {A}^{{\dagger}}-\frac{1}{2}\left\{{A}^{{\dagger}}A,\rho \right\}$ for some operator A.

It is standard to denote gates acting on a logical qubit subspace with overlines, i.e. $\overline{\mathrm{C}\mathrm{N}\mathrm{O}\mathrm{T}}$ etc. In order to avoid notation clutter, only in section 2.3.2 we denote logical gates on the GKP codewords without it, i.e. CNOT and Z instead of $\overline{\mathrm{C}\mathrm{N}\mathrm{O}\mathrm{T}}$ and $\overline{Z}$ .

2. Bosonic qubits and their components

2.1. Early birds & cats and their generalizations

The first bosonic codes were formulated in [10] and designed to protect against photon loss. Of particular interest is a two-mode code with codewords

$\begin{equation}\vert \overline{0}\rangle =\frac{1}{\sqrt{2}}\left(\vert 40\rangle +\vert 04\rangle \right) ,\quad \vert \overline{1}\rangle =\vert 22\rangle ,\end{equation} \tag{ 3 }$

with |k⟩ denoting a Fock state with k photons. If either $\vert \overline{0}\rangle$ or $\vert \overline{1}\rangle$ (or both) were hit by the loss of a single photon on any one of the two modes, we can readily see that the resulting states would still be orthogonal. This orthogonality is a prerequisite for being able to correct the photon loss error, but it is not a sufficient condition. To examine the error correction capability of a (bosonic) code, one asks whether a set of dominant errors satisfies the quantum error correction (QEC) conditions [11] of the code: if this holds (approximately) then there is an (approximate) recovery operation undoing these dominant errors. For a set of errors E = {E₁, ..., E_k} acting on the encoding of a single qubit, the quantum error conditions are as follows. ∀i, j, we require

$\begin{equation}\langle \overline{0}\vert {E}_{i}^{{\dagger}}{E}_{j}\vert \overline{0}\rangle =\langle \overline{1}\vert {E}_{i}^{{\dagger}}{E}_{j}\vert \overline{1}\rangle \;\text{logical}\;\text{states}\;\text{are}\;\text{indistinguishable},\end{equation} \tag{ 4 }$

$\begin{equation}\langle \overline{0}\vert {E}_{i}^{{\dagger}}{E}_{j}\vert \overline{1}\rangle =\langle \overline{1}\vert {E}_{i}^{{\dagger}}{E}_{j}\vert \overline{0}\rangle =0\;\text{orthogonality}\;\text{remains.}\end{equation} \tag{ 5 }$

To examplify the use of these conditions, let us first look at the single-mode version of the code in equation (3):

$\begin{equation}\vert \overline{0}\rangle =\frac{1}{\sqrt{2}}\left(\vert 0\rangle +\vert 4\rangle \right) ,\quad \vert \overline{1}\rangle =\vert 2\rangle .\end{equation} \tag{ 6 }$

This code was introduced in [12] as the smallest member of a family of so-called binomial codes, hence its name kitten or 'baby-binomial' code. This code and its logical gates has been implemented using a superconducting microwave cavity mode as an oscillator in reference [13], but the life-time of the encoded qubit was comparable to that of a Fock state encoding. One can easily check that for the error set $\mathsf{E}=\left\{I,\sqrt{\gamma }a\right\}$ , the QEC conditions in equations (4) and (5) for this code are met. However, these errors are only an approximation of the real noise. A photon loss channel with photon decay rate κ lasting for time t with γ ≡ κt ≪ 1, can be modeled by a superoperator ${\mathcal{N}}_{\gamma }$ with Kraus operators ${E}_{0}=I-\frac{\gamma }{2}{a}^{{\dagger}}a+O\left({\gamma }^{2}\right)\approx {\mathrm{e}}^{-\gamma {a}^{{\dagger}}a/2}$ and ${E}_{1}=\sqrt{\gamma }a$ , or

$\begin{equation}{\mathcal{N}}_{\gamma }\left(\rho \right)={E}_{0}\rho {E}_{0}^{{\dagger}}+{E}_{1}\rho {E}_{1}^{{\dagger}}, {E}_{0}^{{\dagger}}{E}_{0}+{E}_{1}^{{\dagger}}{E}_{1}=I+O\left({\gamma }^{2}\right).\end{equation} \tag{ 7 }$

For the Kraus operators E₀ and E₁ the QEC conditions in equations (4) and (5) are not quite met. In particular, we have

$\begin{equation}\langle \overline{0}\vert {E}_{0}^{{\dagger}}{E}_{0}\vert \overline{0}\rangle -\langle \overline{1}\vert {E}_{0}^{{\dagger}}{E}_{0}\vert \overline{1}\rangle =O\left({\gamma }^{2}\right)\left(\langle \overline{0}\vert {\left({a}^{{\dagger}}a\right)}^{2}\vert \overline{0}\rangle -\langle \overline{1}\vert {\left({a}^{{\dagger}}a\right)}^{2}\vert \overline{1}\rangle \right)\ne 0,\end{equation} \tag{ 8 }$

as $\vert \overline{1}\rangle$ is an eigenstate of a^† a, while $\vert \overline{0}\rangle$ is not. This means that upon the detection of no photon loss (corresponding to E₀) the code states undergo an irreversible distortion. The two-mode version of this code, equation (3), improves on this distortion issue as the quantum error correction conditions for the two mode code are met for the error set $\mathsf{E}=\left\{\sqrt{\gamma }a,\sqrt{\gamma }b,\mathrm{exp}\left(-\frac{\gamma }{2}\left({\hat{n}}_{a}+{\hat{n}}_{b}\right)\right)\approx I-\frac{\gamma }{2}\left({\hat{n}}_{a}+{\hat{n}}_{b}\right)\right\}$ . These error operators can be viewed as the three Kraus operators of a process in which there is either photon loss on mode a, photon loss on mode b, or no photon loss on either modes. For the states in equation (3) we have no distortion upon not detecting a photon from either modes as $\vert \overline{0}\rangle$ and $\vert \overline{1}\rangle$ are both eigenstates of $\mathrm{exp}\left(-\frac{\gamma }{2}\left({\hat{n}}_{a}+{\hat{n}}_{b}\right)\right)$ with eigenvalue exp(−2γ). As far as we know, this two-mode code is still awaiting experimental realization.

By allowing ourselves code states with higher average photon number, we can correct for more loss errors, as well as gain and dephasing errors. More precisely, reference [12] has introduced families of binomial and so-called cat codes which correct against the set of errors $\mathsf{E}=\left\{I,a,\dots ,{a}^{L},{a}^{{\dagger}},\dots ,{{a}^{{\dagger}}}^{G},{a}^{{\dagger}}a,\dots ,{\left({a}^{{\dagger}}a\right)}^{D}\right\}$ for arbitrary L, G and D. For example, the idea behind the binomial codes can be understood as follows. Using the Holstein–Primakoff transformation a^† a → J_z + J, the binomial code words $\vert \overline{0}\rangle$ and $\vert \overline{1}\rangle$ can be seen as spin-eigenstates of J_x = ±J with 2J = N + 1, where one defines N = max(L, G, 2D). Dephasing errors ${\left({a}^{{\dagger}}a\right)}^{m}$ , m = 1, ..., D thus lead to a change in J_x by at most D ⩽ ⌊J⌋, hence keeping codewords orthogonal. At the same time, protection against photon loss and gain is achieved by using a subspace of sufficiently separated Fock states stabilized by the operator Π_S = e^{i 2 π a^† a/ (S+1)} with S = L + G. For S = 1 this gives the photon parity operator ${{\Pi}}_{S=1}\equiv {{\Pi}}_{\text{photon}}={\mathrm{e}}^{\mathrm{i}\pi {a}^{{\dagger}}a}$ : the even-photon codewords in equation (6) are clearly +1 eigenstates of this photon parity operator.

Another family of single-mode codes are the cat codes. A very simple encoding is $\vert \overline{0}\rangle \approx \vert \alpha \rangle$ and $\vert \overline{1}\rangle \approx \vert -\alpha \rangle$ with coherent state |α⟩, first proposed in [14, 15]. Since |α⟩ and |−α⟩ are not orthogonal, it is more appropriate to define the code states as $\vert {C}_{\alpha }^{{\pm}}\rangle =\frac{1}{\sqrt{{N}_{{\pm}}}}\left(\vert \alpha \rangle {\pm}\vert -\alpha \rangle \right)$ with N_± = 2(1 ± exp(−2|α|²)). These states are orthogonal for all α and we can define $\vert {\pm}\rangle \equiv \vert {C}_{\alpha }^{{\pm}}\rangle$ . On this encoding, photon loss induces immediate phase-flip errors since $a\vert {C}_{\alpha }^{{\pm}}\rangle \propto \vert {C}_{\alpha }^{\mp }\rangle$ . Thus the phase-flip error rate (probability per unit time) is proportional to κ|α|² with κ the photon loss rate of the encoding mode.

On the other hand, for large enough α, bit-flips, α ↔ −α, can be expected to occur at a much lower error rate as they correspond to a large change of the state in phase space. Particularly interesting is the engineering of Hamiltonians or dissipative processes which have these code states |±α⟩ as degenerate fixed-points, so that there is a 'macrosopic' energy barrier to transition between them, leading to a bit flip rate exponentially small in |α|². This design can lead to a qubit for which the noise is biased as phase-flip errors are more prominent than bit-flip errors. We will discuss this noise-biased qubit in more detail in section 2.2.

The next-level cat encoding was introduced in [15, 16], and is sometimes referred to as the four-legged cat code since its codewords have four blobs in phase space:

$\begin{align}\hfill \vert \overline{0}\rangle & =\frac{1}{\sqrt{{N}_{0}}}\left(\vert \alpha \rangle +\vert -\alpha \rangle +\vert \mathrm{i}\alpha \rangle +\vert -\mathrm{i}\alpha \rangle \right),\quad \vert \overline{1}\rangle =\frac{1}{\sqrt{{N}_{1}}}\left(\vert \alpha \rangle +\vert -\alpha \rangle -\vert \mathrm{i}\alpha \rangle -\vert -\mathrm{i}\alpha \rangle \right),\hfill \\ \hfill {N}_{b}& =8 {\mathrm{e}}^{-{\alpha }^{2}}\left(\mathrm{cosh} {\alpha }^{2}+{\left(-1\right)}^{b}\mathrm{cos} {\alpha }^{2}\right),\quad \alpha \in \mathbb{R}.\hfill \end{align} \tag{ 9 }$

Using the standard identity $\langle \alpha \vert \beta \rangle ={\mathrm{e}}^{-\left(\vert \alpha {\vert }^{2}+\vert \beta {\vert }^{2}\right)/2}{\mathrm{e}}^{{\alpha }^{{\ast}}\beta }$ one can verify the orthogonality of these two states. As for the kitten code, we can verify that both states are +1 eigenstates of the photon parity operator ${{\Pi}}_{\text{photon}}={\mathrm{e}}^{\mathrm{i}\pi {a}^{{\dagger}}a}$ using that Π_photon|α⟩ = |−α⟩. The photon parity thus functions as a check operator, taking eigenvalue +1 on the code space and measuring it is a natural way to detect photon loss and perform error correction.

The states $\vert \overline{0}\rangle$ and $\vert \overline{1}\rangle$ , both having even photon parity, are however distinguished by their photon parity modulo 4, expressed as the ±1 eigenvalues of the operator ${{\Pi}}_{\text{photon}}^{1/2}=\mathrm{exp}\left(\mathrm{i}\pi {a}^{{\dagger}}a/2\right)$ . To measure the photon parity operator via an ancilla qubit, a cavity mode-qubit dispersive interaction −χZa^† a/2 can be used [7]. In circuit-QED the interaction Za^† a comes about naturally as the effective interaction between, say, a cavity mode and a linearly-coupled, off-resonant, transmon qubit mode [6]. The measurement of Π_photon then proceeds by preparing the ancilla qubit in |+⟩, letting the interaction take place for time t = π/χ and subsequently measuring the qubit in the |±⟩ basis. Using a transmon qubit and cavity mode, reference [17] has shown that tracking the photon parity by repeated measurements of Π_photon makes for a logical qubit which has a longer life-time than a Fock qubit without error correction in the same cavity mode. This result has essentially been the first demonstration of quantum error correction lengthening the life-time as compared to that of native qubits (transmon and/or Fock encoding) in the hardware.

Before we discuss further generalizations, let us examine the quantum error correction conditions, equations (4) and (5) for this cat code with respect to the set of errors $\mathsf{E}=\left\{I,\sqrt{\gamma }a\right\}$ . One can quickly observe that all conditions are obeyed except $\langle \overline{0}\vert {a}^{{\dagger}}a\vert \overline{0}\rangle ?{=}\langle \overline{1}\vert {a}^{{\dagger}}a\vert \overline{1}\rangle$ . Besides the uninteresting case of taking α very large (so that all |±α⟩, |±iα⟩ are orthogonal), this last condition is exactly met at sweet spots given by the equation tan α² = −tanh α². The smallest sweet-spot at |α|² = 2.34 lies close to the number of photons $\overline{n}=2$ of the cat code used in the experiment [17].

There are several error channels which impact the performance of the cat code using repeated photon parity measurements. First of all, the code cannot fully correct against the photon loss channel as it cannot correct the distortion Kraus operator E₀ = I − γa^† a/2. Secondly, two photon-loss events ∝ a² implement a logical bit-flip $\vert \overline{0}\rangle {\leftrightarrow}\vert \overline{1}\rangle$ . Thirdly, photon loss in combination with the inevitable Kerr nonlinearity $\sim {\left({a}^{{\dagger}}a\right)}^{2}$ on the cavity mode causes incorrectable dephasing: the Kerr interaction makes the cavity rotation speed depend on the number of photons in the cavity, but this number becomes indeterminate in the presence of photon loss. Last but not least, transmon qubit decay during the qubit controlled-a^† a interaction, is a serious source of feedback error. For example, when the qubit decays half-way through the interaction, |1⟩ → |0⟩, it applies only half the rotation on the cavity mode. The result is that the eigenvalues of ${{\Pi}}_{\text{photon}}^{1/2}=\overline{Z}$ are measured via the qubit measurement, collapsing the logical state.

This last feedback error problem is an important issue for any bosonic qubit, and it has been a central theme in the theory of fault-tolerant computing in general [18]. A disadvantage of the theoretical schemes for fault-tolerant quantum error correction is that they typically require additional hardware resources, such as logical ancilla qubits or (verified) multi-qubit GHZ states. Instead, we may seek hardware-efficient mitigation of the feedback error problem. As an example, reference [19] has addressed the feedback error due to transmon relaxation by drive-engineering the dispersive coupling Hamiltonian to equal −χ(|2⟩⟨2| + |1⟩⟨1|)a^† a and starting the ancilla transmon qubit in the state $\frac{1}{\sqrt{2}}\left(\vert 0\rangle +\vert 2\rangle \right)$ . Transmon qubit decay from 2 → 1 then commutes with the transmon–cavity interaction and does not cause errors on the cavity mode. The decay does—as in the normal case—affect the reliability of the transmon qubit measurement outcome. All-in all, this has led to an overall factor 5 in improvement of the life-time of the encoded cat qubit [19].

Another way of minimizing feedback errors on a bosonic code is to use a biased-noise ancilla qubit (section 2.2) as an ancilla qubit. As proposed in reference [20], the goal is then to let the strong-noise error channel affect the ancilla qubit measurement, while the low-noise (bit-flip) channel on the ancilla feeds back low-noise to the bosonic code.

Single-mode cat codes with higher-photon numbers can be formulated and form a class of codes [12, 21]. Reference [22] studied the performance of binomial, cat and GKP codes (see section 2.3), against photon loss, assuming optimal noisefree recovery as permitted by the quantum error correcting conditions. Reference [23] has formulated a general framework of rotation-symmetric codes of which the binomial and cat codes are subclasses: the unifying theme is rotation symmetry of the code states in phase space captured by invariance under the operator Π_S. Another interesting class of bosonic codes uses a three-wave mixing χ⁽²⁾-interaction, equation (32), as the central element for defining the code and correcting photon loss [24]. Various classes of multi-mode codes against photon loss exist, see for example [22, 25, 26] and references therein.

A challenge in using a bosonic qubit is that some computational manipulations can be more involved than for a regular qubit. For example, on a regular qubit such as a transmon qubit, rotations by an angle θ around axes X or Y are easily accomplished by temporarily supplying microwave radiation. On a bosonic qubit, these simple single-qubit gates can be non-trivial. An advocated solution in reference [27] is to always use a dual-rail (dr) encoding of a bosonic qubit with $\vert {\overline{0}\rangle }_{\text{dr}}=\vert \overline{0}\overline{1}\rangle$ and $\vert {\overline{1}\rangle }_{\text{dr}}=\vert \overline{1}\overline{0}\rangle$ , where $\vert \overline{0}\rangle$ and $\vert \overline{1}\rangle$ are the states of (an arbitrary) bosonic qubit itself. Having mapped the Bloch-sphere of a qubit onto a two-mode state space, the exponential mode-SWAP operator exp(iθSWAP_a,b) becomes a universal gate to do single-qubit and two-qubit gates, and has been realized in [28]. Here the linear SWAP_a,b transformation interchanges two modes a and b, i.e. its action on quadrature operators for the modes is given by q_a ↔ q_b and p_a ↔ p_b. If we envision using a bosonic qubit as a building block qubit in a stabilizer code, it is however not necessary to perform any gate, but rather we can focus on performing CNOT or CZ, Hadamard (H) and T gates possibly using ancilla qubits, see e.g. reference [5].

2.2. Noise-biased cat qubit

A method to set up a dissipative process which stabilizes the coherent states |±α⟩ was devised in [15]. The idea is to engineer the Lindblad equation (in a frame rotating at the mode frequency):

$\begin{equation}\dot {\rho }=-\mathrm{i}\left[{H}_{\text{sq}},\rho \right]+{\kappa }_{2ph}\mathcal{D}\left({a}^{2}\right)\left(\rho \right)\equiv \mathcal{L}\left(\rho \right),\end{equation} \tag{ 10 }$

with ${H}_{\text{sq}}=\mathcal{E}{{a}^{{\dagger}}}^{2}+{\mathcal{E}}^{{\ast}}{a}^{2}$ where $\mathcal{E}=\mathrm{i}\vert \mathcal{E}\vert$ with $\vert \mathcal{E}\vert$ proportional to the strength of a pumped microwave mode acting as a classical field. To understand the fixed points of this evolution—ρ for which $\mathcal{L}\left(\rho \right)=0$ —we can write the Lindblad equation as

$\begin{equation}\dot {\rho }=-\mathrm{i}\left({H}_{\text{eff}}\rho -\rho {H}_{\text{eff}}^{{\dagger}}\right)+{\kappa }_{2ph}{a}^{2}\left(\rho \right){{a}^{{\dagger}}}^{2},\end{equation} \tag{ 11 }$

with ${H}_{\text{eff}}={H}_{\text{sq}}-\frac{\mathrm{i}{\kappa }_{2ph}}{2}{{a}^{{\dagger}}}^{2}{a}^{2}$ . We can then use, with $K=\frac{{\kappa }_{2ph}}{2}$ :

$\begin{align}\hfill -\mathrm{i}{H}_{\text{eff}}=-K{{a}^{{\dagger}}}^{2}{a}^{2}+\vert \mathcal{E}\vert \left({{a}^{{\dagger}}}^{2}-{a}^{2}\right)=& -K{\tilde {M}}_{\alpha }^{{\dagger}}{M}_{\alpha }-\frac{\vert \mathcal{E}{\vert }^{2}}{K},\hfill \\ \hfill \mathrm{w}\mathrm{i}\mathrm{t}\mathrm{h}{M}_{\alpha }={a}^{2}-{\alpha }^{2}I, {\tilde {M}}_{\alpha }={a}^{2}+{\alpha }^{2}I, & \alpha =\sqrt{\frac{\vert \mathcal{E}\vert }{K}}.\hfill \end{align} \tag{ 12 }$

This immediately implies that the states $\vert {\pm}\alpha ={\pm}\sqrt{\vert \mathcal{E}\vert /K}\rangle$ are fixed points of the Lindblad evolution, as M_α|±α⟩ = 0, and the last term in equation (11) is canceled by the constant $\frac{-\vert \mathcal{E}{\vert }^{2}}{K}$ which remains from the first term. Hence, any linear combination of the states $\vert \alpha ={\pm}\sqrt{\vert \mathcal{E}\vert /K}\rangle$ is a fixed point of the dynamics.

When the pump inducing the squeezing Hamiltonian H_sq is off, $\mathcal{E}=0$ , we can observe that the Fock states $\vert 0\rangle ={\mathrm{lim}}_{\alpha \to 0}\vert {C}_{\alpha }^{+}\rangle$ and $\vert 1\rangle ={\mathrm{lim}}_{\alpha \to 0}\vert {C}_{\alpha }^{-}\rangle$ are fixed points, distinguished by their photon parity. Thus when $\mathcal{E}$ is gradually increased, we can smoothly change from a Fock encoding into the cat $\vert {C}_{\alpha }^{{\pm}}\rangle$ encoding. Photon loss at rate κ, which can be modeled by introducing an additional term $\kappa \mathcal{D}\left(a\right)\left(\rho \right)$ in equation (10), causes phase-flip errors, i.e. flipping between the states $\vert {C}_{\alpha }^{{\pm}}\rangle$ , but does not interfere with the stabilization itself as |±α⟩ are eigenstates of a so that $\mathcal{D}\left(a\right)\left(\vert {\pm}\alpha \rangle \langle {\pm}\alpha \vert \right)=0$ . One can add a drive term H_drive = (t)a^† + ^*(t)a to the Lindblad equation and observe that the annihilation operator a will generate rotations around the Z-axis (periodically interchanging $\vert {C}_{\alpha }^{{\pm}}\rangle$ ). At the same time, a^† in principle leads to a departure from the qubit subspace spanned by |±α⟩ corresponding to leakage. However, due the ∼|α|² gap of the Lindbladian, such departure from the eigenvalue-0 manifold is exponentially suppressed and the effect of the driving term can be analyzed by projecting it onto the stabilized subspace. In this subspace it then induces Rabi oscillations around an axis which is exponentially-closely aligned with the Z-axis, with Rabi frequency Ω ∝ |||α|, experimentally demonstrated in [29].⁷ A measurement in the X-basis can be accomplished by measuring the photon parity through a coupled transmon qubit. The (pumped) squeezing interaction and the required two-photon dissipative process have first been experimentally realized in [31]. This was achieved by coupling a 3D storage cavity (at frequency ω_a) via a bridging transmon to a lossy cavity (at different frequency ω_b) and applying a two-tone drive on the lossy cavity so as to set up a process to convert two storage photons to one lossy cavity photon which is subsequently lost (the ${\kappa }_{2ph}\mathcal{D}\left({a}^{2}\right)$ process). The lossy cavity is driven at pump frequency ω_p = 2ω_a − ω_b as well as close to its own frequency ω_b, generating, through the transmon nonlinearity, an effective degenerate parametric oscillator with resonant terms of the form ${{a}^{{\dagger}}}^{2}b+{b}^{{\dagger}}{a}^{2}$ .

A more recent experimental realization in reference [32] has been able to cleanly generate the desired interactions (via an effective three-wave mixing, see also section 3.3) and observe the exponential decrease of the bit-flip error rate in |α|² as well as the linear increase of the phase-flip error rate with |α|².

An alternative, non-dissipative, route towards a noise-biased qubit was first proposed in [33]. Instead of invoking dissipation, the idea is to engineer a Hamiltonian which has |±α⟩ as degenerate eigenstates, using a Kerr nonlinearity and squeezing. The two-photon dissipation is then considered an optional add-on which helps in mitigating leakage, i.e. a departure from the subspace spanned by |±α⟩. The target Hamiltonian (in the rotating frame of the cavity mode) is

$\begin{align}\hfill H& =-K{{a}^{{\dagger}}}^{2}{a}^{2}+\mathcal{E}{{a}^{{\dagger}}}^{2}+{\mathcal{E}}^{{\ast}}{a}^{2}=-K{M}_{\alpha }^{{\dagger}}{M}_{\alpha }+\frac{\vert \mathcal{E}{\vert }^{2}}{K},\hfill \\ \hfill {M}_{\alpha }& ={a}^{2}-{\alpha }^{2}, \alpha =\sqrt{\frac{\vert \mathcal{E}\vert }{K}} {\mathrm{e}}^{\mathrm{i}\varphi }, \mathcal{E}=\vert \mathcal{E}\vert {\mathrm{e}}^{2\mathrm{i}\varphi }.\hfill \end{align} \tag{ 13 }$

The spectrum of H has eigenvalues running from $\frac{\vert \mathcal{E}{\vert }^{2}}{K}$ downwards as the first term in H is negative-semi-definite. Omitting the factor $\frac{\vert \mathcal{E}{\vert }^{2}}{K}$ , the highest eigenstates are the states |±α⟩ with degenerate zero eigenvalues. We can observe the similarity and difference with equation (12): here we consider a Hermitian matrix and the phase of the pump amplitude $\mathcal{E}$ is variable and determines the phase of the coherent states which are the zero energy eigenstates. Thus, by adiabatically changing the phase of $\mathcal{E}$ we move to different zero energy eigenstates, allowing us to transform α → −α and hence realize a X gate on $\vert {C}_{\alpha }^{{\pm}}\rangle$ . For the stability of the encoded space it is important to understand the spectrum of H and the gap below these degenerate zero eigenstates, see the analysis in [20, 33]. To understand this, assume that the phase φ = 0 for simplicity. We can displace the Hamiltonian by D(±α) with $\alpha =\sqrt{\vert \mathcal{E}\vert /K}$ . For large $\vert \mathcal{E}\vert /K$ , one can approximate D^†(±α)HD(±α) ≈ −4K|α|² a^† a, a harmonic oscillator Hamiltonian. This shows that for large α, the spectrum approximately has the gap 4K|α|² and the first excited states below |±α⟩ are roughly equal to D(±α)|1⟩. The so-called 'Cassinian' Hamiltonian in equation (13) was first studied in [34]: the surfaces of constant classical energy are described by Cassinian ovals in ⟨p⟩ and ⟨q⟩ with the focii of the ovals at $\langle q\rangle ={\pm}\sqrt{\vert \mathcal{E}\vert /K}$ . As a quantum system the spectrum is that of an inverted double-well ('double-oscillator') with the well maxima at zero energy for the states |±α⟩. We can consider the effect of driving and several dissipative processes for the Hamiltonian in equation (13). For example, when one includes photon loss $\kappa \mathcal{D}\left(\rho \right)$ in the Lindblad equation and the pump amplitude is sufficiently large, i.e. $16\vert \mathcal{E}{\vert }^{2}{ >}{\kappa }^{2}$ [33, 35, 36], the fixed point of the Lindblad equation is the state $p\vert \tilde {\alpha }\rangle \langle \tilde {\alpha }\vert +\left(1-p\right)\vert -\tilde {\alpha }\rangle \langle -\tilde {\alpha }\vert$ with modified $\tilde {\alpha }$ , $\vert \tilde {\alpha }{\vert }^{2}{< }\frac{\vert \mathcal{E}\vert }{K}$ . In this regime the system neatly represent the dissipative storage of a classical bit.

The effect of other sources of noise such as dephasing ( ${\kappa }_{\text{deph}}\mathcal{D}\left({a}^{{\dagger}}a\right)\left(\rho \right)$ , see also equation (29)), photon gain ( $\kappa {\overline{n}}_{\text{therm}}\mathcal{D}\left({a}^{{\dagger}}\right)\left(\rho \right)$ ) due to the coupling with a finite temperature heat bath, as well coupling with baths with other spectral densities are discussed in detail in [15, 20, 33, 37].

Reference [30] has implemented the Kerr-cat Hamiltonian in equation (13) and the corresponding qubit in the resonant mode of a so-called SNAIL element (see section 3.3), coupled to a read-out cavity mode. The fourth-order nonlinearity of the SNAIL element gives the wanted −Ka^†2 a² term, while one can drive the mode at twice its frequency so as to use the third-order SNAIL term $\propto {{a}^{{\dagger}}}^{3}+{a}^{3}$ to turn on squeezing. The experiment generated cat states with |α|² ≈ 2.5 with a dephasing life-time of 3 μs, and an enhanced decay life-time of 105 μs, and a π/2 rotation around the Z-axis obtained by driving took 24 ns. The ability to convert the noise-biased qubit to a Fock encoding by turning off the squeezing drive allows to measure Pauli X via a standard dispersive measurement [30]. One can also measure a noise-biased qubit in the X-basis by dispersively coupling (−χa^† aZ/2) it to an ancilla qubit to map the photon parity onto the state of the ancilla qubit which is subsequently measured. To realize a (nondestructive) Pauli Z measurement, distinguishing ±α, reference [30] had applied, besides the squeezing drive, a drive at the difference frequency of the SNAIL mode and the read-out cavity mode (b) to get a resonant beam-splitting interaction ∝ a^† b + ab^†. The upshot is that the coherent states |±α⟩ are mapped to corresponding coherent states in the cavity mode which are heterodyne-measured when leaking out of the cavity.

Given that the noise-biased cat qubit is designed to have a low bit-flip error rate, it can function as an ancilla control qubit in the error correction circuit for another code [20] inducing low feedback noise. Assume we have a code which is an eigenspace of a stabilizer S = e^iA and S is to be measured using the noise-biased cat qubit to detect or correct errors. This requires an interaction of the noise-biased cat qubit and the code of the form H_int ∝ (a + a^†) ⊗ A since a + a^† ≈ Z on the noise-biased cat code space (besides some leakage), allowing for a qubit controlled-S operation. For example, for the cat code, $S={\mathrm{e}}^{\mathrm{i}\pi {b}^{{\dagger}}b}={{\Pi}}_{\text{photon}}$ , requiring a tunable photon–pressure coupling between the two modes of the form H_int ∝ (a + a^†)b^† b. For the GKP code, see section 2.3, S is a displacement so that H_int can be chosen to be a tunable beam-splitting interaction of the form a^† b + ab^†.

It has been argued that, if the noise-bias of this qubit is sufficiently strong, only a classical repetition code [38] might suffice to correct for the dominant phase-flip (Z) errors due to photon loss. Crucial in this idea is that the CNOT gate which is needed to measure the XX checks of this code preserves the noise-bias, that is, Z errors during the gate do not propagate to become X errors after the gate. For the Kerr-cat qubit a noise-bias preserving CNOT gate has been proposed in [37]. A similar idea is to use this Kerr-cat qubit as a basic qubit in a surface code architecture in which the XXXX and YYYY checks are measured [37, 39]. In this modified form of surface code one gains much more information about Z errors. It has been shown that when the probability for phase-flip errors and measurement errors is a factor 100 more than that of bit-flip errors within a phenomenological error model, the threshold against Z errors can be as high as 5% [39]. It is an open question whether such high bias will be feasible in practice as experiments for doing the CZ gate and the noise-bias preserving CNOT gate on these noise-biased qubits are still to come.

2.3. The GKP qubit

The (square) Gottesman–Kitaev–Preskill (GKP) qubit introduced in reference [40] is defined through two commuting displacement operators, acting as translations in phase space, i.e. ${S}_{q}=\mathrm{exp}\left(\mathrm{i}2\sqrt{\pi }\hat{q}\right)$ and ${S}_{p}=\mathrm{exp}\left(-\mathrm{i}2\sqrt{\pi }\hat{p}\right)$ .⁸ The ideal GKP code is the space invariant under these two phase-space translations. As a result, any wave function in q (resp. p) in this space has support on $q=k\sqrt{\pi }$ (resp. $p=l\sqrt{\pi }$ ) for integers $k,l\in \mathbb{Z}$ . The logical operators of the qubit are $\overline{Z}=\mathrm{exp}\left(\mathrm{i}\sqrt{\pi }\hat{q}\right)$ and $\overline{X}=\mathrm{exp}\left(-\mathrm{i}\sqrt{\pi }\hat{p}\right)$ with $\overline{X}\overline{Z}=-\overline{Z}\overline{X}$ . In addition, $\overline{Y}=\mathrm{i}\overline{X}\overline{Z}=\mathrm{exp}\left(\mathrm{i}\pi /2\right)\mathrm{exp}\left(-\mathrm{i}\sqrt{\pi }\hat{p}\right)\mathrm{exp}\left(\mathrm{i}\sqrt{\pi }\hat{q}\right)=\mathrm{exp}\left(\mathrm{i}\sqrt{\pi }\left(-\hat{p}+\hat{q}\right)\right)$ . This choice makes the wave function in q of $\vert \overline{0}\rangle$ a sum of delta functions at values of q which are even multiples of $\sqrt{\pi }$ , while $\vert \overline{1}\rangle$ has uniform support on values of q which are odd multiples of $\sqrt{\pi }$ . The ideal code meets the quantum error correction conditions for a continuous set of 'at most half-logical' displacements $\mathsf{E}=\left\{{\mathrm{e}}^{\mathrm{i}u\hat{p}},{\mathrm{e}}^{\mathrm{i}v\hat{q}}:\vert u\vert ,\vert v\vert {\leqslant}\sqrt{\pi }/2\right\}$ , since any products of these shifts maps a $\vert \overline{0}\rangle$ onto a state orthogonal to both $\vert \overline{1}\rangle$ and $\vert \overline{0}\rangle$ (and vice-versa). The set of correctable displacements forms a square Wigner–Seitz or Voronoi cell (containing only one lattice point such that all points in the cell are closer to this point than to another lattice point) in the code lattice generated by the logical phase-space translations.

Naturally, an asymmetric version of the GKP code which corrects more shift errors in $\hat{q}$ than shift errors in $\hat{p}$ can also be defined. However, when there is no hardware-based noise asymmetry between $\hat{p}$ and $\hat{q}$ this does not seem immediately useful.

In principle, and in theory, to perform quantum error correction the eigenvalues (phases) of the unitary operators S_p and S_q are to be measured. Performing such measurements projects the continuum of errors onto (superpositions of) possible displacements, and we perform error correction by choosing a displacement of minimal amplitude which resets these eigenvalues to +1, corresponding to the code space. In section 2.4 we will analyze GKP quantum error correction using encoded GKP ancilla qubits, see figure 7. The advantage of this form of error correction is that it does not suffer from feedback errors induced by a poor ancilla qubit (instead, it suffers feedback errors from a GKP ancilla qubit) and the information gained through measuring the GKP ancilla states is analog rather than binary. The disadvantage is that one needs to prepare GKP ancilla states themselves first.

For this latter task one can perform some form of phase estimation to measure the eigenvalues of the unitary operators S_p and S_q. Since the eigenvalues take continuous values, one only ever realizes an approximate estimation of these phases. Phase estimation can readily be executed by coupling the GKP mode repeatedly to a single ancilla qubit via controlled-displacement gates as was proposed and discussed in great detail in reference [41], focusing on a circuit-QED implementation. The idea behind this is simple. To measure the eigenvalue of a unitary operator U such as the displacements S_p or S_q, one can use ancilla qubits applying qubit controlled-U^k gates for k = 1, 2, ....For example, when k = 1, the circuit on the left in figure 1 has outcome probabilities $\mathbb{P}\left({\pm}\right)=\frac{1}{2}\left(\right.1{\pm}\langle \text{Re}\left(U\right)\rangle$ , while the circuit on the right has probabilities $\mathbb{P}\left({\pm}\right)=\frac{1}{2}\left(\right.1\mp \langle \text{Im}\left(U\right)\rangle$ .

**Figure 1.** Single round of phase estimation with $U\vert {\psi }_{n}\rangle ={\mathrm{e}}^{\mathrm{i}{\theta }_{n}}\vert {\psi }_{n}\rangle$ where the probability for ancilla qubit to be measured in the state |±⟩ equals $\mathbb{P}\left({\pm}\right)={\sum }_{n}\vert {\alpha }_{n}{\vert }^{2}\frac{1}{2}\left(1{\pm}\mathrm{cos}\left({\theta }_{n}\right)\right)$ (left) and $\mathbb{P}\left({\pm}\right)={\sum }_{n}\vert {\alpha }_{n}{\vert }^{2}\frac{1}{2}\left(1\mp \mathrm{sin}\left({\theta }_{n}\right)\right)$ (right). In the applications here U is a displacement S_p or S_q.
Download figure:
Standard image High-resolution image

$U\vert {\psi }_{n}\rangle ={\mathrm{e}}^{\mathrm{i}{\theta }_{n}}\vert {\psi }_{n}\rangle $ — **Figure 1.** Single round of phase estimation with $U\vert {\psi }_{n}\rangle ={\mathrm{e}}^{\mathrm{i}{\theta }_{n}}\vert {\psi }_{n}\rangle$ where the probability for ancilla qubit to be measured in the state |±⟩ equals $\mathbb{P}\left({\pm}\right)={\sum }_{n}\vert {\alpha }_{n}{\vert }^{2}\frac{1}{2}\left(1{\pm}\mathrm{cos}\left({\theta }_{n}\right)\right)$ (left) and $\mathbb{P}\left({\pm}\right)={\sum }_{n}\vert {\alpha }_{n}{\vert }^{2}\frac{1}{2}\left(1\mp \mathrm{sin}\left({\theta }_{n}\right)\right)$ (right). In the applications here U is a displacement S_p or S_q.
Download figure:
Standard image High-resolution image

In phase-estimation schemes, higher powers k > 1 of U^k are often used, but applying U^k, a displacement of strength ∼k, increases the number of photons in the state by ∼k² and does not provide a good approximation of an approximate GKP state [41]. Instead of repeating the phase estimation to collect bits of the phase and then do a final corrective displacement, it is experimentally simpler to opt for immediate feedback on the code state based on each new bit obtained in a round of phase estimation. This is the route taken in the experimental realization of the GKP code in [42], where a small conditional displacement on the GKP qubit is executed depending on the ancilla qubit measurement outcome. In fact, using such immediate feedback the state of the ancilla qubit does not even need to be measured, as the feedback can be done depending on the qubit state itself, followed by an approximate disentangling step [43] or alternatively a qubit reset step (to avoid entropy build-up).

In addition, in reference [42] only the right circuit in figure 1 measuring Im(U) is used (instead of measuring both Re(U) and Im(U)). If the state to be measured is (approximately) symmetrically centered around the vacuum so that its wavefunction is symmetric under q → −q and p → −p, we have ∫dp|ψ(p)|²Im(S_p) = 0 and ∫dq|ψ(q)|²Im(S_q) = 0. This implies that $\mathbb{P}\left({\pm}\right)=\frac{1}{2}\left(1{\pm}\langle \text{Im}\left(U\right)\rangle \right)=\frac{1}{2}$ , suggesting that the measurement outcome ± can gain a maximal amount of information by weakly projecting onto sin(θ) ≷ 0, and subsequently shifting the state to the point θ = 0. These feedback shifts are realized in [42] by small displacements. Note that if the input state has eigenvalue phase θ close to 0, then Re(U) is close to 1, implying that not much is learned by doing the measurement with outcomes $\mathbb{P}\left({\pm}\right)=\frac{1}{2}\left(1{\pm}\langle \text{Re}\left(U\right)\rangle \right)$ .

We remark that the length of the displacement of the logical $\overline{Y}$ is $\sqrt{2}$ larger than that of $\overline{X}$ and $\overline{Z}$ . This implies some asymmetry in error correction. Namely, if we correct by measuring S_p and S_q, shifts such as $\mathrm{exp}\left(-\mathrm{i}u{\backslash}\hat{p}+\mathrm{i}v{\backslash}\hat{q}\right)$ with u² ⩽ π/4 and v² ⩽ π/4 can be corrected which, as displacements, are a factor $\sqrt{2}$ larger than correctable displacements in pure $\hat{p}$ and $\hat{q}$ directions. Given a noise model which is rotationally-symmetric in phase space, this does not seem to be an optimal choice. It also implies that logical Y eigenstates which can flip due to large displacements in pure $\hat{p}$ and $\hat{q}$ directions can have shorter lifetimes [42].

A 'hexagonal' GKP qubit has also been defined in [40] by choosing two phase-space lattice translations which are not orthogonal such that all three logical operators $\overline{X},\overline{Y}$ and $\overline{Z}$ have the same length as phase-space translation vectors. For this choice we take as stabilizers $\mathrm{exp}\left(\mathrm{i}\xi \left(\sqrt{3}\hat{q}-\hat{p}\right)/2\right)$ and $\mathrm{exp}\left(\mathrm{i}\xi \hat{p}\right)$ with $\xi =2\sqrt{2\pi /\sqrt{3}}$ , generating a hexagonal lattice in phase space. Again the logical operators are half-stabilizers, forming the vectors generating a hexagonal lattice. The correctable displacements now form a hexagonal Wigner–Seitz cell. This cell is larger in volume than the square Wigner–Seitz cell in the square GKP lattice. If we assume that displacement errors occur according to a stochastic Gaussian model as in equation (24), it implies that the hexagonal code can correct a larger probability volume of errors.

If we were to choose stabilizers ${S}_{q}=\mathrm{exp}\left(\mathrm{i}\sqrt{2\pi }\hat{q}\right)$ and ${S}_{p}=\mathrm{exp}\left(-\mathrm{i}\sqrt{2\pi }\hat{p}\right)$ , there would be no additional commuting displacement operators, implying that the +1 eigenspace S_p and S_q is one-dimensional. This eigenstate, also called the sensor state |ψ_sensor⟩ in [44], is a uniform sum of delta function at $q=k\sqrt{2\pi }$ with $k\in \mathbb{Z}$ (and similarly a uniform sum of delta functions at $p=l\sqrt{2\pi }$ with $l\in \mathbb{Z}$ ). The sensor state is interesting in allowing one to simultaneously estimate the complex and real part of the amplitude α of a displacement D(α), by performing phase-estimation for S_q and S_p on D(α)|ψ_sensor⟩ [44].

We will uniquely focus on the square GKP code in the remainder of this review, although most points apply with small variation to the hexagonal code.

2.3.1. Approximate GKP states

Any physical GKP code state will occupy a finite volume in phase space and will have a finite number of photons. In principle, an infinite number of approximations to the perfect GKP code states exist, but some are more useful than other's and here we will mention four. Reference [40] introduced a form of approximate GKP state obtained by applying a Gaussian superposition of displacements, characterized by a 'squeezing' parameter Δ > 0 to a perfect state:

$\begin{equation}\vert \overline{\psi }\rangle =\mathfrak{E}\vert {\overline{\psi }}_{\text{ideal}}\rangle ,\quad \mathfrak{E}\equiv \frac{1}{\sqrt{\pi {{\Delta}}^{2}}}{\int }_{{\mathbb{R}}^{2}} \mathrm{d}u\mathrm{d}v \mathrm{exp}\left(-\frac{{u}^{2}+{v}^{2}}{2{{\Delta}}^{2}}\right)\mathrm{exp}\left(-\mathrm{i}u\hat{p}+\mathrm{i}v\hat{q}\right).\end{equation} \tag{ 14 }$

For this model wavefunction it holds that $\overline{n}\approx \frac{1}{2{{\Delta}}^{2}}-\frac{1}{2}$ [40, 41]. One can perform the Gaussian phase-space integral in equation (14) and—neglecting contributions O(Δ⁴) ≪ 1, see e.g. [45]—one gets a different approximation using an operator $\mathfrak{D}$ :

$\begin{equation}\mathfrak{E}\approx 2\sqrt{\pi {{\Delta}}^{2}}\mathfrak{D},\quad \mathfrak{D}\equiv \mathrm{exp}\left(-{{\Delta}}^{2}\hat{n}\right).\end{equation} \tag{ 15 }$

The envelope operator $\mathfrak{D}$ has approximately the same effect as the 'no loss' Kraus operator of a photon loss channel ${\mathcal{N}}_{\gamma }$ , equation (7), with γ = 2Δ². Another approximation, valid for small Δ is

$\begin{equation}\mathfrak{E}\vert \overline{0}\rangle \approx \mathfrak{F}\vert \overline{0}\rangle \propto {\int }_{\mathbb{R}}\mathrm{d}q \sum _{k\in \mathbb{Z}}\underset{\mathrm{e}\mathrm{n}\mathrm{v}\mathrm{e}\mathrm{l}\mathrm{o}\mathrm{p}\mathrm{e}}{\underbrace{{\mathrm{e}}^{-2{{\Delta}}^{2}\pi {k}^{2}}}}\underset{\mathrm{c}\mathrm{o}\mathrm{m}\mathrm{b}}{\underbrace{{\mathrm{e}}^{-\frac{1}{2{{\Delta}}^{2}}{\left(q-2k\sqrt{\pi }\right)}^{2}}}}\vert q\rangle ,\end{equation} \tag{ 16 }$

$\begin{equation}\mathfrak{E}\vert \overline{+}\rangle \approx \mathfrak{F}\vert \overline{+}\rangle \propto {\int }_{\mathbb{R}}\mathrm{d}q \sum _{k\in \mathbb{Z}}\underset{\mathrm{e}\mathrm{n}\mathrm{v}\mathrm{e}\mathrm{l}\mathrm{o}\mathrm{p}\mathrm{e}}{\underbrace{{\mathrm{e}}^{-\frac{1}{2}{{\Delta}}^{2}\pi {k}^{2}}}}\underset{\mathrm{c}\mathrm{o}\mathrm{m}\mathrm{b}}{\underbrace{{\mathrm{e}}^{-\frac{1}{2{{\Delta}}^{2}}{\left(q-k\sqrt{\pi }\right)}^{2}}}}\vert q\rangle .\end{equation} \tag{ 17 }$

The state $\mathfrak{F}\vert \overline{0}\rangle$ can be interpreted as the result of preparing a squeezed state $\frac{1}{{\pi }^{1/4}}\int \mathrm{d}q \mathrm{exp}\left(-{q}^{2}/2{{\Delta}}^{2}\right)\vert q\rangle$ to which one applies a Gaussian-enveloped coherent sum over stabilizer translations, enacting $\rho \to {\sum }_{k,l\in \mathbb{Z}}{\mathrm{e}}^{-2{{\Delta}}^{2}\pi \left({k}^{2}+{l}^{2}\right)}{S}_{p}^{k}\rho {S}_{p}^{-l}$ . The result is a state which is both an approximate eigenstate of S_q (and $\overline{Z}$ ) due to squeezing, as well as an approximate eigenstate of the translation S_p. Note that unlike $\mathfrak{E}$ and $\mathfrak{D}$ , approximation $\mathfrak{F}$ has an asymmetry in p and q. The three approximations $\mathfrak{D},\mathfrak{E},\mathfrak{F}$ have been discussed and shown to fit a standard form in [46]. In addition, the normalization of these approximate forms can be computed and expressed in terms of theta functions, see e.g. appendix A for the $\mathfrak{D}$ -approximation.

In equation (B.2) we will see a fourth, von-Mises or reverse-Villain, approximation using a cosine function to represent the periodicity in the wave-function comb. This reverse-Villain approximation has been used in [47, 48]. All these approximate states $\mathfrak{E}\vert \overline{0}\rangle$ and $\mathfrak{E}\vert \overline{1}\rangle$ (or $\mathfrak{D}\vert \overline{0}\rangle$ and $\mathfrak{F}\vert \overline{0}\rangle$ etc) are +1 eigenstates of the photon parity operator ${\mathrm{e}}^{\mathrm{i}\pi {a}^{{\dagger}}a}$ as they are invariant under q → −q and p → −p, implying that they only have support on even photon number states. In appendix A we show how to get exact Fock state amplitudes for the approximation $\mathfrak{D}\vert \overline{0}\rangle$ —which for this purpose has the simplest form—and this turns out to involve n-the order derivatives of theta functions. We show in appendix A that the photon number distribution of these GKP states, as well as the sensor state, is following a thermal distribution [22] (see figures A1 and A2), with interesting oscillations on top.

One can propose various measures of state quality or fidelity besides the characterization of the state in terms of Δ. For example, when we measure $\hat{q}$ to infer $\overline{Z}$ on a state, all outcomes in which q is closer to an even multiple of $\sqrt{\pi }$ are interpreted as outcome $\overline{Z}=1$ and vice-versa. For a state ∫dqψ(q)|q⟩, the probability for this outcome is then

$\begin{equation}\mathbb{P}\left(\overline{Z}={\left(-1\right)}^{b}\right)={\int }_{{I}_{b}}\mathrm{d}q\vert \psi \left(q\right){\vert }^{2},\quad {I}_{b}=\left\{q\vert \exists k\in \mathbb{Z}, -\frac{\sqrt{\pi }}{2}{\leqslant}q+\left(2k+b\right)\sqrt{\pi }{\leqslant}\frac{\sqrt{\pi }}{2}\right\}.\end{equation} \tag{ 18 }$

If we apply this to the form $\mathfrak{E}\vert \overline{0}\rangle$ , the error probability $\mathbb{P}\left(\overline{Z}=-1\right){< }\frac{2{\Delta}}{\pi } \mathrm{exp}\left(-\pi /4{{\Delta}}^{2}\right)$ . Since a perfect (homodyne) measurement of $\hat{q}$ is practically not possible, $\mathbb{P}\left(\overline{Z}=-1\right)$ only provides a lower bound on the logical error probability of an approximate state $\vert \overline{0}\rangle$ . We can also examine the expectation value for $\overline{Z}$ on the approximate form $\mathfrak{F}\vert \overline{0}\rangle$ (for simplicity) which equals

$\begin{equation}\frac{\langle \overline{0}\vert {\mathfrak{F}}^{{\dagger}}\overline{Z}\mathfrak{F}\vert \overline{0}\rangle }{\langle \overline{0}\vert {\mathfrak{F}}^{{\dagger}}\mathfrak{F}\vert \overline{0}\rangle }\approx \frac{{\sum }_{k\in \mathbb{Z}}{\mathrm{e}}^{-4{{\Delta}}^{2}\pi {k}^{2}}{\int }_{\mathbb{R}}\mathrm{d}q {\mathrm{e}}^{\mathrm{i}\sqrt{\pi }q} {\mathrm{e}}^{-\frac{1}{{{\Delta}}^{2}}{\left(q-2k\sqrt{\pi }\right)}^{2}}}{{\sum }_{k\in \mathbb{Z}}{\mathrm{e}}^{-4{{\Delta}}^{2}\pi {k}^{2}}{\int }_{\mathbb{R}}\mathrm{d}q {\mathrm{e}}^{-\frac{1}{{{\Delta}}^{2}}{\left(q-2k\sqrt{\pi }\right)}^{2}}}={\mathrm{e}}^{-\pi {{\Delta}}^{2}/4},\end{equation} \tag{ 19 }$

and similarly $\frac{\langle \overline{1}\vert {\mathfrak{F}}^{{\dagger}}\overline{Z}\mathfrak{F}\vert \overline{1}\rangle }{\langle \overline{1}\vert {\mathfrak{F}}^{{\dagger}}\mathfrak{F}\vert \overline{1}\rangle }\approx -{\mathrm{e}}^{-\pi {{\Delta}}^{2}/4}$ , showing that the expectation decays exponentially in Δ² towards 0. In the approximation in equation (19) we have assumed that Δ is small enough so that the peaks at different k do not overlap, giving an easy expression for the probability distribution over q of the approximate GKP state. We further discuss the logical $\overline{Z}$ or $\overline{X}$ measurement of a GKP qubit in section 3.2.

It has become common to describe the quality of a GKP state in terms of an amount of squeezing expressed in dB. For a regular squeezed state (squeezed along q) one has variances $\mathrm{Var}\left(q\right)=\frac{{{\Delta}}^{2}}{2}$ , $\mathrm{Var}\left(p\right)=\frac{1}{2{{\Delta}}^{2}}$ as the vacuum (or coherent state) has $\mathrm{Var}\left(q\right)=\mathrm{Var}\left(p\right)=\frac{1}{2}$ with Δ = e^−|ξ| < 1. The convention which is used in the literature for denoting the dB of squeezing of an approximate GKP state is #dB = −10 log₁₀ Δ², see e.g. [45].

We can view a GKP state as being 'squeezed' in both p and q and interpret this squeezing as the extent in which the state is an eigenstate of a unitary operator such as S_p or S_q. Since a quantum state may not fit one of the standard GKP approximations, a measure of the effective squeezing is useful in expressing the quality of the state. Since we are interested in modular values of $\hat{q}$ and $\hat{p}$ , it is appropriate to use the Holevo phase variance (or the variance of periodic variables such as phases used in circular statistics) to express this squeezing, i.e. one can define [44, 49]:

$\begin{equation}{{\Delta}}_{p/q}=\sqrt{\frac{1}{2\pi } \mathrm{ln}\left(\frac{1}{\vert \mathrm{Tr}{S}_{p/q}\rho {\vert }^{2}}\right)}.\end{equation} \tag{ 20 }$

Note that this measure does not express a logical error rate, e.g. the completely mixed state inside the perfect code space has Δ_p = Δ_q = 0.

2.3.2. Logical gates

An appealing feature of the GKP code is that all logical Clifford transformations are Gaussian quantum operations, realizable by optical elements [40, 45] which enact linear transformations on the operators $\hat{p}$ and $\hat{q}$ in the Heisenberg picture. Important gates such as the CNOT and S gate do however involve two-mode, respectively single-mode squeezing: the experimental realization of such squeezing transformations is typical through pumped optical non-linearities. Such elements are relatively straightforward to obtain for optical fields which travel through nonlinear χ⁽²⁾ or χ⁽³⁾ materials, while for superconducting devices these elements are engineered through the use of Josephson junctions. In contrast, passive linear optical elements—beam-splitters and phase-shifters in optics language—are readily available in circuit-QED by linear capacitive or inductive (fixed) circuit couplings.

In section 3 we will discuss the engineered non-linearities in superconducting hardware which can be activated by microwave drives or activated by flux-drives, while here we discuss the logical gates for the GKP code at a formal level.

As unitary displacement operators, Z and X are not self-inverse, i.e. X ≠ X^†. On a perfect, completely shift-invariant code state X acts identically to X^†, but on a finite-photon number state, see e.g. the wave function in figure 2, it does not: a shift to the left or right moves the envelope away from the center. The Hadamard gate has Heisenberg action $\hat{p}\to -\hat{q}$ and $\hat{q}\to \hat{p}$ so that H^† XH = Z, H^† ZH = X^† and H^† YH = −Y. The Hadamard gate corresponds to a phase-space rotation by an angle π/2, i.e. we can choose $\mathrm{H}\mathrm{a}\mathrm{d}\equiv \mathrm{exp}\left(\mathrm{i}\frac{\pi }{2}{a}^{{\dagger}}a\right)$ , and note again that Had ≠ Had⁻¹. A Had gate could be done by a quarter-cycle waiting in the self-evolution of the oscillator (so comes for free).

**Figure 2.** Wigner function of the state $\mathfrak{F}\vert \overline{0}\rangle$ at Δ = 0.3, and the reduced probability distributions over q and p in black. Unlike the $\mathfrak{E}$ - and $\mathfrak{D}$ -approximation, the $\mathfrak{F}$ -approximation has a clear asymmetry with respect to p and q. Since the Wigner function has a grid-like periodic structure in phase space, the GKP states are also referred to as grid states.
Download figure:
Standard image High-resolution image

$\mathfrak{F}\vert \overline{0}\rangle $ — **Figure 2.** Wigner function of the state $\mathfrak{F}\vert \overline{0}\rangle$ at Δ = 0.3, and the reduced probability distributions over q and p in black. Unlike the $\mathfrak{E}$ - and $\mathfrak{D}$ -approximation, the $\mathfrak{F}$ -approximation has a clear asymmetry with respect to p and q. Since the Wigner function has a grid-like periodic structure in phase space, the GKP states are also referred to as grid states.
Download figure:
Standard image High-resolution image

A disadvantage of using such quarter-cycle waiting Hadamard gate in a GKP surface code architecture is discussed in section 4. The alternative is to use single-qubit rotations around the logical X, Y or Z axes to compose a Hadamard gate.

For the GKP code these rotations around logical axes, R_P(ϕ) ≡ exp(−iϕP/2) with logical Pauli P = X, Y, Z are not natural as the logical Pauli, which is a displacement, sits in the exponent. Note also that this gate R_P(ϕ) is only unitary when acting on a subspace for which P² = I. However, one can perform R_P(ϕ), using a controlled-displacement coupling with a regular qubit and a regular qubit rotation, as shown in figure 3, and realized in [42, 50]. This circuit applies R_P(ϕ) ≡ exp(−iϕP/2) on the space of states for which P² = I but we can examine its effect more generally. Imagine applying the circuit in figure 3 with P = Y and ϕ = π/2. Upon outcome ±, the Kraus operator action on the GKP qubit equals A₊ = cos(ϕ/2)I − i sin(ϕ/2)P resp. A₋ = −i sin(ϕ/2)I + cos(ϕ/2)P. On the perfect code subspace where P² = I, A₊ acts as a unitary and equals R_P(ϕ), while A₋ can be converted to R_P(ϕ) by the additional π-rotation P. However, on a finitely-squeezed GKP state, these Kraus operators are not unitary and their action leads to the envelope of the GKP state to be no longer centered around the vacuum. However, one can apply a displacement P^−1/2 [42] to approximately re-center the GKP state.

**Figure 3.** Performing a single-qubit gate R_P(ϕ) with P = X, Y, Z on a perfect GKP qubit via a regular ancilla qubit, requiring a qubit controlled-displacement. The measurement is in the basis $\vert \phi ,{\pm}\rangle =\frac{1}{\sqrt{2}}\left({\mathrm{e}}^{\mathrm{i}\phi /2}\vert +\rangle {\pm}{\mathrm{e}}^{-\mathrm{i}\phi /2}\vert -\rangle \right)$ and upon outcome −1, P is applied.
Download figure:
Standard image High-resolution image

**Figure 3.** Performing a single-qubit gate R_P(ϕ) with P = X, Y, Z on a perfect GKP qubit via a regular ancilla qubit, requiring a qubit controlled-displacement. The measurement is in the basis $\vert \phi ,{\pm}\rangle =\frac{1}{\sqrt{2}}\left({\mathrm{e}}^{\mathrm{i}\phi /2}\vert +\rangle {\pm}{\mathrm{e}}^{-\mathrm{i}\phi /2}\vert -\rangle \right)$ and upon outcome −1, P is applied.
Download figure:
Standard image High-resolution image

A single-qubit gate such as the T = R_Z(π/4) gate can be done in this manner as well. The S gate with action S^† XS = −Y and S^† ZS = Z can be realized by the transformation $\hat{q}\to \hat{q}$ , $\hat{p}\to \hat{p}-\hat{q}$ corresponding to $S=\mathrm{exp}\left(-\mathrm{i}{\hat{q}}^{2}/2\right)$ .⁹ The S gate can thus be implemented by means of pump-activated squeezing, see section 3, or by using an ancilla qubit as in the circuit in figure 3. Alternative methods for performing a T gate via magic state preparation or using a cubic phase gate ${V}_{\gamma }=\mathrm{exp}\left(\mathrm{i}\gamma {\hat{q}}^{3}\right)$ exist [40]. For example, one can create a +1 eigenstate of the Hadamard gate $\mathrm{H}\mathrm{a}\mathrm{d}=\mathrm{exp}\left(\mathrm{i}\frac{\pi }{2}{a}^{{\dagger}}a\right)$ by starting with a vacuum state, which is already a +1 eigenstate of Had, and measuring S_p and S_q without photon-number changing feedback [51].

When using GKP qubits as basic qubits in a surface code, see section 4, we note that T and S gates are not needed for error correction: their only use is to prepare magic GKP ancilla qubits to be grown into the surface code-encoded magic states using GKP CZ and CNOT gates or parity check measurements, see e.g. [52] and references therein.

The CNOT gate can be realized by the Heisenberg action ${\hat{q}}_{c}\to {\hat{q}}_{c}$ , ${\hat{p}}_{c}\to {\hat{p}}_{c}-{\hat{p}}_{t}$ , ${\hat{q}}_{t}\to {\hat{q}}_{c}+{\hat{q}}_{t}$ and ${\hat{p}}_{t}\to {\hat{p}}_{t}$ . This gate is also called the SUM gate in [40] and SUM(g) with g = 1 in [45]. We see that $\mathrm{C}\mathrm{N}\mathrm{O}\mathrm{T}=\mathrm{exp}\left(-\mathrm{i}{\hat{p}}_{t}{\hat{q}}_{c}\right)$ by using equation (2) with $v={\hat{p}}_{t}$ and $u={\hat{q}}_{c}$ . The inverse CNOT has action ${\hat{q}}_{c}\to {\hat{q}}_{c}$ , ${\hat{p}}_{c}\to {\hat{p}}_{c}+{\hat{p}}_{t}$ , ${\hat{q}}_{t}\to {\hat{q}}_{t}-{\hat{q}}_{c}$ and ${\hat{p}}_{t}\to {\hat{p}}_{t}$ .

We define the action of the CZ gate as ${\mathrm{H}\mathrm{a}\mathrm{d}}_{t} \mathrm{C}\mathrm{N}\mathrm{O}\mathrm{T} {\mathrm{H}\mathrm{a}\mathrm{d}}_{t}^{{\dagger}}$ where Had_t is a Hadamard gate on the target mode. That is, it enacts the transformation ${\hat{q}}_{t}\to {\hat{q}}_{t}$ , ${\hat{p}}_{t}\to {\hat{p}}_{t}-{\hat{q}}_{c}$ , ${\hat{q}}_{c}\to {\hat{q}}_{c}$ , ${\hat{p}}_{c}\to {\hat{p}}_{c}-{\hat{q}}_{t}$ , or $\mathrm{C}\mathrm{Z}=\mathrm{exp}\left(-\mathrm{i}{\hat{q}}_{t}{\hat{q}}_{c}\right)$ . If either oscillator is a state where q is an even multiple of $\sqrt{\pi }$ , then CZ acts as exp(−iπ2k) = 1. If both oscillators are in a state where q is an odd multiple of $\sqrt{\pi }$ , then CZ acts as exp(−iπ(2n + 1)(2k + 1)) = −1 for $n,k\in \mathbb{Z}$ .

Sections 3.3 and 3.4 will discuss how the GKP CZ gate between two GKP modes can be executed using a three-wave or four-wave mixing element. There is however another circuit to perform a CNOT gate which uses a sequence of beam-splitters and some single-mode squeezing [41, 45] which can be more useful in some circumstances, see figure 4. For the CNOT gate the mode transformation on control (c) and target (t) mode equals

$\begin{equation}\left(\begin{pmatrix}\hfill {a}_{c}^{\text{out}}\hfill \\ \hfill {a}_{t}^{\text{out}}\hfill \end{pmatrix}\right)=A\left(\begin{pmatrix}\hfill {a}_{c}\hfill \\ \hfill {a}_{t}\hfill \end{pmatrix}\right)+B\left(\begin{pmatrix}\hfill {a}_{c}^{{\dagger}}\hfill \\ \hfill {a}_{t}^{{\dagger}}\hfill \end{pmatrix}\right),\end{equation} \tag{ 21 }$

with

$\begin{equation}A=\left(\begin{pmatrix}\hfill 1\hfill & \hfill -\frac{1}{2}\hfill \\ \hfill \frac{1}{2}\hfill & \hfill 1\hfill \end{pmatrix}\right),\quad B=\left(\begin{pmatrix}\hfill 0\hfill & \hfill \frac{1}{2}\hfill \\ \hfill \frac{1}{2}\hfill & \hfill 0\hfill \end{pmatrix}\right).\end{equation} \tag{ 22 }$

By the Bloch–Messiah decomposition [53] the singular value decompositions are A = UD_A V ^† and B = UD_B V^T with unitary matrices U and V. For the CNOT gate the singular values are degenerate: ${D}_{A}=\mathrm{diag}\left(\frac{\sqrt{5}}{2},\frac{\sqrt{5}}{2}\right)$ and ${D}_{B}=\mathrm{diag}\left(\frac{1}{2},\frac{1}{2}\right)$ , implying that the beam-splitting transformations U and V are not unique. Reference [53] notes that taking 50:50 beamsplitters with

$\begin{equation}U={U}_{\text{BS}}\equiv \frac{1}{\sqrt{2}}\left(\begin{pmatrix}\hfill \mathrm{i}{\mathrm{e}}^{\mathrm{i}\theta }\hfill & \hfill \mathrm{i}{\mathrm{e}}^{-\mathrm{i}\theta }\hfill \\ \hfill -{\mathrm{e}}^{\mathrm{i}\theta }\hfill & \hfill {\mathrm{e}}^{-\mathrm{i}\theta }\hfill \end{pmatrix}\right),\quad V={V}_{\text{BS}}\equiv \left(\begin{pmatrix}\hfill 0\hfill & \hfill 1\hfill \\ \hfill 1\hfill & \hfill 0\hfill \end{pmatrix}\right){U}^{{\ast}},\end{equation} \tag{ 23 }$

with $\theta =\frac{1}{2}{\mathrm{sin}}^{-1}\left(2/\sqrt{5}\right)$ can be chosen (while [45] makes a different choice). We see that the single-mode squeezing represented by the diagonal matrices corresponds to a squeezer Sq(ξ) with $\xi =-{\mathrm{cosh}}^{-1}\left(\sqrt{5}/2\right)$ .

**Figure 4.** The realization of a CNOT via 50:50 beam-splitters, i.e. V_BS and U_BS defined in equation (23), and single-mode squeezing Sq(ξ) with ξ ≈ −0.4812.
Download figure:
Standard image High-resolution image

It is clear that logical gates are not unique as physical operations as they only have to perform the right action on the code space. Reference [45] has discussed how logical gates propagate or amplify errors on the approximate GKP code states. Keeping the (average) number of photons in an approximate GKP state low by centering the state symmetrically around the vacuum, emerges as a good overall strategy to minimize the propagation of errors and the effect of the inaccurate action of gates.

2.3.3. Noise on a GKP qubit

A simple numerically convenient noise channel, playing the role of depolarizing channel for an oscillator, is the independent Gaussian displacement channel $\mathcal{N}\left(\rho \right)$ with standard deviation σ₀:

$\begin{equation}\mathcal{N}\left(\rho \right)={\int }_{-\infty }^{\infty } \mathrm{d}u{\int }_{-\infty }^{\infty } \mathrm{d}v {\mathbb{P}}_{{\sigma }_{0}}\left(u\right){\mathbb{P}}_{{\sigma }_{0}}\left(v\right){\mathrm{e}}^{\mathrm{i}u\hat{p}+\mathrm{i}v\hat{q}}\rho {\mathrm{e}}^{-\mathrm{i}u\hat{p}-\mathrm{i}v\hat{q}}.\end{equation} \tag{ 24 }$

Here ρ is a single-mode density matrix and ${\mathbb{P}}_{{\sigma }_{0}}\left(x\right)$ the Gaussian probability density function with mean zero and variance ${\sigma }_{0}^{2}$ , i.e. ${\mathbb{P}}_{{\sigma }_{0}}\left(x\right)={\left(2\pi {\sigma }_{0}^{2}\right)}^{-1/2}{\mathrm{e}}^{-{x}^{2}/2{\sigma }_{0}^{2}}$ . This channel does not naturally correspond to physical sources of noise, but (1) one can convert photon loss via amplification to this channel [22], (2) one can 'displacement twirl' noise so that the effective channel is that of probabilistic mixture of displacements [54]. The exact displacement twirl is not a physical operation as it requires large displacements, so this type of modeling should be considered less justified than in the qubit Pauli case when we use a depolarizing noise model through a Pauli twirling approximation.

It is thus of interest to study how realistic noise affects the approximate GKP states beyond this toy model. We will explore the question of stochastic Gaussian displacement noise versus coherent finite-squeezing error during quantum error correction in the next section 2.4. In this section we describe the interesting effect of photon loss on a GKP qubit using Wigner function dynamics [42], and mention some literature discussing other sources of noise.

An oscillator state undergoing photon loss at rate κ can be described, in a rotating frame at its resonant frequency, using a Lindblad equation $\dot {\rho }=\kappa \mathcal{D}\left(a\right)\left(\rho \right)$ using the density matrix ρ. Here we assume that the thermal environment which induces this photon loss is at zero temperature, hence there are no photon gain processes. Alternatively, and conveniently, one describes this dynamics through differential equations using phase-space probability distributions such as the Wigner function. The Wigner function $W\left(q,p,t\right)\equiv \frac{1}{2\pi }\int \mathrm{d}x {\mathrm{e}}^{-\mathrm{i}px}\langle q+\frac{x}{2}\vert \rho \left(t\right)\vert q-\frac{x}{2}\rangle$ for the photon loss dynamics can be shown to obey a two-dimensional Fokker–Planck equation, see [9, 42, 55]

$\begin{equation*}\frac{\partial W\left(q,p,t\right)}{\partial t}=\frac{\kappa }{2}\left(\frac{\partial }{\partial q}\left(qW\left(q,p\right)\right)+\frac{\partial }{\partial p}\left(pW\left(q,p,t\right)\right)+\frac{1}{2}\left(\frac{{\partial }^{2}W\left(q,p,t\right)}{{\partial }^{2}p}+\frac{{\partial }^{2}W\left(q,p,t\right)}{{\partial }^{2}q}\right)\right).\end{equation*}$

This Fokker–Planck equation describes a process of diffusion—a spread in the variance of the variables p and q to the vacuum noise variance equal to 1/2—and drift, i.e. the mean values of p and q flow towards 0. Instead of considering the Wigner function dynamics, we can integrate over, say, p and consider the corresponding Fokker–Planck equation for the probability distribution $P\left(q,t\right){=\int }_{\mathbb{R}}\mathrm{d}p W\left(q,p,t\right)$ , which has the solution:

$\begin{align}\hfill P\left(q,t\right)=& \int \mathrm{d}{q}^{\prime }{P}_{\text{trans}}\left(q,t\vert {q}^{\prime },0\right)P\left({q}^{\prime },t=0\right),\hfill \\ \hfill {P}_{\text{trans}}\left(q,t\vert {q}^{\prime },0\right)=& \sqrt{\frac{1}{2\pi {\sigma }^{2}\left(t\right)}} \mathrm{exp}\left(-\frac{{\left(q-{q}^{\prime } {\mathrm{e}}^{-\kappa t/2}\right)}^{2}}{2{\sigma }^{2}\left(t\right)}\right),\hfill \\ \hfill {\sigma }^{2}\left(t\right)=& \frac{1}{2}\left(1-\mathrm{exp}\left(-\kappa t\right)\right).\hfill \end{align} \tag{ 25 }$

In figure 5 we plot the effect of photon loss of a normalized state $\mathfrak{F}\vert \overline{0}\rangle$ with Δ = 0.3 for κt = 0.1, 0.5 and 1.

We can consider the expectation of a stabilizer or logical $\overline{Z}$ over time, i.e. we consider $\mathrm{Tr} {\mathrm{e}}^{\mathrm{i}\alpha \hat{q}}\rho \left(t\right)=\int \mathrm{d}qP\left(q,t\right){\mathrm{e}}^{\mathrm{i}\alpha q}$ with $\alpha =\sqrt{\pi }$ or $2\sqrt{\pi }$ . Using Gaussian integration and equation (25) this gives

$\begin{equation}\mathrm{Tr} {\mathrm{e}}^{\mathrm{i}\alpha \hat{q}}\rho \left(t\right)=\left(\sqrt{\frac{1}{2\pi {\sigma }^{2}\left(t\right)}}\int \mathrm{d}q {\mathrm{e}}^{-\frac{{q}^{2}}{2{\sigma }^{2}\left(t\right)}} {\mathrm{e}}^{\mathrm{i}\alpha q}\right)\mathrm{Tr} {\mathrm{e}}^{\mathrm{i}\alpha \left(t\right)\hat{q}}\rho \left(0\right)={\mathrm{e}}^{-\frac{1}{4}\left({\alpha }^{2}\left(0\right)-{\alpha }^{2}\left(t\right)\right)}\mathrm{Tr} {\mathrm{e}}^{\mathrm{i}\alpha \left(t\right)\hat{q}}\rho \left(0\right),\end{equation} \tag{ 26 }$

with α(t) = α e^−κt/2. On the right-hand side, we see an exponential decrease as well as a direct dependence on the expection value of a displacement operator with exponentially shrinking shift on the initial state. When the initial state ρ(0) is invariant under q → −q, we can replace $\mathrm{Tr} {\mathrm{e}}^{\mathrm{i}\alpha \left(t\right)\hat{q}}\rho \left(0\right)$ by $\mathrm{Tr}\mathrm{cos}\left(\alpha \left(t\right)\hat{q}\right)\rho \left(0\right)$ . Thus when symmetrically centering the state in phase-space the phases of the stabilizer or logical $\overline{Z}$ never become complex. In addition, when the initial state is an approximate logical $\vert \overline{0}\rangle$ such as $\mathfrak{F}\vert \overline{0}\rangle$ , the expectation value $\langle \overline{Z}\left(t\right)\rangle {\geqslant}0$ at all times as shown for a few points in figure 5 on the right. This is interesting as it shows that $\vert \overline{0}\rangle$ 'never looks more like a $\vert \overline{1}\rangle$ than a $\vert \overline{0}\rangle$ ' under photon loss. The state $\mathfrak{F}\vert \overline{1}\rangle$ whose decay is plotted in figure 6 starts at $\overline{Z}\left(t=0\right){< }0$ and eventually, for large enough t, $\overline{Z}\left(t\right){ >}0$ as the final state is the vacuum centered around q = 0. This asymmetry in its effect on $\vert \overline{0}\rangle$ versus $\vert \overline{1}\rangle$ is reminiscent of a logical amplitude-damping channel.

**Figure 6.** The probability distribution P₁(q) of the state $\mathfrak{F}\vert \overline{1}\rangle$ at Δ = 0.3 undergoing photon loss. We observe that ${\langle \overline{Z}\rangle }_{1}$ moves from a negative initial value to a final positive value as the state moves to the vacuum state.
Download figure:
Standard image High-resolution image

$\mathfrak{F}\vert \overline{1}\rangle $ — **Figure 6.** The probability distribution P₁(q) of the state $\mathfrak{F}\vert \overline{1}\rangle$ at Δ = 0.3 undergoing photon loss. We observe that ${\langle \overline{Z}\rangle }_{1}$ moves from a negative initial value to a final positive value as the state moves to the vacuum state.
Download figure:
Standard image High-resolution image

Now assume that the initial state is displaced away from its centered location by, say, a stabilizer shift ${S}_{p}^{m}$ which does not affect its initial eigenvalue for $\overline{Z}$ . Using equation (26) we get

$\begin{equation}\langle \overline{Z}\left(t\right)\rangle ={\mathrm{e}}^{2\pi \mathrm{i}m{\mathrm{e}}^{-\kappa t/2}} {\mathrm{e}}^{-\frac{1}{4}\left({\alpha }^{2}\left(0\right)-{\alpha }^{2}\left(t\right)\right)}\mathrm{Tr} {\mathrm{e}}^{\mathrm{i}\alpha \left(t\right)\hat{q}}\rho \left(0\right),\end{equation} \tag{ 27 }$

which shows that the expectation value can now become complex, but is not faster decaying in its absolute value. When m is large, we see that the additional phase changes rapidly in time, so that the expectation can rapidly change from positive to negative. However, if we know m and κ and it is the only source of noise, this phase change can be treated as a systematic error. Note that if we had applied an arbitrary but known displacement ${\mathrm{e}}^{\mathrm{i}u\hat{p}}$ on the initial state, the effect would have been similar.

Going beyond photon loss, other sources of inaccuracy and error could also readily be described using dynamics of the Wigner function. A Lindblad equation dynamics of an n-mode system for which the Hamiltonian is quadratic in creation and annihilation operators (beam-splitting, squeezing etc) or linear (driving terms ∼a + a^† enacting displacements) while the dissipator models photon loss or photon gain, can be mapped to a Fokker–Planck equation of a general solvable form:

$\begin{align}\hfill & \frac{\partial }{\partial t}W\left({q}_{1},{p}_{1},\dots ,{q}_{n},{p}_{n}\right)=\left(-\to {\nabla }\cdot \left(A\to {x}\right)+\frac{1}{2}\to {\nabla }\cdot D\to {\nabla }\right)W\left({q}_{1},{p}_{1},\dots ,{q}_{n},{p}_{n}\right),\hfill \\ \hfill & {\to {x}}^{T}=\left({q}_{1},{p}_{1},\dots ,{q}_{n},{p}_{n}\right),{\to {\nabla }}^{T}=\left(\frac{\partial }{\partial {q}_{1}},\frac{\partial }{\partial {p}_{1}},\dots ,\frac{\partial }{\partial {q}_{n}},\frac{\partial }{\partial {p}_{n}}\right),\hfill \end{align} \tag{ 28 }$

with constant 2n × 2n matrices A and D. This general behavior follows from the fact that every term in a Lindblad equation which is linear in a or a^† (e.g. aρ), gives rise to a first-order derivative in the differential equation for the Wigner function (plus a term which is linear in $\hat{p}$ and $\hat{q}$ ) [9, 55], so that terms quadratic in a and a^† (e.g. a^† aρ) gives second-order derivatives. The Gaussian Green's function for equation (28) can be readily given, basically forming a multi-dimensional analog of equation (25), see [9]. All these Gaussian processes keep an initially nonnegative Wigner function nonnegative and hence are simulatable by stochastic means.

On the other hand, nonlinear elements such as a self-Kerr nonlinearity $-K{{a}^{{\dagger}}}^{2}{a}^{2}$ lead to third-order derivatives in the differential equation for the Wigner function, as well as terms in which A is not constant (corresponding to a so-called nonlinear Fokker–Plank equation): the upshot is that the Wigner function can become negative and non-classical during the dynamics and attempts at classical stochastic simulation will suffer from the sign problem. As an example, reference [56] discusses Wigner function dynamics for a single oscillator in the presence of a self-Kerr nonlinearity and dissipation.

Dephasing, meaning the application of a rotation ${\mathrm{e}}^{\mathrm{i}\theta {a}^{{\dagger}}a}$ with unknown θ is a possible error mechanism as it rotates the quadratures $\hat{p}$ and $\hat{q}$ into each other. Dephasing can come about, for example, from an interplay of photon loss and a Kerr nonlinearity, or a fluctuating mode frequency. In a simple stochastic model the angle θ is drawn from a distribution $\mathbb{P}\left(\theta \right)$ with mean ⟨θ⟩ = 0 and some moments ⟨θ^k⟩. For small higher-order moments ⟨θ^k⟩ ≪ 1 for k > 2, we can expand

$\begin{equation}{\mathcal{N}}_{\text{deph}},\langle {\theta }^{2}\rangle \left(\rho \right)=\int \mathrm{d}\theta \mathbb{P}\left(\theta \right){\mathrm{e}}^{\mathrm{i}\theta {a}^{{\dagger}}a}\rho {\mathrm{e}}^{-\mathrm{i}\theta {a}^{{\dagger}}a}\approx \rho +\langle {\theta }^{2}\rangle {a}^{{\dagger}}a\rho {a}^{{\dagger}}a-\frac{1}{2}\langle {\theta }^{2}\rangle \left({a}^{{\dagger}}a\rho +\rho {a}^{{\dagger}}a\rho \right)+O\left(\langle {\theta }^{3}\rangle \right).\end{equation} \tag{ 29 }$

This is a dephasing channel which corresponds to the dynamics of a Lindblad equation $\dot {\rho }={\kappa }_{\text{deph}}\mathcal{D}\left({a}^{{\dagger}}a\right)\rho$ for a short time with κ_deph t = ⟨θ²⟩ ≪ 1. The fixed point of this equation is any mixture of Fock states |n⟩⟨n|; when the initial state is ∑_n c_n|n⟩ the channel maps it onto ∑_n|c_n|²|n⟩⟨n|. In appendix A we evaluate the photon number distribution of such fully-dephased $\mathfrak{D}\vert \overline{0}\rangle$ and $\mathfrak{D}\vert \overline{1}\rangle$ . We prove that the photon number distribution is asymptotically thermal, independent of the logical state. Hence complete dephasing seems to wash out much of distinction between the two logical GKP states.

Reference [22] has discussed the detrimental effect of a Kerr nonlinearity on a variety of single-mode bosonic codes. Numerical simulations of several sources of inaccuries on GKP state preparation using an ancilla qubit were also discussed in e.g. [42, 44, 45, 50, 57] using Lindblad equation dynamics.

2.4. Repeated GKP error correction and decoding: finite squeezing

In this section we examine the effect of (coherent) finite-squeezing errors on repeated GKP error correction using GKP ancilla's. This is follow-up work from reference [48] in which a similar problem was examined using a stochastic Gaussian displacement error model, equation (24), applied to GKP ancilla and data qubits as a proxy for finite-squeezing errors. The goal of this section is to understand whether there are crucial differences between finite-squeezing coherent errors and the Gaussian displacement error model and try to develop a dedicated, computationally-efficient, decoder with good performance.

The dynamics to be analyzed is the repeated execution of the quantum circuit in figure 7 on a single GKP input state $\mathfrak{F}\left({\Delta}\right)\vert \overline{\psi }\rangle$ for m = 1, ..., M cycles. We remark that a variant of such 'Steane error correction' exists: in [58] the authors observed that applying a beam-splitter between GKP ancilla and GKP data qubit followed by squeezing on the GKP data qubit, can also perform error correction. Reference [59] has analyzed the repeated execution of this variant of error correction in more detail.

**Figure 7.** A single round of fault-tolerant GKP error correction for both logical X and Z errors. Each measurement is a perfect homodyne measurement of $\hat{q}$ or $\hat{p}$ with outcomes $\mathfrak{q}$ and $\mathfrak{p}$ respectively. The finitely-squeezed ancilla states are modeled as approximate GKP states, using a slightly-different small-Δ approximation than $\mathfrak{E},\mathfrak{D},\mathfrak{F}$ in equations (14), (15) and (17), given in equation (B.2) and denoted as ${\mathfrak{F}}_{V}$ . We show that the optional corrective displacement (dashed box) keeps the state at low average photon number $\overline{n}$ in figure 9.
Download figure:
Standard image High-resolution image

A clear difference between a stochastic error model and the finite-squeezing model is that in the former entropy build-up is possible, while in the latter the state conditioned on the measurement outcomes in figure 7 is pure at all times. One can invoke displacement twirling as a method to convert a coherent noise model in which one applies a superposition of displacements to a stochastic mixture of displacements. For example, displacement twirling a finitely-squeezed state $\mathfrak{E}\vert \overline{\psi }\rangle$ with some Δ gives a perfect state $\vert \overline{\psi }\rangle$ subject to the Gaussian displacement channel with ${{\Delta}}^{2}=2{\sigma }_{0}^{2}$ [48]. After such stochastification of the noise on a GKP ancilla, one can then represent the feedback error (a shift in one of the quadratures) induced by the ancilla in the circuit in figure 7 effectively as an incoming stochastic shift error on the data qubit. The stochastic shift of the other quadrature of the ancilla then causes a measurement error of the same strength. On this basis, reference [48] stochastically modeled finite-squeezing errors as incoming stochastic displacement errors on the GKP data qubit and measurement errors. Another difference in the models is that in the finite-squeezing error model the measurement outcomes carry non-modular information about the measured quadrature. This can be exploited to recenter the state by choosing a corrective displacement immediately after a single round of error correction, while such corrective displacements would have no effect in the stochastic model.

We will represent the GKP wave-function in the q-basis with ψ_m(q), the wavefunction after m rounds of error correction. We will sometimes omit the normalization of states when these normalizations play no role.

We will now analyze the time evolution without the corrective displacement in the dashed box in figure 9. A single round of quantum error correction shown in figure 7 with measurement outcomes ${\mathfrak{p}}_{m},{\mathfrak{q}}_{m}$ gives ${\psi }_{m}\left(q\right)=\int \mathrm{d}{q}^{\prime }G\left(q{\leftarrow}{q}^{\prime }\vert {\mathfrak{q}}_{m},{\mathfrak{p}}_{m}\right){\psi }_{m-1}\left({q}^{\prime }\right)$ with Green's function

$\begin{align}\hfill G\left(q{\leftarrow}{q}^{\prime }\vert {\mathfrak{q}}_{m},{\mathfrak{p}}_{m}\right)& =\int \mathrm{d}{q}^{{\prime\prime}}{G}_{+}\left(q{\leftarrow}{q}^{{\prime\prime}}\vert {\mathfrak{q}}_{m}\right){G}_{0}\left({q}^{{\prime\prime}}{\leftarrow}{q}^{\prime }\vert {\mathfrak{p}}_{m}\right)\hfill \\ \hfill & ={\psi }^{+}\left(q-{\mathfrak{q}}_{m}\right){\psi }^{0}\left(q-{q}^{\prime }\right){\mathrm{e}}^{-\mathrm{i}{\mathfrak{p}}_{m}\left(q-{q}^{\prime }\right)},\hfill \end{align} \tag{ 30 }$

using ${G}_{0}\left({q}^{{\prime\prime}}{\leftarrow}{q}^{\prime }\vert {\mathfrak{p}}_{m}\right)={\psi }^{0}\left({q}^{{\prime\prime}}-{q}^{\prime }\right){\mathrm{e}}^{-\mathrm{i}{\mathfrak{p}}_{m}\left({q}^{{\prime\prime}}-{q}^{\prime }\right)}$ (Z-error correction) followed by ${G}_{+}\left(q{\leftarrow}{q}^{{\prime\prime}}\vert {\mathfrak{q}}_{m}\right)=\delta \left({q}^{{\prime\prime}}-q\right){\psi }^{+}\left(q-{\mathfrak{q}}_{m}\right)$ (X-error correction). To understand this Green's function, observe that in the limit Δ → 0, the wavefunction ψ⁺(q) has uniform support on $q=k\sqrt{\pi }$ , with $k\in \mathbb{Z}$ , so that the outgoing wave function is supported solely on $q={\mathfrak{q}}_{m}+k\sqrt{\pi }$ , hence the code state sits in the perfect code space with a known shift on top. However, before this, the interaction with the imperfect $\vert \overline{0}\rangle$ ancilla for the Z-error correction applies a convolution to the incoming wavefunction. If the ancilla is perfect (Δ → 0), this convolution amounts to applying superpositions of stabilizer shifts ${S}_{p}^{k}$ with $2k\sqrt{\pi }=q-{q}^{\prime }$ , each with a phase which depends on ${\mathfrak{p}}_{m}$ , on the incoming wave-function. If we assume that all wavefunctions are of the form $\mathfrak{F}\vert \overline{0}\rangle$ , i.e. sums of Gaussians, the convolution leads again to a sum of Gaussians and can be exactly evaluated, that is, one has

$\begin{align}\hfill & \int \mathrm{d}{q}^{\prime }{\psi }^{0}\left(q-{q}^{\prime }\right){\mathrm{e}}^{-\mathrm{i}{\mathfrak{p}}_{m}\left(q-{q}^{\prime }\right)}{\psi }^{0}\left({q}^{\prime }\right)\hfill \\ \hfill & \quad \propto \sum _{k,{k}^{\prime }\in \mathbb{Z}}{\mathrm{e}}^{-2{{\Delta}}^{2}\pi \left({k}^{\prime 2}+{k}^{2}\right)}\int \mathrm{d}{q}^{\prime } {\mathrm{e}}^{-\mathrm{i}{\mathfrak{p}}_{m}\left(q-{q}^{\prime }\right)} {\mathrm{e}}^{-\frac{1}{2{{\Delta}}^{2}}\left[{\left({q}^{\prime }-2{k}^{\prime }\sqrt{\pi }\right)}^{2}+{\left(q-{q}^{\prime }-2k\sqrt{\pi }\right)}^{2}\right]}\hfill \\ \hfill & \quad =\sqrt{\pi {{\Delta}}^{2}} {\mathrm{e}}^{-{{\Delta}}^{2}{\mathfrak{p}}_{m}^{2}/4}\sum _{k,{k}^{\prime }\in \mathbb{Z}}{\mathrm{e}}^{-2{{\Delta}}^{2}\pi \left({k}^{\prime 2}+{k}^{2}\right)} {\mathrm{e}}^{-\frac{{\left(q-2\sqrt{\pi }\left(k+{k}^{\prime }\right)\right)}^{2}}{4{{\Delta}}^{2}}} {\mathrm{e}}^{-\mathrm{i}\frac{{\mathfrak{p}}_{m}}{2}\left(q+2\sqrt{\pi }\left(k-{k}^{\prime }\right)\right)}.\hfill \end{align} \tag{ 31 }$

What we observe is that the convolution broadens the peaks and they acquire phases which depend on the location of the peaks and the outcome ${\mathfrak{p}}_{m}$ . The convolution step, which corrects shifts in p, thus introduces a feedback error in the form of peak broadening for the q variable.

In figure 8 we plot the effective squeezing parameters Δ_p, Δ_q of the state, equation (20), after rounds of Z and X-error correction.

Represented as a state evolution, the stabilizer measurements of S_p and S_q with outcomes ${\mathfrak{q}}_{m}, {\mathfrak{p}}_{m}$ effectively map an incoming state |ϕ⟩ to ${\psi }^{+}\left(\hat{p}+{\mathfrak{p}}_{m}\right)\vert \phi \rangle$ and ${\psi }^{+}\left(\hat{q}-{\mathfrak{q}}_{m}\right)\vert \phi \rangle$ . The finite envelope of the approximate GKP states has the effect that the outgoing states are dominantly supported around $\hat{p}=-{\mathfrak{p}}_{m}, \hat{q}={\mathfrak{q}}_{m}$ . We understand this gain of non-modular information in one quadrature as a reflection of the loss of modular information (i.e. in terms of the increase in Δ_q/p) in its conjugate. In figure 9 we observe, that a displacement about $\alpha =\frac{-{\mathfrak{q}}_{m}+\mathrm{i}{\mathfrak{p}}_{m}}{\sqrt{2}}$ indeed contributes to a recentering of the state.

**Figure 9.** Photon number evolution of an initial state $\mathfrak{F}\vert 0\rangle$ state under repeated stabilizer measurement for Δ = 0.3. Here, ${\overline{n}}_{\mathrm{f}\mathrm{e}\mathrm{e}\mathrm{d}\mathrm{b}\mathrm{a}\mathrm{c}\mathrm{k}}$ denotes the photon number evolution where each EC-circuit as in figure 7 is followed by a corrective displacement $D\left(\frac{-\mathfrak{q}+\mathrm{i}\mathfrak{p}}{\sqrt{2}}\right)$ . The data is obtained by averaging over N = 10⁴ trajectories. One observes that the average photon number stays low, corresponding to a centering of the state.
Download figure:
Standard image High-resolution image

$\mathfrak{F}\vert 0\rangle $ — **Figure 9.** Photon number evolution of an initial state $\mathfrak{F}\vert 0\rangle$ state under repeated stabilizer measurement for Δ = 0.3. Here, ${\overline{n}}_{\mathrm{f}\mathrm{e}\mathrm{e}\mathrm{d}\mathrm{b}\mathrm{a}\mathrm{c}\mathrm{k}}$ denotes the photon number evolution where each EC-circuit as in figure 7 is followed by a corrective displacement $D\left(\frac{-\mathfrak{q}+\mathrm{i}\mathfrak{p}}{\sqrt{2}}\right)$ . The data is obtained by averaging over N = 10⁴ trajectories. One observes that the average photon number stays low, corresponding to a centering of the state.
Download figure:
Standard image High-resolution image

What is noteworthy about the error correcting dynamics in equations (30) and (31) is that the support of the outgoing wavefunction lies within the support of the incoming wavefunction (ψ⁰(q)) plus the support of the ancilla wavefunction (here also ψ⁰(q)) since Supp(f ⋆ g) ⊂ Supp(f) + Supp(g) for a convolution of two functions f and g. The X-error correction step multiplies the convoluted wavefunction by ${\psi }^{+}\left(q-{\mathfrak{q}}_{m}\right)$ which cannot extend its support. Within its support the outgoing wavefunction can have changed amplitudes, depending on the outcomes ${\mathfrak{p}}_{m}$ and ${\mathfrak{q}}_{m}$ . These arguments are relevant as the GKP state $\mathfrak{F}\vert \overline{0}\rangle$ has support which is concentrated around even multiples of $\sqrt{\pi }$ overlapping by an amount exponentially suppressed in Δ² with the support of $\mathfrak{F}\vert \overline{1}\rangle$ . When one uses code states ψ⁰(q) and ψ¹(q) whose supports have negligible overlap—which is the case for $\mathfrak{F}\vert \overline{0}\rangle$ and $\mathfrak{F}\vert \overline{1}\rangle$ for sufficiently small Δ—it implies that, no matter what the measurement outcomes, 0 will largely remain a 0 and 1 will largely remain a 1 in the error correction rounds.

It implies that the picture of stochastic error feedback by using a finite-squeezed $\vert \overline{+}\rangle$ ancilla is inadequate as in this picture the support of the wavefunction gets shifted around by such feedback error, while here we observe that instead amplitudes get changed within the support. We see some of this behavior in the performance of the maximum likelihood decoder versus a passive decoder in figure 10: a passive decoder which decides that 0 stays a 0 does surprisingly well, which may be understandable if we consider that the support of the ψ₀(q) wavefunction can never grow by QEC (of course, in principle, the wave functions have support everywhere albeit exponentially-suppressed with 1/Δ²).

**Figure 10.** Logical error rate $\overline{P}$ for decoding the circuit in figure 7 ( *without* the corrective displacement) for different decoders and squeezing parameters Δ = 0.3 (top data) and 0.4 (bottom data) with the number of stabilizer measurement rounds M = 1,.., 10. For Δ = 0.3 the mean performance of the forward decoder outperforms passive decoding, while the opposite is true for Δ = 0.4. We have observed that the single sample logical error rates strongly fluctuate at O(10⁻²) − O(10⁻¹) around the averages, suggesting rather chaotic behavior. The data are obtained by sampling over N = 5 × 10⁴ trajectories of measurement outcomes. Error bars denote the standard deviations of the averaged logical error rates. The memoryless decoder applies a different corrective displacement after each QEC round described in appendix B.
Download figure:
Standard image High-resolution image

**Figure 10.** Logical error rate $\overline{P}$ for decoding the circuit in figure 7 ( *without* the corrective displacement) for different decoders and squeezing parameters Δ = 0.3 (top data) and 0.4 (bottom data) with the number of stabilizer measurement rounds M = 1,.., 10. For Δ = 0.3 the mean performance of the forward decoder outperforms passive decoding, while the opposite is true for Δ = 0.4. We have observed that the single sample logical error rates strongly fluctuate at O(10⁻²) − O(10⁻¹) around the averages, suggesting rather chaotic behavior. The data are obtained by sampling over N = 5 × 10⁴ trajectories of measurement outcomes. Error bars denote the standard deviations of the averaged logical error rates. The memoryless decoder applies a different corrective displacement after each QEC round described in appendix B.
Download figure:
Standard image High-resolution image

We have formulated a classical 'forward' decoder, see details in appendix B, and compared its performance with an optimal, density matrix decoding method (maximum likelihood decoding) as well as a so-called passive decoder, see the numerical results without active displacement feedback in figure 10. The passive decoder functions as an important sanity check: this decoder throws away all the measurement data, including the final measurement and simply always decides that the outcome is 0 when the input state to the rounds of error correction is an approximate GKP 0 state. Since we do not necessarily know the input state, we note that this decoder is of little practical value. By comparing the performance of this decoder with the MLD decoder we learn to what extent the quantum error correction circuits are preserving quantum information irrespective of the measurement outcomes and to what extent the measurement outcomes provide the proper logical information for correction. This decoder is similar to the passive decoder in the stochastic model of [48] in which none of the error information in each round is used and only the last perfect measurement determines whether the state is identified as 0 or 1. However, in [48] this passive decoder clearly performed worse for these values for Δ.

We compare the decoder performance in figure 11 to a 'memoryless' decoder using the measurement of the current QEC round for immediate logical feedback, see appendix B, clearly showing worse performance with this strategy. In figure 11 we show the better performance of the MLD and the (feedback-adapted) forward decoder using active feedback which minimizes photon number. We compare their performance with a 'parity' decoder which similarly applies the corrective displacement, but then applies a final logical correction (or not) when the sum of all applied shifts is closer to an odd (or even) multiple of $\sqrt{\pi }$ .

We observe that for small Δ and number of EC rounds M ⩽ 3, the passive decoder performs comparably to the MLD decoder, consistent with the intuition given earlier in this section. At larger M it performs worse at Δ = 0.3. Figures 10 and 11 show the average logical error rates obtained from N = 5 × 10⁴ samples for all decoders. We note (not shown in the figures) that there are large fluctuations of O(10⁻²) − O(10⁻¹) per run around the average logical error rate.

Last, to study the difference between coherent and stochastic errors, we plot ${\overline{P}}_{\text{MLD}}^{\text{stoch}}\left({\sigma }_{0}\right)$ and ${\overline{P}}_{\text{MLD}}\left({\Delta}\right)$ using the identification ${{\Delta}}^{2}=2{\sigma }_{0}^{2}$ and ${{\Delta}}^{2}={\sigma }_{0}^{2}$ as a function of M and Δ, see figure 12. The simulation for the MLD decoder based on a stochastic error model is implemented following [48]. We observe, that the conversion ${{\Delta}}^{2}=2{\sigma }_{0}^{2}$ underestimates the logical error, while it is overestimated for ${{\Delta}}^{2}={\sigma }_{0}^{2}$ .

**Figure 12.** Logical error rate for MLD decoders for finite-squeezing (coherent) errors characterized by Δ versus the stochastic error model characterized by σ₀ in [48]. We compare the logical error rates using two different mappings from Δ to σ₀, neither is entirely satisfactory.
Download figure:
Standard image High-resolution image

The simulation and data are accessible at https://github.com/JonCYeh/GKP_EC_Sim.

3. Circuit-QED realizations of GKP qubit components

In this section we review and discuss schemes for state preparation, logical gates and quantum error correction for GKP qubits in circuit-QED. In circuit-QED a natural candidate for a bosonic GKP encoding is a resonant mode of a 3D microwave cavity, having low loss rate. Multiple GKP qubits are then stored in multiple low-loss 3D cavities: an engineering platform for multiple coupled cavities—multi-layer microwave-integrated quantum circuits (MMIQC) [60]—is under development.

The coupling between cavity modes of different cavities can be mediated by dipolar 'antenna' couplings between the electric field of the cavity mode and that of an inserted planar chip in the cavity wall hosting a coupler mode. The idea is then to activate two-mode gates such as the CZ gate by applying microwave drives or flux-modulation outside of the 3D cavities on the coupler mode. As a simple circuit example one can take the electric circuit in figure 13 in which the two LC oscillators correspond to the cavity modes: two superconducting islands, each protruding into one cavity and coupling to the electric field of the resonant mode, are connected by a Josephson junction (so-called bridge configuration): such setup can generate the Φ⁴-interaction for a CZ gate as will be described in section 3.4, while more involved circuits could be used to engineer an effective Φ³-interaction for the same purpose, see section 3.3.

3.1. Coupling with regular qubits

To prepare a logical GKP state or to realize single-qubit gates, one can employ an interaction with a regular qubit in which the state of regular qubit controls the application of a displacement on the GKP mode as in figure 1. If the regular qubit is realized by an anharmonic oscillator such as a transmon qubit, this then requires the engineering of a tunable qubit controlled-displacement interaction of the form ${b}^{{\dagger}}b {\hat{q}}_{\theta }$ with ${\hat{q}}_{\theta }=\mathrm{cos}\left(\theta \right)\hat{q}+\mathrm{sin}\left(\theta \right)\hat{p}$ so that b^† b acts as Pauli Z in the regular qubit subspace.

As mentioned earlier, a common interaction between an off-resonantly coupled transmon qubit and cavity mode is the dispersive or cross-Kerr interaction of the form −χa^† ab^† b → −χa^† aZ/2. This interaction realizes a qubit controlled-rotation which can be converted, in principle, to a qubit controlled-displacement, using additional displacements and qubit-flips as follows. Since ${\mathrm{e}}^{\mathrm{i}\theta {a}^{{\dagger}}aZ}D\left(\alpha \right){\mathrm{e}}^{-\mathrm{i}\theta {a}^{{\dagger}}aZ}=D\left(\alpha {\mathrm{e}}^{\mathrm{i}\theta Z}\right)$ , choosing θ = π/2 gives the displacement D(α) when Z = 1 and the displacement D(−α) when Z = −1. Thus this sequence of gates does what is needed. To realize ${\mathrm{e}}^{\mathrm{i}\theta {a}^{{\dagger}}aZ}$ , one can simply conjugate the interaction ${\mathrm{e}}^{-\mathrm{i}\theta {a}^{{\dagger}}aZ}$ by π-bit-flips on the qubit, so that we can do all 3 gates in the decomposition of D(αe^iθZ).

**Figure 13.** Electric circuit representing a typical capacitive coupling between two LC oscillators, modeling cavity modes, and a (possibly flux-tunable) transmon qubit.
Download figure:
Standard image High-resolution image

Note that the strength of the controlled-displacement α only depends on the strength of the uncontrolled displacement (which can easily be made very strong in O(10) ns). If the entire controlled-displacement is to be done in, say 50 ns, it requires two rotations each with time t = π/χ = 20 ns, or $\frac{\chi }{2\pi }=25$ MHz. This realization thus requires making χ tunable, i.e. when the transmon qubit is to be measured or prepared, it is important for the cross-Kerr interaction to be 'off', as it induces rotations on the GKP grid state which dephase the state in the Fock basis. However, there is a limit to the on–off ratio of χ obtained by flux-tuning of the transmon qubit, in particular if χ is flux-tuned to be stronger, then the resonator becomes more anharmonic as well, see [61] and equation (37).

Instead of needing a tunable interaction, reference [42] realized a qubit controlled-displacement in 1.2 μs, using a very weak dispersive coupling $\frac{\chi }{2\pi }=28$ kHz between transmon qubit and cavity mode. This weak coupling obviates the need for a tunable interaction, but would make for a very slow gate when using the method described in the previous paragraph. The idea in [42] is to realize a qubit controlled-displacement by temporally displacing the cavity mode to states with |β|² = 320 photons, so that even a small qubit-induced cavity rotation can have a large effect. The scheme is best understood by using a displacement frame, see appendix C.1, on a cavity-driven dispersive shift Hamiltonian $H=-\frac{\chi }{2}{a}^{{\dagger}}aZ+\mathrm{i}\mathcal{E}\left(t\right){a}^{{\dagger}}-\mathrm{i}{\mathcal{E}}^{{\ast}}\left(t\right)a$ . The displacement frame shift shows that the dynamics is due to an effective Hamiltonian

$\begin{equation}\tilde {H}\left(t\right)=-\frac{\chi }{2}{a}^{{\dagger}}aZ-\frac{\chi }{2}\vert \beta \left(t\right){\vert }^{2}Z-\frac{\chi }{2}\left(a{\beta }^{{\ast}}\left(t\right)+{a}^{{\dagger}}\beta \left(t\right)\right)Z,\end{equation} \tag{ 32 }$

where $\beta \left(t\right){=\int }_{0}^{t} \mathrm{d}{t}^{\prime }\mathcal{E}\left({t}^{\prime }\right)$ . The effect of the last term in this Hamiltonian after a time T (taking β = β* for simplicity) is the qubit controlled-displacement ${\frac{\chi }{\sqrt{2}}\int }_{0}^{T} \mathrm{d}t{\vert \int }_{0}^{t} \mathrm{d}{t}^{\prime }\mathcal{E}\left({t}^{\prime }\right)\vert \hat{q}Z$ . In order to cancel the qubit controlled-rotation (first term) the qubit state is flipped midway in the interval T, requiring that the cavity displacement direction is also inverted midway, i.e. β → −β. We see that in this realization the applied displacement power and the dispersive shift χ together determine the strength of the qubit controlled-displacement.

We can ask how to improve on the execution of the qubit controlled-displacement gate and the subsequent qubit measurement, where improvement means a faster as well as more reliable execution. As for the realization in [42] one may worry that the large displacements of the state in phase-space during the execution of the gate lead to errors on the GKP state. Even though the cavity has a long lifetime (single-photon life-time T = 245 μs in [42]), the logical operator of a displaced GKP state takes on a complex oscillatory value in time due to photon loss, see equation (27). It is desirable to shorten the duration of the transmon qubit measurement (700 ns in [42]), but it is hard to make the dispersive read-out of the qubit via a read-out resonator very fast. For example, the measurement pulse followed by active read-out resonator depletion is O(600) ns in [62] and O(250) ns (including resonator occupancy) in [63]. Replacing the qubit measurement by feedback and disentangling [43] requiring other controlled-displacements can only lead to a shorter overall preparation time if the duration of such controlled-displacement can be shortened from what was achieved in [42]. Note that the replacement of measurement by coherent interactions could also be done for the GKP qubit rotation in figure 3.

Instead of a transmon qubit as ancilla, one may consider a different qubit such as fluxonium [64], again dispersively coupled to the 3D cavity mode. Advantages of a fluxonium qubit are its long coherence and larger anharmonicity leading to lower leakage [65], equally fast-single qubit gate operations (O(10) ns) as well as potentially very fast and powerful qubit measurement, see e.g. the GrAl-based fluxonium qubit in [66–68]. In addition, flux-tuning fluxonium may give a strongly-tunable dispersive shift χ [64, 69], without the unwanted side-effect of strengthening the cavity anharmonicity.

Another proposal is to use a noise-biased cat qubit to measure the stabilizer displacements of a GKP qubit [20] using a tunable beam-splitter interaction between the two cavity modes of the form H = g(t)ab^† + g*(t)a^† b, as argued in section 2.2. To use the interaction, we thus imagine first preparing the cat qubit in $\vert {C}_{\alpha }^{+}\rangle$ (by starting in the vacuum state |0⟩ and turning on the pump), then activate the tunable beam-splitter, and measure the noise-biased cat in the $\vert {C}_{\alpha }^{{\pm}}\rangle$ basis or employ feedback and disentangling via qubit controlled-displacements [43].

3.2. Logical GKP measurement

How does one determine whether a GKP state is $\vert \overline{0}\rangle$ or $\vert \overline{1}\rangle$ , that is, realize a logical $\overline{Z}$ -measurement? Such logical measurement may be completely destructive, but is desired to have high-fidelity, hence be fault-tolerant in its implementation, meaning that the outcome is insensitive to imperfections in the state. Even though one can measure a logical displacement, i.e. $\overline{X},\overline{Z}$ or $\overline{Y}$ , using a single ancilla qubit as was done in references [42, 50], such measurement has an intrinsic probability of error on an approximate GKP state. For example, measuring $\overline{Z}$ on $\vert \overline{0}\rangle$ does not give outcome 1, since the state is not a perfect eigenstate of $\overline{Z}$ but obeys equation (19). If we assume that this measurement circuit is otherwise perfect and is applied to $\mathfrak{F}\vert \overline{b}\rangle$ with b = 0, 1, the probability for the ancilla qubit to be measured as ± equals $\mathbb{P}\left({\pm}\vert \overline{b}\right)=\frac{1}{4}\frac{\langle \overline{b}\vert {\mathfrak{F}}^{{\dagger}}\left(I{\pm}{\overline{Z}}^{{\dagger}}\right)\left(I{\pm}\overline{Z}\right)\mathfrak{F}\vert \overline{b}\rangle }{\langle \overline{b}\vert {\mathfrak{F}}^{{\dagger}}\mathfrak{F}\vert \overline{b}\rangle }\approx \frac{1}{2}\left(1{\pm}{\left(-1\right)}^{b}{\mathrm{e}}^{-\pi {{\Delta}}^{2}/4}\right)$ ), using equation (19). The upshot is that the ancilla measurement is flipped with symmetric error probability $q=\frac{1}{2}\left(1-{\mathrm{e}}^{-\pi {{\Delta}}^{2}/4}\right)$ ) which goes to 0 when Δ → 0. At, say, Δ = 0.3, this readout error probability q is about 3.4% and much larger than the probability for an incorrect $\overline{Z}$ -outcome through the ideal homodyne measurement given in equation (18). Some repetition of the controlled-displacement circuit with the ancilla qubit and taking a majority vote of the answers could bring down the error probability q at the price of more time and possibly additional feedback error.

A target for future work could be to achieve an improved logical GKP qubit measurement by releasing the GKP state from a superconducting cavity via a switch-release mechanism [70] (taking O(1) μs in time in [70]) into a transmission line and then enact phase-sensitive amplification (e.g. squeezing) so as to measure one quadrature, say $\hat{q}$ , with no further added noise. After calibration of the measurement using Δ-squeezed displaced states and their targeted measurement outcomes, the measurement could proceed by determining whether the amplified signal corresponds to a $\hat{q}$ which is closer to an even (→outcome 0) or odd multiple (→outcome 1) of $\sqrt{\pi }$ . Photon loss in this process may be expected to be a dominant source of noise. To get an estimate of the error rate in the presence of photon loss, we can compute equation (18) for a state at Δ = 0.3 undergoing photon loss as in equation (25) with κt = 0.1 (so that a coherent state loses 1 − exp(−κt) ≈ 10% of its intensity), giving $\mathbb{P}\left(\overline{Z}=1\vert \mathfrak{F}\vert \overline{0}\rangle ,\kappa t=0.1\right)=99.5\%$ . For κt = 0.5 this measurement success probability is already down to 80%.

3.3. GKP CZ gate via three-wave mixing

In this section we describe how one could realize the CZ interaction between two GKP modes via a three-wave mixing element which is activated by applying a (strong) microwave pump tone to a coupler mode, see e.g. [71]. An example of pure three-wave mixing used for broadband parametric amplification is the Josephson-ring modulator circuit [72, 73].

An example of the use of parametrically-activated three-wave mixing is the experiment in [74]: flux-modulation through a coupling Josephson junction (instead of microwave driving) is used to activate a ${{a}^{{\dagger}}}^{2}b$ coupling between a logical (co-planar microwave) resonator whose state is to be manipulated and an ancilla (co-planar microwave) resonator.

For simplicity, we here assume that the following non-degenerate three-wave mixing Hamiltonian is available:

$\begin{equation}{H}_{{\chi }^{\left(2\right)}}={\omega }_{a}\left({a}^{{\dagger}}a+\frac{1}{2}\right)+{\omega }_{b}\left({b}^{{\dagger}}b+\frac{1}{2}\right)+{\omega }_{c}\left({c}^{{\dagger}}c+\frac{1}{2}\right)+{\chi }^{\left(2\right)}\left(a+{a}^{{\dagger}}\right)\left(b+{b}^{{\dagger}}\right)\left(c+{c}^{{\dagger}}\right).\end{equation} \tag{ 33 }$

Here a, b are annihilation operators for two GKP oscillators while c is the annhilation of the pump oscillator. We assume that all frequencies ω_a, ω_b and ω_c are sufficiently detuned, so that the χ⁽²⁾ interaction between the modes will approximately time-average away in the rotating wave approximation (RWA) in the absence of any active driving, see the discussion in appendix C. Moving to the rotating frame of the GKP oscillators we have

$\begin{equation}{\tilde {H}}_{{\chi }^{\left(2\right)}}={\omega }_{c}\left({c}^{{\dagger}}c+\frac{1}{2}\right)+{\chi }^{\left(2\right)}\left(ab {\mathrm{e}}^{-\mathrm{i}\left({\omega }_{a}+{\omega }_{b}\right)t}+a{b}^{{\dagger}} {\mathrm{e}}^{-\mathrm{i}\left({\omega }_{a}-{\omega }_{b}\right)t}+\mathrm{h}.\mathrm{c}.\right)\left(c+{c}^{{\dagger}}\right).\end{equation} \tag{ 34 }$

Since there are two time-dependencies involved, we can make all χ⁽²⁾-interactions resonant by driving the pump mode c with a two-tone drive, namely at ω_p = ω_a + ω_b and ω_p = ω_a − ω_b. Both pump tones will need to be of equal amplitude to get equal contributions from beam-splitting (a^† b + ab^†) as well as two-mode squeezing (ab + a^† b^†). Assuming that the pump mode is a (fairly) harmonic mode which can be strongly driven, we replace the operator c by its classical time-dependent expectation value $\langle c\left(t\right)\rangle =\mathcal{E}\left({\mathrm{e}}^{-\mathrm{i}\left({\omega }_{a}+{\omega }_{b}\right)t}+{\mathrm{e}}^{-\mathrm{i}\left({\omega }_{a}-{\omega }_{b}\right)t}\right)$ with, say, $\mathcal{E}\in \mathbb{R}$ . Making a rotating-wave-approximation, appendix C, gives the generating interaction of the CZ gate between modes a and b:

$\begin{equation}{H}_{\text{CZ}}=2{\chi }^{\left(2\right)}\mathcal{E}{\hat{q}}_{a}{\hat{q}}_{b}.\end{equation} \tag{ 35 }$

Changing the phase of the pump tone ( $\mathcal{E}$ ) allows one to realize CZ⁻¹. If we want to do a GKP CNOT gate via two-tone pump, we cannot start with the interaction in equation (33), but a Hadamard or single-qubit R_Y(π/2) would be required to convert $\hat{q}\to \hat{p}$ .

Instead of applying two simultaneous pump tones to get a CZ (and with extra rotations, a CNOT), we could also decompose the CNOT circuit as in figure 4, i.e. a sequence of beam-splitters and single-mode squeezers. When we drive the pump mode at the difference frequency of the modes, ω_p = ω_a − ω_b in the nondegenerate three-wave mixing Hamiltonian in equation (33), we realize a beam-splitter interaction as the pump photon assists in converting one mode-a photon to a mode-b photon.

Single-mode squeezing can be activated by using a degenerate version of the three-wave mixing element ${\hat{q}}_{a}{\hat{q}}_{b}{\hat{q}}_{c}$ in equation (33) with a Hamiltonian proportional to ${\hat{q}}_{a}^{2}{\hat{q}}_{c}$ (or similarly ${\hat{q}}_{b}^{2}{\hat{q}}_{c}$ ). By applying a pump tone at frequency ω_p = 2ω_a, one activates a squeezing Hamiltonian H_sq on mode a: we down-convert one pump photon into two mode-a photons and vice-versa.

For superconducting devices the only native non-linear circuit element that we have at our disposal are Josephson junctions which—in their simplest use, without externally applied fluxes—realize a U(Φ) = −E_J cos(2πΦ/Φ₀) potential interaction. Here the flux variable Φ can be expanded as a linear combination of the q quadratures of the bosonic modes which participate in the junction¹⁰ . Usually, if E_J is large (≫E_C), we expand this cosine potential around its potential minimum Φ = 0, obtaining only interactions which are symmetric under Φ → −Φ such as Φ⁴ (while absorbing the Φ²-terms in the quadratic part of the Hamiltonian).

It is clear from the discussion above that it would be desirable to engineer a Φ³-interaction where 2πΦ/Φ₀ = αq_a + βq_b + γq_c. When the three modes have sufficiently different frequencies, we can observe that all terms in this Φ³-interaction, except those proportional to a^† a, b^† b or c^† c, average out in time, hence the interaction is 'off' in the absence of active driving of one of the modes, not inducing any nonlinearity on the modes in this off-state. At the same time, by choosing the pump mode drive frequencies appropriately, we can activate, with the same interaction element, either a squeezer for mode a, a squeezer for mode b, a beam-splitter between modes a and b or/and a two-mode squeezer between modes a and b.

Besides the Josephson ring modulator, another three-wave mixing element, called an SNAIL, has been proposed in [77]: it uses a superconducting (SQUID-like) loop containing an asymmetric array of a few Josephson junctions and external flux is applied through the loop. The effective potential induced by this SNAIL is of the form U(Φ) = c₂Φ² + c₃Φ³ + c₄Φ⁴ with c₃ ≫ c₄ ≠ 0 where Φ is the flux variable expanded around its potential minimum, determined by the external flux Φ_ext.

A recent paper [78] discusses the circuit-QED engineering required to realize a universal set of gates for continuous-variable computation using GKP states. By flux-modulating the SNAIL, one can activate some of the terms in the three-wave mixing Hamiltonian in equation (33), mimicking the effect of microwave driving of the pump mode. The authors in [78] then use this activation to show for example how to realize an interaction ${\hat{q}}^{3}$ , required to enact the cubic phase gate ${V}_{\gamma }={\mathrm{e}}^{\mathrm{i}\gamma {\hat{q}}^{3}}$ .

Another use of a Φ³-coupling for GKP state preparation has been proposed in [49]. In this paper the aim is to produce a tunable opto-mechanical coupling of the form ${b}^{{\dagger}}b {\hat{q}}_{\text{GKP}}$ between a GKP mode and (harmonic) ancilla mode (b) which is initially prepared in a coherent state. Such coupling can be used to prepare the GKP mode into a logical state starting from a vacuum state, similar as the preparation via regular qubits discussed in section 3.1. The idea here is that the frequency of the ancilla oscillator is shifted depending on the value for $\hat{q}$ of the GKP mode, leading to a $\hat{q}$ -dependent rotation of the coherent state of the ancilla mode. When the interaction time is chosen so that all $q=k\sqrt{\pi }$ for $k\in \mathbb{Z}$ lead to the same rotation of the coherent state, measuring the coherent amplitude realizes an approximate modular measurement of $\hat{q}$ , resolving the value of $q \text{mod} \sqrt{\pi }$ . Such modular measurement of $\hat{q}$ is equivalent to measuring the eigenvalue phases of ${S}_{q}=\mathrm{exp}\left(\mathrm{i}2\sqrt{\pi }\hat{q}\right)$ . A possible advantage of this method over the coupling with regular qubits is that one gets more information per ancilla mode measurement than 1 bit. In this proposal an externally-applied flux is modulated around a value for which there is an effective third-order Φ³-coupling between the two oscillators while the Φ⁴-coupling vanishes at this flux setting. Choosing the GKP oscillator at much lower frequency (∼0.5 GHz) than the ancilla oscillator (∼10 GHz) creates an asymmetry so that a term like ${b}^{{\dagger}}b {\hat{q}}_{\text{GKP}}$ dominates in the Φ³-interaction and the term is made resonant via flux modulation.

3.4. Use of four-wave mixing?

We comment on the use of a Φ⁴-interaction for realizing the GKP CZ gate. The set-up we have in mind is modeled by the electric circuit in figure 13. Applying circuit-quantization to this circuit leads to a Hamiltonian with three active modes. Due to the coupling between the LC oscillators, each described as a single mode, some hybridization will happen between the bare cavity modes and the transmon coupler mode, and so we will associate annihilation operators a, b and c with these dressed modes. Due to this hybridization the three dressed modes with annihilation operators a, b and c will partake in the Josephson junction. This means that for the flux-variable operator $\hat{{\Phi}}$ across the Josephson-junction branch, we can write $2\pi \hat{{\Phi}}/{{\Phi}}_{0}=\alpha {\hat{q}}_{a}+\beta {\hat{q}}_{b}+\gamma {\hat{q}}_{c}$ with dimensionless α, β, γ modeling the participation of the effective modes in the Josephson junction [61]. Expanding the cosine potential up to fourth-order, and diagonalizing the linear interactions of the Hamiltonian (quadratic in creation and annihilation operators) thus gives rise to three dressed eigenmodes at frequencies ${\tilde {\omega }}_{a}$ , ${\tilde {\omega }}_{b}$ and ${\tilde {\omega }}_{c}$ , and we have the Hamiltonian:

$\begin{equation}H={\tilde {\omega }}_{a}\left({a}^{{\dagger}}a+\frac{1}{2}\right)+{\tilde {\omega }}_{b}\left({b}^{{\dagger}}b+\frac{1}{2}\right)+{\tilde {\omega }}_{c}\left({c}^{{\dagger}}c+\frac{1}{2}\right)-\frac{{E}_{J}}{4!}{\left(\alpha {\hat{q}}_{a}+\beta {\hat{q}}_{b}+\gamma {\hat{q}}_{c}\right)}^{4}.\end{equation} \tag{ 36 }$

As in the discussion on three-wave mixing we assume that all frequencies ${\tilde {\omega }}_{a},{\tilde {\omega }}_{b},{\tilde {\omega }}_{c}$ are sufficiently different (detuned). If there is no active driving (or flux-modulation), a full RWA approximation, whose accuracy depends on the amount of detuning, will leave only energy-conserving self-Kerr and cross-Kerr terms. In other words, in the off-state, the Hamiltonian is approximately

$\begin{align}\hfill {H}_{\text{off}}& \approx {\omega }_{a}\left({a}^{{\dagger}}a+\frac{1}{2}\right)+{\omega }_{b}\left({b}^{{\dagger}}b+\frac{1}{2}\right)+{\omega }_{c}\left({c}^{{\dagger}}c+\frac{1}{2}\right)+-\frac{1}{2}\left({\chi }_{aa}{\left({a}^{{\dagger}}a\right)}^{2}+{\chi }_{bb}{\left({b}^{{\dagger}}b\right)}^{2}+{\chi }_{cc}{\left({c}^{{\dagger}}c\right)}^{2}\right)\hfill \\ \hfill & \quad - {\chi }_{ab}{a}^{{\dagger}}a{b}^{{\dagger}}b-{\chi }_{ac}{a}^{{\dagger}}a{c}^{{\dagger}}c-{\chi }_{bc}{b}^{{\dagger}}b{c}^{{\dagger}}c,\hfill \end{align} \tag{ 37 }$

where ${\chi }_{i{i}^{\prime }}=2\sqrt{{\chi }_{ii}{\chi }_{{i}^{\prime }{i}^{\prime }}}$ [61]. Here ${\omega }_{a}={\tilde {\omega }}_{a}-\frac{{E}_{J}{\alpha }^{4}}{24}$ (and similarly for ω_b and ω_c) due to rewriting the excitation-conserving terms in ${\hat{q}}_{a}^{4}$ as a quadratic term ∝a^† a and the self-Kerr term $\propto {\left({a}^{{\dagger}}a\right)}^{2}$ and ${\chi }_{aa}=\frac{{E}_{J}{\alpha }^{4}}{12}$ . Here we clearly see the advantage of a pure three-wave mixing element over a four-wave mixing element: in the off-state the four-wave mixing element induces unwanted Kerr and cross-Kerr anharmoniticies on the GKP storage modes a and b.

In the off-state, mode c is (ideally) in its vacuum state, hence the cross-Kerr interaction with this mode does not contribute. However, if this mode were driven these corrections are relevant and they induce additional cavity rotations. Let us now indeed discuss the effect of applying a drive on mode c. For this, we expand the fourth-order term in equation (36) which becomes

$\begin{equation}{H}_{{\chi }^{\left(3\right)}}=-\frac{{E}_{J}}{4!}\left[\left(\genfrac{}{}{0pt}{}{4}{2}\right){\times}2\alpha \beta {\gamma }^{2}{\hat{q}}_{a}{\hat{q}}_{b}{\hat{q}}_{c}^{2}+\left(\genfrac{}{}{0pt}{}{4}{2}\right){\alpha }^{2}{\gamma }^{2}{\hat{q}}_{a}^{2}{\hat{q}}_{c}^{2}+\cdots \right].\end{equation} \tag{ 38 }$

We can apply a two-tone drive on mode c at frequency $\frac{{\omega }_{a}+{\omega }_{b}}{2}$ and $\frac{{\omega }_{a}-{\omega }_{b}}{2}$ with equal amplitudes $\mathcal{E}$ . Replacing ${\hat{q}}_{c}$ by its time-dependent expectation $\langle {\hat{q}}_{c}\left(t\right)\rangle =\frac{\mathcal{E}}{\sqrt{2}}\left({\mathrm{e}}^{\mathrm{i}t\frac{{\omega }_{a}+{\omega }_{b}}{2}}+{\mathrm{e}}^{\mathrm{i}t\frac{{\omega }_{a}-{\omega }_{b}}{2}}+\mathrm{h}.\mathrm{c}.\right)$ in equation (38) and going to the rotating frame of all modes a and b, we find that a term like ${\hat{q}}_{a}{\hat{q}}_{b}{\hat{q}}_{c}^{2}$ leads to a time-independent resonant term proportional to ${\hat{q}}_{a}{\hat{q}}_{b}$ . This can be seen as follows. First, note that the signal ${\langle {q}_{c}\left(t\right)\rangle }^{2}$ only contains frequencies ω_a, ω_b, ω_a + ω_b and ω_a − ω_b, all of equal strength. The frequency ω_a + ω_b matches two-mode squeezing (ab + h.c.), while ω_a − ω_b matches beamsplitting (ab^† + h.c.). Besides this, we throw out all time-dependent terms (RWA). In particular we have

Terms without any ${\hat{q}}_{c}$ are only leading to self-Kerr and cross-Kerr for modes a and b.
Terms with a single ${\hat{q}}_{c}$ or ${\hat{q}}_{c}^{3}$ are not frequency-matched to become time-independent (as ω_a and ω_b are sufficiently different).
Terms ${\hat{q}}_{c}^{4}$ leads to self-Kerr for mode c.
Terms with ${\hat{q}}_{c}^{2}$ lead to cross-Kerr between modes c and a or c and b. Note that from a term such as ${\hat{q}}_{a}^{2}{\hat{q}}_{c}^{2}$ there is a contribution proportional to ${\mathcal{E}}^{2}{a}^{{\dagger}}a$ (and similarly ${\hat{q}}_{b}^{2}{\hat{q}}_{c}^{2}$ gives ${\mathcal{E}}^{2}{b}^{{\dagger}}b$ ).

We could also realize the CNOT gate using the beam-splitter and squeezer sequence in figure 4, i.e. we chose a single-tone pump at ω_p = (ω_a − ω_b)/2 for the beam-splitter to let two pump photons assist in converting one mode-a photon to one mode-b photon. Single-mode squeezing can be realized by using the interaction ${\hat{q}}_{a}^{2}{\hat{q}}_{c}^{2}$ in equation (38), i.e. we should take ω_p = ω_a so that two pump photons are converted into two mode-a photons. However, note that this also make the unwanted interaction b^† b(a^† c + ac^†) (which comes from the ${\hat{q}}_{b}^{2}{\hat{q}}_{a}{\hat{q}}_{c}$ term) resonant, which makes this scheme unattractive.

An important parameter measuring the quality of the CZ gate via this Φ⁴-interaction is the relative strength of the unwanted Kerr and cross-Kerr terms versus the strength of the two-tone pump-activated wanted interactions ${\hat{q}}_{a}{\hat{q}}_{b}$ . In part, this relies on the error contributions due to the rotating wave approximation which should be better quantified theoretically (appendix C). Another contributing factor is the relative strength of the participation parameters α, β, γ and the pump strength, namely, without error contribution from the RWA, one has

$\begin{equation}\frac{{\Vert}{H}_{\text{CZ}}\left(a,b\right){\Vert}}{{\Vert}{H}_{\mathrm{c}\mathrm{r}\mathrm{o}\mathrm{s}\mathrm{s}-\mathrm{K}\mathrm{e}\mathrm{r}\mathrm{r}}\left(a,b\right){\Vert}}\propto \frac{{\mathcal{E}}^{2}{\gamma }^{2}}{\alpha \beta };\end{equation} \tag{ 39 }$

Hence, a large γ²/αβ is desirable but a large γ also makes mode c more anharmonic as χ_cc ∝ γ⁴ and this again severley restricts the pump power $\mathcal{E}$ . These conflicting constraints may make this scheme less suitable in practice.

4. Prospects for a GKP-surface code architecture

In this section we would like to provide a perspective of what it would take to build a surface code architecture based on GKP qubits, point out the challenges in this approach, as well as contrast it with existing efforts to engineer a similar architecture using transmon qubits [79], see section 4.1.

We can partially use the results in reference [48] as a starting point for such GKP-surface code architecture. In this code architecture, there are two layers of protection. On the one hand, each GKP qubit is either stabilized or error corrected individually, reducing a continuous set of (displacement) errors to a mostly discrete set of GKP qubit Pauli errors. On the other hand, the surface code layer is there to suppress the logical error rate of a GKP qubit to values which decrease exponentially with the side length of the surface code lattice.

In reference [48] GKP error correction takes place with ancilla GKP modes using the circuits in figure 7. Note that these circuits can also be implemented via CZ gates, but will then require Hadamard or R_Y(π/2) rotations on the GKP data mode. Interspersed with this GKP error correction, parity checks of the surface code, shown in figure 15, are to be measured in QEC cycles. These circuits are similar as for a regular surface code, except that the underlying qubits are GKP qubits encoded in oscillators, see figure 14 for a Z-check. We use the fact that GKP logical operators are not self-inverse as displacements—and as displacements they obey $\left[{\overline{X}}_{1}{\overline{X}}_{2},{\overline{Z}}_{1}^{-1}{\overline{Z}}_{2}\right]=0$ —to measure checks which mutually commute on the entire oscillator space¹¹ . In this figure the measurement of the GKP ancilla is shown as the release and amplification of the cavity state, followed by a quadrature measurement, as discussed in section 3.2. Such measurement would give useful analog information, but the usefulness of this analog information is challenged by losing photons in the step, nor has it been experimentally realized.

We call this GKP-surface code architecture $\mathsf{Only-SurfaceCode-GKP-Ancilla}$ and $\mathsf{All-Regular-Qubit-Ancilla}$ . This set-up would require each GKP qubit oscillator to have CZ capability with 5 other GKP oscillators, namely 4 GKP ancilla oscillators for the surface code and 1 GKP ancilla oscillator for its own error correction.

Reference [48] used a model of Gaussian stochastic displacement noise, equation (24), as an effective, numerically-simulatable, error model for this architecture. The noise channel acts in the different locations: (1) on each GKP qubit prior to GKP error correction and a round of surface-code parity check measurements, (2) prior to the homodyne measurement in figure 7 and (3) prior to the homodyne measurement of the ancilla GKP in the surface code check. Taking the standard deviations from these Gaussian channels to be equal, a threshold standard deviation ${{\sigma }_{0}}_{c}\approx 0.243$ for the toric code was found. Note that this model includes all sources of errors, including finite squeezing and feedback errors, albeit stochastically. Using the conversion ${{\Delta}}^{2}=2{\sigma }_{0}^{2}$ , this gives a threshold of Δ = 0.34 or 9.3 dB, but the data in figure 12 show this conversion is somewhat too optimistic: using squeezed states with Δ gives an error rate which is somewhat worse than a stochastic model with ${\sigma }_{0}={\Delta}/\sqrt{2}$ , so a worst-case threshold estimate using ${{\Delta}}^{2}={\sigma }_{0}^{2}$ would be 12.3 dB. Reference [81] considered a variation on this stochastic noise model—explictly including error feedback—and applied this noise to a concatenation of the GKP qubit with the surface code. Both [48, 81] used minimum weight matching decoders to find thresholds.

**Figure 14.** A Z-check parity measurement for the surface-GKP code in figure 15, on oscillators labelled NE–SW (northeast to southwest of ancilla qubit) using CZ and CZ⁻¹ gates as defined in section 2.3.2. $\vert \overline{+}\rangle$ is an approximate +1 eigenstate of $\overline{X}$ and the GKP stabilizers. Release and amplification (Amp) followed by measurement of the quadrature $\hat{p}$ is a way to do a logical $\overline{X}$ measurement and reprep is a unit standing for the repreparation of the GKP ancilla state.
Download figure:
Standard image High-resolution image

$\vert \overline{+}\rangle $ — **Figure 14.** A Z-check parity measurement for the surface-GKP code in figure 15, on oscillators labelled NE–SW (northeast to southwest of ancilla qubit) using CZ and CZ⁻¹ gates as defined in section 2.3.2. $\vert \overline{+}\rangle$ is an approximate +1 eigenstate of $\overline{X}$ and the GKP stabilizers. Release and amplification (Amp) followed by measurement of the quadrature $\hat{p}$ is a way to do a logical $\overline{X}$ measurement and reprep is a unit standing for the repreparation of the GKP ancilla state.
Download figure:
Standard image High-resolution image

Reference [48] identified the defects and the distance function between them following the associated compact-QED model closely in order to approach exact minimum-weight decoding. Different from this, reference [81] identified the positions where the surface code check outcomes change as defects, but altered the distance function between these defects based on GKP error information.

We add two more observations about this scheme. First, when we use stabilizer error correction, such as surface code error correction, on bosonic codes, we need to implement parity check operators which sometimes act like a logical X on a bosonic qubit, and sometimes like a logical Z. For GKP qubits this translates into the ability to perform CZ gates as well as CNOT gates. For standard (transmon) qubits, the switch between CZ and CNOT is easily achieved by applying a layer of Hadamard gates between a parity X-cycle and a parity Z-cycle. For a GKP qubit encoded into an oscillator with frequency f, such Hadamard gate seems simple: it constitutes waiting for time t = 1/(4f ). But since all data qubits have to undergo this Hadamard gate, it implies that the resonant frequencies of the data qubit oscillators (resonant modes of identical 3D cavities) should all be identical, which seems like a narrow target to aim at (although the difference between a simulation-based predicted 3D cavity frequency and the measured frequency can be less than 0.1% [3]).

As alternative to the Hadamard gate one can use R_Y(π/2) gate and R_Y(−π/2) gate, using a regular qubit as in figure 3, to toggle back and forth between Z and X error corrrection, but it costs a lot more hassle and time than doing a R_Y(π/2) on a transmon qubit in O(10) ns. A second observation is that the use of parametrically-driven three-wave or four-wave mixing as discussed in sections 3.3 and 3.4 could allow for the simultaneous execution of the CZ and CZ⁻¹ gates needed to do a surface-code parity check measurement, as the activation of the CZ or (CZ⁻¹) gate only requires the application of a pump tone to the coupler between each data oscillator and ancilla oscillator (4 couplers in total). The coupling strength of these CZ couplers may not be equally strong, hence the duration of these four pump drives can vary, but an advantage of only driving the coupler mode (instead of the GKP mode) is to enable the simultaneous execution of these commuting gates. Another way of looking at the simultaneously-executed parity check is to observe that a green Z-check in figure 15 on oscillator NE, SE, NW, SW corresponds to

$\begin{equation}\hat{G}={\hat{q}}_{\text{NE}}+{\hat{q}}_{\text{SE}}-{\hat{q}}_{\text{NW}}-{\hat{q}}_{\text{SW}}.\end{equation} \tag{ 40 }$

An interaction Hamiltonian ${H}_{G}=-{\hat{q}}_{A}\hat{G}$ applied for time t has the effect that ${\hat{p}}_{A}\to {\hat{p}}_{A}-\hat{G}t$ , using equation (2) (while for all data oscillators i = NE, SE, NW, SW participating in the check, we have ${\hat{p}}_{i}\to {\hat{p}}_{i}{\pm}t{\hat{q}}_{A}$ ). Taking t = 1, we see that by measuring the ancilla quadrature ${\hat{p}}_{A}$ , we measure Ĝ modulo even multiples of $\sqrt{\pi }$ (as $\vert \overline{+}\rangle$ has sharp peaks at p_A being even multiples of $\sqrt{\pi }$ ). Thus, if one of the oscillators undergoes a $\sqrt{\pi }$ shift in q, the measurement will detect this.

**Figure 15.** Distance-3 rotated surface code in its standard surface-17 layout with green Z-checks and red X-checks: black filled circles are data qubits, open circles are ancilla qubits, dashed lines are two-qubit interactions. The ±1 patterns on each check denotes the use of inverses as in equation (40), so that all checks commute as displacements.
Download figure:
Standard image High-resolution image

Besides the Steane error correction in figure 7, one can also imagine a more hardware-efficient form of GKP error correction via stabilization using a regular qubit as discussed in section 3.1, regularly interspersed with parity check measurements for the surface code which, for example, do use a GKP ancilla. The advantage here is that one does not need to prepare and couple the ancillary GKP qubit as in figure 7 (which again requires a regular qubit). In particular, (ancilla) GKP state preparation is time-consuming (60 μs in [42]) due to requiring slow controlled-displacement gates and slow qubit measurement, and during this process photon loss is affecting the GKP qubit. At the same time, we keep the GKP ancilla for the possibly-less frequent surface code QEC cycle in order to get still analog error information. We refer to this intermediate scheme as $\mathsf{Only-SurfaceCode-GKP-Ancilla}$ .

Another choice is to use regular qubits to extract both GKP and surface code error information, see the circuits in figure 16. We refer to this scheme as $\mathsf{All-Regular-Qubit-Ancilla}$ . An advantage of this scheme is that no Hadamards or R_Y(π/2) rotations are needed on GKP modes and tunable controlled-displacement gates are used throughout.

The $\mathsf{All-Regular-Qubit-Ancilla}$ architecture can however be less tolerant towards errors: it might be hard to get below threshold for the surface code, when all error information is obtained through qubits, giving 1 bit of information at the time. We can provide arguments for this by using a simple error model in which we assume that GKP error correction generates an effective phenomenological error model in each surface code QEC cycle and we assume that the surface code QEC cycle is otherwise perfect. We model the effect of GKP error correction as stabilizing an approximate GKP qubit of the form, say, $\mathfrak{F}\vert \overline{b}\rangle$ at some Δ, besides having a logical error b → ¬b on top with probability p. Effectively then, the approximate GKP code states coming into a perfect surface-code parity check circuit as in figure 16 will flip the regular qubit ancilla with some effective error probability q which depends on Δ. We thus map our error model onto a known (phenonomenological) surface code error model in which there is an incoming error with probability p in each QEC round and a measurement error with probability q(Δ). For this model, figure 3 in reference [82] shows the numerically-found below-threshold region and for q = p the threshold is optimally 3.3% [83]. Reference [82] does not investigate the below-threshold region and its shape for low p, but it certainly lies within the below-threshold region for the repetition code which is conjectured to have a below-threshold region given by H₂(p) + H₂(q) ⩽ 1 with H₂(p) the binary entropy [84].

To estimate q(Δ) we can write the probability to measure the ancilla qubit in the X-basis as $\mathbb{P}\left({\pm}\right)=\frac{1}{2}\frac{{\left(\langle \overline{0}\vert {\mathfrak{F}}^{{\dagger}}\mathfrak{F}\vert \overline{0}\rangle \right)}^{4}{\pm}\frac{1}{2}\left[{\left(\langle \overline{0}\vert {\mathfrak{F}}^{{\dagger}}\overline{Z}\mathfrak{F}\vert \overline{0}\rangle \right)}^{4}+{\left(\langle \overline{0}\vert {\mathfrak{F}}^{{\dagger}}{\overline{Z}}^{{\dagger}}\mathfrak{F}\vert \overline{0}\rangle \right)}^{4}\right]}{{\left(\langle \overline{0}\vert {\mathfrak{F}}^{{\dagger}}\mathfrak{F}\vert \overline{0}\rangle \right)}^{4}}$ , which, using equation (19), approximately gives $\mathbb{P}\left({\pm}\right)\approx \frac{1}{2}\left(1{\pm}{\mathrm{e}}^{-\pi {{\Delta}}^{2}}\right)$ and thus $q\left({\Delta}\right)=\frac{1}{2}\left(1-{\mathrm{e}}^{-\pi {{\Delta}}^{2}}\right)$ . At Δ = 0.3, we already have q(Δ) = 12% while Δ = 0.15 just suffices to get q(Δ) = 3.4%.

The frequency of doing the surface code error correction could also be adapted to the logical decay rate of the stabilized GKP state, e.g. 275 μs in [42], so that the logical qubit error rate between surface code QEC cycles is at least less than 3.3%. It is an open question how to analyze the noise threshold for the $\mathsf{All-Regular-Qubit-Ancilla}$ architecture for a more elaborate error model.

In this $\mathsf{All-Regular-Qubit-Ancilla}$ architecture the workhorse is the controlled-displacement gate with the regular qubit and the regular qubit preparation and measurement. The desiderata for these regular qubits are clearly (1) ability to enact a fast and accurate tunable controlled-displacement with a 3D cavity mode, (2) low leakage to higher excited states, (3) long T₁ and T₂, beyond 100 μs, and (4) fast measurement below O(100) ns, (5) fast preparation of |+⟩ and single-qubit gates (O(10) ns). At first sight, this seems like a wishlist for any good qubit, however it is not necessary to have a high-quality two-qubit gate between these qubits, which is a nontrivial component for the surface code with transmon qubits. Furthermore, the frequency of the 3D cavity GKP modes can be taken to be far different than those of the ancillary qubits and their coupled read-out resonators, possibly leading to easier frequency control and less frequency crowding than in an architecture with only one type of device qubit such as the surface code with transmon qubits [79, 85].

4.1. Comparison: Fock qubit surface code and transmon qubit surface code

Given that we imagine using high-Q 3D cavities for qubit storage, we can ask how to compare a GKP encoding with a simple Fock encoding in a surface code architecture, omitting any additional error correction. CZ gates between a 3D-cavity encoded Fock qubit (mode a) and an ancilla transmon qubit can be realized by a dispersive coupling −χa^† aZ/2, allowing for the execution of the Z-check measurement. Similar as for the controlled-displacement in the GKP encoding, tunability of this interaction, for example, by using an intermediate frequency-tunable resonator to vary the coupling strength, is important. This type of parity check, using 1 transmon qubit to read out 4 Fock qubits, is the reverse of using one bosonic mode to read out the parity of four coupled transmon qubits as realized in reference [86]. For the X-check measurement one requires a CNOT gate with transmon qubit as target, which can be realized by performing a CZ followed and preceded by Hadamard or R_Y(π/2). Again, similar as in the GKP encoding, these simple single-qubit gates require the use of an ancillary qubit, but arbitrary cavity manipulations through such coupled transmon qubit have been demonstrated in [87], albeit of rather long, O(1) μs, duration, and having some, inevitable, leakage towards the state |2⟩ or higher.

An engineering effort for making a surface code architecture using 2D transmon qubits is underway at e.g. Google, IBM, TU Delft and ETH Zürich. Besides using an optimized decoder [88], the crucial numbers which determine whether such architecture will be 'below threshold' are the quality, leakage [80], time-duration and cross-talk of the two-qubit gate and the duration (and cross-talk) of the qubit measurement versus the dephasing and relaxation time of the qubits. Flux-tunable transmon qubits have recently achieved very good numbers for their two-qubit gates: reference [89] reports on a 99.1% CZ fidelity of 40 ns duration and low leakage 0.1%, while Google's supremacy experiments [90] have shown the performance of ISWAP-like two-qubit gates on a 54-qubit Sycamore chip with an average error rate of 0.62% and duration O(10) ns. It is an open question how much further transmon performance numbers, including measurement duration, can be pushed beyond their current values. The use of different superconducting materials [91] can provide new opportunities to lengthen T₂ and T₁ times. Note however that an enhanced T₁ also leads to an enhanced duration of leakage. Frequency crowding and limits on highly-accurate frequency targeting, in particular for non-flux tunable transmons, leading to spurious cross-talk couplings is another challenge in realizing the surface code.

We thus believe that there is plenty of room and, in fact, necessity for developing an alternative surface code architecture in which a data qubit, such as a Fock or GKP qubit, is encoded in a very harmonic mode of high-Q (3D) cavity, while transmon qubits or their next-generation versions such as fluxonium or noise-biased cat qubits, are used as ancilla qubits. If a pump-activated CZ or controlled-displacement gate has high on/off ratio, one expects that spurious couplings between the 3D cavity data modes, due to common coupling to the ancilla qubits, would be well suppressed.

Acknowledgments

We thank Alessandro Ciani, David DiVincenzo, Ioan Pop and Daniel Weigand for useful feedback and discussions and some help with the figures and the numerics. We acknowledge support from the European Research Council (EQEC, ERC Consolidator Grant No. 682726). CV and BMT acknowledge support from a QuantERA grant for the QCDA consortium. This research was supported in part by Perimeter Institute for Theoretical Physics. Research at Perimeter Institute is supported by the Government of Canada through Industry Canada and by the Province of Ontario through the Ministry of Economic Development & Innovation.

Appendix A.: Fock state representation of GKP grid states

In this appendix we examine the Fock coefficients of approximate GKP states, using the $\mathfrak{D}$ -approximation, and the sensor grid state [44]. We study the asymptotic behavior of these Fock coefficients, showing that the photon number distribution trends along a geometric or thermal distribution. It turns out that these Fock coefficients relate to some interesting nontrivial mathematics.

The theta function with rational parameters $\left(a,b\right)\in {\mathbb{Q}}^{2}$ , adopting the notations of [46, 92], is given by

$\begin{equation}\vartheta \left[\begin{matrix}\hfill a\hfill \\ \hfill b\hfill \end{matrix}\right]\left(z,\tau \right)=\sum _{k\in \mathbb{Z}}\mathrm{exp}\left(\pi \mathrm{i}\tau {\left(k+a\right)}^{2}+2\pi \mathrm{i}\left(k+a\right)\left(z+b\right)\right),\end{equation} \tag{ A.1 }$

where $\left(z,\tau \right)\in \mathbb{C}$ and Im(τ) > 0 ensuring absolute convergence of the series. Some common shorthands are the following

$\begin{equation}\vartheta \left[\begin{matrix}\hfill 0\hfill \\ \hfill 0\hfill \end{matrix}\right]\left(0,\mathrm{i}x\right)={\theta }_{3}\left(x\right)=\sum _{k\in \mathbb{Z}}\mathrm{exp}\left(-\pi x{k}^{2}\right),\end{equation} \tag{ A.2 }$

$\begin{equation}\vartheta \left[\begin{matrix}\hfill \frac{1}{2}\hfill \\ \hfill 0\hfill \end{matrix}\right]\left(0,\mathrm{i}x\right)={\theta }_{2}\left(x\right)=\sum _{k\in \mathbb{Z}}\mathrm{exp}\left(-\pi x{\left(k+1/2\right)}^{2}\right),\end{equation} \tag{ A.3 }$

$\begin{equation}\vartheta \left[\begin{matrix}\hfill 0\hfill \\ \hfill \frac{1}{2}\hfill \end{matrix}\right]\left(0,\mathrm{i}x\right)={\theta }_{4}\left(x\right)=\sum _{k\in \mathbb{Z}}\mathrm{exp}\left(-\pi x{k}^{2}+\pi \mathrm{i}k\right).\end{equation} \tag{ A.4 }$

The multidimensional generalization with rational vectors $\left(\to {a},\to {b}\right)\in {\left({\mathbb{Q}}^{m}\right)}^{2}$ is given by

$\begin{equation}\vartheta \left[\begin{matrix}\hfill \to {a}\hfill \\ \hfill \to {b}\hfill \end{matrix}\right]\left(\to {z},{\Omega}\right)=\sum _{\to {k}\in {\mathbb{Z}}^{m}}\mathrm{exp}\left[\pi \mathrm{i}{\left(\to {k}+\to {a}\right)}^{\text{T}}{\Omega}\left(\to {k}+\to {a}\right)+2\pi \mathrm{i}{\left(\to {k}+\to {a}\right)}^{\text{T}}\left(\to {z}+\to {b}\right)\right],\end{equation} \tag{ A.5 }$

where $\to {z}\in {\mathbb{C}}^{m}$ is a complex vector, ${\Omega}\in {\mathbb{C}}^{m{\times}m}$ is a complex matrix and $\text{Im}\left({\Omega}\right)$ is positive definite which ensures the absolute convergence of the series.

First, it is important to note that one can properly normalize the approximate GKP state |j_approx⟩, using the $\mathfrak{D}$ -approximation defined in equation (15), for the two logical states j = 0, 1:

$\begin{equation}\vert {j}_{\text{approx}}\rangle =\frac{1}{\sqrt{N\left({\Delta},j\right)}} \mathfrak{D}\vert {\overline{j}}_{\text{ideal}}\rangle \end{equation} \tag{ A.6 }$

$\begin{equation}=\frac{1}{\sqrt{N\left({\Delta},j\right)}} \mathrm{exp}\left(-{{\Delta}}^{2}\hat{n}\right)\sum _{k\in \mathbb{Z}}\vert q=\left(2k+j\right)\sqrt{\pi }\rangle \end{equation} \tag{ A.7 }$

with

$\begin{equation}N\left({\Delta},j\right)=\frac{1}{\sqrt{\pi \left(1-{u}^{2}\right)}} \vartheta \left[\begin{matrix}\frac{j}{2}\to {1}\\ \to {0}\end{matrix}\right]\left(\to {0},{\Omega}\right),\end{equation} \tag{ A.8 }$

where

$\begin{equation}u=\mathrm{exp}\left(-2{{\Delta}}^{2}\right),\end{equation} \tag{ A.9 }$

$\begin{equation}{\Omega}=\frac{2\mathrm{i}}{1-{u}^{2}}\left(\begin{pmatrix}\hfill 1+{u}^{2}\hfill & \hfill -2u\hfill \\ \hfill -2u\hfill & \hfill 1+{u}^{2}\hfill \end{pmatrix}\right),\end{equation} \tag{ A.10 }$

and $\to {0}$ and $\to {1}$ are the all-zeros and all-ones vectors respectively. Note that there is a bijection between Δ ∈ (0, ∞) and u ∈ [0, 1) so we write either N(Δ, j) or N(u, j) depending on convenience.

In order to obtain this expression one can use the position representation of Fock states in terms of Hermite functions Ψ_n(q) (or Hermite polynomials H_n(q)),

$\begin{equation}\langle n\vert q\rangle ={{\Psi}}_{n}\left(q\right)=\frac{1}{\sqrt{{2}^{n}\sqrt{\pi }n!}} \mathrm{exp}\left(-\frac{{q}^{2}}{2}\right){H}_{n}\left(q\right),\end{equation} \tag{ A.11 }$

and the so-called Mehler's Hermite polynomial formula:

$\begin{equation}\sum _{n\in \mathbb{N}}{u}^{n}{{\Psi}}_{n}\left(x\right){{\Psi}}_{n}\left(y\right)=\frac{1}{\sqrt{\pi \left(1-{u}^{2}\right)}} \mathrm{exp}\left(\frac{\left(1+{u}^{2}\right)\left({x}^{2}+{y}^{2}\right)-4uxy}{2\left(1-{u}^{2}\right)}\right).\end{equation} \tag{ A.12 }$

It is possible to rewrite equation (A.8) in the form given in reference [46]

$\begin{equation}N\left(u,j\right)=\frac{1}{2\sqrt{\pi }\left(1+u\right)}\left(\vartheta \left[\begin{matrix}\hfill \frac{j}{2}\hfill \\ \hfill 0\hfill \end{matrix}\right]\left(0,\mathrm{i}8{\sigma }^{2}\right) \vartheta \left[\begin{matrix}\hfill 0\hfill \\ \hfill 0\hfill \end{matrix}\right]\left(0,\mathrm{i}{\sigma }^{2}/2\right)+\vartheta \left[\begin{matrix}\hfill \frac{j+1}{2}\hfill \\ 0\end{matrix}\right]\left(0,\mathrm{i}8{\sigma }^{2}\right) \vartheta \left[\begin{matrix}\hfill 0\hfill \\ \hfill \frac{1}{2}\hfill \end{matrix}\right]\left(0,\mathrm{i}{\sigma }^{2}/2\right)\right).\end{equation} \tag{ A.13 }$

This expression can be recovered using

$\begin{equation}{\Omega}=\mathrm{i}\left(\begin{pmatrix}\hfill 1\hfill & \hfill 1\hfill \\ \hfill 1\hfill & \hfill -1\hfill \end{pmatrix}\right)\left(\begin{pmatrix}\hfill 2{\sigma }^{2}\hfill & \hfill 0\hfill \\ \hfill 0\hfill & \hfill \frac{1}{2{\sigma }^{2}}\hfill \end{pmatrix}\right)\left(\begin{pmatrix}\hfill 1\hfill & \hfill 1\hfill \\ \hfill 1\hfill & \hfill -1\hfill \end{pmatrix}\right)\end{equation} \tag{ A.14 }$

$\begin{equation}2{\sigma }^{2}=\mathrm{tanh}\left({{\Delta}}^{2}\right)=\frac{1-u}{1+u},\end{equation} \tag{ A.15 }$

and the modular transformation

$\begin{equation}\vartheta \left[\begin{matrix}\hfill \frac{j}{2}\hfill \\ \hfill 0\hfill \end{matrix}\right]\left(0,\mathrm{i}/x\right)=\sqrt{x} \vartheta \left[\begin{matrix}\hfill 0\hfill \\ \hfill \frac{j}{2}\hfill \end{matrix}\right]\left(0,\mathrm{i}x\right).\end{equation} \tag{ A.16 }$

We can turn to the Fock coefficients

$\begin{equation}{c}_{n}\left(j\right)=\langle n\vert {j}_{\text{approx}}\rangle =\frac{1}{\sqrt{N\left({\Delta},j\right)}} {\mathrm{e}}^{-{{\Delta}}^{2}n}\sum _{k\in \mathbb{Z}}{{\Psi}}_{n}\left(\left(2k+j\right)\sqrt{\pi }\right).\end{equation} \tag{ A.17 }$

As the parity of the Hermite function Ψ_n(q) is that of the parity of n, the coefficients c_n(j) vanish for odd n. We can use that for q ⩾ 0, see [93],

$\begin{equation}{{\Psi}}_{2n}\left(q\right)=\sqrt{\frac{{4}^{n}}{\sqrt{\pi }\left(2n\right)!}} \frac{{\mathrm{d}}^{n}}{\mathrm{d}{z}^{n}}\left(\frac{1}{\sqrt{1+z}} \mathrm{exp}\left(-\frac{{q}^{2}}{2}\cdot \frac{1-z}{1+z}\right)\right){\left\vert \right.}_{z=0},\end{equation} \tag{ A.18 }$

with $q=\left(2k+j\right)\sqrt{\pi }$ . This implies that we have

$\begin{equation}{c}_{2n}\left(j\right)=\sqrt{\frac{{4}^{n}}{N\left(u,j\right)\sqrt{\pi }\left(2n\right)!}} {u}^{n}\frac{{\mathrm{d}}^{n}}{\mathrm{d}{z}^{n}}\left[\frac{1}{\sqrt{1+z}}{\theta }_{3-j}\left(2\frac{1-z}{1+z}\right)\right]{\left\vert \right.}_{z=0}.\end{equation} \tag{ A.19 }$

This gives a somewhat concise expression although it is not directly useful. In order to numerically evaluate the coefficients for example, equation (A.17) is more convenient as the Hermite functions, Ψ_n(q), have support essentially within $\left[-2\sqrt{n}, 2\sqrt{n}\right]$ and are easily computed recursively using

$\begin{equation}{{\Psi}}_{0}\left(q\right)={\pi }^{-1/4} {\mathrm{e}}^{-\frac{{q}^{2}}{2}},\end{equation} \tag{ A.20 }$

$\begin{equation}{{\Psi}}_{n}\left(q\right)=q\sqrt{\frac{2}{n}}{{\Psi}}_{n-1}\left(q\right)-\sqrt{\frac{n-1}{n}}{{\Psi}}_{n-2}\left(q\right).\end{equation} \tag{ A.21 }$

This is used together with equation (A.13) to plot the coefficients in figure A1.

The case of the Fock representation of the so-called sensor state [44] which is an approximate eigenstate of ${\mathrm{e}}^{\mathrm{i}\sqrt{2\pi }\hat{q}}$ and ${\mathrm{e}}^{\mathrm{i}\sqrt{2\pi }\hat{p}}$ is very similar. For the perfect sensor state |ψ_ideal⟩ the wavefunction in q is a sum over δ-functions at integer multiples of $\sqrt{2\pi }$ , and similarly the wavefunction in p is a sum over δ-functions at integer multiples of $\sqrt{2\pi }$ .

The approximate state is $\vert {\psi }_{\text{approx}}\rangle =\frac{1}{\sqrt{N\left({\Delta}\right)}}\mathfrak{D}\vert {\psi }_{\text{ideal}}\rangle$ with normalization

$\begin{equation}N\left({\Delta}\right)=\frac{1}{\sqrt{\pi \left(1-{u}^{2}\right)}} \vartheta \left[\begin{matrix}\hfill \to {0}\hfill \\ \hfill \to {0}\hfill \end{matrix}\right]\left(\to {0},\frac{1}{2}{\Omega}\right),\end{equation} \tag{ A.22 }$

where u and Ω are the same as defined in equations (A.9) and (A.10). This can be obtained in a similar way and we can also write N(u) instead of N(Δ) when convenient. The sensor state |ψ_approx⟩ is an eigenstate of exp(iπa^† a/2), hence the photon number is 0 mod 4 for this state [44]. We now have c_n = ⟨n|ψ_approx⟩ with c_2n+1 = 0 and

$\begin{equation}{c}_{2n}=\frac{1}{\sqrt{N\left({\Delta}\right)}} {\mathrm{e}}^{-2{{\Delta}}^{2}n}\sum _{k\in \mathbb{Z}}{{\Psi}}_{2n}\left(k\sqrt{2\pi }\right).\end{equation} \tag{ A.23 }$

Reference [93] derives an expression analogous to equation (A.19) for the sum on the right-hand side:

$\begin{align}\hfill \sum _{k\in \mathbb{Z}}{{\Psi}}_{2n}\left(k\sqrt{2\pi }\right)={\theta }_{3}\left(1\right)\sqrt{\frac{{4}^{n}{{\Phi}}^{n}}{\sqrt{\pi }\left(2n\right)!}}\mathrm{d}\left(n/2\right),\quad n \mathrm{e}\mathrm{v}\mathrm{e}\mathrm{n}\\ \hfill \sum _{k\in \mathbb{Z}}{{\Psi}}_{2n}\left(k\sqrt{2\pi }\right)=0,\quad n \mathrm{o}\mathrm{d}\mathrm{d},\end{align} \tag{ A.24 }$

with ${\Phi}=\frac{{\Gamma}{\left(\frac{1}{4}\right)}^{8}}{128{\pi }^{4}}$ , with Γ() the Euler gamma function, and ${\left\{\mathrm{d}\left(n\right)\right\}}_{n=0}^{\infty }$ is a particular integer sequence studied in [93] which is directly related to the derivatives of θ₃(x) at x = 1 by

$\begin{equation}\frac{1}{\sqrt{1+z}}{\theta }_{3}\left(\frac{1-z}{1+z}\right)={\theta }_{3}\left(1\right)\sum _{n=0}^{\infty }\frac{\mathrm{d}\left(n\right)}{\left(2n\right)!}{{\Phi}}^{n}{z}^{2n}.\end{equation} \tag{ A.25 }$

We show the Fock coefficients for the sensor state in figure A2. Note that the sign of the integers d(n) is the same as the sign of c_4n through equations (A.23) and (A.24). We can write the sensor state $\vert {\psi }_{\text{ideal}}\rangle \propto {\sum }_{t,s\in \mathbb{Z}}{\mathrm{e}}^{-\mathrm{i}\sqrt{2\pi }\hat{p}t} {\mathrm{e}}^{\mathrm{i}\sqrt{2\pi }\hat{q}s}\vert 0\rangle$ where |0⟩ is the vacuum state. Using that ${\mathrm{e}}^{\mathrm{i}\sqrt{2\pi }\hat{q}}=D\left(\mathrm{i}\sqrt{\pi }\right)$ , ${\mathrm{e}}^{-\mathrm{i}\sqrt{2\pi }\hat{p}}=D\left(\sqrt{\pi }\right)$ and $D\left(\alpha \right)\vert \beta \rangle ={\mathrm{e}}^{\mathrm{i}\text{Im}\left(\alpha {\beta }^{{\ast}}\right)}\vert \alpha +\beta \rangle$ , $\langle n\vert \alpha \rangle ={\mathrm{e}}^{-\frac{\vert \alpha {\vert }^{2}}{2}}\frac{{\alpha }^{n}}{\sqrt{n!}}$ , we have

$\begin{equation}\mathrm{sign}\left(\mathrm{d}\left(n\right)\right)=\mathrm{sign}\left({c}_{4n}\right)=\mathrm{sign}\left(\sum _{t\in \mathbb{Z},s\in \mathbb{Z}:z=t+\mathrm{i}s}{\mathrm{e}}^{-\frac{1}{2}\pi \vert z{\vert }^{2}}{z}^{4n}{\left(-1\right)}^{ts}\right).\end{equation} \tag{ A.26 }$

**Figure A2.** Fock coefficients of the $\mathfrak{D}$ -approximate sensor state |ψ_approx⟩ for Δ ∈ {0.3, 0.4, 0.5}. Equations (A.23) and (A.22) are used to numerically compute the coefficients. The thermal distributions with average photon number $\overline{n}=\frac{u}{1-u}=\frac{{\mathrm{e}}^{-2{{\Delta}}^{2}}}{1-{\mathrm{e}}^{-2{{\Delta}}^{2}}}$ are shown for comparison.
Download figure:
Standard image High-resolution image

We want now to derive the asymptotic behavior of the coefficients. This can be done by expressing the normalization of the states by an equality which has to hold for all Δ ∈ (0, ∞) or equivalently all u ∈ [0, 1). In the case of the sensor state, ${\sum }_{n\in \mathbb{N}}\vert {c}_{4n}{\vert }^{2}=1$ implies:

$\begin{equation}N\left({\Delta}\right)=\frac{{\theta }_{3}{\left(1\right)}^{2}}{\sqrt{\pi }}\sum _{n\in \mathbb{N}}\frac{{\left(2\sqrt{{\Phi}}u\right)}^{4n}}{\left(4n\right)!}{\mathrm{d}}^{2}\left(n\right),\end{equation} \tag{ A.27 }$

which is of the form:

$\begin{equation}N\left(u\right)=\frac{A}{\sqrt{\pi }}\sum _{n\in \mathbb{N}}{a}_{4n}{u}^{4n},\end{equation} \tag{ A.28 }$

where the constant A and the sequence a_4n, both independent of u, are defined by equation (A.27). Similarly for the approximate GKP states we have:

$\begin{equation}N\left(u,j\right)=\frac{1}{\sqrt{\pi }}\sum _{n\in \mathbb{N}}\frac{{\left(2u\right)}^{2n}}{\left(2n\right)!}{\left(\frac{{\mathrm{d}}^{n}}{\mathrm{d}{z}^{n}}\left[\frac{1}{\sqrt{1+z}}{\theta }_{3-j}\left(2\frac{1-z}{1+z}\right)\right]{\left\vert \right.}_{z=0}\right)}^{2},\end{equation} \tag{ A.29 }$

which is of the same form:

$\begin{equation}N\left(u,j\right)=\frac{B\left(j\right)}{\sqrt{\pi }}\sum _{n\in \mathbb{N}}{b}_{2n}\left(j\right){u}^{2n},\end{equation} \tag{ A.30 }$

for some other constant B(j) and sequence b_2n(j), also both independent of u, defined by equation (A.29).

Seen as complex functions of $u\in \mathbb{C}$ , N(u, j) and N(u) are both analytic and have a convergence radius of 1. Their behavior for u → 1 can be obtained (see also [46]) using equation (A.13) and the fact that

$\begin{equation}{\theta }_{3-j}\left(x\right) \underset{x\to 0}{\sim } \frac{1}{\sqrt{x}},\quad {\theta }_{4}\left(x\right) \underset{x\to 0}{\sim } \frac{2 {\mathrm{e}}^{-\pi /4x}}{\sqrt{x}}.\end{equation} \tag{ A.31 }$

This gives

$\begin{equation}\frac{A}{\sqrt{\pi }}\sum _{n\in \mathbb{N}}{a}_{4n}{u}^{4n}=N\left(u\right) \underset{u\to 1}{\sim } \frac{1}{2\sqrt{\pi }\left(1-u\right)},\end{equation} \tag{ A.32 }$

$\begin{equation}\frac{B\left(j\right)}{\sqrt{\pi }}\sum _{n\in \mathbb{N}}{b}_{2n}\left(j\right){u}^{2n}=N\left(u,j\right) \underset{u\to 1}{\sim } \frac{1}{2\sqrt{\pi }\left(1-u\right)}.\end{equation} \tag{ A.33 }$

We then apply a transfer theorem, see [94], to deduce the asymptotic behavior of the a_4n and b_2n(j) sequences. More precisely they both converge to some finite value

$\begin{equation}{a}_{4n} \underset{n\to \infty }{\sim } \frac{1}{2A},\quad {b}_{2n}\left(j\right) \underset{n\to \infty }{\sim } \frac{1}{2B\left(j\right)}.\end{equation} \tag{ A.34 }$

In turn this gives the asymptotic behavior of the Fock coefficients:

$\begin{equation}\vert \langle 4n\vert {\psi }_{\text{approx}}\rangle {\vert }^{2} \underset{n\to \infty }{\sim } \frac{{u}^{4n}}{2\sqrt{\pi }N\left(u\right)},\quad \vert \langle 2n\vert {j}_{\text{approx}}\rangle {\vert }^{2} \underset{n\to \infty }{\sim } \frac{{u}^{2n}}{2\sqrt{\pi }N\left(u,j\right)}.\end{equation} \tag{ A.35 }$

In both cases the coefficients are asymptotically equivalent to a geometric or thermal distribution which is usually parametrized by the average photon number $\overline{n}$ as follows

$\begin{equation}{p}_{n}^{\text{thermal}}=\frac{1}{\overline{n}+1}{\left(\frac{\overline{n}}{\overline{n}+1}\right)}^{n}.\end{equation} \tag{ A.36 }$

We can therefore deduce the average photon number of the equivalent asymptotic thermal distribution to the approximate GKP and sensor states using

$\begin{equation}u=\frac{\overline{n}}{\overline{n}+1} {\Rightarrow} \overline{n}=\frac{u}{1-u}=\frac{{\mathrm{e}}^{-2{{\Delta}}^{2}}}{1-{\mathrm{e}}^{-2{{\Delta}}^{2}}}.\end{equation} \tag{ A.37 }$

These thermal distributions for the different Δ's considered are also shown in figures A1 and A2. One can see a good agreement of the general trend although oscillations of the order of the probabilities themselves persist. Note also that for small Δ, $\overline{n}\sim \frac{1}{2{{\Delta}}^{2}}$ , consistent with approximate expressions derived in other literature (see main text).

Appendix B.: Decoders for repeated GKP error correction with finite squeezing

For computational, simulation efficiency (as well as our formulation of a classical decoder) we use the approximation ${\mathfrak{F}}_{V}$ for the GKP ancillas. This form of the states can be viewed as applying a reverse Villain approximation to the approximate $\mathfrak{F}$ -states in equation (17). The reverse Villain approximation, which is tight for a ⩾ 4π², reads

$\begin{equation}\sum _{n\in \mathbb{Z}}{\mathrm{e}}^{-a{n}^{2}+bn}\approx {\mathrm{e}}^{-\frac{a}{2{\pi }^{2}}} {\mathrm{e}}^{\frac{{b}^{2}}{4a}+\frac{a}{2{\pi }^{2}}\mathrm{cos}\left(\frac{\pi }{a}b\right)}.\end{equation} \tag{ B.1 }$

Using equation (B.1) and $\frac{{{\Delta}}^{2}}{{{\Delta}}^{4}+1}\approx {{\Delta}}^{2}$ , we then have

$\begin{align}\hfill {\mathfrak{F}}_{V}\vert \overline{0}\rangle & ={\int }_{\mathbb{R}}\mathrm{d}q {\psi }^{0}\left(q\right)\vert q\rangle ,\quad {\psi }^{0}\left(q\right)\propto {\mathrm{e}}^{-\frac{{{\Delta}}^{2}}{2}{q}^{2}} {\mathrm{e}}^{\frac{1}{\pi {{\Delta}}^{2}}\mathrm{cos}\left(\sqrt{\pi }q\right)},\hfill \\ \hfill {\mathfrak{F}}_{V}\vert \overline{+}\rangle & ={\int }_{\mathbb{R}}\mathrm{d}q {\psi }^{+}\left(q\right)\vert q\rangle ,\quad {\psi }^{+}\left(q\right)\propto {\mathrm{e}}^{-\frac{{{\Delta}}^{2}}{2}{q}^{2}} {\mathrm{e}}^{\frac{1}{4\pi {{\Delta}}^{2}}\mathrm{cos}\left(2\sqrt{\pi }q\right)}.\hfill \end{align} \tag{ B.2 }$

Using these ancillas, the Green's function for M rounds of error correction (without active feedback between rounds), with outcomes denoted as M-dimensional vectors $\to {\mathfrak{p}}$ and $\to {\mathfrak{q}}$ , can be written as

$\begin{equation}{G}^{M}\left({q}_{\text{out}}{\leftarrow}{q}_{\text{in}}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right)={\int }_{{\mathbb{R}}^{M+1}}\mathrm{d}\to {q} \mathrm{exp}\left(-S\left[\to {q}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]\right)\delta \left({q}_{\text{out}}-{q}_{M}\right)\delta \left({q}_{\text{in}}-{q}_{0}\right),\end{equation} \tag{ B.3 }$

with one-dimensional 'action' $S\left[\to {q}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]$ :

$\begin{align}\hfill S\left[\to {q}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]& =\sum _{m=1}^{M}\frac{{{\Delta}}^{2}}{2}{\left({q}_{m}-{q}_{m-1}\right)}^{2}-\frac{1}{{{\Delta}}^{2}\pi } \mathrm{cos}\sqrt{\pi }\left({q}_{m}-{q}_{m-1}\right)-\sum _{m=1}^{M}\frac{1}{4\pi {{\Delta}}^{2}} \mathrm{cos} 2\sqrt{\pi }\left({\mathfrak{q}}_{m}-{q}_{m}\right)\hfill \\ \hfill & \quad +\frac{{{\Delta}}^{2}}{2}{\left({\mathfrak{q}}_{m}-{q}_{m}\right)}^{2}+ \mathrm{i}\sum _{m=1}^{M}{\mathfrak{p}}_{m}\left({q}_{m}-{q}_{m-1}\right).\hfill \end{align} \tag{ B.4 }$

We can readily interpret the first line in the last equation as a kinetic energy term T, generating dynamics in the position variable, while the second line is a potential energy term −U, pinning the position variable to the measured values ${\mathfrak{q}}_{m}$ . We note that in the potential energy, the $\mathrm{cos} 2\sqrt{\pi }\left({\mathfrak{q}}_{m}-{q}_{m}\right)$ term is dominant when Δ is small and hence we can omit the wide parabolic potential proportional to Δ². Similarly, for the kinetic energy, when Δ is small, expanding $\mathrm{cos} \sqrt{\pi }\left({q}_{m}-{q}_{m-1}\right)$ gives heavy-mass terms with mass $\sim \frac{1}{{{\Delta}}^{2}}$ and the light-mass quadratic term ∼Δ² contributes little. We see that aside from the imaginary term on the last line, the action in this approximation is the same as in the Hamiltonian developed for stochastic noise in the reverse Villain approximation in reference [48] identifying ${\sigma }_{0}^{2}={{\Delta}}^{2}$ with σ₀ the standard deviation in the Gaussian displacement model in equation (24). Note however that here the dynamics occurs at the level of wave functions, whereas the description in reference [48] took place at the level of the probabilities (for shift errors). The interesting difference lies in the pure imaginary term making the action S complex. If the path integral is approximated by taking a single 'classical' path which minimized Re(S) then this phase factor only gives an additional phase to this path. However, the pure imaginary term contributes a phase to each path so that the total sum of paths q_in → q_out can be different, due ito interference, than the case in which these phases are absent. Note that these terms comes from the Z-error correction step with outcome ${\mathfrak{p}}_{m}$ which puts a feedback error on the data oscillator.

We formulate a decoder which will provide an approximate tracking of the dynamics of the wave function in q and p of the encoded state which is classical. This approximation can be viewed, to some extent, as making a classical approximation, i.e. selecting a single optimal classical path of a quantum path integral. We believe that similar ideas could be applied to simplified tracking of the Wigner function for the purpose of decoding: this may be of interest when we want to study the effects of a fuller noise model, which includes, say, photon loss on the cavity mode during ancilla GKP state preparation and photon loss on the ancilla prior to measurement.

The use of a classical approximation¹² to this path-integral is to determine the outgoing wave function without fully calculating the entire evolution by executing the M − 1-dimensional integral. In this classical approximation we would have

$\begin{equation}{\psi }_{\text{out}}\left({q}_{\text{out}}\right)\approx \int \mathrm{d}{q}_{\text{in}} \mathrm{exp}\left(-S\left[{\to {q}}_{\text{opt}}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]\right){\psi }_{\text{in}}\left({q}_{\text{in}}\right).\end{equation} \tag{ B.5 }$

We imagine doing a final measurement of $\hat{q}$ on the outgoing wave function ψ_out and will be interested in evaluating equation (18). For the classical approximation we have

$\begin{equation}\mathbb{P}\left(\overline{Z}={\left(-1\right)}^{b}\right)={\int }_{{I}_{b}}\mathrm{d}q\vert {\psi }_{\text{out}}\left(q\right){\vert }^{2}={\int }_{{I}_{b}}\mathrm{d}q\int \mathrm{d}{q}_{\text{in}}\int {q}_{\text{in}}^{\prime } {\mathrm{e}}^{-S\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}\right)\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]-{S}^{{\ast}}\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}^{\prime }\right)\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]}{\psi }_{\text{in}}\left({q}_{\text{in}}\right){\psi }_{\text{in}}^{{\ast}}\left({q}_{\text{in}}^{\prime }\right),\end{equation} \tag{ B.6 }$

where we have explicitly indicated how the optimal path depends on the initial position q_in and the final position q. We note that the action in equation (B.4) has the property ${S}^{{\ast}}\left[{q}_{0},\dots , {q}_{M}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]=S\left[{q}_{M},\dots , {q}_{0}\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]$ , i.e. it is the action of the time-reversed path. Thus when q'_in = q_in, we have ${\mathrm{e}}^{-S\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}\right)\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]-{S}^{{\ast}}\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}\right)\right]}={\mathrm{e}}^{-2\text{Re}\left(S\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}\right)\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]\right)}$ . In words: when we take a path along a closed loop from q_in → q → q_in, there is no phase accumulation. However, in equation (B.6) there are certainly contributions when q_in ≠ q'_in. For the purpose of developing an efficient classical decoder, we apply a stochastic approximation to equation (B.6), keeping only the diagonal terms i.e.

$\begin{equation}{\mathbb{P}}_{\text{class}}\left(b\vert {\psi }_{\text{in}},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)=N{\int }_{{I}_{b}}\mathrm{d}q\int \mathrm{d}{q}_{\text{in}} {\mathrm{e}}^{-2\text{Re}\left(S\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}\right)\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]\right)}\vert {\psi }_{\text{in}}\left({q}_{\text{in}}\right){\vert }^{2},\end{equation} \tag{ B.7 }$

where N is a normalization to make ${\mathbb{P}}_{\text{class}}\left(\right)$ a probability. Since this normalization does not play a role in the use of this expression in decoding, we do not need to determine it. To evaluate equation (B.7), one can generate a q uniformly at random in the interval I_b and q_in is generated according to |ψ_in|². Given q and q_in, if we can evaluate the classical path ${\to {q}}_{\text{opt}}$ between these points and hence compute the weight ${\mathrm{e}}^{-2\text{Re}\left(S\left[{\to {q}}_{\text{opt}}\left(q{\leftarrow}{q}_{\text{in}}\right)\vert \to {\mathfrak{q}},\to {\mathfrak{p}}\right]\right)}$ corresponding to this path, then we can stochastically estimate equation (B.7). However, as was observed in reference [48] this classical-path approach is not computationally simple as the dynamics of the q-variable can be chaotic due to it taking place in a random potential induced by the measurement outcomes $\to {\mathfrak{q}},\to {\mathfrak{p}}$ . Hence, instead of sampling q_in and the endpoint q, we will sample q_in from |ψ_in(q_in)|² and then apply a forward minimization technique, similar as in reference [48], on the function Re(S)[.] to find an approximately optimal path ${\to {q}}_{\text{opt}}$ given q_in, leading to a final value for q_out. We then determine whether q_out lies in I_b and repeat to gather statistics. In fact, we observe numerically that this strategy shows the same performance as fixing an initial ${q}_{\text{in}}=\underset{q}{\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{a}\mathrm{x}}\vert {\psi }_{\text{in}}\left(q\right){\vert }^{2}$ to which we apply the forward minimization. The probability to land in the I_b interval is denoted as ${\mathbb{P}}_{\text{forward}}\left(b\vert {\psi }_{\text{in}},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)$ .

To adapt this forward decoder to the corrective displacement in each round as shown in figure 7, the action in equation (B.4) is modified by substituting ${q}_{m}\to {q}_{m}+{\mathfrak{q}}_{m}$ for each round (and the addition of a term $\mathrm{i}{q}_{m}{\mathfrak{p}}_{m}$ which is irrelevant for this decoder).

The memoryless decoder presented in figure 10 in the main text is implemented by decomposing each measurement outcome ${\mathfrak{q}}_{m}={\mathfrak{l}}_{m}^{q}\sqrt{\pi }+{\mathfrak{n}}_{m}^{q}2\sqrt{\pi }+{\mathfrak{e}}_{m}^{q}$ and ${\mathfrak{p}}_{m}={\mathfrak{l}}_{m}^{p}\sqrt{\pi }+{\mathfrak{n}}_{m}^{p}2\sqrt{\pi }+{\mathfrak{e}}_{m}^{p}$ with ${\mathfrak{l}}_{m}^{q/p}\in {\mathbb{Z}}_{2}$ (yes/no logical shift) ${\mathfrak{n}}_{m}^{q/p}\in \mathbb{Z}$ (number of stabilizer shifts), $\vert {\mathfrak{e}}_{m}^{q/p}\vert {< }\frac{\sqrt{\pi }}{2}$ (minimal shift error) and applying a corrective shift of ${\alpha }_{m}=\frac{-{\delta }_{q,m}+\mathrm{i}{\delta }_{p,m}}{\sqrt{2}}$ with ${\delta }_{q/p,m}={\mathfrak{n}}_{m}^{q/p}2\sqrt{\pi }+{\mathfrak{e}}_{m}^{q/p}$ after the QEC round. Note that this is not the same correction as in figure 7. This correction is motivated by a stochastic-shift error model and ensures that we keep the photon number low by applying an appropiate number of stabilizer displacements while correcting the perceived error.

Let us now further discuss how we test and evaluate our decoders.

Decoding of repeated GKP error correction. The experimentalist implementing the repeated rounds of error correction learns the value of $\to {\mathfrak{p}},\to {\mathfrak{q}}$ and the outcome of the final perfect homodyne measurement of $\hat{q}$ , but in general she should make decoding decisions without knowing the input state ¹³ . However, she can play the game in which she assumes that the input wave state is either ${\psi }_{\text{in}}^{0}\left(q\right)$ (or ${\psi }_{\text{in}}^{1}\left(q\right)$ ) and then determine whether the dynamics—given her measurement data $\to {\mathfrak{q}},\to {\mathfrak{p}}$ —would lead to a read-out of $\overline{Z}=1$ or $\overline{Z}=-1$ . Naturally this may not a deterministic process, so she can calculate the probabilities $\mathbb{P}\left(0\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)$ ( $\overline{Z}=1$ ) and $\mathbb{P}\left(1\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)$ ( $\overline{Z}=-1$ ). This defines a maximum-likelihood full-density matrix decoder. When using this decoder, the experimentalist evaluates the (normalized) Green's function in equation (B.3), given her measurement data. She then flips the final experimentally measured logical outcome bit (modeled as a perfect homodyne measurement of $\hat{q}$ ) whenever $\mathbb{P}\left(1\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right){ >}\mathbb{P}\left(0\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)$ . As logical error rate estimate for this decoder we take

$\begin{equation}{\overline{P}}_{\text{MLD}}=\int \mathrm{d}\to {\mathfrak{q}}\int \mathrm{d}\to {\mathfrak{p}} \mathbb{P}\left(\to {\mathfrak{q}},\to {\mathfrak{p}}\right)\mathrm{min}\left(\mathbb{P}\left(0\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right),\mathbb{P}\left(1\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)\right).\end{equation} \tag{ B.8 }$

This estimate assumes that the error rate is (roughly) the same when the input wavefunction is ${\psi }_{\text{in}}^{1}\left(q\right)$ and that a superposition of such inputs behaves similarly, without logical interference.

For systems of many modes, this decoding method will not typically be efficient (even for a single mode it comes down to a sizable computational effort), hence the goal of decoding is to infer errors without tracking the entire wavefunction. As we argued before, it turns out to be most computationally efficient to use a forward strategy and estimate ${\mathbb{P}}_{\text{forward}}\left(0\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)$ . With this probability in hand, let $f\left(\to {\mathfrak{q}},\to {\mathfrak{p}}\right)=\underset{b}{\text{argmin}}\left({\mathbb{P}}_{\text{forward}}\left(b\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)\right)$ be the indicator bit whether to flip. The logical error estimate that we consider is then

$\begin{equation}{\overline{P}}_{\text{forward}}=\int \mathrm{d}\to {\mathfrak{q}}\int \mathrm{d}\to {\mathfrak{p}} \mathbb{P}\left(\to {\mathfrak{q}},\to {\mathfrak{p}}\right)\left[{\delta }_{f\left(\to {\mathfrak{q}},\to {\mathfrak{p}}\right),0}\mathbb{P}\left(1\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)+{\delta }_{f\left(\to {\mathfrak{q}},\to {\mathfrak{p}}\right),1}\mathbb{P}\left(0\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right)\right].\end{equation} \tag{ B.9 }$

Again, in using only this estimate, it is assumed (and not a priori given) that the evolution of the input state ${\psi }_{\text{in}}^{1}\left(q\right)$ or an arbitrary superposition will have the same logical error rate, and that assuming a different input state would not lead the decoder to different conclusions. The logical error probability for the passive decoder which does not use any syndrome measurement data, but declares that output state is the same as input, is defined as

$\begin{equation}{\overline{P}}_{\text{passive}}=\int \mathrm{d}\to {\mathfrak{q}}\int \mathrm{d}\to {\mathfrak{p}} \mathbb{P}\left(\to {\mathfrak{q}},\to {\mathfrak{p}}\right)\mathbb{P}\left(1\vert {\psi }_{\text{in}}^{0},\to {\mathfrak{q}},\to {\mathfrak{p}}\right).\end{equation} \tag{ B.10 }$

Thus, similar as in equation (B.10), the logical error is determined by the probability to get outcome 1 in the final q measurement. We have numerically estimated the logical errors rates of these three decoders, given in equations (B.8)–(B.10), as a function of M for Δ = 0.3 and Δ = 0.4, see figure 10 in the main text. The logical error rates for adapted versions of these decoders including corrective displacements after each round are displayed in figure 11.

Appendix C.: Hamiltonian engineering via rotating wave approximations

The goal of this appendix is to discuss the underpinnings and the 'beyond' of the commonly-invoked rotating wave approximation of a Hamiltonian of the form H = H₀ + V with

$\begin{equation}{H}_{0}={\omega }_{a}\left({a}^{{\dagger}}a+\frac{1}{2}\right)+{\omega }_{b}\left({b}^{{\dagger}}b+\frac{1}{2}\right)+{\omega }_{c}\left({c}^{{\dagger}}c+\frac{1}{2}\right),\quad V=\sum _{k}{\lambda }_{k}{{\Phi}}^{k},\quad {\Phi}=\alpha {\hat{q}}_{a}+\beta {\hat{q}}_{b}+\gamma {\hat{q}}_{c}.\end{equation} \tag{ C.1 }$

First, in the absence of applying time-dependent drives, the physical basis for an RWA approximation of a Φ^k-term can be motivated in at least two ways. One is to move to a rotating frame in which the Hamiltonian remains time-independent, but is a form amenable to Schrieffer–Wolff degenerate perturbation theory so that terms which either (1) contain an unequal number of creation and annihilation operators, and/or (2) contain an equal number of creation and annihilation operators of modes which are sufficiently far detuned, are seen as perturbations to detuned Fock energy levels. For example, in equation (C.1), to argue about the perturbative effect of terms which contain an unequal number of creation and annihilation operators, we observe that they act as off-diagonal elements in the Fock basis of H₀, changing the number of total excitations. Hence if we view H₀ as block-diagonal with blocks formed by a given total number of excitations in either mode a, b or c, separated by a gap min(ω_a, ω_b, ω_c), then the effect of such excitation-number changing terms can by examined by Schrieffer–Wolff perturbation theory. In lowest-order perturbation theory, one projects V onto these blocks labeled by the number of excitations, so that off-diagonal terms have no effect, Kerr and cross-Kerr terms remain, as well as excitation-number preserving terms (e.g. a^† b^† c²). Given the GHz frequencies of the ω_i's versus the relative strength of Josephson-induced coupling, keeping things to this lowest-order is a good approximation and we replace V by V_eff which omits these terms which do not preserve the number of excitations. This strictly speaking is the rotating-wave-approximation.

As a next approximation, to handle terms which do not preserve excitation number in any of the particular modes, we go to the rotating frame at ω_c for all three modes so that ${\tilde {H}}_{0}={{\Delta}}_{ac}\left({a}^{{\dagger}}a+\frac{1}{2}\right)+{{\Delta}}_{bc}\left({b}^{{\dagger}}b+\frac{1}{2}\right)$ with Δ_ic = ω_i − ω_c and _eff = V_eff. We imagine that _eff is expanded in terms which are products of creation and annihilation operators of the modes. The Hamiltonian ${\tilde {H}}_{0}$ has energy eigenspaces |x⟩_a ⊗ |y⟩_b ⊗ |ψ⟩_c with Fock states x, y = 0, 1, ..., each of which has the degeneracy of the oscillator (c) space, and each space is at least min(|Δ_ba|, |Δ_ca|, |Δ_bc|) away from another space. The perturbation _eff has both diagonal parts with respect to these eigenspaces (e.g. Kerr and cross-Kerr) as well as off-diagonal terms which map between the spaces. In lowest-order perturbation theory, one again projects _eff onto these eigenspaces, so that off-diagonal terms have no effect and Kerr and cross-Kerr terms remain. This is then the full (RWA) approximation. To go beyond this, the effect of the off-diagonal terms can be estimated in second or higher-order perturbation theory to obtain an effective Hamiltonian H_eff which is diagonal in the Fock basis using Schrieffer–Wolff perturbation theory (see e.g. [95–98]). The spectrum of H_eff approximates that of H when we are in the perturbative regime with min(Δ_ij) ≪ ||V_eff||/2. In second and higher-order perturbation theory, i.e. beyond RWA, UH_eff U ^† provides an approximation for $\tilde {H}$ where the unitary U is the perturbatively expanded Schrieffer–Wolff transformation which provides a correction to the Fock eigenbasis. To get an approximation to the original Hamiltonian, one would finally rotate back to the original frame.

An alternative analysis could be based on the Magnus expansion: to apply this, we move to a rotating frame for each mode at its own frequency for the Hamiltonian in equation (C.1) such that the previously mentioned off-diagonal terms become rapidly time-dependent: as a consequence their effect averages out over sufficiently long times. To observe the dynamics in this rotating frame, we consider the Schrödinger equation for the state $\vert \tilde {\psi }\left(t\right)\rangle ={U}_{0}^{{\dagger}}\left(t\right)\vert \psi \left(t\right)\rangle$ with U₀ = exp(−i H₀ t), while |ψ(t)⟩ obeys the Schrödinger equation with H. The state $\vert \tilde {\psi }\left(t\right)\rangle$ will evolve according to the rotating-frame (or interaction-frame) Hamiltonian $\tilde {H}={U}_{0}^{{\dagger}}H{U}_{0}+\mathrm{i}\frac{\mathrm{d}{U}_{0}^{{\dagger}}}{\mathrm{d}t}{U}_{0}$ . For example, when V = λ[a² b^† c^† + h.c.], one has

$\begin{equation}\tilde {H}=\lambda \left[{a}^{2}{b}^{{\dagger}}{c}^{{\dagger}} {\mathrm{e}}^{\mathrm{i}\left(2{\omega }_{a}-{\omega }_{b}-{\omega }_{c}\right)t}+\mathrm{h}.\mathrm{c}.\right].\end{equation} \tag{ C.2 }$

For time-dependent Hamiltonians, the Magnus expansion [99] or Magnus–Taylor expansion [100] then forms a convenient representation of the effective dynamics. For the time-evolution (in the rotating frame) from an initial time t = 0 to a final time T, the Magnus expansion reads

$\begin{equation}U\left(T,0\right)=\mathcal{T} \mathrm{exp}\left(-\mathrm{i}{\int }_{0}^{T} \mathrm{d}t\tilde {H}\left(t\right)\right)=\mathrm{exp}\left(-\mathrm{i}\overline{H}\left(T\right)\right),\end{equation} \tag{ C.3 }$

with $\overline{H}\left(T\right)={\sum }_{k=1}^{\infty }{\overline{H}}^{\left(k\right)}\left(T\right)$ and the first two terms equal

$\begin{equation*}{\overline{H}}^{\left(1\right)}\left(T\right)={\int }_{0}^{T} \mathrm{d}t\tilde {H}\left(t\right),\quad {\overline{H}}^{\left(2\right)}\left(T\right)=-\frac{1}{2}{\int }_{0}^{T} \mathrm{d}{t}^{\prime }{\int }_{0}^{{t}^{\prime }} \mathrm{d}t\left[\tilde {H}\left({t}^{\prime }\right),\tilde {H}\left(t\right)\right],\end{equation*}$

while for k ⩾ 2 one gets increasingly higher-order commutators [99]. For the Hamiltonian in equation (C.1), clearly terms diagonal in the Fock basis, are time-independent and will be present in ${\overline{H}}^{\left(1\right)}\left(T\right)$ . As an example of a rotating term, consider a simple time-dependent Hamiltonian (t) = A exp(iΔt) + h.c. where A is some product of creation and annihilation of some modes and Δ is a detuning. For this Hamiltonian, the strength of this lowest-order term decays inversely with Δ, i.e.

$\begin{equation}{\Vert}{\overline{V}}^{\left(1\right)}\left(T\right){\Vert}{\leqslant}\frac{2}{{\Delta}}\left({\Vert}A{\Vert}+{\Vert}{A}^{{\dagger}}{\Vert}\right)\mathrm{sin}\left({\Delta}T/2\right){\leqslant}\frac{4}{{\Delta}}{\Vert}A{\Vert},\end{equation} \tag{ C.4 }$

scaling with the perturbative parameter ||A||/Δ.

In general, a sufficient condition for the convergence of the Magnus expansion is that ${\int }_{0}^{T} \mathrm{d}t{\Vert}\tilde {H}\left(t\right){\Vert}{< }\pi$ , but, similar as in degenerate perturbation theory, it can still serve as useful asymptotic series expansion.

We thus see that both methods give us perturbative expansions whose validity depends on the strength of the perturbative parameter.

Let us now discuss the case when we actively drive one of the modes, say mode c. As before one can apply an RWA, dropping terms which do not preserve total excitation number to obtain an effective V_eff. If we assume that the only effect of the drive term is to create a coherent state with $\langle c\left(t\right)\rangle =\mathcal{E} {\mathrm{e}}^{-\mathrm{i}{\omega }_{p}t}$ in oscillator c, we can remove ω_c c^† c from H and replace c by ⟨c(t)⟩ everywhere. Given this time-dependence it then seems more convenient to use a Magnus expansion to analyze the effect of the higher-order effects of the non-resonant terms. For this we move to a rotating frame for the oscillators a and b at their own frequency, such that depending on the choice of the drive frequency ω_p some terms become time-independent. For example, a term like a^† bc is time-independent for the choice ω_p = ω_a − ω_b.

A more thorough quantitative analysis of the error induced by the RWA approximation through perturbative or Magnus expansions would be desirable, as it plays into the accuracy of the CZ gate and is influenced by the number of photons in a GKP mode (as the latter influences the strength of the perturbation).

C.1. Time-dependent displacement frame

Imagine a Hamiltonian (in a rotating frame of the mode a) of the form $H={H}_{1}\left(a,{a}^{{\dagger}}\right)+\mathrm{i}\mathcal{E}\left(t\right){a}^{{\dagger}}-\mathrm{i}{\mathcal{E}}^{{\ast}}\left(t\right)a$ where $\mathcal{E}\left(t\right)$ is some time-dependent envelope of the drive and H₁(a, a^†) has some functional form on a and a^†. We consider the time-evolution of the vector $\vert \tilde {\psi }\left(t\right)\rangle ={U}_{0}^{{\dagger}}\left(0,t\right)\vert \psi \left(t\right)\rangle$ with ${U}_{0}\left(0,t\right)=\mathcal{T} \mathrm{exp}\left(-{\mathrm{i}\int }_{0}^{t} \mathrm{d}{t}^{\prime }\left[\mathrm{i}\mathcal{E}\left({t}^{\prime }\right){a}^{{\dagger}}-\mathrm{i}{\mathcal{E}}^{{\ast}}\left({t}^{\prime }\right)a\right]\right)$ which evolves with the Hamiltonian $\tilde {H}\left(t\right)={U}_{0}^{{\dagger}}H{U}_{0}+\mathrm{i}\frac{\partial {U}_{0}^{{\dagger}}}{\partial t}{U}_{0}$ . In words, we consider the evolution in the time-dependent displacement frame given by the time-dependent drive. When after some final time T, the total frame evolution is U₀(0, T) = I, the time-independent Hamiltonian $\tilde {H}\left(t\right)$ will describe the time-evolution of the actual Schrödinger state |ψ(t)⟩ over the entire period of time T. We can write U₀(0, t) = D(β(t)) where $\beta \left(t\right){=\int }_{0}^{t} \mathrm{d}{t}^{\prime }\mathcal{E}\left({t}^{\prime }\right)$ , so that we evaluate $\tilde {H}\left(t\right)={H}_{1}\left(a+\beta ,{a}^{{\dagger}}+{\beta }^{{\ast}}\right)+\mathrm{i}\mathcal{E}\left(t\right){\beta }^{{\ast}}-\mathrm{i}{\mathcal{E}}^{{\ast}}\left(t\right)\beta$ and the last term can be omitted as it give rise to an irrevelant phase.