Fermion-to-qubit mappings with varying resource requirements for quantum simulation

Mark Steudtner; Stephanie Wehner

doi:10.1088/1367-2630/aac54f

1. Introduction

Simulating quantum systems on a quantum computer is one of the most promising applications of small scale quantum computers [1]. Significant efforts have gone into the theoretical development of simulation algorithms [2–6], and their experimental demonstrations [7–12]. Resource estimates [13–15], such as for example for FeMoCo, a model for the nitrogenase enzyme, indicate that simulations of relevant chemical systems may be achieved with relatively modest quantum computing resources [16] in comparison to many standard quantum algorithms [17, 18].

One essential component in realizing simulations of fermionic models on quantum computers is the representation of such models in terms of qubits and quantum gates. Following initial simulation schemes for fermions hopping on a lattice [19], more recent proposals used the Jordan–Wigner [20] transform [3, 7, 21, 22], the Verstraete-Cirac mapping [23], or the Bravyi–Kitaev transform [2] to find a suitable representation. Specifically, the task of all such representations is two-fold. First, we seek a mapping from states in the fermionic Fock space of N sites to the space of n qubits. The fermionic Fock space is spanned by 2^N basis vectors $| {\nu }_{1},\,\ldots ,\,{\nu }_{N}\rangle$ where ν_j ∈ {0, 1} indicates the presence (ν_j = 1) or absence (ν_j = 0) of a spinless fermionic particle at orbital j³ . Such a mapping ${\bf{e}}:{{\mathbb{Z}}}_{2}^{\otimes N}\to {{\mathbb{Z}}}_{2}^{\otimes n}$ is also called an encoding [24]. An example of such an encoding is the trivial one in which n = N and qubits are used to represent the binary string ${\boldsymbol{\nu }}={({\nu }_{1},...,{\nu }_{N})}^{\top }$ . That is

$\begin{eqnarray}&&| {\boldsymbol{\omega }}\rangle =| {\bf{e}}({\boldsymbol{\nu }})\rangle =\underset{j=1}{\overset{n}{\displaystyle \bigotimes }}| {\omega }_{j}\rangle ,\end{eqnarray} \tag{ 1 }$

where ω_j = ν_j in the standard basis $\{| 0\rangle ,| 1\rangle \}$ .

Second, we need a way to simulate the dynamics of Fermions on these N orbitals. These dynamics can be modeled entirely in terms of the annihilation and creation operators c_j and ${c}_{j}^{\dagger }$ that satisfy the anticommutation relations

$\begin{eqnarray}&&{[{c}_{i}^{},{c}_{j}^{}]}_{+}=0,\quad {[{c}_{i}^{\dagger },{c}_{j}^{\dagger }]}_{+}=0,\quad {[{c}_{i},{c}_{j}^{\dagger }]}_{+}={\delta }_{{ij}},\end{eqnarray} \tag{ 2 }$

with [A, B]₊ = AB + BA. Following these relations, the operators act on the fermionic Fock space as

$\begin{eqnarray}&&{c}_{{i}_{m}}^{\dagger }\ {c}_{{i}_{1}}^{\dagger }...\ {c}_{{i}_{m-1}}^{\dagger }{c}_{{i}_{m}}^{\dagger }{c}_{{i}_{m+1}}^{\dagger }...\ {c}_{{i}_{M}}^{\dagger }| {\rm{\Theta }}\rangle =0,\end{eqnarray} \tag{ 3 }$

$\begin{eqnarray}&&{c}_{{i}_{m}}^{\ }\ {c}_{{i}_{1}}^{\dagger }...\ {c}_{{i}_{m-1}}^{\dagger }{c}_{{i}_{m+1}}^{\dagger }...\ {c}_{{i}_{M}}^{\dagger }| {\rm{\Theta }}\rangle =0,\end{eqnarray} \tag{ 4 }$

$\begin{eqnarray}&&{c}_{{i}_{m}}^{\ }\ {c}_{{i}_{1}}^{\dagger }...\ {c}_{{i}_{m-1}}^{\dagger }{c}_{{i}_{m}}^{\dagger }{c}_{{i}_{m+1}}^{\dagger }...\ {c}_{{i}_{M}}^{\dagger }| {\rm{\Theta }}\rangle =\,{(-1)}^{m-1}\ {c}_{{i}_{1}}^{\dagger }...\ {c}_{{i}_{m-1}}^{\dagger }{c}_{{i}_{m+1}}^{\dagger }...\ {c}_{{i}_{M}}^{\dagger }| {\rm{\Theta }}\rangle ,\end{eqnarray} \tag{ 5 }$

$\begin{eqnarray}&&{c}_{{i}_{m}}^{\dagger }\ {c}_{{i}_{1}}^{\dagger }...\ {c}_{{i}_{m-1}}^{\dagger }{c}_{{i}_{m+1}}^{\dagger }...\ {c}_{{i}_{M}}^{\dagger }| {\rm{\Theta }}\rangle \,=\,{(-1)}^{m-1}\ {c}_{{i}_{1}}^{\dagger }...\ {c}_{{i}_{m-1}}^{\dagger }{c}_{{i}_{m}}^{\dagger }{c}_{{i}_{m+1}}^{\dagger }...\ {c}_{{i}_{M}}^{\dagger }| {\rm{\Theta }}\rangle ,\end{eqnarray} \tag{ 6 }$

where $| {\rm{\Theta }}\rangle$ is the fermionic vacuum and $\{{i}_{1}\,,...,\,{i}_{M}\}\subseteq \{1\,,...,\,N\}$ . Mappings of the operators c_j to qubits typically use the Pauli matrices X, Z, and Y acting on one qubit, characterized by their anticommutation relations ${[{P}_{i},{P}_{j}]}_{+}=2{\delta }_{{ij}}{\mathbb{I}}$ for all ${P}_{i}\in { \mathcal P }=\{X,Z,Y\}$ . An example of such a mapping is the Jordan–Wigner transform [20] given by

$\begin{eqnarray}&&{c}_{j}\hat{=}{Z}^{\otimes j-1}\otimes {\sigma }^{-}\otimes {{\mathbb{I}}}^{\otimes n-j},\end{eqnarray} \tag{ 7 }$

$\begin{eqnarray}&&{c}_{j}^{\dagger }\hat{=}{Z}^{\otimes j-1}\otimes {\sigma }^{+}\otimes {{\mathbb{I}}}^{\otimes n-j},\end{eqnarray} \tag{ 8 }$

where

$\begin{eqnarray}&&{\sigma }^{-}=| 0\,\rangle \,\langle \,1| =\displaystyle \frac{1}{2}(X+{\rm{i}}{Y}),\end{eqnarray} \tag{ 9 }$

$\begin{eqnarray}&&\,{\sigma }^{+}=| 1\,\rangle \,\langle \,0| =\displaystyle \frac{1}{2}(X-{\rm{i}}{Y}).\end{eqnarray} \tag{ 10 }$

It is easily verified that together with the trivial encoding (1) this transformation satisfies the desired properties (2)–(6) and can hence be used to represent fermionic models with qubit systems.

In order to assess the suitability of an encoding scheme for the simulation of fermionic models on a quantum computer, a number of parameters are of interest. The first is the total number of qubits n needed in the simulation. Second, we may care about the gate size of the operators c_j and ${c}_{j}^{\dagger }$ when mapped to qubits. In its simplest form, this problem concerns the total number of qubits on which these operators do not act trivially, that is, the number of qubits L, on which an operator acts as ${P}_{j}\in { \mathcal P }$ instead of the identity ${\mathbb{I}}$ , sometimes called the Pauli length. Different transformations can lead to dramatically different performance with respect to these parameters. For both the Jordan–Wigner as well as the Bravyi–Kitaev transform n = N, but we have L = O(n) for the first, while $L=O(\mathrm{log}n)$ for the second. We remark that in experimental implementations we typically do not only care about the absolute number L, but rather the specific gate size and individual difficulty of the qubit gates each of which may be easier or harder to realize in a specific experimental architecture. For error-corrected quantum simulation, the cost in T-gates is as important to optimize as the circuit depth [25], and quantum devices with restricted connectivity even require mappings tailored to them [26, 27]. Finally, we remark that instead of looking for a mapping for individual operators ${c}_{j}^{(\dagger )}$ we may instead opt to map pairs (or higher order terms) of such operators at once, or even look to represent sums of such operators.

1.1. Results

Here, we propose a general family of mappings of fermionic models to qubit systems and quantum gates that allow us to trade-off the necessary number of qubits n against the difficulty of implementation as parametrized by L, or more complicated quantum gates such as CPhase. Ideally, one would of course like both the number of qubits, as well as the the gate size to be small. We show that our mappings can lead to significant savings in qubits for a variety of examples (see table 1) as compared to the Jordan–Wigner transform for instance, at the expense of greater complexity in realizing the required gates. The latter may lead to an increased time required for the simulation depending on which gates are easy to realize in a particular quantum computing architecture.

Table 1. Overview of mappings presented in this paper, listed by the complexity of their code functions, their qubit savings, qubit requirements (n), properties of the resulting gates and first appearance. Mappings can be compared with respect to the size of plain words (N) and their targeted Hamming weight K. We also refer to different methods that are not listed, as they do not rely on codes in any way [30, 31].

Mapping	En-/decoding type	Qubits saved	n(N, K)	Resulting gates	Origin
Jordan–Wigner $\|$ Parity transform	Linear/linear	None	N	Length-O(n) Pauli strings	[20, 24]
Bravyi–Kitaev transform	Linear/linear	None	N	Length-O(log n) Pauli strings	[2]
Checksum codes	Linear/ affine linear	O(1)	N − 1	Length-O(n) Pauli strings	Here
Binary addressing codes	Nonlinear/nonlinear	O(2^n/K)	$\mathrm{log}({N}^{K}/K!)$	$(O(n))$ -controlled gates	Here
Segment codes	Linear/nonlinear	$O(n/K)$	$N/\left(1+\tfrac{1}{2K}\right)$	$(O(K))$ -controlled gates	Here

At the heart of our efforts is an entirely general construction of the creation and annihilation operators in (3) given an arbitrary encoding ${\bf{e}}$ and the corresponding decoding ${\bf{d}}$ . As one might expect, this construction is not efficient for every choice of encoding ${\bf{e}}$ or decoding ${\bf{d}}$ . However, for linear encodings ${\bf{e}}$ , but possibly nonlinear decodings ${\bf{d}}$ , they can take on a very nice form. While in principle any classical code with the same properties can be shown to yield such mappings, we provide an appealing example of how a classical code of fixed Hamming weight [28] can be used to give an interesting mapping.

Two other approaches allow us to be more modest with the algorithmic depth in either accepting a qubit saving that is linear with N, or just saving a fixed amount of qubits for hardly any cost at all.

In previous works, trading quantum resources has been addressed for general algorithms [29], and quantum simulations [30–32]. In the two works of Moll et al and Bravyi et al, qubit requirements are reduced with a scheme that is different from ours. A qubit Hamiltonian is first obtained with e.g. the Jordan–Wigner transform, then unitary operations are applied to it in order taper qubits off successively. The paper by Moll et al provides a straight-forward method to calculate the Hamiltonian, that can be used to reduce the amount of qubits to a minimum, but the number of Hamiltonian terms scales exponentially with the particle number. The notion that our work is based on, was first introduced in [31] by Bravyi et al, for linear en- and decodings. With the generalization of this method, we hope to make the goal of qubit reduction more attainable in reducing the effort to do so. The reduction method is mediated by nonlinear codes, of which we provide different types to choose from. The transform of the Hamiltonian is straight-forward from there on, and we give explicit recipes for arbitrary codes. We can summarize our contributions as follows.

We show that for any encoding ${\bf{e}}\,:{{\mathbb{Z}}}_{2}^{\otimes N}\to {{\mathbb{Z}}}_{2}^{\otimes n}$ there exists a mapping of fermionic models to quantum gates. For the special case that this encoding is linear, our procedure can be understood as a slightly modified version of the perspective taken in [24]. This gives a systematic way to employ classical codes for obtaining such mappings.
Using particle conservation symmetry, we develop 3 types of codes that save a constant, linear and exponential amount of qubits (see table 1 and sections 3.1.1–3.1.3). An example from classical coding theory [28] is used to obtain significant qubit savings (here called the binary addressing code), at the expense of increased gate difficulty (unless the architecture would easily support multi-controlled gates).
The codes developed are demonstrated on two examples from quantum chemistry and physics.
- 1.
  The Hamiltonian of the well-studied hydrogen molecule in minimal basis is re-shaped into a two-qubit problem, using a simple code.
- 2.
  A Fermi–Hubbard model on a 2 × 5 lattice and periodic boundary conditions in the lateral direction is considered. We parametrize and compare the sizes of the resulting Hamiltonians, as we employ different codes to save various amounts of qubits. In this way, the trade-off between qubit savings and gate complexity is illustrated (see table 2).

2. Background

To illustrate the general use of (possibly non linear) encodings to represent fermionic models, let us first briefly generalize how existing mappings can be phrased in terms of linear encodings in the spirit of [24]. Under consideration in representing the dynamics is a mapping for second-quantized Hamiltonians of the form

$\begin{eqnarray}H & = & \displaystyle \sum _{l=0}^{\infty }\ \displaystyle \sum _{\displaystyle \genfrac{}{}{0em}{}{{\boldsymbol{a}}\in {[N]}^{\otimes l}}{{\boldsymbol{b}}\in {{\mathbb{Z}}}_{2}^{\otimes l}}}{h}_{{\boldsymbol{ab}}}\,\displaystyle \prod _{i=1}^{l}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}\\ & = & \displaystyle \sum _{l}\displaystyle \sum _{\displaystyle \genfrac{}{}{0em}{}{{\boldsymbol{a}},{\boldsymbol{b}}}{{\rm{with}}\ {h}_{{\boldsymbol{ab}}}\ne 0}}{\hat{h}}_{{\boldsymbol{ab}}},\end{eqnarray} \tag{ 11 }$

where ${h}_{{\boldsymbol{ab}}}$ are complex coefficients, chosen in a way as to render H Hermitian. We illustrate the use of such a mapping in the context of quantum simulation in appendix A. For our convenience, we use length-l N-ary vectors ${\boldsymbol{a}}={({a}_{1},...,{a}_{l})}^{\top }\in {[N]}^{\otimes l}$ to parametrize the orbitals on which a term ${\hat{h}}_{{\boldsymbol{ab}}}$ is acting, and write $[N]=\{1\,,...,\,N\}$ . A similar notation will be employed for binary vectors of length l, with ${\boldsymbol{b}}={({b}_{1},...,{b}_{l})}^{\top }\in {{\mathbb{Z}}}_{2}^{\otimes l},{{\mathbb{Z}}}_{2}=\{0,1\}$ , deciding whether an operator is a creator or annihilator by the rules ${({c}_{i}^{(\dagger )})}^{1}={c}_{i}^{(\dagger )}$ and ${({c}_{i}^{(\dagger )})}^{0}=1$ .

Every term ${\hat{h}}_{{\boldsymbol{ab}}}$ is a linear operation ${{ \mathcal F }}_{N}\to {{ \mathcal F }}_{N}$ , with ${{ \mathcal F }}_{N}$ being the Fock space restricted on N orbitals, the direct sum of all possible antisymmetrized M-particle Hilbert spaces ${{ \mathcal H }}_{N}^{M}\,:{{ \mathcal F }}_{N}={\bigoplus }_{m=0}^{N}{{ \mathcal H }}_{N}^{m}$ . Conventional mappings transform states of the Fock space ${{ \mathcal F }}_{N}$ into states on N qubits, carrying over all linear operations as well ${ \mathcal L }{({{ \mathcal F }}_{N})\to { \mathcal L }(({{\mathbb{C}}}^{2})}^{\otimes N})$ .

Before we start presenting conventional transformation schemes, we need to make a few remarks on transformed Hamiltonians and notations pertaining to them. First of all, we identify the set of gates ${\{{ \mathcal P },{\mathbb{I}}\}}^{\otimes n}={\{X,Y,Z,{\mathbb{I}}\}}^{\otimes n}$ with the term Pauli strings (on n qubits). The previously mentioned Jordan–Wigner transform, obviously has the power to transform (11) into a Hamiltonian that is a weighted sum of Pauli strings on N qubits. General transforms, however, might involve other types of gates. We however have the choice to decompose these into Pauli strings. One might want to do so when using standard techniques for Hamiltonian simulation. In the following, we will denote the correspondence of second-quantized operators or states B to their qubit counterparts C by: $B\hat{=}C$ . For convenience, we will also omit identities in Pauli strings and rather introduce qubit labels, e.g. $X\otimes {\mathbb{I}}\otimes X={X}_{1}\otimes {X}_{3}=({\bigotimes }_{i\in \{\mathrm{1,3}\}}{X}_{i})$ and write ${{\mathbb{I}}}^{\otimes n}={\mathbb{I}}$ . A complete table of notations can be found in appendix G.

Consider a linear encoding of N fermionic sites into n = N qubits given by a binary matrix A such that

$\begin{eqnarray}&&| {\boldsymbol{\omega }}\rangle =| {\bf{e}}({\boldsymbol{\nu }})\rangle =| A{\boldsymbol{\nu }}\ \mathrm{mod}\ 2\rangle \hat{=}\left(\displaystyle \prod _{j=1}^{N}{({c}_{j}^{\dagger })}^{{\nu }_{j}}\right)| {\rm{\Theta }}\rangle \end{eqnarray} \tag{ 12 }$

and A is invertible, i.e. $({{AA}}^{-1}\ \mathrm{mod}\ 2)={\mathbb{I}}$ . Note that in this case, the decoding given by ${\boldsymbol{\nu }}={\bf{d}}({\boldsymbol{\omega }})=({A}^{-1}{\boldsymbol{\omega }}\ \mathrm{mod}\ 2)$ is also linear. It is known that any such matrix A, subsequently also yields a mapping of the fermionic creation and annihilation operators to qubit gates [24]. To see how these are constructed, let us start by noting that they must fulfill the properties given in (3)–(6) and (2), which motivates the definition of a parity, a flip and an update set below:

1.
${c}_{{i}_{m}}^{(\dagger )}$ anticommutes with the first $m-1$ operators and thus acquires phase ${(-1)}^{m-1}$ .
2.
A creation operator ${c}_{{i}_{m}}^{\dagger }$ might be absent (present) in between ${c}_{{i}_{m-1}}^{\dagger }$ and ${c}_{{i}_{m+1}}^{\dagger }$ , leading the rightmost operator ${c}_{{i}_{m}}^{(\dagger )}$ to map the entire state to zero since ${c}_{{i}_{m}}| {\rm{\Theta }}\rangle =0$ $({c}_{{i}_{m}}^{\dagger }{c}_{{i}_{m}}^{\dagger }=0)$ .
3.
Given that the state was not annihilated, the occupation of site i_m has to be changed. This means a creation operator ${c}_{{i}_{m}}^{\dagger }$ has to be added or removed between ${c}_{{i}_{m-1}}^{\dagger }$ and ${c}_{{i}_{m+1}}^{\dagger }$ .

These rules tell us what the transform of an operator ${c}_{j}^{(\dagger )}$ has to inflict on a basis state (12). In order to implement the phase shift of the first rule, a series of Pauli-Z operators is applied on qubits, whose numbers are in the parity set (with respect to j ∈ [N]), P(j) ⫅ [N]. Following the second rule we project onto the ±1 subspace of the Z-string on qubits indexed by another [N] subset, the so-called flip set of j, F(j). The update set of j, U(j) ⫅ [N] labels the qubits to be flipped completing the third rule using an X-string

$\begin{eqnarray}&&{({c}_{j}^{\dagger })}^{b}{({c}_{j}^{})}^{b+1\mathrm{mod}2}\\ &&\quad \hat{=}\displaystyle \frac{1}{2}\left(\mathop{\displaystyle \bigotimes }\limits_{k\in U(j)}{X}_{k}\right)\left({\mathbb{I}}-{(-1)}^{b}\mathop{\displaystyle \bigotimes }\limits_{l\in F(j)}{Z}_{l}\right)\mathop{\displaystyle \bigotimes }\limits_{m\in P(j)}{Z}_{m},\end{eqnarray} \tag{ 13 }$

with $b\in {{\mathbb{Z}}}_{2}$ . P(j), F(j) and U(j) depend on the matrices A and A⁻¹ as well as the parity matrix R. The latter is a (N × N) binary matrix which has its lower triangle filled with ones, but not its diagonal. For the matrix entries this means R_ij = θ_ij, with θ_ij as the discrete version of the Heaviside function

$\begin{eqnarray}{\theta }_{{ij}}=\left\{\begin{array}{ll}0 & i\leqslant j\\ 1 & i\gt j.\end{array}\right.\end{eqnarray} \tag{ 14 }$

The set members are obtained in the following fashion:

1.
P(j) contains all column numbers in which the jth row of matrix $({{RA}}^{-1}\ \mathrm{mod}\ 2)$ has non-zero entries.
2.
F(j) contains the column labels of non-zero entries in the jth row of A⁻¹.
3.
U(j) contains all row numbers in which the jth column of A has non-zero entries.

Note that this definition of the sets differs from their original appearance in [24, 33], where diagonal elements are not included. In this way, our sets are not disjoint, which leads to Z-cancellations and appearance of Pauli-Y operators, but we have generalized the sets for arbitrary invertible matrices, and provided a pattern for other transforms later. In fact, we recover these linear transforms from the general case in appendix F. There we also show explicitly that these operators abide by (2)–(6).

2.1. Jordan–Wigner, parity and Bravyi–Kitaev transform

As an illustration, we present popular examples of these linear transformations, note again that all of these will have n = N. The Jordan–Wigner transform is a special case for $A={\mathbb{I}}$ , leading to the direct mapping. The operator transform gives L = O(N) Pauli strings as

$\begin{eqnarray}&&{({c}_{j}^{\dagger })}^{b}{({c}_{j}^{})}^{b+1\mathrm{mod}2}\hat{=}\displaystyle \frac{1}{2}({X}_{j}+i{(-1)}^{b}\,{Y}_{j})\mathop{\displaystyle \bigotimes }\limits_{m\lt j}{Z}_{m}.\end{eqnarray} \tag{ 15 }$

In the parity transform [24], we have L = O(N) X-strings:

$\begin{eqnarray}{A}^{-1}=\left[\begin{array}{cccc}1 & & & \\ 1 & 1 & & \\ & \ddots & \ddots & \\ & & 1 & 1\end{array}\right],\quad A=\left[\begin{array}{cccc}1 & & & \\ 1 & 1 & & \\ \vdots & \vdots & \ddots & \\ 1 & 1 & \cdots & 1\end{array}\right],\end{eqnarray} \tag{ 16 }$

$\begin{eqnarray}&&{({c}_{j}^{\dagger })}^{b}{({c}_{j}^{})}^{b+1\mathrm{mod}2}\ \\ &&\quad \hat{=}\displaystyle \frac{1}{2}({Z}_{j-1}\otimes {X}_{j}-i{(-1)}^{b}\,{Y}_{j})\underset{m=j+1}{\overset{N}{\displaystyle \bigotimes }}{X}_{m}.\end{eqnarray} \tag{ 17 }$

The Bravyi–Kitaev transform [2] is defined by a matrix A [24, 33] that has non-zero entries according to a certain binary tree rule, achieving $L=O(\mathrm{log}N)$ .

2.2. Saving qubits by exploiting symmetries

Our goal is to be able to trade quantum resources, which is done by reducing degrees of freedom by exploiting symmetries. For that purpose, we provide a theoretical foundation to characterize the latter.

Parity, Jordan–Wigner and Bravyi–Kitaev transforms encode all ${{ \mathcal F }}_{N}$ states and provide mappings for every ${ \mathcal L }({{ \mathcal F }}_{N})$ operator. Unfortunately,they require us to own a N-qubit quantum computer, which might be unnecessary. In fact, the only operator we want to simulate is the Hamiltonian, which usually has certain symmetries. Taking these symmetries into account enables us to perform the same task with n ≤ N qubits instead. Symmetries usually divide the ${{ \mathcal F }}_{N}$ into subspaces, and the idea is to encode only one of those. Let ${ \mathcal B }$ be a basis spanning a subspace $\mathrm{span}({ \mathcal B })\subseteq {{ \mathcal F }}_{N}$ be associated with a Hamiltonian (11), where for every $l,{\boldsymbol{a}},{\boldsymbol{b}};$ ${\hat{h}}_{{\boldsymbol{ab}}}\,:\mathrm{span}({ \mathcal B })\to \mathrm{span}({ \mathcal B })$ . Usually, Hamiltonian symmetries generate many such (distinct) subspaces. Under consideration of additional information about our problem, like particle number, parity or spin polarization,we select the correct subspace. Note that particle number conservation is by far the most prominent symmetry to take into account. It is generated by Hamiltonians that are linear combinations of products of ${c}_{i}^{\dagger }{c}_{j}^{\,}\,| \,i,j\in [N]$ . These Hamiltonians, originating from first principles, only exhibit terms conserving the total particle number; ${\hat{h}}_{{\boldsymbol{ab}}}\,:{{ \mathcal H }}_{N}^{M}\to {{ \mathcal H }}_{N}^{M}$ . From all the Hilbert spaces ${{ \mathcal H }}_{N}^{M}$ , one considers the space with the particle number matching the problem description.

These symmetries will be utilized in the next section: we develop a language that allows for encodings ${\boldsymbol{e}}$ that reduce the length of the binary vectors ${\boldsymbol{e}}({\boldsymbol{\nu }})$ as compared to ${\boldsymbol{\nu }}$ . This means that the state ${\boldsymbol{\nu }}$ will be encoded in $n\leqslant N$ qubits, since each digit saved corresponds to a qubit eliminated. As suggested by Bravyi et al [31], qubit savings can be achieved under the consideration of non-square, invertible matrices A. However, we will see below that using transformations based on nonlinear encodings and decodings ${\boldsymbol{d}}$ (the inverse transform defined by A⁻¹ before), we can eliminate a number of qubits that scales with the system size. For linear codes on the other hand, we find a mere constant saving.

3. General transformations

We here show how second-quantized operators and states, Hamiltonian symmetries and the fermionic basis ${ \mathcal B }$ are fused into a simple description of occupation basis states. While in this section all general ideas are presented, we would like to refer the reader to the appendices for details: to appendix B in particular, which holds the proof of the underlying techniques. Fermionic basis states are represented by binary vectors ${\boldsymbol{\nu }}\in {{\mathbb{Z}}}_{2}^{\otimes N}$ , with its components implicating the occupation of the corresponding orbitals. Basis states inside the quantum computer, on the other hand, are represented by binary vectors on a smaller space ${\boldsymbol{\omega }}\in {{\mathbb{Z}}}_{2}^{\otimes n}$ . These vectors are code words of the former ${\boldsymbol{\nu }}$ , where the binary code connecting all ${\boldsymbol{\nu }}$ and ${\boldsymbol{\omega }}$ is possibly nonlinear. In the end, an instance of such a code will be sufficient to describe states and operators, in a similar way than the matrix pair (A, A⁻¹) governs the conventional transforms already presented. We now start by defining such codes and connect them to the state mappings.

Let $\mathrm{span}({ \mathcal B })$ be a subspace of ${{ \mathcal F }}_{N}$ , as defined previously. For $n\geqslant \mathrm{log}| { \mathcal B }|$ , we define two binary vector functions ${\boldsymbol{d}}\,:{{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}^{\otimes N},{\boldsymbol{e}}\,:{{\mathbb{Z}}}_{2}^{\otimes N}\to {{\mathbb{Z}}}_{2}^{\otimes n}$ , where we regard each component as a binary function ${\boldsymbol{d}}={({d}_{1},...,{d}_{N})}^{\top }\,| \,{d}_{i}:{{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}$ . Furthermore we introduce the binary basis set ${ \mathcal V }\subseteq {{\mathbb{Z}}}_{2}^{\otimes N}$ , with

$\begin{eqnarray}&&{\boldsymbol{\nu }}\in { \mathcal V },\quad {\rm{only}}\,{\rm{if}}\quad \left(\displaystyle \prod _{i=1}^{N}{({c}_{i}^{\dagger })}^{{\nu }_{i}}\right)| {\rm{\Theta }}\rangle \in { \mathcal B }.\end{eqnarray} \tag{ 18 }$

All elements in ${ \mathcal B }$ shall be represented in ${ \mathcal V }$ . If for all ${\boldsymbol{\nu }}\in { \mathcal V }$ the binary functions ${\boldsymbol{e}}$ and ${\boldsymbol{d}}$ satisfy ${\boldsymbol{d}}({\boldsymbol{e}}({\boldsymbol{\nu }}))={\boldsymbol{\nu }}$ , and for all ${\boldsymbol{\omega }}\in {{\mathbb{Z}}}_{2}^{\otimes n}\,:\,{\boldsymbol{d}}({\boldsymbol{\omega }})\in { \mathcal V }$ , then we call the two functions encoding and decoding, respectively. An encoding-decoding pair ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ) forms a code.

We thus have obtained a general form of encoding, in which qubit states only represent the subspace $\mathrm{span}({ \mathcal B })$ . The decoding, on the other hand, translates the qubit basis back to the fermionic one:

$\begin{eqnarray}&&| {\boldsymbol{\omega }}\rangle := \underset{j=1}{\overset{n}{\displaystyle \bigotimes }}| {\omega }_{j}\rangle \ \hat{=}\ \left(\displaystyle \prod _{i=1}^{N}{({c}_{i}^{\dagger })}^{{d}_{i}({\boldsymbol{\omega }})}\right)| {\rm{\Theta }}\rangle .\end{eqnarray} \tag{ 19 }$

We intentionally keep the description of these functions abstract, as the code used might be nonlinear, i.e. it cannot be described with matrices $A,{A}^{-1}$ . Nonlinearity is thereby predominantly encountered in decoding rather than in encoding functions, as we will see in the examples obtained later.

For any code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ), we will now present the transform of fermionic operators into qubit gates. Before we can do so however, two issues are to be addressed. Firstly, one observes that we cannot hope to find a transformation recipe for a singular fermionic operator ${c}_{j}^{(\dagger )}$ . The reason for this is that the latter operator changes the occupation of the jth orbital. As a consequence, a state with the occupation vector ${\boldsymbol{\nu }}$ is mapped to $({\boldsymbol{\nu }}+{{\boldsymbol{u}}}_{{\boldsymbol{j}}}\ \mathrm{mod}\ 2)$ , where ${{\boldsymbol{u}}}_{{\boldsymbol{j}}}$ is the unit vector of component j; ${({u}_{j})}_{i}={\delta }_{{ij}}$ . The problem is that since we have trimmed the basis, $({\boldsymbol{\nu }}+{{\boldsymbol{u}}}_{{\boldsymbol{j}}}\ \mathrm{mod}\ 2)$ will probably not be in ${ \mathcal V }$ , which means this state is not encoded⁴ . The action of ${c}_{j}^{(\dagger )}$ is, thus, not defined. We can however obtain a recipe for the non-vanishing Hamiltonian terms ${\hat{h}}_{{\boldsymbol{a}}{\boldsymbol{b}}}$ as they do not escape the encoded space being $(\mathrm{span}({ \mathcal B })\to \mathrm{span}({ \mathcal B }))$ -operators. Note that this issue is never encountered in the conventional transforms, as they encode the entire Fock space.

Secondly, we are yet to introduce a tool to transform fermionic operators into quantum gates. The structure of the latter has to be similar to the linear case, as they mimic the same dynamics as presented in section 2. In general, a gate sequence will commence with some kind of projectors into the subspace with the correct occupation, as well as operators implementing parity phase shifts. The sequence should close with bit flips to update the state. The task is now to determine the form of these operators. The issue boils down to finding operators that extract binary information from qubit states, and map it onto their phase. In other words, we need to find linear operators associated with e.g. the binary function d_j, such that it maps basis states $| {\boldsymbol{\omega }}\rangle \to {(-1)}^{{d}_{j}({\boldsymbol{\omega }})}| {\boldsymbol{\omega }}\rangle$ . In any case, we must recover the case of Pauli strings on their respective sets when considering linear codes. For our example, this means the linear case yields the operator $({\bigotimes }_{m\in F(j)}{Z}_{m})$ . Using general codes, we are lead to define the extraction superoperation ${\mathfrak{X}}$ , which maps binary functions to quantum gates on n qubits:

$\begin{eqnarray}&&{\mathfrak{X}}\,:({{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2})\to { \mathcal L }({({{\mathbb{C}}}^{2})}^{\otimes n}).\end{eqnarray} \tag{ 20 }$

The extraction superoperator is defined for all binary vectors ${\boldsymbol{\omega }}\in {{\mathbb{Z}}}_{2}^{\otimes n}$ and binary functions $f,g\,:{{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}$ as:

$\begin{eqnarray}&&{\mathfrak{X}}[f]| {\boldsymbol{\omega }}\rangle ={(-1)}^{f({\boldsymbol{\omega }})}| {\boldsymbol{\omega }}\rangle \\ &&{\rm{(Extraction}}\,{\rm{property)}},\end{eqnarray} \tag{ 21 }$

$\begin{eqnarray}&&{\mathfrak{X}}[{\boldsymbol{\omega }}\,\to \,f({\boldsymbol{\omega }})+g({\boldsymbol{\omega }})\ \mathrm{mod}\ 2]={\mathfrak{X}}[f]\ {\mathfrak{X}}[g]\\ &&{\rm{(Exponentiation}}\,{\rm{identity)}},\end{eqnarray} \tag{ 22 }$

$\begin{eqnarray}&&{\mathfrak{X}}[{\boldsymbol{\omega }}\to b]={(-1)}^{b}\,{\mathbb{I}}\quad | \ b\in {{\mathbb{Z}}}_{2}\\ &&{\rm{(Extracting}}\,{\rm{constant}}\,{\rm{functions)}},\end{eqnarray} \tag{ 23 }$

$\begin{eqnarray}&&{\mathfrak{X}}[{\boldsymbol{\omega }}\to {\omega }_{j}]={Z}_{j}\quad | \ j\in [n]\\ &&{\rm{(Extracting}}\,{\rm{linear}}\,{\rm{functions)}},\end{eqnarray} \tag{ 24 }$

$\begin{eqnarray}&&{\mathfrak{X}}\left[{\boldsymbol{\omega }}\to \displaystyle \prod _{j\in { \mathcal S }}{\omega }_{j}\right]={{\rm{C}}}^{k}\,{\rm{PHASE}}({i}_{1}\,,\,...,\,{i}_{k+1})\\ &&\ \ \ \ \ \ {\rm{with}}\ \ { \mathcal S }=\{{i}_{s}{\}}_{s=1}^{k+1}\subseteq [n],\ \ k\in [n-1]\\ &&\ \ \ \ \ \ {\rm{(Extracting}}\,{\rm{non-linear}}\,{\rm{functions).}}\end{eqnarray} \tag{ 25 }$

Note that the first two properties imply that the operators ${\mathfrak{X}}[f],{\mathfrak{X}}[g]$ commute and all operators are diagonal in the computational basis. Given that binary functions have a polynomial form, we are now able to construct operators by extracting every binary function possible, for example

$\begin{eqnarray}&&{\mathfrak{X}}[{\boldsymbol{\omega }}\to 1+{\omega }_{1}+{\omega }_{1}{\omega }_{2}\ \mathrm{mod}\ 2]\\ &&\quad =\,{\mathfrak{X}}[{\boldsymbol{\omega }}\to 1]\ {\mathfrak{X}}[{\boldsymbol{\omega }}\to {\omega }_{1}]\ {\mathfrak{X}}[{\boldsymbol{\omega }}\to {\omega }_{1}{\omega }_{2}]\end{eqnarray} \tag{ 26 }$

$\begin{eqnarray}&&\quad =-{Z}_{1}\,{\rm{CPHASE}}(1,2).\end{eqnarray} \tag{ 27 }$

We firstly we have used (22) to arrive at (26), and then reach (27) by applying the properties (23)–(25) to the respective sub-terms. This might however not be the final Hamiltonian, since the simulation algorithm might require us to reformulate the Hamiltonian as a sum of weighted Pauli strings [4, 5]. In that case, need to decompose all controlled gates. The cost for this decomposition is an increase in the number of Hamiltonian terms, for instance we find ${\rm{CPHASE}}(i,j)=\tfrac{1}{2}({\mathbb{I}}+{Z}_{i}+{Z}_{j}-{Z}_{i}\otimes {Z}_{j})$ . In general, (24) and (25) can be replaced by an adjusted definition:

$\begin{eqnarray} & & {\mathfrak{X}}\left[{\boldsymbol{\omega }}\to \displaystyle \prod _{j\,\in { \mathcal S }}{\omega }_{j}\right]=\left.{\mathbb{I}}-2\displaystyle \prod _{j\,\in { \mathcal S }}\displaystyle \frac{1}{2}({\mathbb{I}}-{Z}_{j})\quad \right|\ { \mathcal S }\subseteq [n]\\ & & {\rm{(Extracting}}\,{\rm{non}} \mbox{-} {\rm{constant}}\,{\rm{functions).}}\end{eqnarray} \tag{ 28 }$

We will be able to define the operator mappings introducing the parity and update functions, ${\boldsymbol{p}}$ and ${{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}$ :

$\begin{eqnarray}&&{\boldsymbol{p}}\,:{{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}^{\otimes N},\quad {p}_{j}({\boldsymbol{\omega }})=\displaystyle \sum _{i=1}^{j-1}{d}_{i}({\boldsymbol{\omega }})\ \mathrm{mod}\ 2,\end{eqnarray} \tag{ 29 }$

$\begin{eqnarray} & & {{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}\,:{{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}^{\otimes n},\quad {\rm{with}}\ {\boldsymbol{q}}\in {{\mathbb{Z}}}_{2}^{\otimes N}\\ & & {{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}({\boldsymbol{\omega }})={\boldsymbol{e}}({\boldsymbol{d}}({\boldsymbol{\omega }})+{\boldsymbol{q}}\ \ \mathrm{mod}\ 2)+{\boldsymbol{\omega }}\ \ \mathrm{mod}\ 2.\end{eqnarray} \tag{ 30 }$

Finally, we have collected all the means to obtain the operator mapping for weight-l operator sequences as they occur in (11):

$\begin{eqnarray}&&\displaystyle \prod _{i=1}^{l}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}\hat{=}{{ \mathcal U }}^{{\boldsymbol{a}}}\left(\displaystyle \prod _{v=1}^{l-1}\,\displaystyle \prod _{w=v+1}^{l}{(-1)}^{{\theta }_{{a}_{v}{a}_{w}}}\right)\\ &&\quad \times \displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}\left({\mathbb{I}}-\left[\displaystyle \prod _{y=x+1}^{l}{(-1)}^{{\delta }_{{a}_{x}{a}_{y}}}\right]{(-1)}^{{b}_{x}}\ {\mathfrak{X}}[{d}_{{a}_{x}}]\right){\mathfrak{X}}[\,{p}_{{a}_{x}}],\end{eqnarray} \tag{ 31 }$

where θ_ij is defined in (14) and δ_ij is the Kronecker delta. In this expression, we find various projectors, parity operators with corrections for occupations that have changed before the update operator is applied. The update operator ${{ \mathcal U }}^{{\boldsymbol{a}}}$ , is characterized by the ${{\mathbb{Z}}}_{2}^{\otimes N}$ -vector ${\boldsymbol{q}}={\sum }_{i=1}^{l}{{\boldsymbol{u}}}_{{{\boldsymbol{a}}}_{{\boldsymbol{i}}}}\ \mathrm{mod}\ 2$

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}=\displaystyle \sum _{{\boldsymbol{t}}\,\in {{\mathbb{Z}}}_{2}^{\otimes n}}\left[\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{t}_{i}}\right]\displaystyle \prod _{j=1}^{n}\displaystyle \frac{1}{2}({\mathbb{I}}+{(-1)}^{{t}_{j}}\ {\mathfrak{X}}[{\varepsilon }_{j}^{\,{\boldsymbol{q}}}]).\end{eqnarray} \tag{ 32 }$

This is a problem: when summing over the entire ${{\mathbb{Z}}}_{2}^{\otimes n}$ , one has to expect an exponential number of terms. As a remedy, one can arrange the resulting operations into controlled gates, or rely on codes with a linear encoding. If the encoding can be defined using a binary (n × N)-matrix A, ${\boldsymbol{e}}({\boldsymbol{\nu }})=(A{\boldsymbol{\nu }}\ \mathrm{mod}\ 2)$ , the update operator reduces to

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}=\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{\displaystyle \sum _{j}{A}_{{ij}}{q}_{j}\mathrm{mod}2}.\end{eqnarray} \tag{ 33 }$

In appendix B, we show that (31)–(33) satisfy the conditions (2)–(6). Note that the update operator is also important for state preparation: let us assume that our qubits are initialized all in their zero state, $({\bigotimes }_{i\in [n]}| 0\rangle )$ , then the fermionic basis state associated with the vector ${\boldsymbol{\nu }}$ is obtained by applying the update operator ${{ \mathcal U }}^{{\boldsymbol{a}}}$ . Here the vector ${\boldsymbol{a}}$ contains all occupied orbitals, such that ${\boldsymbol{q}}={\boldsymbol{\nu }}$ . Even for nonlinear encodings the state preparation can done with Pauli strings: as the initial state is a product state of all zeros, we can replace operators ${\mathfrak{X}}[{\boldsymbol{\omega }}\to {\prod }_{i\in { \mathcal S }\subseteq [n]}\,{\omega }_{i}]$ by ${\mathbb{I}}$ .

In the following we will turn our attention to the most fruitful symmetry to take into account: particle conservation symmetry. While code families accounting for this symmetry are explored in the next subsection, alternatives to the mapping of entire Hamiltonian terms are discussed for such codes in appendix C.

3.1. Particle-number conserving codes

In the following, we will present three types of codes that save qubits by exploiting particle number conservation symmetry, and possibly the conservation of the total spin polarization. Particle-number conserving Hamiltonians are highly relevant for quantum chemistry and problems posed from first principles. We therefore set out to find codes in which ${\boldsymbol{\nu }}\in { \mathcal V }$ have a constant Hamming weight ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})=K$ . Since the Hamming weight is defined as ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})={\sum }_{m}{\nu }_{m}$ , it yields the total occupation number for the vectors ${\boldsymbol{\nu }}$ . In order to simulate systems with a fixed particle number, we are thus interested to find codes that implement code words of constant Hamming weight. Note that the fixed Hamming weight K does not necessarily need to coincide with the total particle number M. A code with the latter property might also be interesting for systems with additional symmetries. Most importantly, we have not taken into account the spin-multiplicity yet. As the particles in our system are fermions, every spatial site will typically have an even number of spin configurations associated with it. Orbitals with the same spin configurations naturally denote subsets of the total amount of orbitals, much like the suits in a card deck. An absence of magnetic terms as well as spin–orbit interactions leaves the Hamiltonian to conserve the number of particles inside all those suits. Consequently, we can append several constant-weight codes to each other. Each of those subcodes encodes thereby the orbitals inside one suit. In electronic system with only Coulomb interactions for instance, we can use two subcodes ( ${{\boldsymbol{e}}}^{\diamond }$ , ${{\boldsymbol{d}}}^{\diamond }$ ) and ( ${{\boldsymbol{e}}}^{\spadesuit}$ , ${{\boldsymbol{d}}}^{\spadesuit}$ ), to encode all spin-up, and spin-down orbitals, respectively. The global code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ), encoding the entire system, is obtained by appending the subcode functions e.g. ${\boldsymbol{d}}({{\boldsymbol{\omega }}}^{{\bf{1}}}\oplus {{\boldsymbol{\omega }}}^{{\bf{2}}})={{\boldsymbol{d}}}^{\diamond }({{\boldsymbol{\omega }}}^{{\bf{1}}})\oplus {{\boldsymbol{d}}}^{\spadesuit}({{\boldsymbol{\omega }}}^{{\bf{2}}}).$ Appending codes like this will help us to achieve higher savings at a lower gate cost.

The codes that we now introduce (see also again table 1), fulfill the task of encoding only constant-weight words differently well. The larger ${ \mathcal V }$ , the less qubits will be eliminated, but we expect the resulting gate sequences to be more simple. Although not just words of that weight are encoded, we treat K as a parameter—the targeted weight.

3.1.1. Checksum codes

A slim, constant amount of qubits can be saved with the following n = N − 1, affine linear codes. Checksum codes encode all the words with either even or odd Hamming weight. As this corresponds to exactly half of the Fock space, one qubit is eliminated. This means we disregard the last component when we encode ${\boldsymbol{\nu }}$ into words with one digit less. The decoding function then adds the missing component depending on the parity of the code words. The code for K odd is defined as

$\begin{eqnarray}{\boldsymbol{d}}({\boldsymbol{\omega }})=\left[\begin{array}{ccc}1 & & \\ & \ddots & \\ & & 1\\ 1 & \cdots & 1\end{array}\right]{\boldsymbol{\omega }}+\left(\begin{array}{c}0\\ \vdots \\ 0\\ 1\end{array}\right)\,\mathrm{mod}\,2,\end{eqnarray} \tag{ 34 }$

$\begin{eqnarray}{\boldsymbol{e}}({\boldsymbol{\nu }})=\left[\begin{array}{cccc}1 & & & 0\\ & \ddots & & \vdots \\ & & 1 & 0\end{array}\right]{\boldsymbol{\nu }}\,\mathrm{mod}\,2.\end{eqnarray} \tag{ 35 }$

In the even-K version, the affine vector ${{\boldsymbol{u}}}_{{\boldsymbol{N}}}$ , added in the decoding, is removed. Since encoding and decoding function are both at most affine linear, the extracted operators will all be Pauli strings, with at most a minus sign. The advantage of the checksum codes is that they do not depend on K. They can be used even in cases of smaller saving opportunities, like $K\approx N/2$ . We can employ these codes even for Hamiltonians that conserve only the Fermion parity. This makes them important for effective descriptions of superconductors [34].

3.1.2. Codes with binary addressing

We present a concept for heavily nonlinear codes for large qubit savings, $n=\lceil \mathrm{log}({N}^{K}/K!)\rceil$ , [28]. In order to conserve the maximum amount of qubits possible, we choose to encode particle coordinates as binary numbers in ${\boldsymbol{\omega }}$ . To keep it simple, we here consider the example of weight-one binary addressing codes, and refer the reader to appendix D for K > 1. In K = 1, we recognize the qubit savings to be exponential, so consider N = 2ⁿ. Encoding and decoding functions are defined by means of the binary enumerator, $\mathrm{bin}\,:{{\mathbb{Z}}}_{2}^{\otimes n}\to {\mathbb{Z}}$ , with $\mathrm{bin}({\boldsymbol{\omega }})={\sum }_{j=1}^{n}{2}^{j-1}{\omega }_{j}$

$\begin{eqnarray}&&{d}_{j}({\boldsymbol{\omega }})=\displaystyle \prod _{i=1}^{n}({\omega }_{i}+1+{q}_{i}^{\,j})\ \mathrm{mod}\ 2,\end{eqnarray} \tag{ 36 }$

$\begin{eqnarray}&&{\boldsymbol{e}}({\boldsymbol{\nu }})=[{{\boldsymbol{q}}}^{{\bf{1}}}| {{\boldsymbol{q}}}^{{\bf{2}}}| \cdots | {{\boldsymbol{q}}}^{{}^{{{\bf{2}}}^{{\boldsymbol{n}}}}}]{\boldsymbol{\nu }}\ \ \mathrm{mod}\ 2,\end{eqnarray} \tag{ 37 }$

where ${{\boldsymbol{q}}}^{{\boldsymbol{j}}}\in {{\mathbb{Z}}}_{2}^{\otimes n}$ is implicitly defined by $\mathrm{bin}({{\boldsymbol{q}}}^{{\boldsymbol{j}}})+1=j$ . An input ${\boldsymbol{\omega }}$ will by construction render only the jth component of (36) non-zero, when ${{\boldsymbol{q}}}^{{\boldsymbol{j}}}={\boldsymbol{\omega }}$ ⁵ .

The exponential qubit saving comes at a high cost: the product over each component of ${\boldsymbol{\omega }}$ implies multi-controlled gates on the entire register. This is likely to cause connectivity problems. Note that decomposing the controlled gates will in general be practically prohibited by the sheer amount of resulting terms. On top of those drawbacks, we also expect the encoding function to be nonlinear for K > 1.

3.1.3. Segment codes

We introduce a type of scaleable $n=\lceil N/\left(1+\tfrac{1}{2K}\right)\rceil$ codes to eliminate a linear amount of qubits. The idea of segment codes is to cut the vectors ${\boldsymbol{\nu }}$ into smaller, constant-size vectors ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}\in {{\mathbb{Z}}}_{2}^{\otimes \hat{N}}$ , such that ${\boldsymbol{\nu }}={\bigoplus }_{i}{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ . Each such segment ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ is encoded by a subcode. Although we have introduced the concept already, this segmentation is independent from our treatment of spin 'suits'. In order to construct a weight K global code, we append several instances of the same subcode. Each of these subcodes codes is defined on $\hat{n}$ qubits, encoding $\hat{N}=\hat{n}+1$ orbitals. We deliberately have chosen to only save one qubit per segment in order to keep the segment size $\hat{N}(K)$ small.

We now turn our attention to the construction of these segment codes. As shown in appendix E, the segment sizes can be set to $\hat{n}=2K$ and $\hat{N}=2K+1$ . As the global code is supposed to encode all ${\boldsymbol{\nu }}\in {{\mathbb{Z}}}_{2}^{\otimes N}$ with Hamming weight K, each segment must encode all vectors from Hamming weight zero up to weight K. In this way, we guarantee that the encoded space contains the relevant, weight K subspace. This construction follows from the idea that each block contains equal or less than K particles, but might as well be empty. For each segment, the following de- and encoding functions are found for $\hat{{\boldsymbol{\omega }}}\in {{\mathbb{Z}}}_{2}^{\otimes \hat{n}},\hat{{\boldsymbol{\nu }}}\in {{\mathbb{Z}}}_{2}^{\otimes \hat{N}}$ :

$\begin{eqnarray}\hat{{\boldsymbol{d}}}(\hat{{\boldsymbol{\omega }}})=\left[\begin{array}{ccc}1 & & \\ & \ddots & \\ & & 1\\ 0 & ... & 0\end{array}\right]\hat{{\boldsymbol{\omega }}}+f(\hat{{\boldsymbol{\omega }}})\left(\begin{array}{c}1\\ \vdots \\ \vdots \\ 1\end{array}\right)\ \mathrm{mod}\ 2,\end{eqnarray} \tag{ 38 }$

$\begin{eqnarray}\hat{{\boldsymbol{e}}}(\hat{{\boldsymbol{\nu }}})=\left[\begin{array}{cccc}1 & & & 1\\ & \ddots & & \vdots \\ & & 1 & 1\end{array}\right]\hat{{\boldsymbol{\nu }}}\,\mathrm{mod}\,2,\end{eqnarray} \tag{ 39 }$

where $f\,:{{\mathbb{Z}}}_{2}^{\otimes \hat{n}}\to {{\mathbb{Z}}}_{2}$ is a binary switch. The switch is the source of nonlinearity in these codes. On an input $\hat{{\boldsymbol{\omega }}}$ with ${{\rm{w}}}_{{\rm{H}}}(\hat{{\boldsymbol{\omega }}})\gt K$ , it yields one, and zero otherwise.

There is just one problem: segment codes are not suitable for particle-number conserving Hamiltonians, according to the definition of the basis ${ \mathcal B }$ , that we would have for segment codes. The reason for this is that we have not encoded all states with ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})\gt K$ . In this way, Hamiltonian terms ${\hat{h}}_{{\boldsymbol{ab}}}$ that exchange occupation numbers between two segments, can map into unencoded space. We can, however, adjust these terms, such that they only act non-destructively on states with at most K particles between the involved segment. This does not change the model, but aligns the Hamiltonian with the necessary condition that we have on ${ \mathcal B },{\hat{h}}_{{\boldsymbol{a}}{\boldsymbol{b}}}\,:\mathrm{span}({ \mathcal B })\,\to \mathrm{span}({ \mathcal B })$ . This is discussed in detail appendix E, where we also provide an explicit description of the binary switch mentioned earlier.

Using segment codes, the operator transforms will have multi-controlled gates as well: the binary switch is nonlinear. However, gates are controlled on at most an entire segment, which means there is no gate that acts on more than $2K$ qubits. This an improvement in gate locality, as compared to binary addressing codes.

4. Examples

4.1. Hydrogen molecule

In this subsection, we will demonstrate the Hamiltonian transformation on a simple problem. Choosing a standard example, we draw comparison with other methods for qubit reduction. As one of the simplest problems, the minimal electronic structure of the hydrogen molecule has been studied extensively for quantum simulation [3, 4] already. We describe the system as two electrons on 2 spatial sites. Because of the spin-multiplicity, we require 4 qubits to simulate the Hamiltonian in conventional ways. Using the particle conservation symmetry of the Hamiltonian, this number can be reduced. The Hamiltonian also lacks terms that mix spin-up and -down states, with the total spin polarization known to be zero. Taking into account these symmetries, one finds a total of 4 fermionic basis states: ${ \mathcal V }=\{(0,1,0,1),(0,1,1,0),(1,0,0,1),(1,0,1,0)\}$ . These can be encoded into two qubits by appending two instances of a (N = 2, n = 1, K = 1)-code. The global code is defined as :

$\begin{eqnarray}{\boldsymbol{d}}({\boldsymbol{\omega }})=\left[\begin{array}{cc}1 & \\ 1 & \\ & 1\\ & 1\end{array}\right]{\boldsymbol{\omega }}+\left(\begin{array}{c}1\\ 0\\ 1\\ 0\end{array}\right)\ \mathrm{mod}\ 2,\end{eqnarray} \tag{ 40 }$

$\begin{eqnarray}{\boldsymbol{e}}({\boldsymbol{\nu }})=\left[\begin{array}{cccc}0 & 1 & 0 & 0\\ 0 & 0 & 0 & 1\end{array}\right]{\boldsymbol{\nu }}\,\ \mathrm{mod}\ 2.\end{eqnarray} \tag{ 41 }$

The physical Hamiltonian

$\begin{eqnarray}H & = & -{h}_{11}({c}_{1}^{\dagger }{c}_{1}^{\ }+{c}_{3}^{\dagger }{c}_{3}^{\ })-{h}_{22}({c}_{2}^{\dagger }{c}_{2}^{\ }+{c}_{4}^{\dagger }{c}_{4}^{\ })\\ & & +{h}_{1331}\ {c}_{1}^{\dagger }{c}_{3}^{\dagger }{c}_{3}^{\ }{c}_{1}^{\ }+{h}_{2442}\ {c}_{2}^{\dagger }{c}_{4}^{\dagger }{c}_{4}^{\ }{c}_{2}^{\ }\\ & & +{h}_{1221}({c}_{1}^{\dagger }{c}_{4}^{\dagger }{c}_{4}^{\ }{c}_{1}^{\ }+{c}_{3}^{\dagger }{c}_{2}^{\dagger }{c}_{2}^{\ }{c}_{3}^{\ })\\ & & +({h}_{1221}-{h}_{1212})({c}_{1}^{\dagger }{c}_{2}^{\dagger }{c}_{2}^{\ }{c}_{1}^{\ }+{c}_{3}^{\dagger }{c}_{4}^{\dagger }{c}_{4}^{\ }{c}_{3}^{\ })\\ & & +{h}_{1212}({c}_{1}^{\dagger }{c}_{4}^{\dagger }{c}_{3}^{\ }{c}_{2}^{\ }+{c}_{2}^{\dagger }{c}_{3}^{\dagger }{c}_{4}^{\ }{c}_{1}^{\ })\\ & & +{h}_{1212}({c}_{1}^{\dagger }{c}_{3}^{\dagger }{c}_{4}^{\ }{c}_{2}^{\ }+{c}_{2}^{\dagger }{c}_{4}^{\dagger }{c}_{3}^{\ }{c}_{1}^{\ }),\end{eqnarray} \tag{ 42 }$

is transformed into the qubit Hamiltonian

$\begin{eqnarray}&&{g}_{1}\ {\mathbb{I}}+{g}_{2}\ {X}_{1}\otimes {X}_{2}+{g}_{3}\ {Z}_{1}+{g}_{4}\ {Z}_{2}+{g}_{5}\ {Z}_{1}\otimes {Z}_{2}.\end{eqnarray} \tag{ 43 }$

The real coefficients g_i are formed by the coefficients h_ijkl of (42). After performing the transformation, we find

$\begin{eqnarray}&&{g}_{1}=-{h}_{11}-{h}_{22}+\displaystyle \frac{1}{2}{h}_{1221}+\displaystyle \frac{1}{4}{h}_{1331}+\displaystyle \frac{1}{4}{h}_{2442},\end{eqnarray} \tag{ 44 }$

$\begin{eqnarray}&&{g}_{2}={h}_{1212},\end{eqnarray} \tag{ 45 }$

$\begin{eqnarray}&&{g}_{3}={g}_{4}=\displaystyle \frac{1}{2}{h}_{11}-\displaystyle \frac{1}{2}{h}_{22}+-\displaystyle \frac{1}{4}{h}_{1331}+\displaystyle \frac{1}{4}{h}_{2442},\end{eqnarray} \tag{ 46 }$

$\begin{eqnarray}&&{g}_{5}=-\displaystyle \frac{1}{2}{h}_{1221}+\displaystyle \frac{1}{4}{h}_{1331}+\displaystyle \frac{1}{4}{h}_{2442}.\end{eqnarray} \tag{ 47 }$

In previous works, conventional transforms have been applied to that problem Hamiltonian. Afterwards, the resulting 4-qubit-Hamiltonian has been reduced by hand in some way. In [11], the actions on two qubits are replaced with their expectation values after inspection of the Hamiltonian. In [30], on the other hand, the Hamiltonian is reduced to two qubits in a systematic fashion. Finally, the case is revisited in [31], where the problem is reduced below the combinatorical limit to one qubit. The latter two attempts have used Jordan–Wigner, the former the Bravyi–Kitaev transform first.

4.2. Fermi–Hubbard model

We present another example to illustrate the trade-off between qubit number and gate cost as well as circuit depth. For that purpose, we consider a simple toy Hamiltonian and demonstrate that a reduction of qubit requirements is theoretically possible. Although we do not want to claim that this scenario is realistic, we present a simple cost model with it, that hints the potential up-scaling of circuit depth and simulation cost, as the number of qubits decreases: we therefore consider the total sum of Pauli lengths of every term, which gives us an idea of the number of two-qubit gates required, and the number of Hamiltonian terms, as we decompose controlled gates (28), which should give us an idea of possible T-gate requirements and simulation depth. Let us start now to describe the model. We consider a small lattice with periodic boundary conditions in the lateral direction. The system shall contain 10 spatial sites, doubled by the spin-multiplicity. The problem Hamiltonian is

$\begin{eqnarray}H & = & -t\displaystyle \sum _{\langle i,j\rangle \in E}({c}_{i}^{\dagger }{c}_{j}^{\ }+{c}_{j}^{\dagger }{c}_{i}^{\ })\\ & & +U\displaystyle \sum _{j=1}^{10}{c}_{j}^{\dagger }{c}_{j}^{\ }\,{c}_{10+j}^{\dagger }{c}_{10+j}^{\ },\end{eqnarray} \tag{ 48 }$

with its real coefficients t, U. It exhibits hopping terms along the edges E of the graph in figure 1. The sketch on the left of this figure shows the connection graph of the first 10 orbitals. The other 10 orbitals are connected in the same fashion, and each such site is interacting with its counterpart from the other graph. We aim to populate this model with four Fermions, where the total spin polarization is zero. Two conventional transforms and two transforms based on our codes are compared by the amount of qubits necessary, as well as the size of the transformed Hamiltonian. Note that besides eigenenergies, one might also be interested in obtaining the values of correlation functions, e.g. $\langle {c}_{i}^{\dagger }{c}_{j}^{}\rangle$ , which is done by measuring (qubit) operators obtained with the transform (48). The only difference is that if a correlator maps into unencoded space, it is to be set to zero. As benchmarks, we decompose controlled gates and count the number of resulting Pauli strings. The sum of their total weight constitutes the gate count. Having these two disconnected graphs is an invitation to us to append two codes acting on sites 1–10 and 11–20 respectively. for this example, we consider the following codes:

1.
Jordan–Wigner and Bravyi–Kitaev transform: for comparison, we employ these conventional transforms on our system, with which we do not save qubits. The resulting terms are best obtained by the transforming every Fermion operator in (48) by (13), where the flip, parity and update sets, F(j), P(j), U(j) are determined by the choice of matrices A and A⁻¹, which are binary tree matrices in the the case of the Bravyi-Kitev transform, and identity matrices for the Jordan–Wigner transform.
2.
Checksum code ⊕ checksum code: knowing that the particle number is conserved, and that spin cannot be flipped, we are free to save 2 qubits in constraining the parity of both, spin-up and -down particles, alike. This is done in appending two (N = 10) checksum codes, where each that acts on only spin-up (spin-down) orbitals, so indices 1–10 (11–20). The code resulting from appending two even checksum codes is linear, and encoding and decoding function feature the matrices
$\begin{eqnarray}\left[\begin{array}{cccccccc}1 & & & 0 & & & & \\ & \ddots & & \vdots & & & & \\ & & 1 & 0 & & & & \\ & & & & 1 & & & 0\\ & & & & & \ddots & & \vdots \\ & & & & & & 1 & 0\end{array}\right],\ \left[\begin{array}{cccccc}1 & & & & & \\ & \ddots & & & & \\ & & 1 & & & \\ 1 & \cdots & 1 & & & \\ & & & 1 & & \\ & & & & \ddots & \\ & & & & & 1\\ & & & 1 & \cdots & 1\end{array}\right].\end{eqnarray} \tag{ 49 }$
However, as not the entire Fock space is encoded, we need to perform the operator transform according to (31), where the update operator is defined by (33), where A refers here to the first matrix.
3.
Segment code ⊕ segment code: knowing the particle number in one 'spin suite' to be 2, we can for both, spin-up and -down orbitals, append two K = 2 segment codes to each other. This equals a total of 4 segment codes, saving 4 qubits. The resulting global code $({\boldsymbol{e}},{\boldsymbol{d}})$ is defined by
$\begin{eqnarray}&&{\boldsymbol{e}}\left(\underset{i=1}{\overset{4}{\displaystyle \bigoplus }}{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}\right)=\underset{i=1}{\overset{4}{\displaystyle \bigoplus }}\hat{{\boldsymbol{e}}}({\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}),\end{eqnarray} \tag{ 50 }$

$\begin{eqnarray}{\boldsymbol{d}}({\boldsymbol{\omega }})=\left(\left[\begin{array}{cccc}1 & & & \\ & 1 & & \\ & & 1 & \\ & & & 1\end{array}\right]\otimes \left[\begin{array}{cccc}1 & & & 1\\ & \ddots & & \vdots \\ & & 1 & 1\end{array}\right]\right){\boldsymbol{\omega }}\ \mathrm{mod}\ 2,\end{eqnarray} \tag{ 51 }$
where $\hat{{\boldsymbol{e}}}$ are the encodings of the subcodes (39), and ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ are occupations on the segements of the total orbital vector ${\boldsymbol{\nu }}={\bigoplus }_{i}{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ . These segments are formed as suggested by the right-hand side of figure 1. For details on the decoding functions and Hamiltonian adjustments, please consider appendix E. The Hamiltonian transform is in the end carried out again by (31) and (33).
4.
Checksum code $\oplus$ segment code: a compromise between the above, in which the spin-up orbitals are transformed via a checksum code, and the spin-down orbitals are transformed via two segment codes. The global code used for the Hamiltonian transformation is the appendage of an (even-weight, N = 10) checksum code and two (K = 2) segment codes, including Hamiltonian adjustments on the spin-down orbitals.

**Figure 1.** Left: illustration of the Fermi–Hubbard model considered. Lines between two sites, like 1 and 2, indicate the appearance of the term $t({c}_{1}^{\dagger }{c}_{2}^{\ }+{c}_{2}^{\dagger }{c}_{1}^{\ })$ in the Hamiltonian (48). Periodic boundary conditions link sites 1 and 5 as well as 6 and 10. Sites 11–20 follow the same graph. Right: segmenting of the system; the two blocks are infringed. The gray links are to be adjusted.
Download figure:
Standard image High-resolution image

Note that from the combinatorial perspective, we could encode the problem with 11 qubits. However, if we append two K = 2 binary addressing codes to each other, the resulting Hamiltonian is on 14 qubits already. The problem is that the resulting Hamiltonian for this case cannot be expressed with decomposed controlled gates due to the high number of resulting terms.

Indeed, table 2 suggests that decomposing the controlling gates might easily lead to very large Hamiltonians with a multitude of very small terms. The gate decomposition appears therefore undesirable. We in general recommend to rather decompose large controlled gates as shown in [35]. However, one also notices that an elimination of up to two qubits comes at a low cost: the amount of gates is not higher than in the Bravyi–Kitaev transform. As soon as we employ segment codes on the other hand, the Hamiltonian complexity rises with the amount of qubits eliminated.

Table 2. Relaxing the qubit requirements for the Hamiltonian (48), where various mappings trade different amounts of qubits. The notation ⊕ is used as two codes for different graphs are appended. We compare different mappings by the amount of qubits. We make comparrisons by the number of Hamiltonian terms and the total weight of the resulting Pauli strings.

Mapping	Qubits	Gates	Terms
Jordan–Wigner transform	20	232	74
Bravyi–Kitaev transform	20	278	74
Checksum code ⊕ Checksum code	18	260	74
Checksum code ⊕ Segment code	17	4425	876
Segment code ⊕ Segment code	16	9366	1838

5. Conclusion and future work

In this work, we have introduced new methods to reduce the number of qubits required for simulating fermionic systems in second quantization. We see the virtue of the introduced concepts in the fact that it takes into account symmetries on a simple but non-abstract level. We merely concern ourselves with objects as simple as binary vectors, but attribute the physical interpretation of orbital occupations to them. At this level, the mentioned symmetries are easy to apply and exploit. The accounting for the complicated antisymmetrization of the many-body wave function on the other hand is done in the fermionic operators, which to transform we have provided recipes for. In these operator transforms we see room for improvement: we for instance lack a proper gate composition for update operators of nonlinear encodings at this point. We on the other hand have the extraction superoperator ${\mathfrak{X}}$ return only conventional (multi)-controlled phase gates. Nonlinear codes would on the other hand benefit from a gate set that includes gates with negative control, i.e. with the (−1) eigenvalue conditioned on $| 0\rangle$ eigenspaces of certain qubits involved. We consider our work to be relevant for quantum simulation with near-term devices, with a limited number of qubits at disposal. Remarks about asymptotic scaling are thus missing in this work, but would be interesting. Also, we have centered our investigations around quantum computers with qubits. The idea behind the generalized operator transforms, however, can possibly be adapted to multi-level systems (qudits). The operator transforms of segment and binary addressing codes, for instance, might simplify in such a setup, if generalized Pauli operators are available in some form.

Apart from the codes presented, we have laid the foundation for the reader to invent their own. As supplementary material, we include a program to transform arbitrary Hamiltonians from a second-quantized form into Pauli string form, using user-defined codes. In this way we hope that in the long term, many more entries will be added to table 1. The extension of this work to a more general setting for symmetries, in which the latter are generated by groups or sets of operators that commute with the Hamiltonian, is an open task. Furthermore, we are certain that table 1 can be extended into another way: gate relaxations for transforms with n > N have already been shown [2, 23, 26, 36], and we are currently working in that direction.

Acknowledgments

We would like to thank Kenneth Goodenough, Victoria Lipinska, Thinh Le Phuc and Valentina Caprara Vivoli for helpful discussions on how to present our results. We would also like to thank CWJ Beenakker for his support. MS was supported by the Netherlands Organization for Scientific Research (NWO/OCW) and an ERC Synergy Grant. SW was supported by STW Netherlands, an NWO VIDI Grant and an ERC Synergy Grant.

Appendix A.: On quantum simulation

At this point, we discuss quantum simulation in the context of our transformations. Amongst other things, we describe the most simple algorithm for Hamiltonian simulation, and proceed by investigating feasibility issues with our transforms Let us start by explaining how this work fits into the larger frame.

The transformations we have developed are going to be useful to trade quantum resources for quantum simulation of fermionic systems, independent from the concrete quantum algorithms chosen for simulation of the problem. For those problems from quantum chemistry and many-body physics we are usually given a fermionic system and its Hamiltonian. One is then to determine the system's ground state and ground state energy, sometimes parts of its spectrum. Where classical computation is infeasible, we simulate the system inside a quantum computer, on which the problem can be solved with existing algorithms. With either transform (see table 1), the fermionic system is therefore mapped to a system of n qubits. With the operator transform, H turns into ${\mathsf{H}}$ , a sum of weighted ${ \mathcal L }({({{\mathbb{C}}}^{2})}^{\otimes n})$ gates, Pauli strings at best. We then apply algorithms like quantum phase estimation [3, 37–39], variational quantum eigensolvers [9, 11, 12, 40] , or adiabatic simulations [6]. All of those algorithms receive ansatz states as inputs and in some way prepare (eigen-) states, while also outputting their energy. The ground state is the state with the lowest energy, and can be then manipulated as it is inside the quantum registers after the simulation. For the remainder of this appendix, we discuss implications of the simulation algorithms onto our transforms Thus we outline some principles, these algorithms rely on: algorithms might require us to simulate the time evolution of our encoded system according to ${\mathsf{H}}$ . For that purpose, we need to know how to transform the time evolution operator $\exp ({\rm{i}}{\mathsf{Ht}})$ , where ${\mathsf{t}}$ is a time step, into gate sequences. Maybe we even need to apply those evolution conditionally, means as an operation controlled on an auxiliary qubit (register). We thus need to embed ${\mathsf{H}}$ into an algorithm for Hamiltonian simulation.

Let us now be a bit more concrete, and select such an algorithm. Despite the wide range of theoretical proposals for Hamiltonian simulation algorithms [41–43], only the perhaps simplest scheme appears to be experimentally feasible for digital quantum simulation at the moment. Note that it can only be applied to Hamiltonians that are a sum of Pauli strings with real weights

$\begin{eqnarray}&&{\mathsf{H}}=\displaystyle \sum _{\sigma \in {\{X,Y,Z,{\mathbb{I}}\}}^{\otimes n}}\ {\theta }_{\sigma }\times \sigma \qquad {\rm{with}}\,{\rm{all}}\quad {\theta }_{\sigma }\in {\mathbb{R}}.\end{eqnarray} \tag{ A1 }$

The idea is to approximate $\exp ({\rm{i}}{\mathsf{Ht}})$ , by sequences of the exponentiated Pauli strings $\exp ({\rm{i}}{\theta }_{\sigma }\,{\mathsf{s}}\,\sigma )$ , where ${\mathsf{s}}$ is a time slice of ${\mathsf{t}}$ . This method is commonly referred to as Trotterization. The numbers, signs and values of the time slices ${\mathsf{s}}$ , as well as the ordering of the exponentiated strings, govern the error of the simulation—strategies to minimize that error can be learned from the works of Suzuki [44, 45]. Note that we do not specify whether the Hamiltonian simulation is performed in an analog or digital fashion, however, not all strings σ are feasible to be implemented in an analog fashion. The digital gadget for the exponentiation of Pauli strings, on the other hand, is well known [46]. See figure A1 for an example. We are therefore able to approximately perform a (conditional) simulated time evolution with ${\mathsf{H}}$ of the form (A1). Using algorithms like variational eigensolvers, where we do not simulate the time evolution but estimate the Hamiltonian expectation value by measuring its terms, we are in principle not tied to the structure of (A1). However, it is more convenient. Equation (A1) gives us two constraints on how to transform (11).

**Figure A1.** Implementing $\exp ({\rm{i}}\phi \,{X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m})$ , conditional on qubit 'phase'. $\phi ={\mathsf{s}}\,{\theta }_{{}_{{X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m}}}$ is a real rotation angle, where ${\mathsf{s}}$ , is a time slice, and ${\theta }_{{}_{{X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m}}}$ is the Hamiltonian weight of the string ${X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m}$ , as in (A1).
Download figure:
Standard image High-resolution image

**Figure A1.** Implementing $\exp ({\rm{i}}\phi \,{X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m})$ , conditional on qubit 'phase'. $\phi ={\mathsf{s}}\,{\theta }_{{}_{{X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m}}}$ is a real rotation angle, where ${\mathsf{s}}$ , is a time slice, and ${\theta }_{{}_{{X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m}}}$ is the Hamiltonian weight of the string ${X}_{j}\otimes {X}_{k}\otimes {Z}_{l}\otimes {Z}_{m}$ , as in (A1).
Download figure:
Standard image High-resolution image

The first constraint is that we need to decompose every fermionic operator into Pauli strings, using (28). The total number of Pauli strings resulting can be a problematically high when the underlying codes are highly nonlinear. For Trotterization that means a tremendous increase in length due to the abundance of sequenced Pauli string gadgets, many of them with very small rotation angles (ϕ in figure A1).

The second constraint seems trivial at first: in order to simulate a Hamiltonian, it has to be hermitian. More precisely, it has to be hermitian on the entire ${({{\mathbb{C}}}^{2})}^{\otimes n}$ , so the coefficient θ_σ have to be real. We on the other hand might not even need the entire ${({{\mathbb{C}}}^{2})}^{\otimes n}$ to encode our physical system. Non-hermicities, meaning complex coefficients θ_σ, can occur whenever one is careless with the remainder of the qubit space, when the code space is left or states are encoded in an ambiguous way. We here list a few pitfalls that can cause non-hermitian terms to occur after the transform and discuss how to avoid them.

Issues may be caused by codes that are not one-to-one. A one-to-one code $({\boldsymbol{e}},{\boldsymbol{d}})$ has the property: ${\boldsymbol{e}}({\boldsymbol{d}}({\boldsymbol{\omega }}))={\boldsymbol{\omega }}$ for all ${\boldsymbol{\omega }}\in {{\mathbb{Z}}}_{2}^{\otimes n}$ . Although we have excluded the one-to-one property from the definition of the codes (taing into account the next item), it assures the hermiticity of the transformed Hamiltonian.
The encoded basis ${ \mathcal B }$ has a size that is in between 2ⁿ and 2ⁿ⁻¹, so n qubits provide too much Hilbert space by default. However, we can always add a state to the basis that is mapped to zero by all terms ${\hat{h}}_{{\boldsymbol{ab}}}$ . This state, represented by ν, can have several partners on the code space ${\boldsymbol{\omega }}$ , for which ${\boldsymbol{d}}({\boldsymbol{\omega }})={\boldsymbol{\nu }}$ (i.e. not be mapped one-to-one). For example for particle-number conserving Hamiltonians, we can balance these dimensional mismatches using the vacuum state in such a way, since ${c}_{i}^{\dagger }{c}_{j}^{\ }| {\rm{\Theta }}\rangle =0$ .
We encounter this problem when using a code with a Hamiltonian, that is not feasible with it. The segment codes for instance are feasible only for certain adjusted particle-number-conserving Hamiltonians, as we shall see in appendix E.

Appendix B.: General operator mappings

The goal of this appendix is to verify that the fermionic mode is accurately represented by our qubit system. This is divided into three steps: step one is to analyze the action of Hamiltonian terms on the fermionic basis. In the second step, we verify parity and projector parts of (31) to work like the original operators in step one, disregarding the occupational update for a moment. Conditions for this state update are subsequently derived. The update operator (32) is shown to fulfill these conditions in the third step, thus concluding the proof.

B.1. Hamiltonian dynamics

In order to verify that the gate sequences (31) are mimicking the Hamiltonian dynamics adequately, we verify that the resulting terms have the same effect on the Hamiltonian basis. This is done on the level of second quantization with respect to the notation (18): no transition into a qubit system is made. This step serves the sole purpose to quantify the effect of the Hamiltonian terms on the states. To that end, we begin by studying the effect of a singular fermionic operator ${c}_{j}^{(\dagger )}$ on a pure state, before considering an entire term ${\hat{h}}_{{\boldsymbol{ab}}}$ on a state in ${ \mathcal B }$ . As a preliminary, we note that (3)–(6) follow directly from (2), when considering that

$\begin{eqnarray}&&{c}_{j}^{\ }{c}_{j}^{\ }={c}_{j}^{\dagger }{c}_{j}^{\dagger }={c}_{j}^{\ }| {\rm{\Theta }}\rangle =0.\end{eqnarray} \tag{ B1 }$

The relations (3)–(6) indicate how singular operators act on pure states in general. We now become more specific and apply these rules to a state $({\prod }_{i}{({c}_{i}^{\dagger })}^{{\nu }_{i}})| {\rm{\Theta }}\rangle$ , that is not necessarily in ${ \mathcal B }$ , but is described by an occupation vector ${\boldsymbol{\nu }}\in {{\mathbb{Z}}}_{2}^{\otimes N}$ . The effect of an annihilation operator on such a state is considered first:

$\begin{eqnarray}&&{c}_{j}^{\,}\left[\displaystyle \prod _{i=1}^{N}{({c}_{i}^{\dagger })}^{{\nu }_{i}}\right]| {\rm{\Theta }}\rangle =\left[\displaystyle \prod _{i\lt j}{(-{c}_{i}^{\dagger })}^{{\nu }_{i}}\right]\ \ {c}_{j}^{\,}{({c}_{j}^{\dagger })}^{{\nu }_{j}}\ \ \left[\displaystyle \prod _{k\gt j}{({c}_{k}^{\dagger })}^{{\nu }_{k}}\right]| {\rm{\Theta }}\rangle \ \end{eqnarray} \tag{ B2 }$

$\begin{eqnarray}&&=\,\left[\displaystyle \prod _{i\lt j}{(-{c}_{i}^{\dagger })}^{{\nu }_{i}}\right]\ \ \displaystyle \frac{1}{2}[1-{(-1)}^{{\nu }_{j}}]\ \ \left[\displaystyle \prod _{k\gt j}{({c}_{k}^{\dagger })}^{{\nu }_{k}}\right]| {\rm{\Theta }}\rangle \ \end{eqnarray} \tag{ B3 }$

$\begin{eqnarray}&&=\,\left[\displaystyle \prod _{i\lt j}{(-1)}^{{\nu }_{i}}\right]\ \ \displaystyle \frac{1}{2}[1-{(-1)}^{{\nu }_{j}}]\ \ \left[\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}+{\delta }_{{jk}}\mathrm{mod}2}\right]| {\rm{\Theta }}\rangle \ .\end{eqnarray} \tag{ B4 }$

A short explanation on what has happened: in (B2), c_j has anticommuted with all creation operator ${c}_{i}^{\dagger }$ that have indexes i < j. Depending on the component ν_j, a creation operator ${c}_{j}^{\dagger }$ might now be to the right of the annihilator c_j. If the creation operator is not encountered, we may continue the anticommutations of c_j until it meets the vacuum and annihilates the state by ${c}_{j}| {\rm{\Theta }}\rangle =0$ . Using the anticommutation relations (2), we therefore replace ${c}_{j}^{\,}{({c}_{j}^{\dagger })}^{{\nu }_{j}}$ with $\tfrac{1}{2}[1-{(-1)}^{{\nu }_{j}}]$ when going from (B2) to (B3). Finally, the terms are rearranged in (B4): conditional sign changes of the anticommutations are factored out of the new state with an occupation that is now described by the binary vector $({\boldsymbol{\nu }}+{{\boldsymbol{u}}}_{{\boldsymbol{j}}}\ \mathrm{mod}\ 2)$ rather than ${\boldsymbol{\nu }}$ . When considering to apply a creation operator ${c}_{j}^{\dagger }$ on the former state, the result is similar. Alone at step (B3), we have to replace ${c}_{j}^{\dagger }{({c}_{j}^{\dagger })}^{{\nu }_{j}}$ by $\tfrac{1}{2}[1+{(-1)}^{{\nu }_{j}}]$ instead, as now the case of appearance of the creation operator leads to annihilation: ${c}_{j}^{\dagger }{c}_{j}^{\dagger }=0$ . We thus find

$\begin{eqnarray}&&{c}_{j}^{\dagger }\left[\displaystyle \prod _{i=1}^{N}{({c}_{i}^{\dagger })}^{{\nu }_{j}}\right]| {\rm{\Theta }}\rangle =\left[\displaystyle \prod _{i\lt j}{(-1)}^{{\nu }_{i}}\right]\ \ \displaystyle \frac{1}{2}[1+{(-1)}^{{\nu }_{j}}]\ \ \left[\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}+{\delta }_{{jk}}\mathrm{mod}2}\right]| {\rm{\Theta }}\rangle .\end{eqnarray} \tag{ B5 }$

We now turn our attention to the actual goal, effect of a Hamiltonian term from (11) on a state in ${ \mathcal B }$ (this means its occupation vector ${\boldsymbol{\nu }}$ is in ${ \mathcal V }$ ). We therefore consider a generic operator sequence ${\prod }_{i=1}^{l}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}$ , parametrized by some N-ary vector ${\boldsymbol{a}}\in {[N]}^{\otimes l}$ and a binary vector ${\boldsymbol{b}}\in {{\mathbb{Z}}}_{2}^{\otimes l}$ , for some length l. With (B4) and (B5), we now have the means to consider the effect such a sequence of annihilation and creation operators. The two relations will be repeatedly utilized in an inductive procedure, as every single operator ${({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}$ of ${\prod }_{i=1}^{l}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}$ will act on a basis state, one after another. The state's occupation is updated after every such operation. For convenience, we define:

$\begin{eqnarray}&&{{\boldsymbol{\nu }}}^{({\boldsymbol{i}})}\in {{\mathbb{Z}}}_{2}^{\otimes N}\quad | \ i\in \{0,\,...,\,l\},\end{eqnarray} \tag{ B6 }$

$\begin{eqnarray}&&{{\boldsymbol{\nu }}}^{({\boldsymbol{l}})}={\boldsymbol{\nu }}\in { \mathcal V },\end{eqnarray} \tag{ B7 }$

$\begin{eqnarray}&&{{\boldsymbol{\nu }}}^{({\boldsymbol{i}}-{\bf{1}})}={{\boldsymbol{\nu }}}^{({\boldsymbol{i}})}+{{\boldsymbol{u}}}_{{{\boldsymbol{a}}}_{{\boldsymbol{i}}}}\ \ \mathrm{mod}\ 2.\end{eqnarray} \tag{ B8 }$

Now, the procedure starts:

$\begin{eqnarray}&&\left[\,\displaystyle \prod _{i=1}^{l}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}\right]\ \left[\,\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}}\right]| {\rm{\Theta }}\rangle \end{eqnarray} \tag{ B9 }$

$\begin{eqnarray}&&=\,\left[\,\displaystyle \prod _{i=1}^{l-1}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}\right]\ \displaystyle \frac{1}{2}[1-{(-1)}^{{b}_{l}}{(-1)}^{{\nu }_{{a}_{l}}}]{(-1)}^{{\displaystyle \sum }_{j\lt {a}_{l}}{\nu }_{j}}\left[\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}+{\delta }_{{a}_{l}k}\mathrm{mod}2}\right]| {\rm{\Theta }}\rangle \end{eqnarray} \tag{ B10 }$

$\begin{eqnarray}&&=\,\left[\displaystyle \frac{1}{2}[1-{(-1)}^{{b}_{l}}{(-1)}^{{\nu }_{{a}_{l}}^{(l)}}]{(-1)}^{{\displaystyle \sum }_{j\lt {a}_{l}}{\nu }_{j}^{(l)}}\right]\ \left[\,\displaystyle \prod _{i=1}^{l-1}{({c}_{{a}_{i}}^{\dagger })}^{{b}_{i}}{({c}_{{a}_{i}}^{})}^{1+{b}_{i}\mathrm{mod}2}\right]\ \left[\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}^{(l-1)}}\right]| {\rm{\Theta }}\rangle \end{eqnarray} \tag{ B11 }$

$\begin{eqnarray}&&=\,\left[\,\displaystyle \prod _{i=1}^{l}\mathop{\underbrace{\displaystyle \frac{1}{2}[1-{(-1)}^{{b}_{i}}{(-1)}^{{\nu }_{{a}_{i}}^{(i)}}]}}\limits_{{\rm{projector}}\,{\rm{eigenvalues}}}\mathop{\underbrace{{(-1)}^{{\displaystyle \sum }_{j\lt {a}_{i}}{\nu }_{j}^{(i)}}}}\limits_{{\rm{parity}}\,{\rm{signs}}}\right.]\ \mathop{\underbrace{\left[\,\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}^{(0)}}\right]| {\rm{\Theta }}\rangle }}\limits_{{\rm{updated}}\,{\rm{state}}}.\end{eqnarray} \tag{ B12 }$

We again explain what has happened: first, the rightmost operator, which is either ${c}_{{a}_{l}}^{\ }$ or ${c}_{{a}_{l}}^{\dagger }$ depending on the parameter b_l, acts on the state according to either (B4) or (B5). We therefore combine the two relations for the absorption of this operator ${({c}_{{a}_{l}}^{\dagger })}^{{b}_{l}}{({c}_{{a}_{l}}^{})}^{1+{b}_{l}\mathrm{mod}2}$ in (B10). In the same fashion, all the remaining operators of the sequence are one-after-another absorbed into the state. The new state is described by the vector ${{\boldsymbol{\nu }}}^{({\boldsymbol{l}}-{\bf{1}})}$ after the update. And the cycle begins anew with ${({c}_{{a}_{l-1}}^{\dagger })}^{{b}_{l-1}}{({c}_{{a}_{l-1}}^{})}^{1+{b}_{l-1}\mathrm{mod}2}$ . From (B11) on, we use the notations (B6)–(B8) to describe partially updated occupations. By the end of this iteration, the occupation of the state is changed to $\,{{\boldsymbol{\nu }}}^{({\bf{0}})}={\boldsymbol{\nu }}+{\boldsymbol{q}}\ \mathrm{mod}\ 2\$ , with the total change $\ {\boldsymbol{q}}={\sum }_{i}{{\boldsymbol{u}}}_{{{\boldsymbol{a}}}_{{\boldsymbol{i}}}}\ \mathrm{mod}\ 2$ . Also, the coefficients of (B12) take into account sign changes from anticommutations ('parity signs' in (B12)) and the eigenvalues of the applied projections. In its entirety, (B12) denotes the resulting state, and is the main ingredient for the next step.

B.2. Parity operators and projectors

We are given the operator transform (31) and the state transform (19). We want to show the that the Fermion system is adequately simulated, which means to show that the effect (B12) is replicated by (31) acting on $| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle$ . This is the goal of the next two steps. We start by evaluating the application of (31) on that state, up to the update operator ${{ \mathcal U }}^{a}$ . This means that the operators applied implement two things only: the parity signs of (B12), and the projection onto the correct occupational state. Note that these parity operators and projectors are applied before the update operator in (31):

$\begin{eqnarray}&&\mathop{\underbrace{{{ \mathcal U }}^{{\boldsymbol{a}}}}}\limits_{{\rm{update}}\,{\rm{operator}}}\mathop{\underbrace{\left(\displaystyle \prod _{v=1}^{l-1}\,\displaystyle \prod _{w=v+1}^{l}{(-1)}^{{\theta }_{{a}_{v}{a}_{w}}}\right)}}\limits_{{\rm{parity}}\,{\rm{operators}}}\displaystyle \prod _{x=1}^{l}\mathop{\underbrace{\displaystyle \frac{1}{2}\left({\mathbb{I}}-\left[\displaystyle \prod _{y=x+1}^{l}{(-1)}^{{\delta }_{{a}_{x}{a}_{y}}}\right]{(-1)}^{{b}_{x}}\ {\mathfrak{X}}[{d}_{{a}_{x}}]\right)}}\limits_{{\rm{projectors}}}\mathop{\underbrace{{\mathfrak{X}}[\,{p}_{{a}_{x}}]}}\limits_{{\rm{parity}}\,{\rm{operators}}}.\end{eqnarray} \tag{ B13 }$

We now commence our evaluation:

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}\left[\left(\displaystyle \prod _{v=1}^{l-1}\,\displaystyle \prod _{w=v+1}^{l}{(-1)}^{{\theta }_{{a}_{v}{a}_{w}}}\right)\displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}\left({\mathbb{I}}-\left[\displaystyle \prod _{y=x+1}^{l}{(-1)}^{{\delta }_{{a}_{x}{a}_{y}}}\right]{(-1)}^{{b}_{x}}\ {\mathfrak{X}}[{d}_{{a}_{x}}]\right){\mathfrak{X}}[\,{p}_{{a}_{x}}]\right]\quad | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B14 }$

$\begin{eqnarray}&&=\ {{ \mathcal U }}^{{\boldsymbol{a}}}\left[\left(\displaystyle \prod _{v=1}^{l-1}\,\displaystyle \prod _{w=v+1}^{l}{(-1)}^{{\theta }_{{a}_{v}{a}_{w}}}\right)\displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}\left(1-\left[\displaystyle \prod _{y=x+1}^{l}{(-1)}^{{\delta }_{{a}_{x}{a}_{y}}}\right]{(-1)}^{{b}_{x}}\ {(-1)}^{{d}_{{a}_{x}}({\boldsymbol{e}}({\boldsymbol{\nu }}))}\right){(-1)}^{{p}_{{a}_{x}}({\boldsymbol{e}}({\boldsymbol{\nu }}))}\right]\quad | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B15 }$

$\begin{eqnarray}&&=\ {{ \mathcal U }}^{{\boldsymbol{a}}}\left[\left(\displaystyle \prod _{v=1}^{l-1}\,\displaystyle \prod _{w=v+1}^{l}{(-1)}^{{\theta }_{{a}_{v}{a}_{w}}}\right)\displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}\left(1-\left[\displaystyle \prod _{y=x+1}^{l}{(-1)}^{{\delta }_{{a}_{x}{a}_{y}}}\right]{(-1)}^{{b}_{x}}\ {(-1)}^{{\nu }_{{a}_{x}}}\right){(-1)}^{{\displaystyle \sum }_{j\lt {a}_{x}}{\nu }_{j}}\right]\quad | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B16 }$

$\begin{eqnarray}&&=\ {{ \mathcal U }}^{{\boldsymbol{a}}}\ \left[\displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}\left(1-{(-1)}^{{b}_{x}}\ {(-1)}^{{\nu }_{{a}_{x}}+{\displaystyle \sum }_{y=x+1}^{l}{\delta }_{{a}_{x}{a}_{y}}}\right){(-1)}^{{\displaystyle \sum }_{j\lt {a}_{x}}{\nu }_{j}+{\displaystyle \sum }_{y=x+1}^{l}{\theta }_{{a}_{x}{a}_{y}}}\right]\quad | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B17 }$

$\begin{eqnarray}&&=\,\left[\displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}(1-{(-1)}^{{b}_{x}}\ {(-1)}^{{\nu }_{{a}_{x}}^{(x)}}){(-1)}^{{\displaystyle \sum }_{j\lt {a}_{x}}{\nu }_{j}^{(x)}}\right]\quad {{ \mathcal U }}^{{\boldsymbol{a}}}\ | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle .\end{eqnarray} \tag{ B18 }$

Let us describe what has happened: in (B15), the extraction property (21) is used, and we arrive at (B16) after using the property ${\boldsymbol{d}}({\boldsymbol{e}}({\boldsymbol{\nu }}))={\boldsymbol{\nu }}$ and the definition of the parity function. From there we go to (B17) when we merge the two products and perform rearrangements that make it easy to cast all delta and theta functions into the components of the partially updated occupations ${{\boldsymbol{\nu }}}^{({\boldsymbol{i}})}$ , (B18).

Comparing (B18) to (B12), we notice to have successfully mimicked the same sign changes and and projections, as the coefficients in both relations match. Now it is only left to show that the state update is executed correctly. Naively, one would think that we would need to show that

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}\ | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \hat{=}\left[\,\displaystyle \prod _{k=1}^{N}{({c}_{k}^{\dagger })}^{{\nu }_{k}^{(0)}}\right]| {\rm{\Theta }}\rangle ,\end{eqnarray} \tag{ B19 }$

but this is too strong a statement. It is in fact sufficient to demand

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle =| {\boldsymbol{e}}({{\boldsymbol{\nu }}}^{({\bf{0}})})\rangle \ =\ | {\boldsymbol{e}}({\boldsymbol{\nu }}+{\boldsymbol{q}}\ \mathrm{mod}\ 2)\rangle .\end{eqnarray} \tag{ B20 }$

For ${{\boldsymbol{\nu }}}^{({\bf{0}})}\in { \mathcal V }$ , (B19) and (B20) is equivalent. However, it might be the case that ${{\boldsymbol{\nu }}}^{({\bf{0}})}\notin { \mathcal V }$ , so ${{\boldsymbol{\nu }}}^{({\bf{0}})}$ is not encoded. This mean that (B19) is not fulfilled, since ${\boldsymbol{d}}({\boldsymbol{e}}({{\boldsymbol{\nu }}}^{({\bf{0}})}))\ne {{\boldsymbol{\nu }}}^{({\bf{0}})}$ . It is however not necessary to include ${{\boldsymbol{\nu }}}^{({\bf{0}})}$ in the encoding, as for ${{\boldsymbol{\nu }}}^{({\bf{0}})}\notin { \mathcal V }$ , the state will vanish anyways: we know from ${\hat{h}}_{{\boldsymbol{ab}}}\,:\mathrm{span}({ \mathcal B })\to \mathrm{span}({ \mathcal B })$ , that in this case ${\hat{h}}_{{\boldsymbol{ab}}}$ must act destructively on that basis state, ${\hat{h}}_{{\boldsymbol{ab}}}\,({\prod }_{k}{({c}_{k}^{\dagger })}^{{\nu }_{k}})| {\rm{\Theta }}\rangle =0$ . This detail is implemented by the projector part of the transformed sequence (31). These projectors are, as we have just shown, working faithfully like (B12), for the transformed sequence acting on every $| {\boldsymbol{\nu }}\rangle$ with ${\boldsymbol{\nu }}\in { \mathcal V }$ . Hence (B20) is a sufficient condition for the updated state. The proof is completed once we have verified that (B20) is satisfied with the update operator defined as in (32). This is done during the next step.

B.3. Update operator

The missing piece of the proof is to check that (32) and (33) fulfill the condition (B20). We start by verifying the condition (B20) for (33), which we have presented as special case of (32) for linear encoding functions: ${\boldsymbol{e}}({\boldsymbol{\nu }}+{{\boldsymbol{\nu }}}^{{\prime} }\ \mathrm{mod}\ 2)={\boldsymbol{e}}({\boldsymbol{\nu }})+{\boldsymbol{e}}({{\boldsymbol{\nu }}}^{{\prime} })\ \mathrm{mod}\ 2$ . Using that property, one can in fact derive (33) from (32) directly. We now apply (33) to $| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle$ , but firstly we note that

$\begin{eqnarray}&&{X}_{j}| {\boldsymbol{\omega }}\rangle =| {\boldsymbol{\omega }}+{{\boldsymbol{u}}}_{{\boldsymbol{j}}}\ \mathrm{mod}\ 2\rangle ,\end{eqnarray} \tag{ B21 }$

where ${{\boldsymbol{u}}}_{{\boldsymbol{j}}}$ is the jth unit vector of ${{\mathbb{Z}}}_{2}^{\otimes n}$ . Using (B21) and the linearity of ${\boldsymbol{e}}$ , we find:

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle =\left[\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{\displaystyle \sum }_{j}{A}_{{ij}}{q}_{j}\mathrm{mod}2}\right]| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B22 }$

$\begin{eqnarray}&&=\left[\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{\boldsymbol{e}}({\boldsymbol{q}})}\right]| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B23 }$

$\begin{eqnarray}&&=| {\boldsymbol{e}}({\boldsymbol{\nu }})+{\boldsymbol{e}}({\boldsymbol{q}})\ \mathrm{mod}\ 2\rangle \end{eqnarray} \tag{ B24 }$

$\begin{eqnarray}&&=| {\boldsymbol{e}}({\boldsymbol{\nu }}+{\boldsymbol{q}}\ \mathrm{mod}\ 2)\rangle ,\end{eqnarray} \tag{ B25 }$

which shows (B20) for linear encodings.

We now turn our attention to general encodings and prove the same expression for update operators as defined in (32):

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle =\left(\displaystyle \sum _{{\boldsymbol{t}}\,\in {{\mathbb{Z}}}_{2}^{\otimes n}}\left[\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{t}_{i}}\right]\displaystyle \prod _{j=1}^{n}\displaystyle \frac{1}{2}({\mathbb{I}}+{(-1)}^{{t}_{j}}\ {\mathfrak{X}}[{\varepsilon }_{j}^{\,{\boldsymbol{q}}}])\right)\ | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B26 }$

$\begin{eqnarray}&&=\left(\displaystyle \sum _{{\boldsymbol{t}}\,\in {{\mathbb{Z}}}_{2}^{\otimes n}}\left[\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{t}_{i}}\right]\displaystyle \prod _{j=1}^{n}\mathop{\underbrace{\displaystyle \frac{1}{2}(1+{(-1)}^{{t}_{j}+{\varepsilon }_{j}^{{\boldsymbol{q}}}({\boldsymbol{e}}({\boldsymbol{\nu }}))})}}\limits_{\qquad {\delta }_{{t}_{j}^{}{\varepsilon }_{j}^{{\boldsymbol{q}}}({\boldsymbol{e}}({\boldsymbol{\nu }}))}}\right)| {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B27 }$

$\begin{eqnarray}&&=\left(\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{\varepsilon }_{i}^{{\boldsymbol{q}}}({\boldsymbol{e}}({\boldsymbol{\nu }}))}\right)\ | {\boldsymbol{e}}({\boldsymbol{\nu }})\rangle \end{eqnarray} \tag{ B28 }$

$\begin{eqnarray}&&=| {\boldsymbol{e}}({\boldsymbol{\nu }})+{{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}({\boldsymbol{e}}({\boldsymbol{\nu }}))\ \mathrm{mod}\ 2\rangle \end{eqnarray} \tag{ B29 }$

$\begin{eqnarray}&&=| {\boldsymbol{e}}({\boldsymbol{\nu }})+{\boldsymbol{e}}({\boldsymbol{\nu }})+{\boldsymbol{e}}({\boldsymbol{d}}({\boldsymbol{e}}({\boldsymbol{\nu }}))+{\boldsymbol{q}}\ \mathrm{mod}\ 2)\ \mathrm{mod}\ 2\rangle \end{eqnarray} \tag{ B30 }$

$\begin{eqnarray}&&=| {\boldsymbol{e}}({\boldsymbol{\nu }}+{\boldsymbol{q}}\ \mathrm{mod}\ 2)\rangle ,\end{eqnarray} \tag{ B31 }$

which completes the proof. We swiftly recap what has happened: in (B26), we have plugged the definition of (32) into the left-hand side of (B20). In between this equation and (B27), we have evaluated the expectation values of the extracted operators ${\mathfrak{X}}[{\varepsilon }_{j}^{{\boldsymbol{q}}}]$ . From that line to the next, the ${{\mathbb{Z}}}_{2}^{\otimes n}$ -sum is collapsed over the condition ${\boldsymbol{t}}={{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}({\boldsymbol{e}}({\boldsymbol{\nu }}))$ . We go from (B28) to (B29) by applying (B21). Once we insert the definition (30) into (B29), it becomes obvious that the condition (B20) is fulfilled. Thus, the entire operator transform is now proven.

Appendix C.: Transforming particle-number conserving Hamiltonians

In this appendix, we examine the richest symmetry to exploit for qubit savings: particle conservation. We begin by introducing the most relevant class of Hamiltonians that exhibit this symmetry, but ultimately the main goal of this appendix is to simplify the operator transform for all such Hamiltonians. Motivated by the compartmentalized recipes of the conventional mappings, (13), we suggest alternatives to the transform (31), that do not depend on the sequence length l.

Let us start by noting how easy it is to state that a Hamiltonian the total number of particles: a Hamiltonian like (11), conserves the total number of particles when every term ${\hat{h}}_{{\boldsymbol{ab}}}$ has as many creation operators as it has annihilation operators. The lengths l, implicit in the sequences ${\hat{h}}_{{\boldsymbol{ab}}}$ that occur in the Hamiltonian, are thereby determined by the field theory or model, that underlies the problem. The coefficients ${h}_{{\boldsymbol{ab}}}$ , on the other hand, are determined by the set of basis functions used. For first-principle problems in quantum chemistry and solid state physics, we usually encounter particle-number-conserving Hamiltonians with terms of weight that is at most l = 4:

$\begin{eqnarray}&&H=\displaystyle \sum _{{ij}}{t}_{{ij}}\ {c}_{i}^{\dagger }{c}_{j}^{\ }+\displaystyle \sum _{{ijkl}}{U}_{{ijkl}}\ {c}_{i}^{\dagger }{c}_{j}^{\dagger }{c}_{k}^{\ }{c}_{l}^{\ },\end{eqnarray} \tag{ C1 }$

where U_ijkl, t_ij are complex coefficients of the interaction and single particle terms, respectively. In the notation of (11), these coefficients correspond to h_{(i, j, k, l)(1,1,0,0)} and h_{(i, j)(1,0)}. The (l = 4) interaction terms usually originate from either magnetism and/or the Coulomb interaction. Even for these (l = 4)-terms, the operator transform (31) is quite bulky, and we in general would like to have a transform that is independent of l. Before we begin to discuss such transform recipes however, we need to set up some preliminaries. First of all, we need to find a suitable code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ), as discussed in the main part. Ideally, we would encode only the Hilbert space with the correct number of particles, M, but Hilbert spaces of other particle numbers can also be included. Assuming that the Hamiltonian visits every state with the same particle number, we must encode entire Hilbert spaces ${{ \mathcal H }}_{N}^{m}$ only. Secondly, we need to reorder the fermionic operators inside the Hamiltonian terms ${\hat{h}}_{{\boldsymbol{ab}}}$ . The reason for this is, that our goal can only be achieved by finding recipes for smaller sequences of constant length. In order to transform the Hamiltonian terms then, we need to invoke the anticommutation relations (2) to introduce an order in ${\hat{h}}_{{\boldsymbol{ab}}}$ , such that these small sequences appear as consecutive, distinct blocks. As we shall see, these blocks will have the shape ${c}_{i}^{\dagger }{c}_{j}^{\ }$ . So every ${\hat{h}}_{{\boldsymbol{ab}}}$ needs to be reordered, such that every even operator is a creation operator, and every odd operator an annihilator. For the (l = 4)-terms in (C1), this reordering means ${c}_{i}^{\dagger }{c}_{j}^{\dagger }{c}_{k}^{\ }{c}_{l}^{\ }\ \to \ {c}_{i}^{\dagger }{c}_{l}^{\ }{c}_{j}^{\dagger }{c}_{k}^{\ }-{\delta }_{{jl}}\ {c}_{i}^{\dagger }{c}_{k}^{\ }$ .

Let us quickly sketch the idea behind that reordering and introduce some nomenclature: instead of considering Hamiltonian terms, we realize that also the terms ${c}_{i}^{\dagger }{c}_{j}^{\ }$ also conserve the particle number: ${{ \mathcal H }}_{N}^{m}\to {{ \mathcal H }}_{N}^{m}$ . Let us act with ${c}_{i}^{\dagger }{c}_{j}^{\ }$ on an encoded state. We consider a state that is not annihilated by ${c}_{i}^{\dagger }{c}_{j}^{\ }$ . Its particle number is reduced by one through c_j, but then immediately restored by ${c}_{i}^{\dagger }$ . In fact, for a general sequence of that arrangement, every even operator restores the particle number in this way and every odd reduces it. We therefore call the subspace, in which we find the state after an even (odd) number of operators, the even (odd) subspace. Since all l must be even for the Hamiltonian to have particle conservation symmetry, the even subspace is the one encoded. The odd subspace, on the other hand, has one particle less, so it is ${{ \mathcal H }}_{N}^{(M-1)}$ , if the even one is ${{ \mathcal H }}_{N}^{M}$ .

C.1. Encoding the two spaces separately

In this ordering, one can find a recipe for a singular creation or annihilation operator. The strategy is to consider a second code for the odd subspace. As before ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ) denotes the code for the even subspace, and now ( ${{\boldsymbol{e}}}^{{\prime} },{{\boldsymbol{d}}}^{{\prime} }$ ) is encoding the odd subspace. The idea is that after an odd operator (which in this ordering is an annihilation operator), the state is updated into the odd subspace. With every even operator (which is a creation operator), the state is updated from the odd subspace back into the even one. We find:

$\begin{eqnarray}&&{c}_{j}^{\dagger }\hat{=}\displaystyle \frac{1}{2}\ {\bar{{ \mathcal U }}}^{(j)}\ ({\mathbb{I}}+{\mathfrak{X}}[{d}_{j}])\ {\mathfrak{X}}[{p}_{j}],\end{eqnarray} \tag{ C2 }$

$\begin{eqnarray}&&{c}_{j}^{\ }\hat{=}\displaystyle \frac{1}{2}\ {{ \mathcal U }}^{(j)}\ ({\mathbb{I}}-{\mathfrak{X}}[{d}_{j}^{{\prime} }]){\mathfrak{X}}[{p}_{j}^{{\prime} }].\end{eqnarray} \tag{ C3 }$

In (C3), ${{ \mathcal U }}^{(j)}$ is defined as in (32), but its counterpart from (C2) is defined by

$\begin{eqnarray}&&{\bar{{ \mathcal U }}}^{(j)}=\displaystyle \sum _{{\boldsymbol{t}}\,\in {{\mathbb{Z}}}_{2}^{\otimes n}}\left[\underset{i=1}{\overset{n}{\displaystyle \bigotimes }}{({X}_{i})}^{{t}_{i}}\right]\displaystyle \prod _{i=1}^{n}\displaystyle \frac{1}{2}({\mathbb{I}}+{(-1)}^{{t}_{i}}\ {\mathfrak{X}}[{\varepsilon }_{k}^{{\prime} \ {{\boldsymbol{u}}}_{{\boldsymbol{j}}}}]),\end{eqnarray} \tag{ C4 }$

with the primed functions ${{\boldsymbol{\varepsilon }}}^{{\prime} {\boldsymbol{q}}},{{\boldsymbol{p}}}^{{\prime} }$ defined like (30) and (29), but with ( ${{\boldsymbol{e}}}^{{\prime} }$ , ${{\boldsymbol{d}}}^{{\prime} }$ ) in place of ( ${\boldsymbol{e}}$ , ${\boldsymbol{d}}$ ).

This method relies on n qubits being feasible to simulate the odd subspace in. That is, however, not always the case. The basis set of ${{ \mathcal H }}_{N}^{M-1}$ is in general larger than ${{ \mathcal H }}_{N}^{M}$ , when M > N/2. In this way, the odd subspace can also be larger and even be infeasible to simulate with just n qubits. As a solution, one changes the ordering into odd operators being creation operators, and even ones being annihilators, like ${c}_{k}^{\ }{c}_{i}^{\dagger }\,{c}_{l}^{\ }{c}_{j}^{\dagger }$ . This causes the odd subspace to become ${{ \mathcal H }}_{N}^{(M+1)}$ , which has a smaller basis set than ${{ \mathcal H }}_{N}^{M}$ . For that case ( ${\boldsymbol{e}}$ , ${\boldsymbol{d}}$ ) become the code for the odd subspace, and ( ${{\boldsymbol{e}}}^{{\prime} }$ , ${{\boldsymbol{d}}}^{{\prime} }$ ) will be associated to the even subspace in (C2) and (C3).

The obvious disadvantage is that two codes have to be employed at once. However, the checksum code for instance (section 3.1.1 in the main part), comes in two different flavors already, which can be used as codes for even and odd subspaces, respectively.

C.2. Encoding the building blocks

The building blocks ${c}_{i}^{\dagger }{c}_{j}^{\ }$ are guaranteed to conserve the particle number, so the even subspace is conserved. As a consequence, one may consider the possibility to transform the operators as the pairs we have rearranged them into. In this way, we still have a certain compartmentalization of (31). Two special cases are to be taken into account: when i > j, an additional minus sign has to be added, as compared to the i < j case. Also, when i = j, all parity operators cancel and the projectors coincide. We find:

$\begin{eqnarray}{c}_{i}^{\dagger }{c}_{j}\hat{=}\left\{\begin{array}{ll}\tfrac{1}{4}\ {(-1)}^{{\theta }_{{ij}}}\ {{ \mathcal U }}^{(i,j)}\ {\mathfrak{X}}[{p}_{i}]\ {\mathfrak{X}}[{p}_{j}]\ ({\mathbb{I}}+{\mathfrak{X}}[{d}_{i}])({\mathbb{I}}-{\mathfrak{X}}[{d}_{j}]) & i\ne j\\ \tfrac{1}{2}(1-{\mathfrak{X}}[{d}_{j}]) & i=j,\end{array}\right.\end{eqnarray} \tag{ C5 }$

with ${{ \mathcal U }}^{(i,j)}$ being the l = 2 version of (32), and ${\boldsymbol{p}}$ and ${{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}$ defined as usual by (29) and (30).

Appendix D.: Multi-weight binary addressing codes based on dissections

With binary addressing codes, that is codes that are similar to the one presented in section 3.1.2 in the main part, even an exponential amount of qubits can be saved for systems with low particle number, but at the expense of complicated gates. For this appendix, we firstly recap the situation of section 3.1.2 and clarify what binary addressing means. Firstly, some nomenclature is introduced. We then generalize the concept of binary addressing codes to weight K codes, using results from [28]. As an example, we explicitly obtain the K = 2 code.

Suppose we have a system with N = 2^r orbitals, and one particle in it. Our goal is to encode the basis state, where the particle is on orbital y ∈ [2^r], as a binary number in r qubits. In this way, the state with occupational vector ${{\boldsymbol{u}}}_{{\boldsymbol{y}}}$ is encoded as $| {{\boldsymbol{q}}}^{{\boldsymbol{y}},{\boldsymbol{r}}}\rangle$ , with ${{\boldsymbol{q}}}^{{\boldsymbol{y}},{\boldsymbol{r}}}\in {{\mathbb{Z}}}_{2}^{\otimes r}$ and $y=\mathrm{bin}({{\boldsymbol{q}}}^{{\boldsymbol{y}},{\boldsymbol{r}}})+1$ . Probing an unknown basis state, a decoding will now have components of the form

$\begin{eqnarray}&&{\boldsymbol{\omega }}\to \displaystyle \prod _{i\in [r]}({\omega }_{i}+{q}_{i}^{y,r}+1)\ \mathrm{mod}\ 2.\end{eqnarray} \tag{ D1 }$

Such binary functions output 1 only when ${\boldsymbol{\omega }}={{\boldsymbol{q}}}^{{\boldsymbol{y}},{\boldsymbol{r}}}$ . In our nomenclature, we say that in the basis state $| {{\boldsymbol{q}}}^{{\boldsymbol{y}},{\boldsymbol{r}}}\rangle$ , the particle has the coordinate y. We refer to codes that store particle coordinates in binary form, as binary addressing codes.

In the K = 1 case from the main part, the code words just contain the binary representation of one coordinate. The question is now how to generalize the binary addressing codes. For multi-weight codes, we have to have K sub-registers to store the addresses of K particles. Naively, one would want to store the coordinate of each particle in its respective sub-register in binary form, as we have done for K = 1. This however, holds a problem. As the particles are indistinguishable, the stored coordinates would be interchangeable, the code would not be one-to-one. For the binary numbers ${{\boldsymbol{\omega }}}^{1}$ and ${{\boldsymbol{\omega }}}^{2}$ , that represent a coordinate each, this would mean ${\boldsymbol{d}}({{\boldsymbol{\omega }}}^{1}\oplus {{\boldsymbol{\omega }}}^{2})={\boldsymbol{d}}({{\boldsymbol{\omega }}}^{2}\oplus {{\boldsymbol{\omega }}}^{1})$ . That strategy not only complicates the operator transform, it also leads to a certain qubit overhead, as each plain word has as many code words as there are permutations of K items. Since this naive idea leaves us unconvinced, we abandon it and search for one-to-one codes instead. The key is to consider the coordinates to be in a certain format and this is where [28] comes into play. We proceed by using some relevant concepts of that paper.

Let us consider the coordinates of K particles to be given in the N-ary vector ${\boldsymbol{x}}=({x}_{1}\,,\,...,\,{x}_{K})$ . Between those coordinates, we have imposed an ordering x_i > x_j as i > j. Particles cannot share the same orbital, so we are excluding the cases where two coordinates are equal. Using results from [28], we transform the latter into coordinates that lack such an ordering, and where each component is an integer from a different range:

$\begin{eqnarray}&&{\boldsymbol{x}}\ \to \ {\boldsymbol{y}}={({y}_{1},...,{y}_{K})}^{\top }\qquad {\rm{with}}\quad {\boldsymbol{y}}\in \underset{m=1}{\overset{K}{\displaystyle \bigotimes }}\left[\lceil\displaystyle \frac{N}{m}\rceil\right].\end{eqnarray} \tag{ D2 }$

Through that transform, each vector ${\boldsymbol{y}}$ corresponds to a valid vector ${\boldsymbol{x}}$ , and there is no duplication. We now represent the ${\boldsymbol{y}}$ -coordinates by binary numbers in the code words ${\boldsymbol{\omega }}\in {{\mathbb{Z}}}_{2}^{\otimes n}$ , where $n={\sum }_{m=1}^{K}\lceil \mathrm{log}\tfrac{N}{m}\rceil$ :

$\begin{eqnarray}&&{\boldsymbol{\omega }}=\underset{m=1}{\overset{K}{\displaystyle \bigoplus }}{{\boldsymbol{q}}}^{{{\boldsymbol{y}}}_{{\boldsymbol{m}}},\lceil\displaystyle \frac{N}{m}\rceil}\qquad {\rm{with}}\quad {{\boldsymbol{q}}}^{{\boldsymbol{i}},{\boldsymbol{j}}}\in {{\mathbb{Z}}}_{2}^{\otimes j}\quad {\rm{and}}\quad \mathrm{bin}({{\boldsymbol{q}}}^{{\boldsymbol{i}},{\boldsymbol{j}}})+1=i.\end{eqnarray} \tag{ D3 }$

A geometric interpretation of the process portrays the vector ${\boldsymbol{x}}$ as a set of coordinates in a K-dimensional, discrete vector space. The vectors allowed by the ordering form thereby a multi-dimensional tetrahedron. The states outside the tetrahedron do not correspond to a valid ${ \mathcal V }$ vector, so encoding each coordinate x_i in $\lceil \mathrm{log}N\rceil$ qubits would be redundant. We therefore dissect the tetrahedron, and rearrange it into a brick, as it is referred to in [28]. What is actually done is to apply symmetry operations (like point-reflections) on the vector space until the tetrahedron is deformed into the desired shape a K-dimensional, rectangular volume. The fact that the vectors to encode are now all inside a hyper-rectangle is what we wanted to achieve. We can now clip the ranges of the coordinate axes (to $[\lceil \mathrm{log}\tfrac{N}{m}\rceil ]$ ) to exclude vectors the vectors outside the brick. As the values on the axes correspond to non-binary addresses, this means that the qubit space is trimmed as well, and we have eliminated all states based on not-allowed coordinates. This is where we now reconnect to our task of finding a code: the ${\boldsymbol{e}}$ - and ${\boldsymbol{d}}$ -functions have to take into account the reshaping process, as only the coordinates ${\boldsymbol{x}}$ have a physical interpretation and can be decoded. The binary addresses in the code words, on the other hand, are representatives of ${\boldsymbol{y}}$ . With with binary logic, the two coordinates have to be reconnected. We illustrate this abstract process on the example of the $(K=2)$ -code.

D.1. Weight-two binary addressing code

As an example, we present the weight-two binary addressing code on N = 2^r orbitals. The integer r will determine the size of the entire qubit system $n=2r-1$ , with two registers of size r and r − 1.

With the two registers, a binary vector ${\boldsymbol{\omega }}={\boldsymbol{\alpha }}\oplus {\boldsymbol{\beta }}$ with ${\boldsymbol{\alpha }}\in {{\mathbb{Z}}}_{2}^{\otimes r}$ and ${\boldsymbol{\beta }}\in {{\mathbb{Z}}}_{2}^{\otimes (r-1)}$ is defining the qubit basis. In two-dimensions, the brick turns into a rectangle and the tetrahedron into triangle. The decoding function takes binary addresses of the rectangular ${\boldsymbol{y}}$ , and transforms them into coordinates in the triangle ${\boldsymbol{x}}$ . The ordering condition implies hereby where to dissect the rectangle: figure D1 may serve as a visual aid, disregarding the excluded cases of ${y}_{1}={y}_{2}$ , we find for ${y}_{1}\in [N],{y}_{2}\in [N/2]$ and ${\boldsymbol{x}}\in {[N]}^{\otimes 2}$ :

$\begin{eqnarray}({x}_{1},{x}_{2})=\left\{\begin{array}{ll}({y}_{1},N/2+{y}_{2}) & {\rm{for}}\quad {y}_{1}\lt N/2+{y}_{2}\\ ({y}_{1},N/2-{y}_{2}+1) & {\rm{for}}\quad {y}_{1}\gt N/2+{y}_{2}.\end{array}\right.\end{eqnarray} \tag{ D4 }$

This decoding is translated into a binary functions as follows: the coordinate y₁ is represented by the binary vector ${\boldsymbol{\alpha }}$ and y₂ by ${\boldsymbol{\beta }}$ . For each component defined by the binary vector ${\boldsymbol{b}}\in {{\mathbb{Z}}}_{2}^{\otimes r}$ , we have

$\begin{eqnarray}{d}_{j}({\boldsymbol{\alpha }}\oplus {\boldsymbol{\beta }}) & = & S({\boldsymbol{\alpha }},{\boldsymbol{\beta }})\displaystyle \prod _{i=1}^{r}({\alpha }_{i}+{q}_{i}^{\,j,r}+1)+(1+S({\boldsymbol{\alpha }},{\boldsymbol{\beta }}))(1+T({\boldsymbol{\alpha }},{\boldsymbol{\beta }}))\displaystyle \prod _{i=1}^{r}({\alpha }_{i}+{q}_{i}^{\,j,r})\\ & & +(1+S({\boldsymbol{\alpha }},{\boldsymbol{\beta }}))(1+T({\boldsymbol{\alpha }},{\boldsymbol{\beta }}))\displaystyle \prod _{k=1}^{r-1}({\beta }_{k}+{q}_{k}^{\,j,r})+S({\boldsymbol{\alpha }},{\boldsymbol{\beta }})\displaystyle \prod _{k=1}^{r-1}({\beta }_{k}+{q}_{k}^{\,j,r}+1)\ \ \mathrm{mod}\ 2,\end{eqnarray} \tag{ D5 }$

with ${{\boldsymbol{q}}}^{{\boldsymbol{j}},{\boldsymbol{r}}}=({q}_{1}^{\,j,r},{q}_{2}^{\,j,r}\,,\,...,\,{q}_{{2}^{r}}^{\,j,r})$ as defined in (D3) and we have employed two binary functions S and $T:({{\mathbb{Z}}}_{2}^{\otimes r},{{\mathbb{Z}}}_{2}^{\otimes (r-1)})\to {{\mathbb{Z}}}_{2}$ . Here, S compares the binary numbers to determine if the coordinates are left of the dissection (a black tile in figure D1)

$\begin{eqnarray}&&S({\boldsymbol{\alpha }},{\boldsymbol{\beta }})={\alpha }_{r}\displaystyle \sum _{j=1}^{r-1}\left[\displaystyle \prod _{r-1\geqslant i\gt j}({\alpha }_{i}+{\beta }_{i}+1)\right](1+{\alpha }_{j}){\beta }_{j}+1+{\alpha }_{r}\ \ \mathrm{mod}\ 2.\end{eqnarray} \tag{ D6 }$

The binary function T, on the other hand, is checking whether a set of coordinates is on a diagonal position (diagonally marked tiles). These excluded cases are mapped to ${(0)}^{\otimes r}$ altogether

$\begin{eqnarray}&&T({\boldsymbol{\alpha }},{\boldsymbol{\beta }})={\displaystyle \prod }_{i}({\alpha }_{i}+{\beta }_{i})\ \mathrm{mod}\ 2.\end{eqnarray} \tag{ D7 }$

This concludes the decoding function. Unfortunately, the amount of logic elements in the decoding will complicate the weight-two codes quite a bit, and the encoding function is hardly better. The reason for this is to find in the ordering condition: the update operations are conditional on whether we change the ordering of the coordinates represented by ${\boldsymbol{\alpha }}$ and ${\boldsymbol{\beta }}$ . This is reflected in a nonlinear encoding function: we remind us that the encoding function is a map ${\boldsymbol{e}}:{{\mathbb{Z}}}_{2}^{\otimes {2}^{r}}\to {{\mathbb{Z}}}_{2}^{\otimes (2r-1)}$ , and with ${\boldsymbol{\nu }}\in {{\mathbb{Z}}}_{2}^{\otimes {2}^{r}}$ we find

$\begin{eqnarray*}{\boldsymbol{e}}({\boldsymbol{\nu }}) & = & \displaystyle \sum _{j=2}^{{2}^{r-1}}\ \displaystyle \sum _{i=1}^{j-1}({{\boldsymbol{q}}}^{{\boldsymbol{i}},{\boldsymbol{r}}}+{{\boldsymbol{I}}}^{{\boldsymbol{r}}}\ \mathrm{mod}\ 2)\oplus ({{\boldsymbol{q}}}^{{\boldsymbol{j}},{\boldsymbol{r}}-{\bf{1}}}+{{\boldsymbol{I}}}^{{\boldsymbol{r}}-{\bf{1}}}\ \mathrm{mod}\ 2){\nu }_{i}{\nu }_{j}\\ & & +\displaystyle \sum _{j={2}^{r-1}+1}^{{2}^{r}}\ \displaystyle \sum _{i=1}^{{2}^{r-1}}({{\boldsymbol{q}}}^{{\boldsymbol{i}},{\boldsymbol{r}}}+{{\boldsymbol{I}}}^{{\boldsymbol{r}}}\ \mathrm{mod}\ 2)\oplus ({{\boldsymbol{q}}}^{{\boldsymbol{j}}-{{\bf{2}}}^{{\boldsymbol{r}}-{\bf{1}}},{\boldsymbol{r}}-{\bf{1}}}){\nu }_{i}{\nu }_{j}\\ & & +\displaystyle \sum _{j={2}^{r-1}+2}^{{2}^{r}}\ \displaystyle \sum _{i={2}^{r-1}+1}^{j-1}({{\boldsymbol{q}}}^{{\boldsymbol{i}},{\boldsymbol{r}}})\oplus ({{\boldsymbol{q}}}^{{\boldsymbol{j}}-{{\bf{2}}}^{{\boldsymbol{r}}-{\bf{1}}},{\boldsymbol{r}}-{\bf{1}}}){\nu }_{i}{\nu }_{j}\quad \ \mathrm{mod}\ 2,\end{eqnarray*}$

with ${{\boldsymbol{q}}}^{{\boldsymbol{i}},{\boldsymbol{j}}}$ as defined in (D3), and ${{\boldsymbol{I}}}^{{\boldsymbol{j}}}={(1)}^{\otimes j}={{\boldsymbol{q}}}^{{{\bf{2}}}^{{\boldsymbol{j}}},{\boldsymbol{j}}}$ .

**Figure D1.** Visualization of the two-dimensional vector space: a valid vector is represented as a colored tile. The left gray tiles and the black ones constitute the triangle, defining all valid vectors ${\boldsymbol{x}}={({x}_{1},{x}_{2})}^{\top }$ . The marked diagonal tiles are to be excluded from the encoded space. The black tiles and the gray ones on the right of this diagonal form the brick, containing all ${\boldsymbol{y}}={({y}_{1},{y}_{2})}^{\top }$ vectors.
Download figure:
Standard image High-resolution image

**Figure D1.** Visualization of the two-dimensional vector space: a valid vector is represented as a colored tile. The left gray tiles and the black ones constitute the triangle, defining all valid vectors ${\boldsymbol{x}}={({x}_{1},{x}_{2})}^{\top }$ . The marked diagonal tiles are to be excluded from the encoded space. The black tiles and the gray ones on the right of this diagonal form the brick, containing all ${\boldsymbol{y}}={({y}_{1},{y}_{2})}^{\top }$ vectors.
Download figure:
Standard image High-resolution image

The dissecting of tetrahedrons can be generalized for codes of weight larger than two (see again [28]), but as one increases the number of dissections, the code functions are complicated even further.

Appendix E.: Segment codes

In this appendix, we provide detailed information on the segment codes. We firstly concern ourselves with the segmentation of the global code, including a derivation of the segment sizes. In another subsection we construct the segment codes themselves. The last subsection is dedicated to the adjustments one has to make to Hamiltonian, such that segment codes become feasible to use.

E.1. Segment sizes

At this point we want to sketch the idea behind the segment sizes ( $\hat{N},\hat{n}$ ) stated during section 3.1.3 in the main part, but first of all we would like to clearly set up the situation.

We consider vectors ${\boldsymbol{\nu }}\in {{\mathbb{Z}}}_{2}^{\otimes N}$ to consist of $\hat{m}$ smaller vectors ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ of length $\hat{n}+1$ , such that ${\boldsymbol{\nu }}={\bigoplus }_{i=1}^{\hat{m}}{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ . We call those vectors ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ segments of ${\boldsymbol{\nu }}$ . The goal is now to find a code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ) to encode a basis ${ \mathcal V }$ which contains all vectors ${\boldsymbol{\nu }}$ with Hamming weight K. For that purpose we relate the segment ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ to a segment of the code space, $\hat{{{\boldsymbol{\omega }}}^{i}}$ , for all i ∈ [N]. The code space segments constitute the code words in a fashion similar to the previous segmentation of ${\boldsymbol{\nu }}$ : ${\boldsymbol{\omega }}={\bigoplus }_{i=1}^{\hat{m}}\hat{{{\boldsymbol{\omega }}}^{i}}$ . However, the length of those binary vectors $\hat{{{\boldsymbol{\omega }}}^{i}}$ is $\hat{n}$ , such that with $n=\hat{m}\hat{n}$ and $N=\hat{m}(\hat{n}+1)$ , the problem is reduced by $\hat{m}$ qubits as compared to conventional transforms. We now introduce the subcodes ( $\hat{{\boldsymbol{e}}}:{{\mathbb{Z}}}_{2}^{\otimes (\hat{n}+1)}\to {{\mathbb{Z}}}_{2}^{\otimes \hat{n}},\hat{{\boldsymbol{d}}}:{{\mathbb{Z}}}_{2}^{\otimes \hat{n}}\to {{\mathbb{Z}}}_{2}^{\otimes (\hat{n}+1)}$ ), with which we encode the ith segment ${\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ as $\hat{{{\boldsymbol{\omega }}}^{i}}$ (see figure E1). Note that we require the subcodes to inherit all the code properties. In this way we guarantee the code properties of the global code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ) when appending $\hat{m}$ instances of the same subcode:

$\begin{eqnarray}&&{\boldsymbol{d}}\left(\underset{i=1}{\overset{\hat{m}}{\displaystyle \bigoplus }}\hat{{{\boldsymbol{\omega }}}^{i}}\right)=\underset{i=1}{\overset{\hat{m}}{\displaystyle \bigoplus }}\hat{{\boldsymbol{d}}}(\hat{{{\boldsymbol{\omega }}}^{i}}),\qquad {\boldsymbol{e}}\left(\underset{i=1}{\overset{\hat{m}}{\displaystyle \bigoplus }}{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}\right)=\underset{i=1}{\overset{\hat{m}}{\displaystyle \bigoplus }}\hat{{\boldsymbol{e}}}({\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}).\end{eqnarray} \tag{ E1 }$

**Figure E1.** Visualization of (E1) for $\hat{n}=4$ . The global code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ) relates the occupation vectors to the global code words ${\boldsymbol{\nu }}\leftrightarrow {\boldsymbol{\omega }}$ . The an instance of the subcode ( $\hat{{\boldsymbol{e}}},\hat{{\boldsymbol{d}}}$ ) relates ith block in ${\boldsymbol{\nu }},{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ , to the ith segment in the code words, $\hat{{{\boldsymbol{\omega }}}^{i}}$ .
Download figure:
Standard image High-resolution image

**Figure E1.** Visualization of (E1) for $\hat{n}=4$ . The global code ( ${\boldsymbol{e}},{\boldsymbol{d}}$ ) relates the occupation vectors to the global code words ${\boldsymbol{\nu }}\leftrightarrow {\boldsymbol{\omega }}$ . The an instance of the subcode ( $\hat{{\boldsymbol{e}}},\hat{{\boldsymbol{d}}}$ ) relates ith block in ${\boldsymbol{\nu }},{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}}$ , to the ith segment in the code words, $\hat{{{\boldsymbol{\omega }}}^{i}}$ .
Download figure:
Standard image High-resolution image

The orbital number being an integer multiple of the block size is of course an idealized scenario. One will probably have to add a few other components in order to compensate for dimensional mismatches.

We now set out to find the smallest segment size $\hat{n}$ . It should be clear that $\hat{n}$ is a function of the targeted Hamming weight K: this means K determines which segment codes are suitable for the system. The reason for this is that we need to encode all vectors with weight 0 to K inside every segment, taking into account for the up to K particles on the orbitals inside one segment. In order to include weight K vectors, the size of each segment must be at least K. If the segment size would be exactly K, on the other hand, we end up encoding the entire Fock space again. In doing so, we are not making any qubit savings. The segments must thus be larger than K. In other words, we look for an integer $\hat{n}\gt K$ , where the sum of all combinations $\hat{{\boldsymbol{\nu }}}\in {{\mathbb{Z}}}_{2}^{\otimes (\hat{n}+1)}$ with ${{\rm{w}}}_{{\rm{H}}}(\hat{{\boldsymbol{\nu }}})\leqslant K$ is smaller equal ${2}^{\hat{n}}$

$\begin{eqnarray}&&{2}^{\hat{n}}\ \geqslant \ \displaystyle \sum _{k=0}^{K}\left(\displaystyle \genfrac{}{}{0em}{}{\hat{n}+1}{k}\right).\end{eqnarray} \tag{ E2 }$

In the case $\hat{n}=2K$ , the condition is fulfilled as identity, since exactly half of all ${2}^{\hat{n}+1}$ combinations are included in the sum.

E.2. Subcodes

This subsection offers a closer look at the construction of the segment subcodes ( $\hat{{\boldsymbol{e}}},\hat{{\boldsymbol{d}}}$ ). Let us start by considering the decoding $\hat{{\boldsymbol{d}}}$ in order to explore the nature of the binary switch $f(\hat{{\boldsymbol{\omega }}})$ , that occurs in (38). One observes the two (affine) linear $({{\mathbb{Z}}}_{2}^{\otimes \hat{n}}\to {{\mathbb{Z}}}_{2}^{\otimes (\hat{n}+1)})$ -maps

$\begin{eqnarray}\hat{{\boldsymbol{\omega }}}\to \left[\begin{array}{ccc}1 & & \\ & \ddots & \\ & & 1\\ 0 & ... & 0\end{array}\right]\hat{{\boldsymbol{\omega }}}\,\mathrm{mod}\,2,\qquad \hat{{\boldsymbol{\omega }}}\ \to \ \left[\begin{array}{ccc}1 & & \\ & \ddots & \\ & & 1\\ 0 & ... & 0\end{array}\right]\hat{{\boldsymbol{\omega }}}+\left(\begin{array}{c}1\\ \vdots \\ \vdots \\ 1\end{array}\right)\ \mathrm{mod}\ 2.\end{eqnarray} \tag{ E3 }$

to produce together all the vectors with weight equal or smaller than K, if we input all $\hat{{\boldsymbol{\omega }}}$ with ${{\rm{w}}}_{{\rm{H}}}(\hat{{\boldsymbol{\omega }}})\leqslant K$ into the first, and the remaining cases with ${{\rm{w}}}_{{\rm{H}}}(\hat{{\boldsymbol{\omega }}})\gt K$ into the second one. Note that the last component is always zero in outputs of the first function and one in the second. Therefore, the inverse of both maps is always a linear map with the matrix $[\ {\mathbb{I}}\ | {{\boldsymbol{I}}}^{\hat{{\boldsymbol{n}}}}]$ . We take this inverse as encoding (39), and the two maps (E3) are merged into the decoding (38). In order to switch between these two maps we define the binary function $f(\hat{{\boldsymbol{\omega }}})\,:{{\mathbb{Z}}}_{2}^{\otimes \hat{n}}\to {{\mathbb{Z}}}_{2}$ such that

$\begin{eqnarray}f(\hat{{\boldsymbol{\omega }}})=\left\{\begin{array}{ll}1 & {\rm{for}}\ \ {{\rm{w}}}_{{\rm{H}}}(\hat{{\boldsymbol{\omega }}})\gt K\\ 0 & {\rm{otherwise}}.\end{array}\right.\end{eqnarray} \tag{ E4 }$

In general, one can define this binary switch in a brute-force way by

$\begin{eqnarray}&&f(\hat{{\boldsymbol{\omega }}})=\displaystyle \sum _{k=K+1}^{2K}\displaystyle \sum _{\displaystyle \genfrac{}{}{0em}{}{{\boldsymbol{t}}\in {{\mathbb{Z}}}_{2}^{\otimes 2K}}{{{\rm{w}}}_{{\rm{H}}}({\boldsymbol{t}})=k}}\displaystyle \prod _{m=1}^{2K}({\hat{\omega }}_{m}+1+{t}_{m})\ \mathrm{mod}\ 2.\end{eqnarray} \tag{ E5 }$

For the case K = 1 ( $\hat{n}=2$ ), the switch equals $f({\boldsymbol{\omega }})={\omega }_{1}{\omega }_{2}$ , and for the code we recover a version of binary addressing codes, where the vector (0, 0, 0) is encoded.

$\begin{eqnarray}&&\hat{{\boldsymbol{d}}}(\hat{{\boldsymbol{\omega }}})=\left(\begin{array}{c}{\hat{\omega }}_{1}\,({\hat{\omega }}_{2}+1)\\ ({\hat{\omega }}_{1}+1)\,{\hat{\omega }}_{2}\\ {\hat{\omega }}_{1}{\hat{\omega }}_{2}\end{array}\right)\ \mathrm{mod}\ 2,\qquad \hat{{\boldsymbol{e}}}(\hat{{\boldsymbol{\nu }}})=\left[\begin{array}{ccc}1 & 0 & 1\\ 0 & 1 & 1\end{array}\right]\hat{{\boldsymbol{\nu }}}\ \ \mathrm{mod}\ 2.\end{eqnarray} \tag{ E6 }$

In the K = 2 ( $\hat{n}=4$ ) case, this binary switch is found to be $f(\hat{{\boldsymbol{\omega }}})={\hat{\omega }}_{1}{\hat{\omega }}_{2}{\hat{\omega }}_{3}+{\hat{\omega }}_{1}{\hat{\omega }}_{2}{\hat{\omega }}_{4}+{\hat{\omega }}_{1}{\hat{\omega }}_{3}{\hat{\omega }}_{4}+{\hat{\omega }}_{2}{\hat{\omega }}_{3}{\hat{\omega }}_{4}+{\hat{\omega }}_{1}{\hat{\omega }}_{2}{\hat{\omega }}_{3}{\hat{\omega }}_{4}\ \mathrm{mod}\ 2$ .

E.3. Hamiltonian adjustments

As mentioned in section 3.1.3, in the main part, segment codes are not automatically compatible with all particle-number-conserving Hamiltonians. We show here, how certain adjustments can be made to these Hamiltonians, such that their action on the space ${{ \mathcal H }}_{N}^{K}$ is not changed, but segment codes become feasible to describe them with. In order to understand this issue, we begin by examining the encoded space. For that purpose we reprise the situation of (E1), where we have append $\hat{m}$ instances of the same subcode. With segment codes, the basis ${ \mathcal V }$ contains vectors with Hamming weights from 0 to $\hat{m}K$ . We have encoded all possible vectors ${\boldsymbol{\nu }}$ with $0\leqslant {{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})\leqslant K$ , but although we have some, not all vectors with ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})\gt K$ are encoded. We can illustrate that point rather quickly: each segment has length $2K+1$ , but the subcode encodes vectors $\hat{{\boldsymbol{\nu }}}$ with only ${{\rm{w}}}_{{\rm{H}}}(\hat{{\boldsymbol{\nu }}})\leqslant K$ . The (global) basis ${ \mathcal V }$ is thus deprived of vectors ${\boldsymbol{\nu }}=({\bigoplus }_{i}{\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}})$ where for any segment $i,{{\rm{w}}}_{{\rm{H}}}({\hat{{\boldsymbol{\nu }}}}^{{\boldsymbol{i}}})\gt K$ .

We now turn our attention to terms, which, when present in a Hamiltonian, make segment codes infeasible to use. Note, that ${ \mathcal V }$ -vectors with ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})\ne K$ , are not corresponding to fermionic states we are interested in. In particular it is a certain subset of states with ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})\gt K$ , which can lead out of the encoded space (into the states previously mentioned) when acted upon with certain fermionic operators. Let us consider the operator ${c}_{i}^{\dagger }{c}_{j}^{\ }$ as an example, where i and j are in different segments (let us call these segments A and B). Now a basis state as depicted in figure E2, is not annihilated by ${c}_{i}^{\dagger }{c}_{j}^{\ }$ , and leads into a state with 3 particles in segment A. The problem is that the initial state is encoded in the $(K=2)$ segment codes, whereas the updated state (with the 3 particles in A) is not. In general, operators ${\hat{h}}_{{\boldsymbol{ab}}}$ , that change occupations in between segments, will cause some basis states with ${{\rm{w}}}_{{\rm{H}}}({\boldsymbol{\nu }})\gt K$ to leave the encoded space. We can however adjust these terms ${\hat{h}}_{{\boldsymbol{ab}}}\to {\hat{h}}_{{\boldsymbol{ab}}}^{{\prime} }$ , such that ${\hat{h}}_{{\boldsymbol{ab}}}^{{\prime} }:\mathrm{span}({ \mathcal B })\to \mathrm{span}({ \mathcal B })$ , where ${ \mathcal B }$ is the basis encoded by the segment codes. We now sketch the idea behind those adjustments, before we reconsider the situation of figure E2. Note that after these adjustments have been made to all Hamiltonian terms in question, the segment codes are compatible with the new Hamiltonian. The idea is to switch those terms off for states, that already have K particles inside the segments, to which particles will be added. We have to take care to do this in a way that leaves the Hamiltonian hermitian on the level of second quantization, i.e. we have to adjust the terms ${\hat{h}}_{{\boldsymbol{ab}}}^{\ }$ and ${\hat{h}}_{{\boldsymbol{ab}}}^{\dagger }$ into ${\hat{h}}_{{\boldsymbol{ab}}}^{{\prime} }$ and ${({\hat{h}}_{{\boldsymbol{ab}}}^{\dagger })}^{{\prime} }$ , such that ${\hat{h}}_{{\boldsymbol{ab}}}^{{\prime} }+{({\hat{h}}_{{\boldsymbol{ab}}}^{\dagger })}^{{\prime} }$ is hermitian. For the K = 2 code of figure E2, we can make the following adjustments:

**Figure E2.** (Filled) circles represent (occupied) fermionic orbitals, where K = 2 segment codes are used in the indicated blocks. This occupational case is problematic for the codes, as the operator ${c}_{i}^{\dagger }{c}_{j}^{\ }$ acting on this state leaves the encoded space.
Download figure:
Standard image High-resolution image

$\begin{eqnarray}&&{c}_{i}^{\dagger }{c}_{j}^{\ }\ \to \ \left(1-\displaystyle \sum _{l,k\lt l\,\in \,{\rm{}}\,{\rm{B}}}{c}_{k}^{\dagger }{c}_{k}^{\ }{c}_{l}^{\dagger }{c}_{l}^{\ }\right){c}_{i}^{\dagger }{c}_{j}^{\ }\left(1-\displaystyle \sum _{w,v\lt w\,\in \,{\rm{}}\,{\rm{A}}}{c}_{v}^{\dagger }{c}_{v}^{\ }{c}_{w}^{\dagger }{c}_{w}^{\ }\right).\end{eqnarray} \tag{ E7 }$

Appendix F.: Conventional mappings

We now revisit the conventional transforms from section 2 in the main part, and discuss all notations that have been introduced to express it close to the appealing nomenclature of [24, 33]. In particular, we show that the relation (13) is recovered as a special case from (31) and (33). After that, we verify that such constructions satisfy the fermionic anticommutation relations. For now, however, we would like to restate the situation: a linear n = N code, encoding the entire Fock space, is mediated by the quadratic matrices A and ${A}^{-1}$ , such that ${\boldsymbol{e}}({\boldsymbol{\nu }})=(A{\boldsymbol{\nu }}\ \mathrm{mod}\ 2)$ and ${\boldsymbol{d}}({\boldsymbol{\omega }})=({A}^{-1}{\boldsymbol{\omega }}\ \mathrm{mod}\ 2)$ . The matrices are required to be each others inverses, so

$\begin{eqnarray}&&\displaystyle \sum _{j=1}^{N}{A}_{{ij}}\ {({A}^{-1})}_{{jk}}\ \mathrm{mod}\ 2={\delta }_{{ik}}.\end{eqnarray} \tag{ F1 }$

We now explain the form of the parity, update and flip sets. As the code is linear, the extraction operator is retrieving only Pauli strings following (22) and (24). One finds:

$\begin{eqnarray}&&{\mathfrak{X}}[{d}_{i}]={\mathfrak{X}}\left[{\boldsymbol{\omega }}\to \displaystyle \sum _{j}{({A}^{-1})}_{{ij}}{\omega }_{j}\ \mathrm{mod}\ 2\right]=\mathop{\displaystyle \bigotimes }\limits_{j\in [N]}{({Z}_{j})}^{{({A}^{-1})}_{{ij}}}=\mathop{\displaystyle \bigotimes }\limits_{j\in F(i)}{Z}_{j},\end{eqnarray} \tag{ F2 }$

$\begin{eqnarray}&&{\mathfrak{X}}[{p}_{i}]={\mathfrak{X}}\left[{\boldsymbol{\omega }}\to \displaystyle \sum _{j\lt i}\displaystyle \sum _{k}{({A}^{-1})}_{{jk}}{\omega }_{k}\ \mathrm{mod}\ 2\right]=\mathop{\displaystyle \bigotimes }\limits_{k\in [N]}{({Z}_{j})}^{{({{RA}}^{-1})}_{{ik}}}=\mathop{\displaystyle \bigotimes }\limits_{k\in P(i)}{Z}_{k},\end{eqnarray} \tag{ F3 }$

where P(i) and F(i) are the parity and flip sets with respect to i, as we defined them in section 2. The update sets U(i) are obtained from update operators of linear encodings:

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}=\mathop{\displaystyle \bigotimes }\limits_{i}{({X}_{i})}^{{e}_{i}({\boldsymbol{q}})}=\mathop{\displaystyle \bigotimes }\limits_{i}{({X}_{i})}^{{\displaystyle \sum }_{j}{A}_{{ij}}{q}_{j}}=\displaystyle \prod _{k\in [l]}\mathop{\displaystyle \bigotimes }\limits_{i\in [N]}{({X}_{i})}^{{A}_{{{ia}}_{k}}}=\displaystyle \prod _{k\in [l]}\ \mathop{\displaystyle \bigotimes }\limits_{i\in U({a}_{k})}{X}_{i}.\end{eqnarray} \tag{ F4 }$

In order to derive (13), we would like to point out the commutation relations between Pauli strings $({\bigotimes }_{u\in U(i)}{X}_{u}),({\bigotimes }_{v\in F(j)}{Z}_{v})$ and $({\bigotimes }_{w\in P(k)}{Z}_{w})$ . These will prove useful in verifying the fermionic commutation relations later. For commutations of update- and flip set strings we find:

$\begin{eqnarray}&&\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{v\in F(j)}{Z}_{v}\right)=\mathop{\displaystyle \bigotimes }\limits_{w\in [N]}{({X}_{w})}^{{A}_{{iw}}}{({Z}_{w})}^{{({A}^{-1})}_{{wj}}}=\mathop{\displaystyle \bigotimes }\limits_{w\in [N]}{(-1)}^{{A}_{{iw}}{({A}^{-1})}_{{wj}}}{({Z}_{w})}^{{({A}^{-1})}_{{wj}}}{({X}_{w})}^{{A}_{{iw}}}\end{eqnarray} \tag{ F5 }$

$\begin{eqnarray}&&={(-1)}^{\displaystyle \sum _{w}{A}_{{iu}}{({A}^{-1})}_{{wj}}}\left(\mathop{\displaystyle \bigotimes }\limits_{v\in F(j)}{Z}_{v}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right)={(-1)}^{{\delta }_{{ij}}}\left(\mathop{\displaystyle \bigotimes }\limits_{v\in F(j)}{Z}_{v}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right).\end{eqnarray} \tag{ F6 }$

We have used the relation (F1) for the above. Similarly, for commutations of update and parity strings we have:

$\begin{eqnarray}&&\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{w\in P(j)}{Z}_{w}\right)={(-1)}^{{\theta }_{{ij}}}\left(\mathop{\displaystyle \bigotimes }\limits_{w\in P(j)}{Z}_{w}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right).\end{eqnarray} \tag{ F7 }$

Finally, we combine (F2)–(F4) with the operator from (31). Using (F5) and (F7) to move every update string $({\bigotimes }_{u\in U({a}_{j})}{X}_{u})$ in between the projectors and parity strings of a_j and a_j+1, we get

$\begin{eqnarray}&&{{ \mathcal U }}^{{\boldsymbol{a}}}\left(\displaystyle \prod _{v=1}^{l-1}\,\displaystyle \prod _{w=v+1}^{l}{(-1)}^{{\theta }_{{a}_{v}{a}_{w}}}\right)\left(\displaystyle \prod _{x=1}^{l}\displaystyle \frac{1}{2}\left({\mathbb{I}}-\left[\displaystyle \prod _{y=x+1}^{l}{(-1)}^{{\delta }_{{a}_{x}{a}_{y}}}\right]{(-1)}^{{b}_{x}}\ {\mathfrak{X}}[{d}_{{a}_{x}}]\right){\mathfrak{X}}[\,{p}_{{a}_{x}}]\right)\end{eqnarray} \tag{ F8 }$

$\begin{eqnarray}&&=\displaystyle \prod _{x=1}^{l}\left(\displaystyle \frac{1}{2}\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U({a}_{x})}{X}_{u}\right)\left({\mathbb{I}}-{(-1)}^{{b}_{x}}\mathop{\displaystyle \bigotimes }\limits_{v\in F({a}_{x})}{Z}_{v}\right)\mathop{\displaystyle \bigotimes }\limits_{w\in P({a}_{x})}{Z}_{w}\right),\end{eqnarray} \tag{ F9 }$

which is a sequence of the operators (13). The transform of a singular operator is c_j^(†) is thus derived from (31). Although we have already shown that (31) satisfies (3)–(6), but we now want to show that (13) fulfills the anticommutation relations (2) in particular. In doing so, we generally distinguish the cases i = j and $i\ne j$ . For ${[{c}_{j}^{(\dagger )},{c}_{j}^{(\dagger )}]}_{+}$ , we consult (F5) and find

$\begin{eqnarray}&&{c}_{j}^{\dagger }{c}_{j}^{\dagger }={c}_{j}^{\ }{c}_{j}^{\ }\hat{=}\displaystyle \frac{1}{4}\left({\mathbb{I}}-\mathop{\displaystyle \bigotimes }\limits_{v\in F(j)}{Z}_{v}\right)\left({\mathbb{I}}+\mathop{\displaystyle \bigotimes }\limits_{w\in F(j)}{Z}_{w}\right)=0.\end{eqnarray} \tag{ F10 }$

We notice that for $i\ne j$ , the gate transform of ${c}_{i}^{\ }{c}_{j}^{\ }$ $({c}_{i}^{\dagger }{c}_{j}^{\dagger })$ properly differs by a minus sign from the transform of ${c}_{j}^{\ }{c}_{i}^{\ }$ $({c}_{j}^{\dagger }{c}_{i}^{\dagger })$ due to (F7). We want to make this observation explicit for the $i\ne j$ case of ${[{c}_{i}^{},{c}_{j}^{\dagger }]}_{+}$ :

$\begin{eqnarray}&&{c}_{i}^{\ }{c}_{j}^{\dagger }\ \hat{=}\ \displaystyle \frac{1}{4}\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right)\left({\mathbb{I}}-\mathop{\displaystyle \bigotimes }\limits_{v\in F(i)}{Z}_{v}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{w\in P(i)}{Z}_{w}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{{u}^{{\prime} }\in U(j)}{X}_{{u}^{{\prime} }}\right)\left({\mathbb{I}}+\mathop{\displaystyle \bigotimes }\limits_{{v}^{{\prime} }\in F(j)}{Z}_{{v}^{{\prime} }}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{{w}^{{\prime} }\in P(j)}{Z}_{{w}^{{\prime} }}\right)\end{eqnarray} \tag{ F11 }$

$\begin{eqnarray}&&=\displaystyle \frac{{(-1)}^{{\theta }_{{ij}}}}{4}\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{{u}^{{\prime} }\in U(j)}{X}_{{u}^{{\prime} }}\right)\left({\mathbb{I}}-\mathop{\displaystyle \bigotimes }\limits_{v\in F(i)}{Z}_{v}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{w\in P(i)}{Z}_{w}\right)\left({\mathbb{I}}+\mathop{\displaystyle \bigotimes }\limits_{{v}^{{\prime} }\in F(j)}{Z}_{{v}^{{\prime} }}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{{w}^{{\prime} }\in P(j)}{Z}_{{w}^{{\prime} }}\right)\end{eqnarray} \tag{ F12 }$

$\begin{eqnarray}&&=\displaystyle \frac{{(-1)}^{{\theta }_{{ij}}+{\theta }_{{ji}}}}{4}\left(\mathop{\displaystyle \bigotimes }\limits_{{u}^{{\prime} }\in U(j)}{X}_{{u}^{{\prime} }}\right)\left({\mathbb{I}}+\mathop{\displaystyle \bigotimes }\limits_{{v}^{{\prime} }\in F(j)}{Z}_{{v}^{{\prime} }}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{{w}^{{\prime} }\in P(j)}{Z}_{{w}^{{\prime} }}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{u\in U(i)}{X}_{u}\right)\left({\mathbb{I}}-\mathop{\displaystyle \bigotimes }\limits_{v\in F(i)}{Z}_{v}\right)\left(\mathop{\displaystyle \bigotimes }\limits_{w\in P(i)}{Z}_{w}\right)\end{eqnarray} \tag{ F13 }$

$\begin{eqnarray}&&\hat{=}-{c}_{j}^{\dagger }{c}_{i}^{\ }.\end{eqnarray} \tag{ F14 }$

At last, we find by explicit construction :

$\begin{eqnarray}&&{c}_{j}^{\dagger }{c}_{j}^{\ }\ \hat{=}\ \displaystyle \frac{1}{2}\left({\mathbb{I}}-\mathop{\displaystyle \bigotimes }\limits_{v\in F(j)}{Z}_{v}\right),\qquad {c}_{j}^{\ }{c}_{j}^{\dagger }\ \hat{=}\ \displaystyle \frac{1}{2}\left({\mathbb{I}}+\mathop{\displaystyle \bigotimes }\limits_{v\in F(j)}{Z}_{v}\right).\end{eqnarray} \tag{ F15 }$

Thus, we find ${[{c}_{i}^{},{c}_{j}^{\dagger }]}_{+}\hat{=}{\delta }_{{ij}}{\mathbb{I}}$ , and our construction (13) is in compliance with all relations in (2).

Appendix G.: Notations

Symbol	Type	Informal definition
$[...]$		Set of integers from 1 to argument
$\hat{=}$		Correspondence between fermionic operators/states to qubit counterparts
${\boldsymbol{a}}$	${[N]}^{\otimes l}$	Length-l N-ary vector describing orbitals in ${\hat{h}}_{{\boldsymbol{ab}}}$
${A}^{(-1)}$		$(N\times N)$ binary matrix defining a conventional encoding (decoding)
${\boldsymbol{b}}$	${{\mathbb{Z}}}_{2}^{\otimes l}$	Length-l binary vector determining operator types in ${\hat{h}}_{{\boldsymbol{ab}}}$
${ \mathcal B }$		Basis of a space of Fermions on N orbitals, smaller than the Fock space
${c}_{j}^{(\dagger )}$	${ \mathcal L }({{ \mathcal F }}_{N})$	Fermionic annihilation (creation) operator
${({{\mathbb{C}}}^{2})}^{\otimes n}$		Vector space of n-qubit states
${\boldsymbol{d}}$	${{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}^{\otimes N}$	Decoding function
${\boldsymbol{e}}$	${{\mathbb{Z}}}_{2}^{\otimes N}\to {{\mathbb{Z}}}_{2}^{\otimes n}$	Encoding function
${{\boldsymbol{\varepsilon }}}^{{\boldsymbol{q}}}$	${{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}^{\otimes n}$	Update function important for nonlinear encodings, see (30)
${{ \mathcal F }}_{N}$		Fock space restricted on N orbitals
$F(j)$	subset of [n]	Flip set with respect to orbital j, see (13)
${\hat{h}}_{{\boldsymbol{ab}}}$	$\mathrm{span}({ \mathcal B })\to \mathrm{span}({ \mathcal B })$	Term in a fermionic Hamiltonian, see (11)
${{ \mathcal H }}_{N}^{M}$		Antisymmetrized Hilbert space of M indistinguishable particles on N orbitals
${\mathbb{I}}$		Identity operator on arbitrary spaces
K	[N]	Targeted Hamming weight of a code
l		Length of a sequence of fermionic operators in ${\hat{h}}_{{\boldsymbol{ab}}}$
L	[n]	Weight of a Pauli string
${ \mathcal L }(...)$		Linear operators on argument vector space
M	[N]	Total particle number in a system of N orbitals
n		Number of qubits
N		Number of orbitals
${\boldsymbol{\nu }}$	${ \mathcal V }\subseteq {{\mathbb{Z}}}_{2}^{\otimes N}$	N-orbital occupation vector representing a fermionic basis state, see (18)
${\boldsymbol{\omega }}$	${{\mathbb{Z}}}_{2}^{\otimes n}$	Binary vector representing a product state in the n-qubit basis, see (19)
${\boldsymbol{p}}$	${{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2}^{\otimes N}$	Parity function, implementing signs for parity operators
$P(j)$	subset of [n]	Parity set of orbital j, see (13)
${ \mathcal P }$		Set of single-qubit Pauli operators $\{X,Y,Z\}$
${\boldsymbol{q}}$	${{\mathbb{Z}}}_{2}^{\otimes N}$	Change of a vector ${\boldsymbol{\nu }}$ by a term ${\hat{h}}_{{\boldsymbol{ab}}}$
R		Binary $(N\times N)$ matrix, with the lower triangle (including diagonal) filled with ones
θ_ij	${[N]}^{\otimes 2}\to {{\mathbb{Z}}}_{2}$	Discrete version of the Heaviside function, see (14)
${{\boldsymbol{u}}}_{{\boldsymbol{j}}}$	${{\mathbb{Z}}}_{2}^{\otimes N}$ or ${{\mathbb{Z}}}_{2}^{\otimes n}$	Binary unit vector, just component j is one
U(j)	subset of [n]	Update set of orbital j, see (13)
${{ \mathcal U }}^{{\boldsymbol{a}}}$	${ \mathcal L }({({{\mathbb{C}}}^{2})}^{\otimes n})$	Update operator with respect to an occupation of ${\boldsymbol{a}}$ , see (32) and (33)
${ \mathcal V }$	subset of ${{\mathbb{Z}}}_{2}^{\otimes N}$	Defines the occupation vectors ${\boldsymbol{\nu }}$ implementing the basis ${ \mathcal B }$ , see (18)
${{\rm{w}}}_{{\rm{H}}}(...)$	${{\mathbb{Z}}}_{2}^{\otimes N}\to [N]\cup \{0\}$	Hamming weight of a binary vector, sum of its components
${\mathfrak{X}}$	$({{\mathbb{Z}}}_{2}^{\otimes n}\to {{\mathbb{Z}}}_{2})\to { \mathcal L }({({{\mathbb{C}}}^{2})}^{\otimes n})$	Extraction superoperator, maps binary functions into quantum gates, see (20)–(28)
${{\mathbb{Z}}}_{2}$		Binary digits $\{0,1\}$