Optimal control of Bose–Einstein condensates in three dimensions

J-F Mennemann; D Matthes; R-M Weishäupl; T Langen

doi:10.1088/1367-2630/17/11/113027

1. Introduction

Over the last decade, the ever increasing experimental toolbox of atomic, optical and molecular physics has lead to an exciting improvement in the control and understanding of complex quantum systems [1]. Recently, this has resulted in an important shift of paradigm. While quantum systems were previously mostly studied to check the validity of theoretical models, interest has now increased in their manipulation for specific technological applications. Prototypical examples for this shift of paradigm are atomic interferometers for quantum enhanced metrology [2–4], atomic field probes [5] and microscopes [6, 7], inertial sensors [8], atomic clocks [9], or applications in quantum computing [10, 11] and quantum simulation [12].

In many cases, these applications rely on the controlled preparation of a well-defined quantum many-body state with particular properties. One of the key experimental challenges is thus the efficient transfer of a system to such a state. Optimal control theory (OCT) is a mathematical tool to devise control strategies for this transfer [13]. It is well studied in many physical systems, ranging from atoms and molecules to solid-state systems [14–18].

In this work, we apply it to the control of a dilute atomic Bose–Einstein condensate (BEC), a system which is well described by the three-dimensional (3D) Gross–Pitaevskii equation (GPE) [19, 20]. Such BECs form a versatile experimental platform for the storage, manipulation and probing of interacting quantum fields with high precision [1]. In a seminal work Hohenester et al [21] demonstrated that OCT [22] provides a highly efficient way to realize the transfer of a BEC to a target state, vastly outperforming more simple schemes. In this context, it has also been shown that OCT is robust against fluctuations and decoherence, and can also specifically take into account experimental constraints [23]. This has recently lead to first experimental demonstrations [24, 25].

OCT of BECs has so far been mostly used in one-dimensional (1D) settings, as the number of necessary spatial discretization points and hence the numerical costs scale exponentially with the number of dimensions. [26]. However, most experimental situations can only approximately be described by a 1D model, which has so far limited the applicability of OCT to real-life situations.

In the following we demonstrate the first OCT of a BEC in all three spatial dimensions and study the physical insights that come with this advance. As we demonstrate through a number of examples, extending optimal control of BECs from 1D to 3D is the crucial step to bridge the gap from limited proof-of-principle experiments to a general applicability of this powerful technique. For example, we show how collective excitations are inevitably created as a result of the control. These excitations are directly connected to the nonlinear nature of the GPE and can only be fully captured and minimized in a 3D treatment including multiple control parameters.

Finally, we provide a detailed description of all necessary numerical details to make our work the starting point for the application of the presented technique in many future experiments.

2. The control problem

We start with a brief review of OCT, as well as of the description of BECs in terms of the GPE.

2.1. Gross–Pitaevskii equation

The mean-field dynamics of a BEC is described by the GPE

$\begin{eqnarray}&&{\rm{i}}{\hslash }{\partial }_{t}\psi =-\displaystyle \frac{{{\hslash }}^{2}}{2m}{\rm{\Delta }}\psi +{V}_{{\boldsymbol{\lambda }}}\psi +g{| \psi | }^{2}\psi ,\end{eqnarray} \tag{ 1 }$

where $\psi \equiv \psi ({\boldsymbol{r}},t)$ denotes a complex-valued wave function, with initial condition $\psi ({\bf{r}},0)\equiv {\psi }_{0}\in {L}^{2}({{\mathbb{R}}}^{3};{\mathbb{C}}).$ Here, ${V}_{{\boldsymbol{\lambda }}}\equiv V({\boldsymbol{r}},{\boldsymbol{\lambda }}(t))$ is an external potential that is characterized by a single or several control parameters denoted by the vector ${\boldsymbol{\lambda }}.$ Assuming that the wave function is normalized to unity, the coupling constant $g=N4\pi {{\hslash }}^{2}{a}_{{\rm{s}}}\;{m}^{-1}$ is defined by the mass m, the s-wave scattering length a_s and the number N of atoms in the BEC. For example, for ultracold gases of ${}^{87}$ Rb atoms, the atomic mass is given by $m=1.44\times {10}^{-25}\;$ kg and the s-wave scattering length by ${a}_{{\rm{s}}}=5.24\;{\rm{nm}}.$ Measuring length in units of ${l}_{0}=1\times \mu {\rm{m}}$ , mass in units of the atomic mass and time in units of ${t}_{0}={{ml}}_{0}^{2}/{\hslash },$ equation (1) can be written as

$\begin{eqnarray}&&{\rm{i}}{\partial }_{t}\psi =-\displaystyle \frac{1}{2}{\rm{\Delta }}\psi +{V}_{{\boldsymbol{\lambda }}}\psi +g{| \psi | }^{2}\psi \end{eqnarray} \tag{ 2 }$

which is the starting point for our considerations below.

2.2. Optimal control problem

We seek to find an optimal time-evolution of the m-component control parameter

$\begin{eqnarray*}&&{\boldsymbol{\lambda }}\;:(0,T)\to {{\mathbb{R}}}^{m},\quad {\boldsymbol{\lambda }}(0)={{\boldsymbol{\lambda }}}_{0},\quad {\boldsymbol{\lambda }}(T)={{\boldsymbol{\lambda }}}_{T},\end{eqnarray*}$

which steers the system from the initial state ψ₀ at time zero to a desired state ψ_d at final time T. Without loss of generality we assume that ψ₀ and ψ_d are ground state solutions of the stationary GPE corresponding to the smooth external potentials ${V}_{{{\boldsymbol{\lambda }}}_{0}}$ and ${V}_{{{\boldsymbol{\lambda }}}_{T}}$ at times t = 0 and t = T, respectively, with fixed parameters ${{\boldsymbol{\lambda }}}_{0},{{\boldsymbol{\lambda }}}_{T}.$ To find the time evolution we apply well-known techniques from OCT [21]. As cost functional, we use

$\begin{eqnarray}&&J({\boldsymbol{\lambda }},\psi )=\displaystyle \frac{1}{2}\left(1-{| \langle {\psi }_{{\rm{d}}},\psi (T)\rangle | }^{2}\right)+\displaystyle \frac{\gamma }{2}{\displaystyle \int }_{0}^{T}{| {\partial }_{t}{\boldsymbol{\lambda }}(t)| }^{2}\;{\rm{d}}t,\end{eqnarray} \tag{ 3 }$

where $\langle u,v\rangle ={\displaystyle \int }_{{{\mathbb{R}}}^{3}}u{({\bf{r}})}^{*}v({\bf{r}})\;d{\bf{r}}$ denotes the standard scalar product of $u,v\in {L}^{2}({{\mathbb{R}}}^{3};{\mathbb{C}}).$ The definition (3) is the generalization of the functional used in [21, 23, 26–30] to a multi-component control parameter ${\boldsymbol{\lambda }}.$ The first term in J measures the proximity of ψ to the desired state ψ_d at the end of the steering process. The expression $\tilde{{\mathcal{F}}}(\psi )=1-{| \langle {\psi }_{{\rm{d}}},\psi \rangle | }^{2}$ is known as the infidelity and provides a measure for the difference of ψ and ψ_d. In detail, it quantifies the L²-norm of ψ's component that is orthogonal to ψ_d. The second term regularizes the control trajectory to account for the fact that parameters can never be changed infinitely fast in a real experiment. Here, $\gamma \gt 0$ sets the penalty for fast variations of ${\boldsymbol{\lambda }}(t).$ For our examples below we find that already a very small value $\gamma =1\times {10}^{-6}$ yields a satisfactory regularization.

Our goal is to minimize $J({\boldsymbol{\lambda }},\psi )$ subject to the constraint that ψ solves the GPE (equation (2)) with the initial condition given by the respective ψ₀. To this end, one introduces the Lagrange function

$\begin{eqnarray}&&L({\boldsymbol{\lambda }},\psi ,p)=J({\boldsymbol{\lambda }},\psi )\\ &&\quad +\mathrm{Re}{\displaystyle \int }_{0}^{T}{\displaystyle \int }_{{{\mathbb{R}}}^{3}}{p}^{*}\left({\rm{i}}{\partial }_{t}\psi +\displaystyle \frac{1}{2}{\rm{\Delta }}\psi -{V}_{{\boldsymbol{\lambda }}}\psi -g{| \psi | }^{2}\psi \right)\;{\rm{d}}t\;{\rm{d}}{\bf{r}}\end{eqnarray} \tag{ 4 }$

where $p({\bf{r}},t)$ acts as a generalized Lagrange multiplier [22]. At a local minimum $({\boldsymbol{\lambda }},\psi ,p)$ of J, all three variational derivatives ${D}_{p}L({\boldsymbol{\lambda }},\psi ,p)[\delta p],{D}_{\psi }L({\boldsymbol{\lambda }},\psi ,p)[\delta \psi ]$ and ${D}_{{\boldsymbol{\lambda }}}L({\boldsymbol{\lambda }},\psi ,p)[\delta {\boldsymbol{\lambda }}]$ vanish for all admissible variations $\delta p,\delta \psi$ and $\delta {\boldsymbol{\lambda }},$ respectively. The corresponding three conditions constitute the optimality system

$\begin{eqnarray}&&{\rm{i}}{\partial }_{t}\psi =-\displaystyle \frac{1}{2}{\rm{\Delta }}\psi +{V}_{{\boldsymbol{\lambda }}}\psi +g{| \psi | }^{2}\psi ,\end{eqnarray} \tag{ 5a }$

$\begin{eqnarray}&&{\rm{i}}{\partial }_{t}p=-\displaystyle \frac{1}{2}{\rm{\Delta }}p+{V}_{{\boldsymbol{\lambda }}}p+2\;g{| \psi | }^{2}p+g{\psi }^{2}{p}^{*},\end{eqnarray} \tag{ 5b }$

$\begin{eqnarray}&&\gamma \displaystyle \frac{{{\rm{d}}}^{2}}{{\rm{d}}{t}^{2}}{\boldsymbol{\lambda }}=-\mathrm{Re}\langle \psi ,({\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}})\;p\rangle ,\end{eqnarray} \tag{ 5c }$

together with the initial and terminal conditions

$\begin{eqnarray}&&\psi (0)={\psi }_{0},\end{eqnarray} \tag{ 6a }$

$\begin{eqnarray}&&{\rm{i}}p(T)=-\langle {\psi }_{{\rm{d}}},\psi (T)\rangle {\psi }_{{\rm{d}}},\end{eqnarray} \tag{ 6b }$

$\begin{eqnarray}&&{\boldsymbol{\lambda }}(0)={{\boldsymbol{\lambda }}}_{0},\quad {\boldsymbol{\lambda }}(T)={{\boldsymbol{\lambda }}}_{T}.\end{eqnarray} \tag{ 6c }$

In general, no analytical solutions are available for (5) with (6). Here we use an iterative method to find a numerical approximation of the solution. For this purpose it is useful to introduce the reduced cost functional

$\begin{eqnarray}&&\hat{J}({\boldsymbol{\lambda }})=J({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}}),\end{eqnarray} \tag{ 7 }$

where ${\psi }_{{\boldsymbol{\lambda }}}$ denotes the unique solution of the GPE for a given control parameter curve ${\boldsymbol{\lambda }}.$ The goal is to find a local (or, preferably, even global) minimizer ${{\boldsymbol{\lambda }}}^{*}$ of $\hat{J}.$

The most straight-forward iterative procedure that can be employed is the method of steepest descent,

$\begin{eqnarray}&&{{\boldsymbol{\lambda }}}^{k+1}={{\boldsymbol{\lambda }}}^{k}-{\alpha }^{k}{\rm{\nabla }}\hat{J}({{\boldsymbol{\lambda }}}^{k}),\quad k=0,1,2,\ldots .\end{eqnarray} \tag{ 8 }$

To determine an appropriate step size α^k, we perform a line search in each iteration:

$\begin{eqnarray}&&{\alpha }^{k}=\underset{\alpha }{\mathrm{arg}\;\mathrm{min}}\;\hat{J}({{\boldsymbol{\lambda }}}^{k}-\alpha {\rm{\nabla }}\hat{J}({{\boldsymbol{\lambda }}}^{k})).\end{eqnarray} \tag{ 9 }$

Here the upper index denotes the iteration step. A comment is due on the use of the gradient ${\rm{\nabla }}\hat{J}({{\boldsymbol{\lambda }}}^{k})$ in (8). Recall that the gradient of $\hat{J}$ at ${\boldsymbol{\lambda }}$ with respect to a specific inner product (·,·)_X on the space X of admissible variations $\delta {\boldsymbol{\lambda }}$ is the uniquely determined element ${\rm{\nabla }}\hat{J}\in X$ such that ${({\rm{\nabla }}\hat{J},\delta {\boldsymbol{\lambda }})}_{X}={D}_{{\boldsymbol{\lambda }}}\hat{J}({\boldsymbol{\lambda }})[\delta {\boldsymbol{\lambda }}]$ for all admissible variations $\delta {\boldsymbol{\lambda }}\in X.$ The gradient thus depends sensitively on the choice of the inner product (·,·)_X on X. It has been pointed out already in [27] that any admissible variation $\delta {\boldsymbol{\lambda }}$ must have a finite value in the penalty term, i.e., its weak time derivative ${\partial }_{t}\delta {\boldsymbol{\lambda }}$ must be square-integrable on $(0,T),$ and must respect the boundary conditions in (6c), i.e., $\delta {\boldsymbol{\lambda }}(0)=\delta {\boldsymbol{\lambda }}(T)=0.$ A natural choice for (·,·)_X is thus the ${H}_{0}^{1}(0,T,{{\mathbb{R}}}^{m})$ -scalar product,

$\begin{eqnarray}&&({\bf{u}},{\bf{v}}):= {\displaystyle \int }_{0}^{T}{\partial }_{t}{\bf{u}}(t)\cdot {\partial }_{t}{\bf{v}}(t){\rm{d}}t.\end{eqnarray} \tag{ 10 }$

A calculation, which we present in the appendix, shows that this choice of (·,·)_X yields

$\begin{eqnarray}&&\displaystyle \frac{{{\rm{d}}}^{2}}{{\rm{d}}{t}^{2}}\left[{\rm{\nabla }}\hat{J}({\boldsymbol{\lambda }})\right]=\gamma \ddot{{\boldsymbol{\lambda }}}+\mathrm{Re}\langle \psi ,({\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}})p\rangle ,\end{eqnarray} \tag{ 11a }$

$\begin{eqnarray}&&\left[{\rm{\nabla }}\hat{J}({\boldsymbol{\lambda }})\right](0)={\bf{0}},\end{eqnarray} \tag{ 11b }$

$\begin{eqnarray}&&\left[{\rm{\nabla }}\hat{J}({\boldsymbol{\lambda }})\right](T)={\bf{0}},\end{eqnarray} \tag{ 11c }$

wherein ψ and p are solutions of (5a) and (6a) or (5b) and (6b), respectively. By definition, ${\rm{\nabla }}\hat{J}$ vanishes at the boundaries t = 0 and t = T, and so the iteration (8) preserves the boundary conditions (6c). We emphasize that the seemingly canonical choice of (·,·)_X as the standard L²-scalar product would not allow to specify boundary data for ${\rm{\nabla }}\hat{J},$ which would result in a severe loss of stability of the optimization algorithm.

2.3. Implementation

In the situations considered below we found that the method of steepest descent (see equation (8)) works reliably. However, using more advanced methods the number of iterations needed to ensure convergence of the algorithm can be reduced significantly. In fact, our solver is based on the nonlinear conjugate gradient scheme of Hager and Zhang [31], which has also been employed in [27] for optimal control of the 1D GPE. We stress that all inner products and norms related to the nonlinear conjugate gradient scheme need to be expressed in terms of the inner product given in equation (10).

The reduced cost functional (7) needs to be evaluated several times per iteration. Moreover, at the beginning of each iteration a gradient vector needs to be determined using equation (11a). Solutions to the time-dependent GPE (5a) and the adjoint equation (5b) are obtained via the time-splitting spectral method [32]. Initial and desired final states for a given potential are found by imaginary time propagation.

In order to accelerate the solving of the optimal control problem we perform all computations on the graphics processing unit (GPU) of a powerful graphics card. To speed up the calculations and ensure convergence of the algorithm we start each optimization with a coarse spatial grid and a relatively big time step Δt. The result for ${\boldsymbol{\lambda }}$ is used as an input for another round of optimization on a finer grid. This procedure is repeated until the algorithm converges to a final time-evolution for ${\boldsymbol{\lambda }}.$ A detailed description of our implementation is given in the appendix.

3. Examples

In the following we demonstrate the results of our scheme by considering three applications of increasing complexity, which are directly connected to recent experiments.

3.1. Harmonic oscillator potential

In the first application we study a BEC in an elongated harmonic potential. Initially, the trap frequencies are chosen such that the condensate is aligned along the y direction. Using a suitable time-evolution of the trap frequencies, we aim to rotate the condensate by π/2, while keeping it in the ground state of the external potential.

An example of the transition is visualized in figure 1. It can be understood as a toy example of a broad class of experimental protocols in which the trapping geometry is changed, e.g. to mode match different traps [34], to (de)compress a trap [35], to load an optical lattice [36] or to transfer condensates into dynamical potentials for atomtronics [37, 38]. Conceptually similar pulsed manipulations are also performed to focus BECs in time-of-flight expansion [39].

3.1.1. Trapping potential

The harmonic potential in this example is given by

$\begin{eqnarray*}&&{V}_{{\boldsymbol{\lambda }}}(x,y,z)=\displaystyle \frac{m}{2}\left({\left[{\omega }_{x}({\lambda }_{1})\right]}^{2}{x}^{2}+{\left[{\omega }_{y}({\lambda }_{2})\right]}^{2}{y}^{2}+{\omega }_{z}^{2}{z}^{2}\right),\end{eqnarray*}$

wherein the frequencies ω_x and ω_y can be set independently via the control parameters λ₁ and λ₂. More precisely, we transform the external potential from an initial configuration with ω_x = ω_xⁱ and ω_y = ω_yⁱ at time t = 0 to a final configuration with ω_x = ω_x^f and ω_y = ω_y^f at the final time t = T. To this end, we parametrize ${\omega }_{x}$ and ω_y as

$\begin{eqnarray*}{\omega }_{x}({\lambda }_{1}) & = & {\omega }_{x}^{i}+{\lambda }_{1}({\omega }_{x}^{f}-{\omega }_{x}^{i}),\\ {\omega }_{y}({\lambda }_{2}) & = & {\omega }_{y}^{i}+{\lambda }_{2}({\omega }_{y}^{f}-{\omega }_{y}^{i}),\end{eqnarray*}$

with

$\begin{eqnarray*}{\lambda }_{1}(0) & = & 0,\quad {\lambda }_{1}(T)=1,\\ {\lambda }_{2}(0) & = & 0,\quad {\lambda }_{2}(T)=1.\end{eqnarray*}$

We note that these parametrizations, as all others discussed below, are chosen as an example and can easily be adjusted to the parameters accessible in a specific experimental realization.

3.1.2. Numerical simulations

In the following simulations the number of atoms is N = 5000, the final time is set to T = 9 ms and ${\omega }_{z}=5\;{\rm{kHz}}.$ The initial configuration of the trapping potential is given by ${\omega }_{x}^{i}=5\;{\rm{kHz}}$ and ${\omega }_{y}^{i}=0.75\;{\rm{kHz}},$ the final configuration by ${\omega }_{x}^{f}=0.75\;{\rm{kHz}}$ and ${\omega }_{y}^{f}=5\;{\rm{kHz}}.$

Before we discuss the result of the optimal control algorithm we first consider a numerical simulation as a benchmark, in which the control parameters λ₁ and λ₂ are varied linearly. The corresponding time-evolution of the trap frequencies ω_x and ω_y is depicted in figure 2(a).

**Figure 2.** Two-parameter control of an elongated harmonic oscillator potential. The computational domain is chosen as $([-10,10]\times [-10,+10]\times [-2.5,2.5])\;\mu {{\rm{m}}}^{3}.$ In the finest discretization level we use $128\times 128\times 32$ grid points and a time step size of $\triangle t=0.001$ ms. Left column: without optimal control. Right column: optimal control. See text for details.
Download figure:
Standard image High-resolution image

**Figure 2.** Two-parameter control of an elongated harmonic oscillator potential. The computational domain is chosen as $([-10,10]\times [-10,+10]\times [-2.5,2.5])\;\mu {{\rm{m}}}^{3}.$ In the finest discretization level we use $128\times 128\times 32$ grid points and a time step size of $\triangle t=0.001$ ms. Left column: without optimal control. Right column: optimal control. See text for details.
Download figure:
Standard image High-resolution image

In order to investigate the overlap of ψ with ψ_d beyond the end of the control we continue the time-evolution with ${\boldsymbol{\lambda }}(t)={\boldsymbol{\lambda }}(T)$ for $t\gt T.$ We proceed analogously in the other examples. As can be seen from figure 2(b) the infidelity decreases only slightly until t = T and shows a strong oscillation for $t\gt T.$ This behavior of the infidelity indicates that the final state differs significantly from the desired state ψ_d. This is also strikingly visualized by example snapshots of the density at time ${t}^{*}=22$ ms in figures 2(c)–(e).

Next, we consider the result of the optimal control algorithm. Using ${{\boldsymbol{\lambda }}}^{0}(t)\;=\;$ $[0.25\mathrm{sin}(\pi t/T)+t/T,-0.25\mathrm{sin}(\pi t/T)+t/T]$ for $t\in [0,T]$ as a starting point, the algorithm converges to a solution that reduces the cost functional by four orders of magnitude. The time-evolution of the frequencies ω_x and ω_y is shown in figure 2(f), the time-evolution of the corresponding infidelity in figure 2(g). It can clearly be seen that the infidelity strongly decreases until the end of the control at t = T. Moreover, the infidelity remains on a very low level for $t\gt T,$ indicating that the desired final state has been reached with high precision. Consequently, the deviations of the density to the density of the desired state at time $t={t}^{*}$ are very close to zero as can be seen from figures 2(h)–(j). We note at this point that the evolution of the 3D wave functions can naturally only be described here in limited detail. A supplementary video that visualizes these dynamics in greater detail is available online [33].

3.2. Loading of a toroidal trap

In the second application we consider the loading of a toroidal trap as shown in figure 3. Such toroidal traps have recently been employed to realize atomic analogues of electrical circuits to study superflow and hysteresis [40–44].

3.2.1. Trapping potential

The trapping potential is given by a slightly elongated harmonic potential and a Gaussian function centered at the origin of our coordinate system [40]

$\begin{eqnarray*}{V}_{{\boldsymbol{\lambda }}}(x,y,z) & = & \displaystyle \frac{m}{2}\left({\left[{\omega }_{x}({\lambda }_{1})\right]}^{2}{x}^{2}+{\omega }_{y}^{2}{y}^{2}+{\omega }_{z}^{2}{z}^{2}\right)\\ & & +{V}_{0}({\lambda }_{2})\mathrm{exp}(-2({x}^{2}+{y}^{2})/{w}_{0}^{2}).\end{eqnarray*}$

In an experiment this Gaussian function could for example correspond to a red-detuned laser beam realizing a repulsive dipole potential.

As illustrated in figure 3 we consider the transformation of the potential from an initial harmonic configuration with ω_x = ω_xⁱ and V₀ = 0 at time t = 0 to a toroidal configuration with ω_x = ω_x^f and ${V}_{0}={V}_{0}^{*}$ at the final time t = T. Hence, a suitable parameterization of ω_x and V₀ is given by

$\begin{eqnarray}&&{\omega }_{x}({\lambda }_{1})={\omega }_{x}^{i}+{\lambda }_{1}({\omega }_{x}^{f}-{\omega }_{x}^{i}),\end{eqnarray} \tag{ 12a }$

$\begin{eqnarray}&&{V}_{0}({\lambda }_{2})={V}_{0}^{*}\;\chi ({\lambda }_{2}),\end{eqnarray} \tag{ 12b }$

where

$\begin{eqnarray*}{\lambda }_{1}(0) & = & 0,\quad {\lambda }_{1}(T)=1,\\ {\lambda }_{2}(0) & = & 0,\quad {\lambda }_{2}(T)=1.\end{eqnarray*}$

In equation (12b), χ plays the role of a saturation function. The use of the saturation function ensures that V₀ remains positive—and thus experimentally realizable—for any possible choice of λ₂. This does not restrict the original control problem, as every experimentally realizable trajectory ${V}_{0}(t)\geqslant 0$ can be parametrized through a suitable ${\lambda }_{2}(t)$ in ${V}_{0}^{*}\chi ({\lambda }_{2}(t)).$ In fact, we choose all parameters for the external potential to be close to previous experimental realizations. However, our approach also allows us to optimize more general situations where the parametrization of the trapping potential is more complicated [45].

Similar saturation functions are commonly used in control theory to realize limits on control parameters. In our particular case χ is implemented using a piecewise cubic hermite interpolating polynomial (PCHIP). Its functional form is shown in figure 4. The interpolating points are chosen such that χ always remains positive. Moreover, $\chi (0)=0$ and $\chi (1)=1.$

**Figure 4.** Saturation function used in the toroidal trap and splitting examples.
Download figure:
Standard image High-resolution image

3.2.2. Numerical simulations

The following simulations are carried out using ${V}_{0}^{*}=h\times 30\;{\rm{kHz}},$ ${w}_{0}=5\;\mu {\rm{m}},\;$ T = 9 ms and N = 5000. The frequencies ${\omega }_{y}=2.5\;{\rm{kHz}}$ and ${\omega }_{z}=5\;{\rm{kHz}}$ are kept constant during the simulation. The initial configuration of the confinement potential is characterized by ${\omega }_{x}^{i}=1\;{\rm{kHz}}$ and ${V}_{0}(t=0)/h=0\;{\rm{kHz}},$ whereas the final configuration is given by ${\omega }_{x}^{f}=2.5\;{\rm{kHz}}$ and ${V}_{0}(T)/h={V}_{0}^{*}.$

As in the previous example we consider first the case where the parameters ω_x and V₀ are changed linearly (see figure 5(a)). Figure 5(b) reveals that the associated infidelity does not drop at all until t = T. For $t\gt T$ we observe a slight decrease of the infidelity. This can be attributed to the fact that, as time evolves, the density of the condensate becomes more evenly distributed in the toroidal trapping potential, bringing its wavefunction closer to ψ_d. However, as can be seen from figures 5(c)–(e), the final wave function still differs strongly from the wave function of the desired state after ${t}^{*}=22\;$ ms.

**Figure 5.** Loading of a toroidal trap using two-parameter control. The computational domain is given by $([-8,8]\times [-8,+8]\times [-2.5,2.5])\;\mu {{\rm{m}}}^{3}.$ In the finest discretization level we use $128\times 128\times 40$ grid points and a time step size of $\triangle t=0.001$ ms. Left column: linear variation of the control parameters. Right column: optimal control of the control parameters.See text for details.
Download figure:
Standard image High-resolution image

Let us now discuss the result of the optimal control algorithm. An optimal time-evolution of the control parameters is given in figure 5(f). Intuitively this control can be understood as the result of two separate time-scales. During the first half of the control, the trap frequency ω_x is increased, while the limits imposed on λ₂ prohibit any change of V₀. During the second half, on the other hand, V₀ is adjusted to its final value, while ω_x is only subject to small corrections.

Until the end of the control this leads to a drop in the infidelity by approximately three orders of magnitude, as visualized in figure 5(g). Furthermore, the infidelity remains bounded by $3\times {10}^{-3}$ for $t\gt T,$ which is well below the measurement sensitivity in typical experiments. Consequently, only slight deviations from the desired wavefunction at time ${t}^{*}=22$ ms can be observed in figures 5(h)–(j).

3.3. Splitting

In terms of technological applications, a particular noteworthy realization of BECs is achieved using atom chips [47, 48]. On these chips micro-fabricated wires allow the precise manipulation of BECs using static, radio and microwave fields. As a third application we thus consider the splitting of a single condensate into two identical halves using such an atom chip [46]. A visualization is presented in figure 6. This splitting protocol has recently been used to study the non-equilibrium dynamics of quantum gases, revealing subtle effects, such as prethermalization [49–52], generalized statistical ensembles [53] and the light-cone-like emergence of thermal correlations [54, 55]. Moreover, it forms the basic building block for integrated matter-wave interferometers [56, 57].

**Figure 6.** The splitting of a Bose–Einstein condensate, as realized by a radial deformation of an initially harmonic potential into a double well [46]. The two gases in the final picture are completely decoupled, with no more overlap between the respective wave functions. Animations of the full dynamics are available online [33].
Download figure:
Standard image High-resolution image

3.3.1. Trapping potential

In the experiments the splitting is realized by dressing the static magnetic trapping potential with a strong near-field radio-frequency (RF) field. The unscaled static potential is given by ${V}_{\mathrm{static}}={g}_{{\rm{F}}}{\mu }_{B}{m}_{{\rm{F}}}| {\bf{B}}| ,$ with the magnetic field ${\bf{B}}=({B}_{x},{B}_{y},{B}_{z})$ being well approximated by the famous Ioffe–Pritchard form

$\begin{eqnarray*}{B}_{x} & = & {B}_{1}x-\displaystyle \frac{{B}_{2}}{2}{xy},\\ {B}_{z} & = & -{B}_{1}z-\displaystyle \frac{{B}_{2}}{2}{zy},\\ {B}_{y} & = & {B}_{0}+\displaystyle \frac{{B}_{2}}{2}\left[{y}^{2}-\displaystyle \frac{1}{2}\left({x}^{2}+{z}^{2}\right)\right].\end{eqnarray*}$

The parameters are given by ${B}_{0}={\hslash }{\omega }_{0}/{m}_{{\rm{F}}}{g}_{{\rm{F}}}{\mu }_{B},{B}_{1}=\sqrt{m{\omega }_{\perp }^{2}{B}_{0}/{m}_{{\rm{F}}}{g}_{{\rm{F}}}{\mu }_{B}}$ and ${B}_{2}=m{\omega }_{\parallel }^{2}/{m}_{{\rm{F}}}{g}_{{\rm{F}}}{\mu }_{B}.$ In the following simulations we consider ${}^{87}$ Rb atoms which are trapped in the $5{{\rm{S}}}_{1/2}\;F=2,{m}_{{\rm{F}}}=2$ state where g_F = 1/2. The trap parameters are

$\begin{eqnarray*}{\omega }_{0} & = & 2\pi \times 390\;\mathrm{kHz},\\ {\omega }_{x}={\omega }_{z}\equiv {\omega }_{\perp } & = & 2\pi \times 2\;\mathrm{kHz},\\ {\omega }_{y}\equiv {\omega }_{\parallel } & = & 2\pi \times 85\;\mathrm{Hz}.\end{eqnarray*}$

The resulting dressed-state potential is given by [58]

$\begin{eqnarray*}{V}_{\lambda } & = & {g}_{{\rm{F}}}{\mu }_{B}{\tilde{m}}_{{\rm{F}}}\sqrt{{\left(\displaystyle \frac{{\hslash }{\omega }_{{RF}}}{| {g}_{{\rm{F}}}| {\mu }_{B}}-| {\bf{B}}| \right)}^{2}+{\left(\displaystyle \frac{{B}_{\mathrm{RF}\perp }(t)}{2}\right)}^{2}},\\ & = & {g}_{{\rm{F}}}{\mu }_{B}{\tilde{m}}_{{\rm{F}}}\sqrt{{{\rm{\Delta }}}_{\mathrm{RF}}{({\bf{r}})}^{2}+{{\rm{\Omega }}}_{\mathrm{rabi}}^{2}(t)}\end{eqnarray*}$

with ${\tilde{m}}_{F}=2,{\omega }_{\mathrm{RF}}$ the frequency of the RF radiation and B_RF⊥ denoting the component of the linear polarized dressing field ${{\bf{B}}}_{\mathrm{RF}}$ that is aligned perpendicular to the static field. As in [54] we use a detuning of ${{\rm{\Delta }}}_{\mathrm{RF}}(0)=-2\pi \times 30\;\mathrm{kHz}$ from the ${m}_{{\rm{F}}}=2\to {m}_{{\rm{F}}}=1$ transition for the simulation. The Rabi-frequency is parameterized by the control parameter λ

$\begin{eqnarray*}&&{{\rm{\Omega }}}_{\mathrm{rabi}}(\lambda )={{\rm{\Omega }}}_{\mathrm{rabi}}^{*}\;\chi (\lambda ),\quad {{\rm{\Omega }}}_{\mathrm{rabi}}^{*}=2\pi \times 155\;\mathrm{kHz}\end{eqnarray*}$

wherein

$\begin{eqnarray*}&&{\lambda }_{1}(0)=0,\quad {\lambda }_{1}(T)=1.\end{eqnarray*}$

The control parameter λ mimics the situation in experiments, where the double well potential is controlled by changing the RF field amplitude through an RF current in a wire. For λ = 0 we recover the static harmonic potential, whereas λ = 1 corresponds to a fully separated double well with no wave function overlap between the two halves of the system. Since the Rabi-frequency is strictly positive in experiments we employ the same saturation function χ as in the previous example (see figure 4).

As the trapping potential is significantly changed during the splitting the atoms are radially displaced from their equilibrium position in the harmonic trap. Consequently, strong dipole and breathing oscillations are usually observed in experiments. This poses a strong limitation to the use of such systems as interferometers [56]. The minimization of such excitations is therefore one of the main motivations for our optimization.

3.3.2. Numerical simulations: single-parameter control

We illustrate the splitting procedure for N = 2000 atoms and T = 6 ms.

In a first step we again consider the case where the Rabi-frequency is increased linearly (see figure 7(a)). This procedure is similar to the one that is typically used in experiments [49, 53]. At the final time t = T the infidelity has only decreased slightly as can be seen from figure 7(b). Moreover, the infidelity shows the expected strong oscillations for $t\gt T.$ A snapshot of the density at time ${t}^{*}=22.5$ ms is illustrated in figures 7(c)–(e), revealing that there is a large discrepancy between the computed state ψ and the desired state ψ_d.

**Figure 7.** Splitting of a BEC using single-parameter optimal control. Left column: linear variation of the control parameter. Right column: optimal control of the control parameter. We note that the time-evolution of ${| \langle {\psi }_{{\rm{d}}},\delta {\psi }_{2}(t)\rangle | }^{2}$ in (g) has been scaled and slightly shifted in time to account for the unknown phase and amplitude of the excitation. The computational domain is given by $([-4,4]\times [-15,+15]\times [-2,2])\;\mu {{\rm{m}}}^{3}$ which is discretized by $96\times 128\times 48$ grid points in the finest discretization level. The corresponding time step is $\triangle t=0.001$ ms. See text for details.
Download figure:
Standard image High-resolution image

Next, we consider the result of the optimal control algorithm. We find that, irrespective of the specific choice of the initial guess λ⁰, the algorithm always converges to approximately the same minimizer of the cost functional. The corresponding time-evolution of the Rabi-frequency is shown in figure 7(f). We observe that the Rabi-frequency remains zero for the first few miliseconds. In fact, only about three miliseconds of the optimization time T are used for the transformation of the external potential. This behavior persists even if we increase the optimization time T, with the Rabi-frequency vanishing for an even longer initial period of time. The precise timescale depends on the parameters of the trap, as the optimization algorithm tries to find a compromise between longitudinal and radial directions.

Interestingly, our 3D control qualitatively resembles the result of a previous 1D optimization that included beyond mean-field effects to model the distribution of atoms into the two final gases on the quantum level [57]. In both cases, the initial BEC is first rapidly split into two halves. Subsequently, these two halves are kept close enough to experience a tunnel coupling for a finite time-scale. This qualitative observation is very interesting, as reducing relative number fluctuations can help to significantly enhance the sensitivity of such interferometers. A detailed study of how useful our control can be in this context will be a natural extension of this work.

As a result of the optimal control algorithm the infidelity at the final time T is reduced by more than two orders of magnitude (see figure 7(g)). However, for $t\gt T$ we again observe a strong oscillation. Snapshots of the density distribution at ${t}^{*}=22.5$ ms are given in figures 7(h)–(j).

3.3.3. Bogoliubov–de Gennes analysis

Interestingly, the 6 ms period of the very regular infidelity oscillation shown in figure 7(g) for the optimized splitting is approximately the same as the period of the infidelity oscillation depicted in figure 7(b) for the simple linear splitting. This suggests that the character of the oscillation is determined by the intrinsic properties of the BEC rather than by the splitting protocol.

Indeed, we demonstrate in the following that the oscillations are caused by collective excitations of the BEC, which are created during, but irrespective of the details of the splitting process. To this end, we show that they are the result of a small deviation δ ψ from the desired state ψ_d, which can be described within the Bogoliubov–de Gennes (BdG) framework.

Let therefore ${\rm{\Phi }}({\bf{r}},t)=\phi ({\bf{r}}){{\rm{e}}}^{-{\rm{i}}\mu t/{\hslash }}$ denote an eigenstate solution of the GPE. Here, μ is the corresponding chemical potential and ϕ is a solution of the stationary GPE, ${H}_{0}\phi +g{| \phi | }^{2}\phi =\mu \phi ,$ with ${H}_{0}=-{{\hslash }}^{2}/2m{\rm{\Delta }}+V.$ We consider a generic state ψ which deviates from the eigenstate solution by a small fluctuation δ ψ, i.e,

$\begin{eqnarray}&&\psi ({\bf{r}},t)\approx {\rm{\Phi }}({\bf{r}},t)+\delta \psi ({\bf{r}},t).\end{eqnarray} \tag{ 13 }$

In a linear approximation (with respect to δ ψ) this small deviation is given by

$\begin{eqnarray}&&\delta \psi ({\bf{r}},t)=\left(u({\bf{r}}){{\rm{e}}}^{-{\rm{i}}\omega t}+{v}^{*}({\bf{r}}){{\rm{e}}}^{{\rm{i}}{\omega }^{*}t}\right){{\rm{e}}}^{-{\rm{i}}\mu t/{\hslash }},\end{eqnarray} \tag{ 14 }$

where u, v and ω are defined via the solutions of the BdG equations [59, 60]

$\begin{eqnarray}\left[\begin{array}{lc}{H}_{0}-\mu +2\;g{| \phi | }^{2} & g{\phi }^{2}\\ -{(g{\phi }^{2})}^{*} & -{H}_{0}+\mu -2\;g{| \phi | }^{2}\end{array}\right]\left[\begin{array}{l}u\\ v\end{array}\right]={\hslash }\omega \left[\begin{array}{l}u\\ v\end{array}\right].\end{eqnarray} \tag{ 15 }$

We want to investigate small fluctuations δ ψ corresponding to some of the lowest energy eigenvalues ${\hslash }\omega$ in equation (15). To this end, we proceed in a conceptually similar way to [61] where numerical methods are used to investigate the stability and decay rates of non-isotropic attractive BECs. Like in [61] we consider the full 3D problem. However, for the discretization of the operators in (15) we employ a high-order finite difference discretization rather than working in a Fourier basis. By gradually increasing the spatial resolution of the finite difference discretization we are able to verify the convergence of the algorithm. A detailed description of our implementation is again given in the appendix.

As an example, we find that the first three eigenvalues converge towards ${\omega }_{1}=\pm 314.54\;{\rm{Hz}},$ ${\omega }_{2}=\pm 523.49\;{\rm{Hz}}$ and ${\omega }_{3}=\pm 734.26\;{\rm{Hz}}.$ Subsequently, the corresponding eigenfunctions $({u}_{i},{v}_{i})$ are normalized according to the norm [60]

$\begin{eqnarray}&&{\displaystyle \int }_{{{\mathbb{R}}}^{3}}\left({u}_{i}^{2}({\bf{r}})-{v}_{i}^{2}({\bf{r}})\right){\rm{d}}{\bf{r}}=1.\end{eqnarray} \tag{ 16 }$

Knowing the frequencies ω_i and amplitude functions u_i and v_i, it is possible to investigate the time-evolution of the excitations given by equation (14). It turns out that ${| \delta {\psi }_{i}(t)| }^{2}$ can be well described by a simple periodic oscillation in amplitude, while the shape remains mostly unchanged (see left column in figure 8). As u_i and v_i are purely real-valued functions, which approximately fulfill v_i = −u_i (see right column in figure 8) we find $\delta {\psi }_{i}({\boldsymbol{r}},{{\mathcal{T}}}_{i}/2)\approx \delta {\psi }_{i}({\boldsymbol{r}},{{\mathcal{T}}}_{i})$ and hence the effective oscillation periods are halved with respect to the eigenvalues found above, i.e. ${{\mathcal{T}}}_{\mathrm{eff},i}={{\mathcal{T}}}_{i}/2=\pi /{\omega }_{i}.$ In detail we find ${{\mathcal{T}}}_{\mathrm{eff},1}=9.99$ ms, ${{\mathcal{T}}}_{\mathrm{eff},2}=6.00$ ms and ${{\mathcal{T}}}_{\mathrm{eff},3}=4.28$ ms.

**Figure 8.** Solutions of the Bogoliubov–de Gennes equations using a 6th-order finite difference discretization for N = 2000 atoms. Left: density of the first three (scaled) excitations $\delta {\psi }_{1}({\bf{r}},t),\delta {\psi }_{2}({\bf{r}},t)$ and $\delta {\psi }_{3}({\bf{r}},t)$ at $t={{\mathcal{T}}}_{\mathrm{eff},1}/2,$ $t={{\mathcal{T}}}_{\mathrm{eff},2}/2$ and $t={{\mathcal{T}}}_{\mathrm{eff},3}/2.$ Right: normalized (with respect to the inner product (16)) amplitude functions u and v evaluated along the longitudinal direction at x = x_s and z = 0. All functions are purely real-valued.
Download figure:
Standard image High-resolution image

**Figure 8.** Solutions of the Bogoliubov–de Gennes equations using a 6th-order finite difference discretization for N = 2000 atoms. Left: density of the first three (scaled) excitations $\delta {\psi }_{1}({\bf{r}},t),\delta {\psi }_{2}({\bf{r}},t)$ and $\delta {\psi }_{3}({\bf{r}},t)$ at $t={{\mathcal{T}}}_{\mathrm{eff},1}/2,$ $t={{\mathcal{T}}}_{\mathrm{eff},2}/2$ and $t={{\mathcal{T}}}_{\mathrm{eff},3}/2.$ Right: normalized (with respect to the inner product (16)) amplitude functions u and v evaluated along the longitudinal direction at x = x_s and z = 0. All functions are purely real-valued.
Download figure:
Standard image High-resolution image

Note that the effective period of the second excitation is very close to the period of the oscillation of the infidelity observed above. Indeed, plotting the time-evolution of ${| \langle {\psi }_{{\rm{d}}},\delta {\psi }_{2}(t)\rangle | }^{2}$ along with the time-evolution of the infidelity in figure 7(g) demonstrates clearly that the oscillation of the infidelity is dominated by the second excitation. As further evidence, we extract the deviation of ψ from ψ_d from our simulation. A comparison shows again very good agreement with the time-evolution of $\delta {\psi }_{2}(t)$ (see the appendix).

The fact that only the second but not the first excitation contributes to the observations can be understood from symmetry arguments. The first excitation corresponds to an antisymmetric wave function with respect to the longitudinal direction, whereas the second excitation is symmetric. During the splitting process, the halving of the atom number in each of the two gases, as well as an overall change in the longitudinal trapping potential leads to a symmetric change in the extension of the BEC in this direction. If the control is unable to compensate for this change in extension, the second Bogoliubov–de Gennes mode is automatically excited.

This effect is especially pronounced for the linear splitting. In contrast to that, the optimal control algorithm can still reduce the infidelity at t = T, but even a small deviation of the wave function from the stationary state leads to a strong oscillation in the infidelity for $t\geqslant T.$

Once the wave function differs from the stationary state in the longitudinal direction it is impossible to stop the observed oscillation by a simple variation of the Rabi-frequency. The BEC will thus oscillate for $t\gt T$ after the end of the control.

A central role in this scenario is played by the longitudinal frequency ω_y. The smaller ω_y the longer the extension of the condensate in the longitudinal direction. In analogy to a classical harmonic oscillator this increases the susceptibility to small deviations from the equilibrium position. We have confirmed this intuition with additional simulations, finding an even more pronounced excitation of the second mode for smaller ω_y.

This is particularly noteworthy with respect to experiments studying BECs in the 1D limit, where ${\omega }_{x,z}\gg {\omega }_{y}$ [53]. Intuitively, such experiments should be very well described through a 1D approximation, where only a reduced GPE for the x-direction has to be considered (see the appendix). Our results here show that such an approach will, in general, also lead to a strong breathing oscillation. Even if the 1D control is able to reach the 1D desired state with high precision, it does not necessarily describe the experimental reality and will thus fail in 3D.

3.3.4. Numerical simulations: two-parameter control

In the last part of this article we will show how the oscillations reported above can be eliminated using a more sophisticated control scheme that is made possible by the 3D character of our control and that involves a manipulation of the trapping potential along the longitudinal direction. In experiments on atom chips, this manipulation can, for example, be realized using additional wire structures, which provide longitudinal confinement independent of the main radial trapping structures [62].

In analogy to the previous examples, we consider the following parameterization of ${{\rm{\Omega }}}_{\mathrm{rabi}}$ and ω_y:

$\begin{eqnarray}&&{{\rm{\Omega }}}_{\mathrm{rabi}}({\lambda }_{1})={{\rm{\Omega }}}_{\mathrm{rabi}}^{*}\;\chi ({\lambda }_{1}),\quad {{\rm{\Omega }}}_{\mathrm{rabi}}^{*}=2\pi \times 155\;\mathrm{kHz},\end{eqnarray} \tag{ 17a }$

$\begin{eqnarray}&&{\omega }_{y}({\lambda }_{2})={\omega }_{y}^{*}\;{\lambda }_{2},\quad {\omega }_{y}^{*}=2\pi \times 85\;\mathrm{Hz},\end{eqnarray} \tag{ 17b }$

with

$\begin{eqnarray*}{\lambda }_{1}(0) & = & 0,\quad {\lambda }_{1}(T)=1,\\ {\lambda }_{2}(0) & = & 1,\quad {\lambda }_{2}(T)=1.\end{eqnarray*}$

The only difference to the previous example is thus that the value of the longitudinal trap frequency ω_y is now part of the control. We still fix ${\omega }_{y}(t=0)=2\pi \times 85\;{\rm{Hz}}$ and ${\omega }_{y}(t=T)=2\pi \times 85\;{\rm{Hz}}$ such that the initial and desired final states remain unchanged.

Using ${{\boldsymbol{\lambda }}}^{0}(t)=[t/T,1]$ for $t\in [0,T]$ as an initial guess the optimization algorithm converges to a solution which reduces the cost functional by more than three orders of magnitude. The time-evolution of the corresponding physical parameters is given in figure 9(a). As can be seen from figure 9(b) the infidelity remains very low for $t\geqslant T.$ Snapshots of the density distribution at time ${t}^{*}=22.5$ ms confirm that the deviation from the desired state is extremely small, see figures 9(d)–(f).

**Figure 9.** (a) Time-evolution of ${{\rm{\Omega }}}_{\mathrm{rabi}}$ and ω_y corresponding to two-parameter optimal control of the splitting process. (b) Associate infidelity. (c) Comparison of the infidelities for the linear and the optimal single- and two-parameter control. (d), (e) and (f) snapshots of the density at time ${t}^{*}=22.5$ ms.
Download figure:
Standard image High-resolution image

**Figure 9.** (a) Time-evolution of ${{\rm{\Omega }}}_{\mathrm{rabi}}$ and ω_y corresponding to two-parameter optimal control of the splitting process. (b) Associate infidelity. (c) Comparison of the infidelities for the linear and the optimal single- and two-parameter control. (d), (e) and (f) snapshots of the density at time ${t}^{*}=22.5$ ms.
Download figure:
Standard image High-resolution image

In the given example we have chosen $T={{\mathcal{T}}}_{\mathrm{eff},2}.$ In contrast to that, for a time $T\lt {{\mathcal{T}}}_{\mathrm{eff},2}$ we find significantly worse results. The minimum time scale T is thus set by the oscillation period of the excitation that the control aims to stop. This oscillation period is in turn set by the geometry of the trap. Each different experimental situation will thus require carefully chosen parameters for the control.

4. Conclusion and outlook

In this work we have presented the first optimal control of the GPE in 3D. As we have shown, this situation is inherently more difficult than the optimal control of the 1D GPE because of the nonlinear coupling of different coordinate directions. We have performed a detailed analysis of the resulting small excitations, which we were able to minimize by extending previous control schemes from a single to a multi-parameter control.

In contrast to 1D approximations our 3D approach allows the study of realistic potentials. This will have direct impact on the quality of experiments and will therefore provide an important step in the ongoing effort to use the properties of the quantum world for real life applications. Importantly, our scheme is not limited to the examples discussed in this work but rather very flexible, with many more applications conceivable.

A straight-forward extension of our numerical solver could include the treatment of excited states. This would allow the 3D study of a recent experiment, where the BEC was transferred to the first excited state of the trapping potential via a 1D optimal control sequence [24]. Based on our observations we expect an even stronger excitation of BdG-modes in such an experiment. In that context, another interesting application would be to replace the cigar-shaped confinement potentials used in the splitting and vibrational state inversion experiments by torus-shaped trapping potentials. Due to the different topology the issues related to the excitation of small perturbations are expected to be strongly reduced.

Another obvious extension of this work could be to consider different cost functionals. More precisely, it would be interesting to investigate whether it is possible to reduce the optimization times T by using other cost functionals which are not based on the infidelity but rather on a conserved quantity like the total energy.

Finally, interesting further directions include the study of beyond mean-field effects using the multi-configurational time-dependent Hartree framework for bosons [63] or the optimization of finite temperature states.

Acknowledgments

We thank Jörg Schmiedmayer, Wolfgang Rohringer and Alexander Pikovski for helpful discussions. RML is supported by the Hertha-Firnberg Program of the Austrian Science Fund (FWF), Grant T402-N13. DM acknowledges Deutsche Forschungsgemeinschaft Collaborative Research Center TRR 109, Discretization in Geometry and Dynamics. TL acknowledges support by the FWF through the Doctoral Programme CoQuS (W1210) and by the Alexander von Humboldt Foundation through a Feodor Lynen Research Fellowship.

Appendix

A.1. Gradient of the reduced cost functional

In the main text we have introduced the cost functional $J({\boldsymbol{\lambda }},\psi )$ in (3) and the reduced cost functional $\hat{J}({\boldsymbol{\lambda }})=J({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})$ in (7). Recall that, for a given control ${\boldsymbol{\lambda }}\;:(0,T)\to {{\mathbb{R}}}^{m}$ satisfying the boundary conditions (6c), ${\psi }_{{\boldsymbol{\lambda }}}$ is the solution to the initial value problem (5a) and (6a) for the GPE with the corresponding potential ${V}_{{\boldsymbol{\lambda }}}.$ Below, we argue why the H¹-gradient of $\hat{J}$ is given by the component ${\boldsymbol{\Lambda }}\;:(0,T)\to {{\mathbb{R}}}^{m}$ of the solution $(\psi ,p,{\boldsymbol{\Lambda }})$ to the system consisting of (5a), (5b) and

$\begin{eqnarray}&&\displaystyle \frac{{{\rm{d}}}^{2}}{{\rm{d}}{t}^{2}}({\boldsymbol{\Lambda }}-\gamma {\boldsymbol{\lambda }})=\mathrm{Re}\langle p,({\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}})\;\psi \rangle ,\end{eqnarray} \tag{ 18 }$

subject to the initial and terminal conditions (6a), (6b) and

$\begin{eqnarray}&&{\boldsymbol{\Lambda }}(0)={\boldsymbol{\Lambda }}(T)=0.\end{eqnarray} \tag{ 19 }$

Before discussing the gradient, we first calculate the variational derivative of $\hat{J}.$ As it is customary in the context of optimization problems, we express the validity of the GPE (5a) in the form of a constraint Z = 0, with the contraint functional

$\begin{eqnarray*}&&Z({\boldsymbol{\lambda }},\psi )={\rm{i}}{\partial }_{t}\psi +\displaystyle \frac{1}{2}{\rm{\Delta }}\psi -{V}_{{\boldsymbol{\lambda }}}\psi -g{| \psi | }^{2}\psi .\end{eqnarray*}$

By definition, ${\psi }_{{\boldsymbol{\lambda }}}$ satisfies $Z({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})=0,$ hence

$\begin{eqnarray}&&\hat{J}({\boldsymbol{\lambda }})=L({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}},p),\end{eqnarray} \tag{ 20 }$

where L denotes the Lagrangian which was defined in equation (4) of the main text.

$\begin{eqnarray*}&&L({\boldsymbol{\lambda }},\psi ,p)=J({\boldsymbol{\lambda }},\psi )+\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle p,Z({\boldsymbol{\lambda }},\psi )\rangle \;{\rm{d}}t.\end{eqnarray*}$

Equation (20) holds for arbitrary smooth functions $p\;:(0,T)\to {L}^{2}({{\mathbb{R}}}^{3};{\mathbb{C}}).$ For fixed p, differentiation of $\hat{J}$ in the direction $\delta {\boldsymbol{\lambda }}$ yields

$\begin{eqnarray}{D}_{{\boldsymbol{\lambda }}}\hat{J}(\lambda )[\delta {\boldsymbol{\lambda }}] & = & {D}_{{\boldsymbol{\lambda }}}L({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}},p)[\delta {\boldsymbol{\lambda }}]+{D}_{\psi }L({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}},p)[\delta \psi ],\\ & = & {D}_{{\boldsymbol{\lambda }}}J({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta {\boldsymbol{\lambda }}]+\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle p,{D}_{{\boldsymbol{\lambda }}}Z({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta {\boldsymbol{\lambda }}]\rangle \;{\rm{d}}t,\\ & & +{D}_{\psi }J({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta \psi ]+\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle p,{D}_{\psi }Z({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta \psi ]\rangle \;{\rm{d}}t,\end{eqnarray} \tag{ 21 }$

where $\delta \psi$ is the variation in ${\psi }_{{\boldsymbol{\lambda }}}$ induced by the variation $\delta {\boldsymbol{\lambda }}$ of ${\boldsymbol{\lambda }},$ i.e., it satisfies ${D}_{{\boldsymbol{\lambda }}}Z({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta {\boldsymbol{\lambda }}]+{D}_{\psi }Z({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta \psi ]=0$ and $\delta \psi (0)=0.$ For simplification of ${D}_{{\boldsymbol{\lambda }}}\hat{J},$ we choose p, which has been arbitrary up to this point, such that the last two terms in (21) cancel. Indeed, taking p as a solution to the terminal value problem (5b) and (6b), it follows that

$\begin{eqnarray*}&&{D}_{\psi }J({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta \psi ]+\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle p,{D}_{\psi }Z({\boldsymbol{\lambda }},{\psi }_{{\boldsymbol{\lambda }}})[\delta \psi ]\rangle \;{\rm{d}}t,\\ &&=-\mathrm{Re}\left({\langle {\psi }_{{\rm{d}}},{\psi }_{{\boldsymbol{\lambda }}}(T)\rangle }^{*}\langle {\psi }_{{\rm{d}}},\delta \psi (T)\rangle \right),\\ &&+\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle p,{\rm{i}}{\partial }_{t}\delta \psi +\displaystyle \frac{1}{2}{\rm{\Delta }}\delta \psi -{V}_{{\boldsymbol{\lambda }}}\delta \psi -2g{| {\psi }_{{\boldsymbol{\lambda }}}| }^{2}\delta \psi -g{\psi }_{{\boldsymbol{\lambda }}}^{2}\delta {\psi }^{*}\rangle \;{\rm{d}}t,\\ &&=\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle {\rm{i}}{\partial }_{t}p+\displaystyle \frac{1}{2}{\rm{\Delta }}p-{V}_{{\boldsymbol{\lambda }}}p-2g{| {\psi }_{{\boldsymbol{\lambda }}}| }^{2}p-g{\psi }^{2}{p}^{*},\delta \psi \rangle \;{\rm{d}}t=0.\end{eqnarray*}$

To arrive at this result, we have performed an integration by parts with respect to time, using the terminal condition (6b) and the fact that $\delta \psi (0)=0$ thanks to the initial condition (6a). In view of these cancellations, equation (21) simplifies to

$\begin{eqnarray}&&{D}_{{\boldsymbol{\lambda }}}\hat{J}({\boldsymbol{\lambda }})[\delta {\boldsymbol{\lambda }}]=\gamma {\displaystyle \int }_{0}^{T}{\partial }_{t}{\boldsymbol{\lambda }}\cdot {\partial }_{t}(\delta {\boldsymbol{\lambda }}){\rm{d}}t-\mathrm{Re}{\displaystyle \int }_{0}^{T}\langle p,{\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}}\cdot (\delta {\boldsymbol{\lambda }})\psi \rangle \;{\rm{d}}t.\end{eqnarray} \tag{ 22 }$

We are now in the position to calculate the H¹-gradient of $\hat{J}.$ Recall that the Sobolev space ${H}^{1}(0,T;{{\mathbb{R}}}^{m})$ consists of all square integrable functions ${\boldsymbol{\lambda }}\in {L}^{2}(0,T;{{\mathbb{R}}}^{m})$ that possess a weak derivative ${\partial }_{t}{\boldsymbol{\lambda }}\in {L}^{2}(0,T;{{\mathbb{R}}}^{m}).$ Functions ${\boldsymbol{\lambda }}\in {H}^{1}(0,T;{{\mathbb{R}}}^{m})$ are actually Hölder continuous, and therefore, they have well-defined boundary values at t = 0 and t = T. It is natural to consider the reduced cost functional $\hat{J}$ as defined on ${H}_{*}^{1}(0,T;{{\mathbb{R}}}^{m}),$ which is the affine subspace of functions ${\boldsymbol{\lambda }}\in {H}^{1}(0,T;{{\mathbb{R}}}^{m})$ that satisfy the boundary conditions (6c). Indeed, any admissible control ${\boldsymbol{\lambda }}:\;(0,T)\to {{\mathbb{R}}}^{m}$ must produce a finite value in the penalty term in J, which implies that ${\partial }_{t}{\boldsymbol{\lambda }}\in {L}^{2}(0,T;{{\mathbb{R}}}^{m}).$ The tangent space to ${H}_{*}^{1}(0,T;{{\mathbb{R}}}^{m}),$ i.e., the space of possible variations $\delta {\boldsymbol{\lambda }},$ is the linear subspace ${H}_{0}^{1}(0,T;{{\mathbb{R}}}^{m})$ of all functions ${\boldsymbol{\Lambda }}\in {H}^{1}(0,T;{{\mathbb{R}}}^{m})$ with vanishing boundary values, ${\boldsymbol{\Lambda }}(0)={\boldsymbol{\Lambda }}(T)=0.$ This is a Hilbert space with respect to the inner product

$\begin{eqnarray*}&&({{\boldsymbol{\Lambda }}}_{1},{{\boldsymbol{\Lambda }}}_{2}):= {\displaystyle \int }_{0}^{T}{\partial }_{t}{{\boldsymbol{\Lambda }}}_{1}(t)\cdot {\partial }_{t}{{\boldsymbol{\Lambda }}}_{2}(t){\rm{d}}t.\end{eqnarray*}$

By definition, the gradient of $\hat{J}$ with respect to the inner product $(\cdot ,\cdot )$ is the uniquely determined element ${\boldsymbol{\Lambda }}\in {H}_{0}^{1}(0,T;{{\mathbb{R}}}^{m})$ such that $({\boldsymbol{\Lambda }},\delta {\boldsymbol{\lambda }})={D}_{{\boldsymbol{\lambda }}}\hat{J}({\boldsymbol{\lambda }})[\delta {\boldsymbol{\lambda }}]$ for all variations $\delta {\boldsymbol{\lambda }}\in {H}_{0}^{1}(0,T;{{\mathbb{R}}}^{m}).$ In view of (22), ${\boldsymbol{\Lambda }}$ satisfies

$\begin{eqnarray}&&{\displaystyle \int }_{0}^{T}{\partial }_{t}({\boldsymbol{\Lambda }}-\gamma {\boldsymbol{\lambda }})\cdot {\partial }_{t}(\delta {\boldsymbol{\lambda }}){\rm{d}}t\\ &&=-{\displaystyle \int }_{0}^{T}\mathrm{Re}\langle p,{\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}}\;\psi \rangle \cdot (\delta {\boldsymbol{\lambda }}){\rm{d}}t\quad \mathrm{for}\ \mathrm{all}\ \mathrm{variations}\ \delta {\boldsymbol{\lambda }}\in {{\rm{H}}}_{0}^{1}(0,{\rm{T}};{{\mathbb{R}}}^{m}),\end{eqnarray} \tag{ 23 }$

and ${\boldsymbol{\Lambda }}\in {H}_{0}^{1}(0,T;{{\mathbb{R}}}^{m})$ induces the boundary conditions (19). To verify that the solution ${\boldsymbol{\Lambda }}$ to the boundary value problem (18) and (19) satisfies (23), it sufficies to integrate by parts in the time integral on the left-hand side, using that $\delta {\boldsymbol{\lambda }}(0)=\delta {\boldsymbol{\lambda }}(T)=0.$

A.2. Algorithms and implementation

A.2.1. Numerical evaluation of the cost functional

The evaluatation of the reduced cost functional (7) for a given control curve ${\boldsymbol{\lambda }}$ implicitly involves the computation of ${\psi }_{{\boldsymbol{\lambda }}},$ that is, the solution of the GPE. No analytical solutions are available in general, so we use a numerical approximation. For brevity of notation, we write ψ instead of ${\psi }_{{\boldsymbol{\lambda }}}$ in the following.

For the numerical computation of the first term in (3), that is $1/2\left(1-{| \langle {\psi }_{{\rm{d}}},\psi (T)\rangle | }^{2}\right),$ we have to solve the GPE (5a) with initial data (6a) for $t\in [0,T].$ Our simulations are performed on the spatial domain

$\begin{eqnarray*}&&{\rm{\Omega }}=[-{L}_{x}/2,{L}_{x}/2]\times [-{L}_{y}/2,{L}_{y}/2]\times [-{L}_{z}/2,{L}_{z}/2]\end{eqnarray*}$

with ${L}_{x},{L}_{y}$ and L_z chosen sufficiently large to capture the significant part of the rapidly decaying solution ψ.

For numerical discretization in time, we employ the following time-splitting spectral method [32]:

$\begin{eqnarray}&&\psi ({t}_{n+1})\approx {{\rm{e}}}^{-{\rm{i}}{B}_{n}^{+}\triangle t/2}{{\rm{e}}}^{-{\rm{i}}A\triangle t}{{\rm{e}}}^{-{\rm{i}}{B}_{n}^{-}\triangle t/2}\psi ({t}_{n}),\end{eqnarray} \tag{ 24 }$

with operators $A=-1/2{\rm{\Delta }},{B}^{\pm }={V}_{{{\boldsymbol{\lambda }}}^{(n+1/2)}}+g{| {\psi }_{n}^{\pm }| }^{2},$ and with ${t}_{n}=n\triangle t,n=0,\ldots ,N-1$ such that $N\triangle t=T.$ Here ${{\boldsymbol{\lambda }}}^{(n+1/2)}=1/2\;({\boldsymbol{\lambda }}({t}_{n})+{\boldsymbol{\lambda }}({t}_{n+1})),$ and the choice of ${\psi }_{n}^{\pm }$ is given below. Thus, the nth time step consists of the following three sub-steps. First, solve ${\rm{i}}{\partial }_{t}\psi =({V}_{{{\boldsymbol{\lambda }}}^{(n+1/2)}}+g{| \psi ({t}_{n})| }^{2})\psi$ for a duration of $\triangle t/2$ with initial value $\psi ({t}_{n});$ thus ${\psi }_{n}^{-}=\psi ({t}_{n}).$ The result is used as initial value for the free Schrödinger equation ${\rm{i}}{\partial }_{t}\psi =-1/2{\rm{\Delta }}\psi ,$ which is then solved for duration of $\triangle t;$ the result is ${\psi }_{n}^{+}.$ Finally, ${\rm{i}}{\partial }_{t}\psi =({V}_{{{\boldsymbol{\lambda }}}^{(n+1/2)}}+g{| {\psi }_{n}^{+}| }^{2})\psi$ is solved with initial value ${\psi }_{n}^{+},$ again for a duration of $\triangle t/2.$ The result of the third sub-step is taken as $\psi ({t}_{n+1}).$

The free Schrödinger equation is solved using the Fourier spectral method. To this end, the wave function ψ is interpolated by a trigonometric polynomial on the grid points of the cartesian grid

$\begin{eqnarray*}&&({x}_{{j}_{x}},{y}_{{j}_{y}},{z}_{{j}_{z}})=(-{L}_{x}/2+{j}_{x}\triangle x,-{L}_{y}/2+{j}_{y}\triangle y,-{L}_{z}/2+{j}_{z}\triangle z),\end{eqnarray*}$

where $\triangle x={L}_{x}/{J}_{x}$ with ${j}_{x}=0,\ldots ,{J}_{x}-1$ etc. Thus, at time t_n, the wave function $\psi ({t}_{n})$ is represented by a 3D array of complex numbers ${{\boldsymbol{\psi }}}^{(n)}\in {{\mathbb{C}}}^{{J}_{x},{J}_{y},{J}_{z}}.$

As Matlab code, the nth time step looks as follows:

$\begin{eqnarray}{\boldsymbol{\psi }} & = & \mathrm{exp}(-1i\;\ast \;({{\bf{V}}}_{{{\boldsymbol{\lambda }}}^{(n+1/2)}}+g\;\ast \;\mathrm{abs}({\boldsymbol{\psi }}).\hat{}2)\;\ast \;\triangle t/2)\;.\;\ast \;\;{\boldsymbol{\psi }};\\ {\boldsymbol{\psi }} & = & \mathrm{fftn}({\boldsymbol{\psi }});\\ {\boldsymbol{\psi }} & = & {\bf{M}}\;.\;\ast \;\;{\boldsymbol{\psi }};\\ {\boldsymbol{\psi }} & = & \mathrm{ifftn}({\boldsymbol{\psi }});\\ {\boldsymbol{\psi }} & = & \mathrm{exp}(-1i\;\ast \;({{\bf{V}}}_{{{\boldsymbol{\lambda }}}^{(n+1/2)}}+g\;\ast \;\mathrm{abs}({\boldsymbol{\psi }}){.}^{}\hat{}2)\;\ast \;\triangle t/2)\;.\;\ast \;\;{\boldsymbol{\psi }};\end{eqnarray} \tag{ 25 }$

The array ${\bf{M}}$ represents the action of the free Schrödinger operator. Due to the vectorized implementation in Matlab this procedure is highly efficient.

The method is of second order in time and of spectral accuracy in space, provided that ψ₀ and ${V}_{{\boldsymbol{\lambda }}}$ are sufficiently smooth. In comparison to a finite difference Crank–Nicolson scheme (see for example [21, 32]), the solution of a linear evolution equation is avoided, and less grid points J = J_x J_y J_z are needed to achieve the same quality of approximation for ψ.

Typically, the numerical costs for our implementation (25) are dominated by the fast Fourier transforms fftn and ifftn, which are of order ${\mathcal{O}}(J\mathrm{log}J).$ However, in some simulations (splitting), the costs for computing the external potential ${V}_{{{\boldsymbol{\lambda }}}^{(n+1/2)}}$ exceed that of the Fourier transforms.

For the numerical solution of the optimization problem, on the order of 10 to 100 evaluations of the cost functional are needed. The respective solution of the time-dependent GPE is performed on the graphics processing unit (GPU) of a powerful graphics card. Thanks to the vectorized implementation (25), it suffices to initialize the arrays ${\boldsymbol{\psi }}$ and ${\bf{V}}$ once at the beginning, using the Matlab command gpuArray. For handling the intermediate results or for calling the data in the memory of the main processor at the end of the computation we use the command gather. The trap potentials need to be updated in each time step. However, these calculations can be performed in a vectorized way on the GPU as well.

Finally, we compute $1/2\left(1-{| \langle {\psi }_{{\rm{d}}},\psi (T)\rangle | }^{2}\right)$ with $\psi (T)\approx {{\boldsymbol{\psi }}}^{(N)},$ using a quadrature formula. The integral $\displaystyle \frac{\gamma }{2}{\displaystyle \int }_{0}^{T}{| {\partial }_{t}{\boldsymbol{\lambda }}(t)| }^{2}\;{\rm{d}}t$ is computed by a quadrature formula as well, using a finite difference formula of second order for the approximation of the time derivative ${\partial }_{t}{\boldsymbol{\lambda }}.$

A.2.2. Numerical computation of the gradient

According to (11a), the H¹-gradient $\hat{J}({\boldsymbol{\lambda }})$ is obtained as solution to the second order problem

$\begin{eqnarray}&&\displaystyle \frac{{{\rm{d}}}^{2}}{{\rm{d}}{t}^{2}}\left[{\rm{\nabla }}\hat{J}({\boldsymbol{\lambda }})\right]={\bf{r}}({\boldsymbol{\lambda }}),\quad {\bf{r}}({\boldsymbol{\lambda }})=\gamma \ddot{{\boldsymbol{\lambda }}}+\mathrm{Re}\langle \psi ,({\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}})p\rangle \end{eqnarray} \tag{ 26 }$

subject to the boundary conditions $\left[{\rm{\nabla }}\hat{J}({\boldsymbol{\lambda }})\right](0)={\bf{0}}$ and $\left[{\rm{\nabla }}\hat{J}({\boldsymbol{\lambda }})\right](T)={\bf{0}}.$ The time derivatives are discretized by second order finite differences.

To evaluate the right-hand side ${\bf{r}}({\boldsymbol{\lambda }}),$ the functions ψ and p need to be determined for $t\in [0,T].$ First, the state equation (5a) is solved as described above. Then, the adjoint equation (5b) is solved backwards in time, for the terminal condition (6b). For solution of the adjoint equation, a time-splitting method is applied as well: we alternately solve the equations ${\rm{i}}{\partial }_{t}p={V}_{{\boldsymbol{\lambda }}}p+2g{| \psi | }^{2}p+g{\psi }^{2}{p}^{*}$ and ${\rm{i}}{\partial }_{t}p=-1/2{\rm{\Delta }}p.$ The free Schrödinger equation is discretized by the Fourier-spectral method, and the value of ${\partial }_{{\boldsymbol{\lambda }}}{V}_{{\boldsymbol{\lambda }}}$ at time $t=(n-1/2)\triangle t$ is computed by means of the complex-step derivative approximation [64].

For integration of the ajoint equation on the time interval $[(n-1)\triangle t,n\triangle t],$ an approximation of the wave function ${\psi }^{(n-1/2)}=1/2\;({\psi }^{(n-1)}+{\psi }^{(n)})$ is needed. Since it is impossible to store the arrays ${{\boldsymbol{\psi }}}^{(n)}$ for every time step $n=0,\ldots ,N$ on the graphics card, the state equation is simultaneously solved backwards in time as well. The procedure is sketched in figure A1 : the calculation of ${{\bf{p}}}^{(n-1)}$ involves only two instances of the wave function and the 'old' adjoint state—that is, ${{\boldsymbol{\psi }}}^{(n)},{{\boldsymbol{\psi }}}^{(n-1)},$ and ${{\bf{p}}}^{(n)}.$ As soon as the approximations of ${{\boldsymbol{\psi }}}^{(n-1)}$ and ${{\bf{p}}}^{(n-1)}$ are available, also ${{\bf{r}}}^{(n-1)}$ can be computed. In this way it is enough to store at each time step four arrays in 3D, and the values of all available ${{\bf{r}}}^{(n)}$ with n = 0, ..., N (the storage space of which is neglegible).

**Figure A1.** Computation of the source term ${\bf{r}}$ needed to determine the gradient: the calculation of the array ${{\bf{p}}}^{(n-1)}$ involves only two instances of the wave function ${{\boldsymbol{\psi }}}^{(n)},{{\boldsymbol{\psi }}}^{(n-1)}$ and the current adjoint state ${{\bf{p}}}^{(n)}.$ As soon as the approximations of ${{\boldsymbol{\psi }}}^{(n-1)}$ and ${{\bf{p}}}^{(n-1)}$ are available, also ${{\bf{r}}}^{(n-1)}$ can be computed. At each time step only the gray shaded objects need to be stored in the memory of the graphics card. The storage space for ${{\bf{r}}}^{({\ell })}$ with ${\ell }=0,...,N$ is negligibly small.
Download figure:
Standard image High-resolution image

**Figure A1.** Computation of the source term ${\bf{r}}$ needed to determine the gradient: the calculation of the array ${{\bf{p}}}^{(n-1)}$ involves only two instances of the wave function ${{\boldsymbol{\psi }}}^{(n)},{{\boldsymbol{\psi }}}^{(n-1)}$ and the current adjoint state ${{\bf{p}}}^{(n)}.$ As soon as the approximations of ${{\boldsymbol{\psi }}}^{(n-1)}$ and ${{\bf{p}}}^{(n-1)}$ are available, also ${{\bf{r}}}^{(n-1)}$ can be computed. At each time step only the gray shaded objects need to be stored in the memory of the graphics card. The storage space for ${{\bf{r}}}^{({\ell })}$ with ${\ell }=0,...,N$ is negligibly small.
Download figure:
Standard image High-resolution image

A further difficulty in the numerical computation of the adjoint equation arises from the conjugate-complex quantity p* in $g{\psi }^{2}{p}^{*}.$ Without going into details, we refer to the implementation in [27], which can be easily applied to the 3D case. As in the case of the GPE the computation of ${\bf{r}}$ can be significantly accelerated by using the graphics card. Still, the costs for the computation of the gradient are three to four times higher than for the evaluation of the cost functional.

A.2.3. Computation of the initial and desired final states

The initial and terminal states ψ₀ and ψ_d are assumed to be ground state solutions of the stationary GPE. We compute them by imaginary time propagation [65, 66] (also known as normalized gradient flow): the time step $\triangle t$ in (25) is replaced by $-i\triangle t,$ and the wave function ϕ is normalized after every time step. By using adaptive time stepping, we reach a sufficiently exact solution with justifiable numerical costs.

A.2.4. Further details of the implementation

For the numerical solution of the considered optimal control problems we use a personal computer $({\rm{i}}7-4770{\rm{K}}$ CPU $@3.50$ Ghz $\times 8$ ) and Matlab. The parts with the highest numerical costs, thus the solving of the partial differential equations and the computation of the external potentials, are performed on the graphics card (GeForce GTX TITAN), which accelerates the calculations significantly. The evaluation of the Fourier transform, for example, on the finest space discretization can be accelerated by a factor 4−6. In this context, it is important to mention that the CPU-version of fftn in Matlab is parallelized as well and hence uses all cores available on the CPU.

In general it is useful to initially solve each optimal control problem with a small number of Fourier modes J_x, J_y, J_z and with a relatively big time step $\triangle t.$ Subsequently, the same optimal control problem is solved on a finer mesh grid and with smaller time step, whereby as initial data ${{\boldsymbol{\lambda }}}^{0}$ is used, obtained as approximated solution in the computation before. We repeat this procedure until the computed control curve with respect to the old discretization does not differ from the control curve of the finer discretization anymore.

We consider a sequence of discretization parameters

$\begin{eqnarray*}({J}_{x}^{(1)},{J}_{y}^{(1)},{J}_{z}^{(1)},{(\triangle t)}^{(1)}) & \to & ({J}_{x}^{(2)},{J}_{y}^{(2)},{J}_{z}^{(2)},{(\triangle t)}^{(2)})\\ & \to & \ldots \to ({J}_{x}^{(M)},{J}_{y}^{(M)},{J}_{z}^{(M)},{(\triangle t)}^{(M)})\end{eqnarray*}$

with ${J}_{x}^{({\ell }+1)}\gt {J}_{x}^{({\ell })},{J}_{y}^{({\ell }+1)}\gt {J}_{y}^{({\ell })},{J}_{z}^{({\ell }+1)}\gt {J}_{z}^{({\ell })}$ and ${(\triangle t)}^{({\ell }+1)}\lt {(\triangle t)}^{({\ell })}$ for ${\ell }=2,\ldots ,M.$ Typically on the order of 10 to 100 iterations of the conjugate gradient method are needed for solving the optimal control problems on the coarse grid. The computational time is of some minutes. The numerical costs for the calculations of each single iteration increase rapidly with each discretization level. In the same time if one gets near to the local minimum, less iterations for finding the local minimum are required. By means of the described strategy each of the presented optimal control problems can be solved in several hours computing time with respect to the finest discretization level.

A.3. Numerical solution of the 3D Bogoliubov–de Gennes equations

For numerical treatment of (15), we proceed analogously to [67]: a change of variables $u=\frac{1}{2}({w}_{1}-{w}_{2})$ and $v=\frac{1}{2}({w}_{1}+{w}_{2})$ transforms the system into:

$\begin{eqnarray}-\left[\begin{array}{ll} & {H}_{0}-\mu +g{\phi }^{2}\\ {H}_{0}-\mu +3\;g{\phi }^{2} & \end{array}\right]\left[\begin{array}{l}{w}_{1}\\ {w}_{2}\end{array}\right]={\hslash }\omega \left[\begin{array}{l}{w}_{1}\\ {w}_{2}\end{array}\right].\end{eqnarray} \tag{ 27 }$

A double application of the operator decouples the eigenvalue problem,

$\begin{eqnarray}&&({H}_{0}-\mu +g{\phi }^{2})({H}_{0}-\mu +3\;g{\phi }^{2}){w}_{1}=\lambda {w}_{1},\end{eqnarray} \tag{ 28a }$

$\begin{eqnarray}&&({H}_{0}-\mu +3\;g{\phi }^{2})({H}_{0}-\mu +g{\phi }^{2}){w}_{2}=\lambda {w}_{2},\end{eqnarray} \tag{ 28b }$

where $\lambda ={{\hslash }}^{2}{\omega }^{2}.$ Clearly, it suffices to solve the first eigenvalue problem (28a).

The eigenvalue problem ${{Aw}}_{1}=\lambda {w}_{1}$ given in (28a) can only be solved using numerical methods. To this end, the operator $A=({H}_{0}-\mu +g{\phi }^{2})({H}_{0}-\mu +3\;g{\phi }^{2})$ is discretized via a 6th-order symmetric finite difference formula. Clearly, ϕ und μ need to be determined in advance and with high precision. Here, we solve

$\begin{eqnarray*}&&{H}_{0}\phi +g{| \phi | }^{2}\phi =\mu \phi ,\quad {H}_{0}=-{{\hslash }}^{2}/2m\;{\rm{\Delta }}+V\end{eqnarray*}$

using the same 6th-order finite difference discretization along with the imaginary time-stepping algorithm (see above). In this context, the second-order time-splitting method is replaced by the classical Runge–Kutta method of order 4. Subsequently, the chemical potential can be computed using the identity

$\begin{eqnarray*}&&\mu ={\displaystyle \int }_{{{\mathbb{R}}}^{3}}\left(\displaystyle \frac{1}{2}{| {\rm{\nabla }}\phi ({\bf{r}})| }^{2}+V({\bf{r}}){| \phi ({\bf{r}})| }^{2}+g{| \phi ({\bf{r}})| }^{4}\right){\rm{d}}{\bf{r}}.\end{eqnarray*}$

Once ϕ and μ have been determined we need to solve the discretized eigenvalue problem (28a).

Naturally, we consider the same computational domain $([-4,4]\times [-15,+15]\times [-2,2])\;\mu {{\rm{m}}}^{3}$ that was used in the splitting experiment in section 3.3. Like in the original experiment we employ ${J}_{x}=96,{J}_{y}=128$ and J_z = 48 grid points in the respective coordinate directions (in the finest discretization level). The resulting large-scale eigenvalue problem is then solved efficiently by means of an iterative algorithm. For this purpose we employ the Matlab function eigs which only determines the most relevant eigenvalues and their corresponding eigenfunctions: the algorithm yields the eigenvalues closest to a specified shift σ which we set to a value slightly larger than zero. (We are only interested in the first few non-trivial solutions of (28a) corresponding to the eigenvalues of smallest magnitude.) The underlying algorithm of eigs requires the repeated solution of the linear system of equations

$\begin{eqnarray}&&(A-\sigma I)x=b\end{eqnarray} \tag{ 29 }$

for a given right-hand side b. We employ the biconjugate gradients stabilized method (bicgstab) which is implemented in Matlab as well. Note that A − σ I is badly conditioned which is why the bicgstab-routine needs to be called with a preconditioner $M={M}_{1}{M}_{2},$ i.e. equation (29) is effectively replaced by ${M}^{-1}(A-\sigma I)x={M}^{-1}b.$ We found that the algorithm converges reasonably fast when the factors M₁ and M₂ are given by the matrices L and U obtained from a sparse incomplete LU-factorization. Such an approximate factorization of $A-\sigma I$ can be computed using another Matlab function called ilu. For further information about the Matlab functions mentioned above we refer to the Matlab documentation and the literature cited therein. The time needed to compute a few eigenvalue-eigenvector solutions of the Bogoliubov–de Gennes equations depends strongly on the number of grid points ${J}_{x},{J}_{y}$ and J_z. For the number of grid points reported above the whole computation takes on the order of five hours computing time utilizing the above mentioned CPU.

A.4. Extracting the excitation from the time-evolution of the wave-function

The small perturbation which causes the oscillation of the infidelity in the splitting example can be extracted directly from the time-evolution of the wave function ψ. To this end, we assume that $\psi ({\bf{r}},t)$ and ${\psi }_{{\rm{d}}}({\bf{r}})$ are almost identical for t = T, i.e.,

$\begin{eqnarray*}&&\psi (t=T)\approx {{\rm{e}}}^{{\rm{i}}\theta }{\psi }_{{\rm{d}}},\quad \theta =\underset{{\theta }^{\prime }}{\mathrm{arg}\;\mathrm{min}}\;\| \psi (T)-{{\rm{e}}}^{{\rm{i}}{\theta }^{\prime }}{\psi }_{{\rm{d}}}\| .\end{eqnarray*}$

This assumption is in good agreement with our observations, where the minimum value of the infidelity is reached at this point in time. In analogy to equation (13) we define the difference $\triangle \psi := \psi ({\bf{r}},t)-{\rm{\Phi }}({\bf{r}},t),$ which leads to the result

$\begin{eqnarray*}&&\triangle \psi ({\bf{r}},t):= \psi ({\bf{r}},t)-{{\rm{e}}}^{{\rm{i}}\theta }{\psi }_{{\rm{d}}}({\bf{r}}){{\rm{e}}}^{-{\rm{i}}\mu (t-T)/{\hslash }},\quad t\geqslant T.\end{eqnarray*}$

Here, we have introduced an additional phase factor ${{\rm{e}}}^{{\rm{i}}\mu T/{\hslash }}$ in order to take into account that we consider the time-evolution of $\triangle \psi$ starting at t = T. A snapshot of the density ${| \triangle \psi ({\bf{r}},t)| }^{2}$ for t = 9 ms is shown in figure A2 . It is quite obvious that the distribution of the density is very similar to the distribution of the density of the second excitation depicted in figure 8.

**Figure A2.** Density of $\triangle \psi ({\bf{r}},t)$ at t = 9 ms.
Download figure:
Standard image High-resolution image

**Figure A2.** Density of $\triangle \psi ({\bf{r}},t)$ at t = 9 ms.
Download figure:
Standard image High-resolution image

A.5. 1D approximation for the splitting of a BEC

We briefly discuss the 1D approximation for the splitting example. In this case, the reduced GPE for the x-direction is given by

$\begin{eqnarray*}&&{\rm{i}}{\hslash }{\partial }_{t}\psi =-\frac{{{\hslash }}^{2}}{2m}{\partial }_{{xx}}\psi +{V}_{\lambda }(x,0,0)\psi +{g}_{1{\rm{d}}}{| \psi | }^{2}\psi ,\end{eqnarray*}$

where the effective 1D interaction strength ${g}_{1{\rm{d}}}$ is found by integrating out the two transversal dimensions [68, 69]

$\begin{eqnarray}&&{g}_{1{\rm{d}}}\approx g{\displaystyle \int }_{-\infty }^{\infty }{\displaystyle \int }_{-\infty }^{\infty }{| \tilde{\phi }(y,z)| }^{4}\;{\rm{d}}y\;{\rm{d}}z.\end{eqnarray} \tag{ 30 }$

Here, $\tilde{\phi }(y,z):= \phi (0,y,z)$ corresponds to the normalized ground-state solution of the 3D model in the $(x\equiv 0)$ –plane.

With this approximation we find ${g}_{1{\rm{d}}}\approx h\times 1300.44$ Hz μm for N = 2000 atoms. This value describes the situation along the whole x-axis and also leads to reasonable results away from the center of the cloud, as can be seen from figures A3 (a)–(b). We then follow the same procedures as in the 3D case to find an optimal control trajectory for the Rabi frequency.

**Figure A3.** Initial state (a) and desired state (b) of the splitting example along the transversal x-direction The blue solid lines correspond to the eigenstates of the 1D approximation using the effective coupling constant ${g}_{1{\rm{d}}}.$ Dashed lines correspond to the eigenstates of the full 3D GPE evaluated along the x-direction for shifted values of y and z. Here, ${\phi }_{3{\rm{D}},1}(x)={\phi }_{3{\rm{D}}}(x,7.5\;\mu {\rm{m}},1\;\mu {\rm{m}}),$ ${\phi }_{3{\rm{D}},2}(x)={\phi }_{3{\rm{D}}}(x,7.5\;\mu {\rm{m}},0\;\mu {\rm{m}}),$ ${\phi }_{3{\rm{D}},3}(x)={\phi }_{3{\rm{D}}}(x,0\;\mu {\rm{m}},1\;\mu {\rm{m}})$ and ${\phi }_{3{\rm{D}},4}(x)={\phi }_{3{\rm{D}}}(x,0\;\mu {\rm{m}},0\;\mu {\rm{m}}).$ Each wave function has been normalized to unity. (c) Optimal control of the Rabi-frequency corresponding to the 1D approximation. (d) Infidelity (1D–1D) corresponding to the one-dimensional model and infidelity (1D–3D) when the same trajectory of the Rabi-frequency is applied in a simulation using the 3D model.
Download figure:
Standard image High-resolution image

Optimal control of Bose–Einstein condensates in three dimensions

Article metrics

Author e-mails

Author affiliations

Dates

Abstract

1. Introduction

2. The control problem

2.1. Gross–Pitaevskii equation

2.2. Optimal control problem

2.3. Implementation

3. Examples

3.1. Harmonic oscillator potential

3.1.1. Trapping potential

3.1.2. Numerical simulations

3.2. Loading of a toroidal trap

3.2.1. Trapping potential

3.2.2. Numerical simulations

3.3. Splitting

3.3.1. Trapping potential

3.3.2. Numerical simulations: single-parameter control

3.3.3. Bogoliubov–de Gennes analysis

3.3.4. Numerical simulations: two-parameter control

4. Conclusion and outlook

Acknowledgments

Appendix

A.1. Gradient of the reduced cost functional

A.2. Algorithms and implementation

A.2.1. Numerical evaluation of the cost functional

A.2.2. Numerical computation of the gradient

A.2.3. Computation of the initial and desired final states

A.2.4. Further details of the implementation

A.3. Numerical solution of the 3D Bogoliubov–de Gennes equations

A.4. Extracting the excitation from the time-evolution of the wave-function

A.5. 1D approximation for the splitting of a BEC

Optimal control of Bose–Einstein condensates in three dimensions

Article metrics

Share this article

Author e-mails

Author affiliations

Dates

Abstract

1. Introduction

2. The control problem

2.1. Gross–Pitaevskii equation

2.2. Optimal control problem

2.3. Implementation

3. Examples

3.1. Harmonic oscillator potential

3.1.1. Trapping potential

3.1.2. Numerical simulations

3.2. Loading of a toroidal trap

3.2.1. Trapping potential

3.2.2. Numerical simulations

3.3. Splitting

3.3.1. Trapping potential

3.3.2. Numerical simulations: single-parameter control

3.3.3. Bogoliubov–de Gennes analysis

3.3.4. Numerical simulations: two-parameter control

4. Conclusion and outlook

Acknowledgments

Appendix

A.1. Gradient of the reduced cost functional

A.2. Algorithms and implementation

A.2.1. Numerical evaluation of the cost functional

A.2.2. Numerical computation of the gradient

A.2.3. Computation of the initial and desired final states

A.2.4. Further details of the implementation

A.3. Numerical solution of the 3D Bogoliubov–de Gennes equations

A.4. Extracting the excitation from the time-evolution of the wave-function

A.5. 1D approximation for the splitting of a BEC