Vortex detection in atomic Bose–Einstein condensates using neural networks trained on synthetic images

Myeonghyeon Kim; Junhwan Kwon; Tenzin Rabga; Y Shin

doi:10.1088/2632-2153/ad03ad

1. Introduction

Quantum vortices are topological defects in superfluid systems, characterized by phase singularities in the superfluid order parameter. These vortices exhibit quantized circulation of particles around them due to the integer phase winding, leading to a depleted particle density at their cores, reminiscent of the eye of a tornado. Quantum vortices play an important role in various superfluid transport phenomena, and extensive research has been conducted on their dynamic properties in different superfluid systems such as liquid helium [1], superconductors [2], and atomic Bose–Einstein condensates (BECs) [3]. Atomic BECs, in particular, offer several advantages in the study of quantum vortex dynamics, as they can be effectively described by the Gross–Pitaevskii equation (GPE) within the mean-field approximation, allowing direct comparisons between theoretical predictions and experimental observations [4]. Additionally, resonant imaging techniques allow for direct visualization of quantum vortices in atomic BECs during a time-of-flight (ToF) expansion, where the vortex core size becomes larger than the imaging device's resolution [5–7]. Due to these advantages, there has been active research on the properties of quantum vortices in atomic BECs [4, 8] as well as investigations of related phenomena such as rotating BECs [3, 9–12], vortex shedding [13–17], 2D superfluidity [18–21], turbulence [22–27], and spontaneous defect formation [28–31]. These studies have significantly advanced our understanding of the superfluid physics associated with these intriguing topological defects.

When imaging vortices in a BEC, the vortex cores must be identified through appropriate image analysis. One simple method is manual counting by a human observer; however, this can be exceedingly time-consuming when dealing with large datasets, especially in BEC samples containing numerous vortices. To address this issue, previous studies have developed several image processing algorithms for automated detection of density-depleted cores. In rotating BECs, because vortices form a lattice structure with distinct separation, Gaussian and Laplacian filters can readily locate local density minima, which coincide with single vortex cores [32]. However, in turbulent BECs, vortex distribution is irregular, and vortices may not be well-separated, leading to multiple vortices within a single density-depleted region. In such cases, the number of vortices can be estimated by measuring the area of the density-depleted regions from a binarized image of the density using a given threshold [16, 29]. However, accurately determining the locations of vortex cores remains a challenge even with this method.

Recent advances in artificial intelligence and neural network technology have enabled the use of machine learning algorithms in cold atom research. These algorithms have been used to optimize experimental parameters [33–36] and analyze experimental and numerical data [37–45], providing high accuracy and efficiency. Metz et al [45] recently demonstrated the successful application of a convolutional neural network (CNN) to detect vortices in numerically generated BEC images. This successful debut of the CNN algorithm in numerical studies of BECs encourages its use in real experiments. However, several challenges must be addressed when using the CNN algorithm to analyze experimental data. The actual experimental images have a lower resolution than the numerical images because of the limitations of the experimental setup. Additionally, during ToF expansion before imaging, BEC samples undergo changes that can lead to blurring of the vortex core density gradient and, in some cases, the production of density ripples around the core [7].

Another significant challenge is to obtain a sufficiently large and labeled dataset to train CNN. Machine learning algorithms often require a substantial amount of labeled data for effective training, but collecting such data in real-world experiments can be time-consuming and costly, which is particularly relevant in atomic BEC experiments. Recently, to tackle the issue of obtaining a costly labeled training set for machine learning algorithms, the use of synthetic data has been proposed [46–48]. This is especially attractive as the GPE can generate synthetic BEC images for CNN training with parameters that are similar to those of the experimental setup. Moreover, an automated algorithm can be used to accurately identify the ground truth of synthetic images, i.e. vortex positions, eliminating the need for manual labeling and providing efficiency and reduced bias.

In this paper, we present our work on the use of CNNs trained on synthetic images to detect vortices in experimental BEC images. To address the issue of low resolution in experimental images, we employ an upsampling process and evaluate the CNN's performance by varying the upsampling size. Additionally, we optimize the CNN's performance in recognizing vortices in the experimental BEC images by adjusting image variables in the training dataset, such as the noise strength and the criterion for defining the BEC boundary. Our results demonstrate that, with proper parameter tuning, CNNs trained on synthetic images can accurately and effectively detect vortices in experimental BEC images, providing a useful tool for analyzing quantum vortices in real experimental data.

The remainder of this paper is organized as follows. Section 2 provides a detailed description of the experimental setup used to generate BECs with vortices and the GPE simulation process to create synthetic BEC images that closely resemble the experimental conditions. Additionally, we explain the CNN training procedure using these simulated images. In section 3, we present the results of applying CNN to detect vortices in experimental images. We analyze the effect of tuning parameters on CNN's performance and discuss how these parameters affect the accuracy of vortex detection in the experimental data. Finally, in section 4, we summarize our findings and provide an outlook on the use of machine learning algorithms for vortex detection in the study of vortex dynamics in atomic BECs.

2. Methods

2.1. Experimental image data

We experimentally produce BECs with multiple vortices by rapidly cooling an atomic thermal gas, causing it to undergo a phase transition into the superfluid phase. During the quench process, the superfluid order parameter of the system develops independently in local areas, leading to the emergence of topological defects at the interfaces between these phase domains. This phenomenon is called the Kibble–Zurek mechanism [49, 50]. In the case of a superfluid system, these topological defects appear as quantum vortices.

The experiment begins with the preparation of a thermal gas of ⁸⁷Rb atoms in an optical dipole trap (ODT) with a highly oblate and elongated geometry, as described in [51]. The trapping potential of the ODT is expressed as

$\begin{equation} V(x,y,z) = \frac{1}{2}m\omega^2 R_x^2 \bigg[ ( \frac{x}{R_x} )^2 +( \frac{y}{R_y})^{3.9} + ( \frac{z}{R_z} )^2 \bigg], \end{equation} \tag{ 1 }$

where $\omega \approx 2\pi \times 7$ Hz is the trapping frequency for the x direction, m is the mass of a ⁸⁷Rb atom, and $\{R_x, R_y, R_z\} \approx \{65, 244, 2.8\}\,\mu\textrm{m}$ are the Thomas–Fermi radii of the final condensate. The ODT provides strong confinement along the z direction and this highly oblate geometry causes the vortex lines to be energetically aligned along the tightly confining z-axis.

We rapidly cool the thermal gas by decreasing the trap depth of the ODT from $1.15 U_c$ to $0.27 U_c$ , where U_c is the critical trap depth at which the sample starts to undergo a phase transition. During the quench, the temperature of the sample is regulated by the depth of the trap through evaporation. To facilitate the formation of quantum vortices, we keep the sample in the ODT for an additional 1.25 s after the quench. We then release the trapping potential to allow the BEC to expand. During this expansion, the vortex cores expand faster, becoming visible in the subsequent imaging. After a 40 ms ToF expansion, we perform absorption imaging of the BEC along the z-axis to capture the density distribution of the BEC, which allows us to detect the presence and spatial arrangement of quantum vortices. The cooling speed is controlled by the decrease time of the trap depth, which ranges from 0.8 s to 11 s, resulting in a variable vortex number in the BEC. For our fastest quench, the vortex number exceeds 60 [30, 31]. The typical number of atoms in the condensate is approximately $N = 9.6\times 10^6$ .

In figure 1(a), we display examples of experimental images in which the vortex cores are identified as localized areas of reduced density. The images are presented as 2D optical depth (OD) arrays and have been cropped to a size of 120 × 40 pixels, with each pixel being approximately 4.3 µm in size. This cropping ensures that the BEC sample is centered and enclosed within the image. To remove any outlier pixels that may have been caused by noise during the calculation of OD, we trim values that differ by more than four standard deviations from the surrounding values, replacing them with the average value of neighboring pixels. We then normalize the images so that the minimum pixel value is set to zero and the maximum is set to one, creating a consistent scale for analysis.

2.2. Generating synthetic training sets

The dynamics of weakly interacting atomic BECs can be effectively described by the GPE [4]. This equation takes the form

$\begin{equation} \mathrm i\hbar \frac{\partial}{\partial t} \psi(x,y,z,t) = \bigg[ -\frac{\hbar^2}{2m}\nabla^2 + V(x,y,z) + \frac{4\pi \hbar^2 N a_\mathrm s}{m} \psi ^2 \bigg] \psi(x,y,z,t), \end{equation} \tag{ 2 }$

where ψ is the condensate wave function and $a_\mathrm s$ is the scattering length of the atoms. The BEC dynamics in the z direction is highly suppressed due to tight confinement, so we simplify the equation by integrating out the z-dependent part of the wave function, resulting in an effective equation for a 2D complex wave function $\psi(x,y)$ . The numerical solution of the 2D GPE is obtained on a grid space of 1024 × 512 with a cell size of 0.54 µm, which has a much higher resolution than our experimental images. The healing length of the condensate in the experiment was 0.26 µm and such high resolution is necessary for the GPE simulation. After determining the complex wave function of the system from the GPE, we calculate the density profile of the BEC by taking the squared norm of the wave function $|\psi(x,y)|^2$ .

To create synthetic images of BECs with quantum vortices, we first calculate the ground state of the system using the imaginary-time evolution of the GPE [52, 53]. Then, we imprint phase singularities onto the condensate wave function. The phase singularity mask is given by $\theta(x,y) = \sum_{i = 1}^{N_v} s_i \arctan(x-x_i,y-y_i)$ , where $(x_i,y_i)$ is the position of the ith phase singularity and $s_i = \pm 1$ is the sign of phase winding randomly chosen. The number of singularities, N_v , imprinted on the condensate ranges from 0 to 100, and their locations are randomly distributed throughout the sample area. After phase imprinting, we evolve the condensate with the new phase profile for a short hold time using the real-time evolution of the GPE, during which phase singularities develop into vortices in the system. Then, to account for the effects of the ToF imaging process, where the density profile around the vortex cores are slightly altered, we allow the sample to expand by suddenly releasing the trapping potential. In the free expansion, we gradually reduce the interaction strength for the same ToF time to simulate the effect due to the fast expansion along the z direction [5, 6]. Finally, we create a density image through the squared norm of the wavefunction. Figure 1(b) shows typical simulated BEC images.

After obtaining a synthetic BEC image, we label it with the positions of the vortices in the simulated BEC sample. We know the initial positions of the phase singularities on the condensate, but the vortices may have shifted and dissipated during the evolution after the phase-imprinting sequence. To identify the vortex positions, we use a brute-force algorithm based on the phase information of the wave function [45]. The algorithm searches for all local density minima as potential vortices and calculates the integral of the phase gradient $\nabla (\textrm{arg}(\psi) )$ along the circumference of each candidate point. If the integral is non-zero, the point is identified as a vortex. To ensure that only the vortices within the BEC are accurately labeled, we define a sample area criterion using a cutoff density, denoted by ρ_cut, and exclude any singularity in the outer region of the sample with density $\rho \unicode{x2A7D} \rho_\textrm{cut}$ from the labeled vortex list. Figure 2(a) shows the change of the sample area enclosed by the contour ( $\rho = \rho_\textrm{cut}$ ) as the value of ρ_cut is adjusted. Here, the sample density is normalized so that the maximum density, $\textrm{max}( |\psi(x,y)|^2)$ , is set to one.

**Figure 2.** Tuning parameters for the training set of synthetic images. (a) Determining the effective sample area: The region where the BEC density exceeds a specified cutoff density ρ_cut is considered as the effective sample area. Images illustrate different boundaries of the BEC for various density cutoff values ρ_cut. Each image was normalized to have a maximum value of unity. Only phase singularities within this defined area are labeled as true vortices. (b) Introducing noise: to replicate realistic experimental conditions, Gaussian random values with a standard deviation σ_n were added to each pixel of the normalized simulated images. Images with different noise levels are shown. The images were normalized again after adding the noise.
Download figure:
Standard image High-resolution image

The last step in creating a synthetic training set is to pre-process the simulated images to make them look like the ones obtained in the experiment. Initially, the simulated images are cropped to 960 × 320 pixels and then downsized to 120 × 40 pixels, so that they have the same resolution and size as the experimental images. After that, Gaussian noise is added to each pixel of the image with a uniform noise strength across the image. The Gaussian noise has a zero mean and a standard deviation denoted as σ_n. By adjusting the tuning parameter σ_n, the noise level of the images can be changed, as shown in figure 2(b). Finally, the image is normalized to set the minimum and maximum pixel values to zero and one, respectively, completing the pre-processing sequence (figure 1(b)).

2.3. CNN training and evaluation

Our vortex detection algorithm is based on the CNN model proposed by Metz et al [45]. This CNN architecture consists of seven convolutional layers and three maxpool layers. The stride of the layers is 1, except for the first two maxpool layers, which have a stride of 2 to reduce the output size. The CNN takes greyscale density images as a single input channel, and it has three output channels, each reduced both in height and width by a factor of four compared to the input channel. Each 4 × 4 grid cell of the input, serving as the unit window for vortex detection, is coarsened into a set of three values for the output channels, which represent the probability of a vortex core being in the grid cell and the rescaled x and y positions of the vortex core within the grid cell, respectively. When the size of the input image is $\mathrm{H} \times \mathrm{W}$ , the resulting output tensor has dimensions of $\mathrm{H}/4 \times \mathrm{W}/4 \times 3$ , which are denoted as $\widetilde{Y}_{ijk}$ , where i and j are the position indices of a 4 × 4 grid cell, and k is the index of the output channels. For ground-truth data of the vortex positions, the array Y_ijk is created, where $Y_{ij1} = 1$ when a grid cell contains a vortex and $Y_{ij1} = 0$ when a grid cell contains no vortex. The loss function used for training the CNN is defined as follows:

$\begin{align} L = \sum_{\mathrm{batch}}\sum_{i,j} \Bigg[ -w_{1}Y_{ij1}\log(\widetilde{Y}_{ij1}) -(1-Y_{ij1})\log(1-\widetilde{Y}_{ij1}) + w_{2}Y_{ij1} ( {(Y_{ij2}-\widetilde{Y}_{ij2})}^2+{(Y_{ij3}-\widetilde{Y}_{ij3})}^2 ) \Bigg], \end{align} \tag{ 3 }$

where w₁ and w₂ are hyperparameters that control the weighting of each term. In this study, w₁ and w₂ are set to 10, as in [45].

We apply upsampling to the experimental and synthetic images to feed them to the CNN. Nearest-neighbor upsampling is used, where each pixel in the original image is repeated $n_\textrm{up}$ times in both the row and column directions without any interpolation. The original size of the images is 120 × 40 pixels, and after upsampling, they become $120n_\textrm{up}\times40n_\textrm{up}$ in size. The value of $n_\textrm{up}$ can be adjusted to change the resolution of the images, but it should be noted that the images with different $n_\textrm{up}$ values look the same because no interpolation is used during the upsampling process.

The CNN is trained on a dataset of 2000 synthetic images, with a batch size of 100. The training process is conducted for 300 epochs, with an initial learning rate of $\eta = 10^{-3}$ . After 300 epochs, the learning rate is decreased to 10⁻⁴. To ensure a fair comparison of the performance of CNNs trained with various parameter combinations, the early stopping method is used with a patience value of 50 epochs, without fixing the training epoch. This means that, while monitoring the loss function, if it does not show further reduction over the course of 50 epochs, the algorithm regards it as having reached its minimum and concludes the training process.

The performance of the CNN is evaluated by comparing the number of vortices identified by the machine to the number counted by a human observer. To quantify the discrepancy between the machine and human counts, we calculate the root mean squared error (RMSE) value that is given by

$\begin{equation} \mathrm{RMSE} = \bigg( \frac{\sum_{i}^{n}{[ N_{\mathrm{m}}(i)-N_{\mathrm{h}}(i) ]^2}}{n} \bigg)^{1/2}, \end{equation} \tag{ 4 }$

where $N_\textrm{m}(i)$ is the machine count for the ith image, $N_\textrm{h}(i)$ is the corresponding human count, and n = 350 is the number of experimental images used for the test. Out of the 350 experimental images, the distribution of $N_\textrm{h}$ is as follows: 56.2% have 0 to 20 vortices, 18.9% have 20 to 40 vortices, 20.6% have 40 to 60 vortices, and 4.3% have 60 to 80 vortices. The RMSE value provides an estimate of the average difference between the machine and human counts per image. The confidence threshold is set to 0.5 for the CNN's vortex detection.

3. Results and discussion

3.1. Parameter tuning of synthetic training sets

The performance of the machine for vortex detection is affected by the tuning parameters such as the upsampling size $n_\textrm{up}$ for imaging resolution, the cutoff density ρ_cut, and the noise strength σ_n in the formation of synthetic training sets. In this section, we explain how the machine performance is changed by the three parameters. We train various CNNs with different parameter settings and compare their RMSE values to determine the optimal parameter combination for accurate vortex detection.

We first investigate the effect of image resolution on machine performance by adjusting the upsampling size $n_\textrm{up}$ from one to five, while keeping $\sigma_\textrm{n} = 0.07$ and $\rho_\textrm{cut} = 0.01$ fixed. The results of the evaluation are shown in figure 3. As the size of the upsampling increases from $n_\textrm{up} = 1$ to 2, the machine performance improves significantly, reducing the RMSE value by a factor of about 3. Further increases in $n_\textrm{up}$ result in gradual improvements in detection performance, and for $n_\textrm{up}\gt3$ , the RMSE value converges to approximately 3. The scatter plots of figure 3(a) compare the vortex number counts $N_\textrm{m}$ and $N_\textrm{h}$ for various $n_\textrm{up}$ to show the details of the machine performance. When $n_\textrm{up} = 1$ , i.e. without upsampling, the machine exhibits poor performance, particularly for cases with a high vortex number, indicating that the image resolution is not sufficient to resolve vortices when clustered with high density (figure 3(b)). It is interesting to note that since the 4 × 4 grid cell is used as the unit window for single vortex detection in the CNN model, $n_\textrm{up} = 4$ corresponds to having a resolution of 1 pixel in the bare experimental image. This seems to explain the RMSE convergence for $n_\textrm{up} \unicode{x2A7E} 3$ . The results for various $n_\textrm{up}$ suggest that we can utilize the CNN detection algorithm effectively for low-resolution experimental images by simple upsampling without interpolation. The required upsampling size may be smaller in experimental settings with better resolution.

**Figure 3.** Performance of the CNN with varying upsampling size $n_\textrm{up}$ . (a) Root mean squared error (RMSE) values for different $n_\textrm{up}$ settings. $\sigma_\textrm{n} = 0.07$ and $\rho_\textrm{cut} = 0.01$ . The scatter plots in the upper and right sides compare the vortex number count $N_\textrm{m}$ obtained using the CNN and the vortex number count $N_\textrm{h}$ determined by a human observer, for different $n_\textrm{up}$ values. The dotted lines and gray regions in the plots indicate the $N_\textrm{m} = N_\textrm{h}$ line and $\pm 10\%$ of $N_\textrm{h}$ , respectively. (b) Detection results for an experimental image using CNNs trained on images upsampled at different $n_\textrm{up}$ sizes. Detected vortices are indicated by white dots.
Download figure:
Standard image High-resolution image

Next, we explore the effect of the parameters σ_n and ρ_cut on machine performance. We measure the RMSE value for a range of noise strength σ_n from 0.01 to 0.15, for different ρ_cut values and a fixed $n_\textrm{up} = 4$ . The results shown in figure 4(a), demonstrate a U-shaped dependence on the noise strength σ_n, with an optimal noise strength of around 0.07, regardless of the density cutoff value used. At $\sigma_\textrm{n} = 0.07$ and $\rho_\textrm{cut} = 0.01$ , the RMSE has a minimum value of approximately 3. Interestingly, a CNN trained with low-noise images tends to underestimate the number of vortices, although it often overestimates for images with a few vortices (figure 4(c)). This behavior appears counterintuitive because it is expected that, trained with low-noise images, the machine might interpret strong noises as vortices, leading to overcounting. When the machine is trained with high-noise images, it shows an underestimate of the vortex number for the entire experimental image test set, reflecting its low sensitivity to vortices, ignoring them as noises (figures 4(b) and (c)).

**Figure 4.** Performance of the CNN with varying noise strength σ_n and cutoff density ρ_cut. (a) RMSE values as a function of σ_n for different ρ_cut values. The RMSE has a minimum value of ${\approx}3$ at $\sigma_\textrm{n} = 0.07$ and $\rho_\textrm{cut} = 0.01$ , suggesting an optimal combination of noise strength and density cutoff for accurate vortex detection. (b) CNN detection results for $\sigma_\textrm{n} = 0.02, 0.07, 0.15$ with fixed $\rho_\textrm{cut} = 0.01$ , and (c) the corresponding scatter plots of $N_\textrm{h}$ and $N_\textrm{m}$ . (d) CNN detection results for $\rho_\textrm{cut} = 0.01, 0.15$ with fixed $\sigma_\textrm{n} = 0.08$ , and (e) the corresponding scatter plots. The CNN trained with images with high ρ_cut tends to miss vortices at the boundary region of the BEC sample. The dotted lines and the gray regions in (c) and (e) indicate the $N_\textrm{m} = N_\textrm{h}$ line and $\pm 10\%$ of $N_\textrm{h}$ , respectively, same as in figure 3(a). The upsampling size was fixed at $n_\textrm{up} = 4$ .
Download figure:
Standard image High-resolution image

**Figure 4.** Performance of the CNN with varying noise strength σ_n and cutoff density ρ_cut. (a) RMSE values as a function of σ_n for different ρ_cut values. The RMSE has a minimum value of ${\approx}3$ at $\sigma_\textrm{n} = 0.07$ and $\rho_\textrm{cut} = 0.01$ , suggesting an optimal combination of noise strength and density cutoff for accurate vortex detection. (b) CNN detection results for $\sigma_\textrm{n} = 0.02, 0.07, 0.15$ with fixed $\rho_\textrm{cut} = 0.01$ , and (c) the corresponding scatter plots of $N_\textrm{h}$ and $N_\textrm{m}$ . (d) CNN detection results for $\rho_\textrm{cut} = 0.01, 0.15$ with fixed $\sigma_\textrm{n} = 0.08$ , and (e) the corresponding scatter plots. The CNN trained with images with high ρ_cut tends to miss vortices at the boundary region of the BEC sample. The dotted lines and the gray regions in (c) and (e) indicate the $N_\textrm{m} = N_\textrm{h}$ line and $\pm 10\%$ of $N_\textrm{h}$ , respectively, same as in figure 3(a). The upsampling size was fixed at $n_\textrm{up} = 4$ .
Download figure:
Standard image High-resolution image

The sample-area criterion, represented by the cutoff density ρ_cut in the synthetic images, also has a significant impact on the CNN performance. The density cutoff ρ_cut determines how close to the sample boundary is a vortex labeled in the synthetic image data. For noise strengths of 0.03 or higher, the CNN's performance improves as the cutoff density cutoff decreases, even to 0.01 (figure 4(a)). As shown in figure 4(d), the CNN trained for higher ρ_cut tends not to count density dips near the sample boundary as vortices. It is worth noting that Metz et al [45] found that a CNN works efficiently with $\rho_\textrm{cut} = 0.15$ to analyze simulated BEC images. The optimal value of the cutoff density may differ under various task conditions.

3.2. Noise strength mixing method

The peculiar dependence of the machine performance on noise strength σ_n observed in the previous section motivates further investigation to find a more effective implementation of signal noise in the synthetic training dataset. Our current approach to creating a synthetic dataset involves adding Gaussian noise with a uniform strength across the entire image. However, it should be noted that noise in actual experimental images is the result of a complex combination of multiple factors, and its strength can vary spatially over the sample and fluctuate for different realizations. The primary source is atom-shot noise, which is caused by the quantized nature of atoms and leads to fluctuations in atom density. In the simplest scenario of a non-interacting ideal gas, the counting of atoms, N, within a specific volume follows a Poisson distribution, resulting in a variance of atom number $(\Delta N)^2 \sim \bar{N}$ with $\bar{N}$ being the mean value of N. Even when interactions among particles are taken into account, atom-shot noise remains a significant source of noise for atom image data [54, 55]. Additionally, the inherent Poisson statistics associated with photon counting in the imaging process introduce another unavoidable source of noise as well as other technical noises.

To gain insight into the noise characteristics of our experimental images, we measure the noise strength and its spatial variations over the image. For each pixel, we calculate the difference δ, between its value and the average value of its four nearest and four next-nearest neighboring pixels. We then estimate the noise strength σ_n in a region of interest by taking the standard deviation of δ in the region. Figures 5(a)–(c) show the probability distributions of δ obtained from three different regions in the image; a high-atom-density center region, a low-atom-density sample boundary region, and a background region without atoms, respectively. The noise distributions are found to be well modeled by Gaussian profiles, and the measured σ_n values increase with increasing atom density, which is in agreement with the predictions from atom-shot and photon-shot noises. In particular, the noise strength in the high-density center region is $\sigma_\textrm{n} = 0.073$ , close to the optimal noise strength of 0.07 for synthetic images, which explains the optimization. Additionally, figure 5(d) displays the histogram of the noise strength in the high-density region across all experimental images, showing that the noise strength fluctuates from one image to another, centered around 0.073.

**Figure 5.** Characteristics of noise in experimental images. (a)–(c) Normalized noise distributions in three regions of an image: (a) center region with high atom density, (b) sample boundary region with low atom density, and (c) background region without atoms. Noise δ was measured as the difference between the pixel's own value and the average value of its four nearest and four next-nearest neighboring pixel values. The noise strength σ_n was estimated by calculating the standard deviation of δ. Dashed lines represent Gaussian curves with standard deviations set to σ_n. The insets indicate the corresponding regions in the image. (d) Frequency counts of σ_n in the high-density center region for the 350 experimental images.
Download figure:
Standard image High-resolution image

The proposition that the best training dataset is one collected from real task situations suggests that it is desirable to create a synthetic image dataset that includes all the noise characteristics of experimental images. However, it is difficult to incorporate every noise factor into synthetic images through a comprehensive theoretical analysis. As an alternative, we construct a training set with synthetic images that have different noise strengths while maintaining uniform Gaussian noise. We train the CNN using a set of 2000 training images with varying noise levels, ranging from 0.00 to 0.07, where $n_\textrm{up} = 4$ and $\rho_\textrm{cut} = 0.01$ . The resulting RMSE value is 2.49, which is approximately 15% lower than the minimum RMSE obtained when using the optimally selected uniform noise strength $\sigma_\textrm{n} = 0.07$ (figure 6(a)). This improvement indicates that the non-uniform noise strength of the experimental images can be better addressed by the noise strength mixing method.

**Figure 6.** Performance of the CNN trained using a noise strength mixing method. (a) Scatter plot between $N_\textrm{h}$ and $N_\textrm{m}$ , where CNN was trained on synthetic images with a range of σ_n values from 0.00 to 0.07 and a fixed $\rho_\textrm{cut} = 0.01$ . The dotted line and the gray region indicate the $N_\textrm{m} = N_\textrm{h}$ line and $\pm 10\%$ of $N_\textrm{h}$ , respectively. (b) CNN detection results for four experimental image examples. (c) Examples of complex structures in turbulent BECs. The regions corresponding to these examples are marked by white boxes in (b). The vortices detected by CNN are indicated by white dots.
Download figure:
Standard image High-resolution image

Turbulent BECs contain a large number of vortices distributed in a disorganized manner (figure 6(b)) and they lead to experimental images with intricate patterns, such as linear chains of density dips created by multiple vortices and faint cracks at the sample boundaries that resemble vortices (figure 6(c)). This complexity makes it hard to accurately count the vortices, even for a human observer. Nevertheless, our vortex detection algorithm performs well, with an RMSE of ≈ 3, which is more than satisfactory for the purposes of this study.

4. Summary and outlook

We have presented an effective machine learning-based vortex detection algorithm for experimental images of atomic BECs. By training a CNN with synthetic images generated from the GPE, we achieved accurate and efficient vortex detection. We explored the effects of three key parameters in the synthetic training set, namely, the upsampling size, noise strength, and cutoff density, to optimize the CNN performance. We found that the low-resolution of experimental images can be overcome by the upsampling process, while a diverse training set with various noise strengths improved the algorithm's ability to distinguish real vortices from noise-induced artifacts. Additionally, adjusting the cutoff density enabled us to accurately label the vortices within the sample area, thereby refining the CNN's performance near the sample boundary.

Our vortex detection method has a number of advantages for the experimental study of vortex dynamics in atomic BECs. CNN's capability to accurately detect vortices in low-resolution experimental images, combined with its high speed, makes it possible to quickly process a large amount of image data, which could be beneficial for statistical investigations and correlation studies on turbulent BECs [56–60]. Furthermore, the synthetic training dataset can be easily adjusted to different experimental and imaging configurations, allowing for broad and rapid applications of the algorithm to various experimental studies. Finally, we anticipate that the algorithm can be extended to analyze time-resolved data and track vortex trajectories in dynamic BEC systems. The combination of accuracy, efficiency, and flexibility makes the algorithm a valuable tool for future studies of vortices and, possibly, other topological defects in BEC systems.

Acknowledgments

This work was supported by the National Research Foundation of Korea (Grant No. NRF-2023R1A2C3006565) and the Institute for Basic Science in Korea (Grant No. IBS-R009-D1).

Data availability statement

All data that support the findings of this study are included within the article (and any supplementary files).

Vortex detection in atomic Bose–Einstein condensates using neural networks trained on synthetic images

Article metrics

Submit

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction