Supercriticality of the Dynamo Limits the Memory of the Polar Field to One Cycle

, , and

Published 2021 May 25 © 2021. The American Astronomical Society. All rights reserved.
, , Citation Pawan Kumar et al 2021 ApJ 913 65 DOI 10.3847/1538-4357/abf0a1

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/913/1/65

Abstract

The polar magnetic field precursor is considered to be the most robust and physics-based method for the prediction of the next solar cycle strength. However, to make a reliable prediction of a cycle, is the polar field at the solar minimum of the previous cycle enough or do we need the polar field of many previous cycles? To answer this question, we performed several simulations using Babcock–Leighton-type flux-transport dynamo models with a stochastically forced source for the poloidal field (α term). We show that when the dynamo is operating near the critical dynamo transition or only weakly supercritical, the polar field of cycle n determines the amplitude of the next several cycles (at least three). However, when the dynamo is substantially supercritical, this correlation of the polar field is reduced to one cycle. This change in the memory of the polar field from multiple to one cycle with the increase of the supercriticality of the dynamo is independent of the importance of various turbulent transport processes in the model. Our this conclusion contradicts the existing idea. We further show that when the dynamo operates near the critical transition, it produces frequent extended episodes of weaker activity, resembling the solar grand minima. The occurrence of grand minima is accompanied by the multicycle correlation of the polar field. The frequency of grand minima decreases with the increase of supercriticality of the dynamo.

Export citation and abstract BibTeX RIS

1. Introduction

The large-scale magnetic field of the Sun oscillates with varying amplitudes. This field is believed to be generated through a large-scale dynamo operating inside the convection zone (CZ; Moffatt 1978; Charbonneau 2014, 2020). This dynamo is driven by helical convection and differential rotation. In the αΩ dynamo scenario (which is applicable for the Sun), the magnetic field grows when the dynamo number

Equation (1)

exceeds a critical value, where η0 is the turbulent magnetic diffusivity, ΔΩ is angular velocity variation, and α0 is the measure of the α effect (Krause & Rädler 1980). In the original αΩ dynamo model, however, the helical convective α generates a poloidal field from the toroidal one. In recent years, it has been realized that the poloidal field in the Sun is primarily generated through the decay of tilted bipolar magnetic regions (BMRs), the so-called the Babcock–Leighton process (Babcock 1961; Leighton 1964; Dasi-Espuig et al. 2010; Kitchatinov & Olemskoy 2011; Muñoz-Jaramillo et al. 2013; Priyal et al. 2014).

With increasing age, stars spin down through magnetic braking (Skumanich 1972), thereby decreasing their ability to generate a magnetic field. Thus, there must be a critical age beyond which each star ceases to produce a large-scale magnetic field. Stellar observations indeed find pieces of evidence of a violation of the rotation–age relation of some stars (Reinhold & Gizon 2015). Further, there is an upper value in the rotation period for each spectral type (Rengarajan 1984; also see Metcalfe et al. 2016). These observations can be explained by the ceasing of the large-scale dynamo above a certain rotation period; see the discussion in Kitchatinov & Nepomnyashchikh (2017) and Cameron & Schüssler (2017). Thus, the solar dynamo is possibly operating not too far from the critical dynamo transition. Three-dimensional numerical simulations (Karak et al. 2015b) and nonlinear mean-field models (Kitchatinov & Olemskoy 2010) showed that a slightly subcritical dynamo is also possible when the initial condition for the magnetic field is strong. On the other hand, explaining the deviation of gyrochronology in the dynamo model, Kitchatinov & Nepomnyashchikh (2017) predicted that the solar dynamo is about 10% supercritical.

Besides understanding the physics of dynamo action, the prediction of the solar magnetic cycle has become increasingly important due to its effect on space and Earth's climate (Petrovay 2020). It has been realized that in the Babcock–Leighton dynamo, the prediction is possible if we have knowledge of the polar field at the preceding minimum (Schatten et al. 1978; Choudhuri et al. 2007). It is the polar field that is transported to the deep CZ where the shear induces a toroidal field for the following cycle (Charbonneau & Dikpati 2000; Jiang et al. 2007; Charbonneau & Barlet 2011). Yeates et al. (2008) showed that when diffusive transport dominates over the advective transport via meridional flow, the memory of the polar field is limited to one cycle. However, when the advective transport dominates, the memory can last for multiple cycles. Limited observations of polar faculae hint at a one-cycle memory (Muñoz-Jaramillo et al. 2013). If this result is true in the Sun, then this means that to make a reliable prediction, we must use a diffusion-dominated dynamo model for which we need knowledge of the polar field of the previous cycle.

The solar dynamo is nonlinear. As differential rotation does not change with the solar cycle (except for a tiny variation in the form of torsional oscillation), we expect that the Ω effect, i.e., poloidal → toroidal, is not heavily nonlinear. Some global convection simulations, however, show a considerable variation of differential rotation (e.g., Karak et al. 2015a; Käpylä et al. 2016). In contrast, the flux emergence and the Babcock–Leighton process i.e., toroidal → poloidal part, could be nonlinear. BMR tilt quenching (Karak & Miesch 2017; Lemerle & Charbonneau 2017; Jha et al. 2020), active region inflow (Martin-Belda & Cameron 2017), and the latitudinal variation of BMR, the so-called latitudinal quenching (Jiang 2020; Karak 2020), are possible candidates for the nonlinearity in the latter process. As the flux emergence, BMR tilt, and meridional flow involve some variation, the toroidal → poloidal part also involves some randomness. The randomness in this part tries to reduce the memory of the polar flux. In this study, we explore the importance of nonlinearity in the toroidal → poloidal part on the memory of the polar flux. Using stochastically forced kinematic Babcock–Leighton dynamo models, we show that in addition to the stochasticity, it is the nonlinearity in the toroidal → poloidal part that determines the memory of the polar flux. When the dynamo is operating in the highly supercritical regime, the correlation between the toroidal to the polar flux of the same cycle is lost, and the multiple-cycle memory of the polar flux is limited to one cycle only. This one-cycle memory is independent of the relative importance of diffusion or advection processes, which is a clear contradiction of Yeates et al. (2008).

2. Models

We use two types of Babcock–Leighton dynamo models, namely, the flux-transport and time-delay dynamo models.

2.1. Flux-transport Dynamos

In the flux-transport dynamo model (Charbonneau 2010; Karak et al. 2014), we solve the following equations for axisymmetric magnetic fields:

Equation (2)

Equation (3)

where A and B are the potential of the poloidal magnetic field ( B p ) and the toroidal magnetic field, respectively, such that ${{\boldsymbol{B}}}_{{\rm{t}}{\rm{o}}{\rm{t}}{\rm{a}}{\rm{l}}}={{\boldsymbol{B}}}_{p}+B{\hat{{\boldsymbol{e}}}}_{\phi }$, with ${{\boldsymbol{B}}}_{p}={\boldsymbol{\nabla }}\times A{\hat{{\boldsymbol{e}}}}_{{\boldsymbol{\phi }}}$, $s=r\sin \theta $ with θ being the colatitude, ${\boldsymbol{v}}={v}_{r}{\hat{{\boldsymbol{e}}}}_{{\boldsymbol{r}}}+{v}_{\theta }{\hat{{\boldsymbol{e}}}}_{\theta }$ is the meridional circulation, Ω is the angular velocity, ηp and ηt are the diffusivities of the poloidal and toroidal fields, respectively, and Sα is the parameter that captures the Babcock–Leighton process for the generation of the poloidal field from toroidal one. We use six models, namely Models I, II, III, IV, V, and VI.

2.1.1. Models I–II

For Models I and II, we use the local prescription of α, which was done in the Surya code (Nandy & Choudhuri 2002; Chatterjee et al. 2004; Choudhuri 2018). In these models, Sα = α B, where

Equation (4)

For the present study, we use the same model as given in Yeates et al. (2008). For Model I, we have used the parameters for the poloidal field diffusion: η2 = 1 × 1012 cm2 s−1 and η0 = 2 × 1012 cm2 s−1 and for the meridional circulation: v0 = 15 m s−1, while for Model II, we use the same diffusivities, but v0 = 26 m s−1. Yeates et al. (2008) call these two models the diffusion-dominated regime (their Run 1) and the advection-dominated regime (Run 2; see their Section 5.1).

2.1.2. Models III–VI

For Models III–VI, we use the nonlocal α prescription, as initially done in Dikpati & Charbonneau (1999), and followed by many authors; see Choudhuri et al. (2005) and Choudhuri & Hazra (2016) for a discussion on the different α prescriptions. Thus, in this model,

Equation (5)

where

Equation (6)

with γ = 30 and B0 = 4 × 104 G. Also, ηt = ηp = η, where

with ηRZ = 5 × 108 cm2 s−1 and ηsurf = 2 × 1012 cm2 s−1. Basically, we use the same model as given in the Reference Solution of Hotta & Yokoyama (2010) with only a change in the value of ηsurf. For Model III, we use η0 = 5 × 1010 cm2 s−1, while for Model IV, we use five times less diffusivity than in Model III. Model V is the same as Model III but the meridional circulation is switched off. In Model VI, η0 is varied, but α0 is fixed at 0.4 ms−1.

2.2. Time-delay Dynamo

Finally, we use a time-delay dynamo model that was developed in Wilmot-Smith et al. (2006); also see Hazra et al. (2014) for an application of this model. In this model, the equations for the toroidal and poloidal fields are truncated by removing the spatial dependences, and some time delays are introduced to mimic the finite times required to communicate the fields between the base of the convection zone (BCZ) to the surface through meridional flow and magnetic buoyancy. The following equations are solved in this model:

Equation (7)

Equation (8)

Here, T0 represents the time delay required for the generation of a toroidal field from the poloidal one through differential rotation, while T1 is the time delay involved in the production of a poloidal field from the toroidal field through the Babcock–Leighton process. ω and L are the contrast in differential rotation and length scale in the tachocline, respectively, τd is the diffusion timescale of the turbulent diffusion in the CZ, and α0 is the amplitude of the α (like the one in flux-transport dynamo models). We note that unlike in the previous delay dynamo model of Wilmot-Smith et al. (2006), here we have included the usual alpha quenching: $1/(1+{(B/{B}_{\mathrm{eq}})}^{2})$. The parameters for our study are as follows: T0 = 2, T1 = 0.5, w/L = −0.34, Beq = 1, α0 = 0.29, and τd = 15.

3. Results from Flux-transport Dynamo Models

We shall first present the results of our flux-transport dynamo models in detail.

3.1. Identifying Critical Dynamo Parameters

Let us identify the dynamo transitions in each model. As seen in Equation (1), the growth of the magnetic field is possible when D exceeds a critical value. In our models, this is possible either by increasing α0 or by decreasing η0. In Models I–V, we vary α0 while keeping other parameters the same. Figure 1 shows the mean ϕtor from these runs; ϕtor is computed over a layer r = 0.677R–0.726R and latitudinal extent 10°–45°, and normalized by the area and B0. In Model VI, we fix α0 at 0.4 m s−1 and decrease η0; Figure 1(c) shows this result. We find that the critical α0, defined as ${\alpha }_{0}^{\mathrm{crit}}$, for the dynamo transition in Models I, II, III, IV, and V are 6.2, 10, 0.38, 0.05, and 0.8 m s−1. In Model VI, where the critical η0, defined as ${\eta }_{0}^{\mathrm{crit}}$, is around 0.05 × 1012 cm2 s−1. As long as α0 is above these critical values (or η0 below ${\eta }_{0}^{\mathrm{crit}}$), the magnetic field shows a regular polarity reversal. Increasing α0 (or decreasing η0) makes the model more supercritical. In the supercritical regime, the magnetic field does not grow linearly because of the nonlinearity imposed in these kinematic dynamo models.

Figure 1.

Figure 1. (a) Variation of the toroidal flux ϕtor with the strength of the Babcock–Leighton α0 from Models I (solid) and II (dashed). The critical α0, ${\alpha }_{0}^{\mathrm{crit}}$ for Models I and II are 6.2 m s−1 and 10 m s−1. (b) Same as (a) but obtained from Model III and IV (shown in inset); note in these models ${\alpha }_{0}^{\mathrm{crit}}=0.38$ and 0.05 m s−1. (c) For Model VI, η0 is varied; note ${\eta }_{0}^{\mathrm{crit}}=0.05\times {10}^{12}$ cm2 s−1. (c) Inset: for Model V (no meridional circulation); ${\alpha }_{0}^{\mathrm{crit}}=0.8\,{\rm{m}}\,{{\rm{s}}}^{-1}$.

Standard image High-resolution image

Let us make a few comments on Figure 1. We observe that in Models III–V, the variation of magnetic field with α0 for ${\alpha }_{0}\gt {\alpha }_{0}^{\mathrm{crit}}$ is different from that in Models I and II. This difference is because of the different ways of including nonlinearity in these models. In Models I and II (Surya/local α), the nonlinearity is included in the magnetic buoyancy part in the following way. When the toroidal field in any latitude grid above the BCZ at intervals of time 8.8 × 105 s exceeds a certain value (B0 = 0.8 × 105 G), 50% of the flux at that grid is reduced and the same is deposited at a surface layer. This reduction of toroidal flux is the only nonlinearity in Models I and II. However, in Models III–VI, a nonlinearity of the form $1/[1+{(B/{B}_{0})}^{2}]$ is included in the Babcock–Leighton α term; see Equation (5).

3.2. Identifying the Correlation of the Polar Field

Now we include stochastic fluctuations in the Babcock–Leighton α to produce variable magnetic cycles at different regimes of the dynamo, i.e., at different dynamo parameters. To do so, in Equations (4) and (6), we replace α0 by α0(1 + σ(t, τcor)), where σ is the uniform random deviation within [−1, 1], and τcor is the time after which α0 is updated. The value of τcor is chosen in such a way that the ratio of cycle period to τcor remains the same. For reference, τcor = 2.3 yr and 1.5 yr, respectively, for Models I and II when α0 = 30 m s−1. Unless stated otherwise, in all stochastically forced simulations, we include 100% fluctuations. 1 Thus, the relative fluctuation level is kept the same. We perform simulations at different values of ${\hat{\alpha }}_{0}={\alpha }_{0}/{\alpha }_{0}^{\mathrm{crit}})$ (for Models I–V) and ${\hat{\eta }}_{0}={\eta }_{0}^{\mathrm{crit}}/{\eta }_{0}$ (for Model VI).

We realized when 100% fluctuations in α0 is added, Models I and II (Surya) tend to decay unless α0 is sufficiently above ${\alpha }_{0}^{\mathrm{crit}}$. Or in other words, ${\alpha }_{0}^{\mathrm{crit}}$ is increased in Models I and II when fluctuations are included. This does not happen in Models III–V (nonlocal α). This different behavior is due to the extensively different parameters and the treatment of magnetic buoyancy in these models. Therefore, with 100% fluctuations in α0, we obtain stable cycles in Models I and II only when ${\hat{\alpha }}_{0}\geqslant 2.0$. In Table 1, we present the correlations between the peaks of the polar flux and the peaks of the low-latitude toroidal flux at the base of CZ (ϕr is computed on the solar surface over the latitudinal extent 70°–89° and normalized by the area and B0).

Table 1. Correlation Coefficients between ϕr (n) and ϕtor of Different Cycles for Simulations at Different Values of ${\hat{\alpha }}_{0}$ and ${\hat{\eta }}_{0}$

Local α (Surya) Nonlocal α  Nonlocal α
  Model IModel II   Model IIIModel IVModel V  Model VI
${\hat{\alpha }}_{0}$ ϕr (n) & r (s.l.?) r (s.l.?)  ${\hat{\alpha }}_{0}$ ϕr (n) & r (s.l.?) r (s.l.?) r (s.l.?)  ${\hat{\eta }}_{0}$ ϕr (n) & r (s.l.?)
2.0 ϕtor (n)0.94 (Y)0.96 (Y) 1.0 ϕtor (n)0.92 (Y)0.74 (Y)0.86 (Y) 1.0 ϕtor (n)0.79 (Y)
  ϕtor (n+1)0.96 (Y)0.98 (Y)   ϕtor (n+1)0.99 (Y)0.99 (Y)0.94 (Y)   ϕtor (n+1)0.99 (Y)
  ϕtor (n+2)0.90 (Y)0.96 (Y)   ϕtor (n+2)0.91 (Y)0.78 (Y)0.77 (Y)   ϕtor (n+2)0.77 (Y)
  ϕtor (n+3)0.86 (Y)0.92 (Y)   ϕtor (n+3)0.89 (Y)0.68 (Y)0.60 (Y)   ϕtor (n+3)0.73 (Y)
2.5 ϕtor (n)0.71 (Y)0.69 (Y) 1.5 ϕtor (n)0.23 (Y)0.39 (Y)0.32 (Y) 1.5 ϕtor (n)0.50 (Y)
  ϕtor (n+1)0.80 (Y)0.87 (Y)   ϕtor (n+1)0.99 (Y)0.97 (Y)0.89 (Y)   ϕtor (n+1)0.99 (Y)
  ϕtor (n+2)0.54 (Y)0.79 (Y)   ϕtor (n+2)0.23 (Y)0.47 (Y)0.35 (Y)   ϕtor (n+2)0.48 (Y)
  ϕtor (n+3)0.47 (Y)0.59 (Y)   ϕtor (n+3)0.26 (Y)0.42 (Y)0.20 (Y)   ϕtor (n+3)0.49 (Y)
3.0 ϕtor (n)0.40 (Y)0.59 (Y) 2.0 ϕtor (n)−0.07 (N)0.19 (Y)0.18 (Y) 2.0 ϕtor (n)0.28 (Y)
  ϕtor (n+1)0.72 (Y)0.82 (Y)   ϕtor (n+1)0.99 (Y)0.97 (Y)0.89 (Y)   ϕtor (n+1)0.99 (Y)
  ϕtor (n+2)0.26 (Y)0.40 (Y)   ϕtor (n+2)−0.09 (N)0.29 (Y)0.25 (Y)   ϕtor (n+2)0.28 (Y)
  ϕtor (n+3)0.26 (Y)0.23 (Y)   ϕtor (n+3)0.18 (N)0.20 (Y)0.07 (N)   ϕtor (n+3)0.41 (Y)
4.0 ϕtor (n)0.27 (Y)0.37 (Y) 4.0 ϕtor (n)−0.39 (Y)−0.12 (N)0.02 (N) 4.0 ϕtor (n)0.03 (N)
  ϕtor (n+1)0.67 (Y)0.75 (Y)   ϕtor (n+1)0.99 (Y)0.95 (Y)0.88 (Y)   ϕtor (n+1)0.99 (Y)
  ϕtor (n+2)−0.01 (N)0.10 (N)   ϕtor (n+2)−0.40 (Y)0.00 (N)0.09 (N)   ϕtor (n+2)0.04 (N)
  ϕtor (n+3)0.10 (N)−0.02 (N)   ϕtor (n+3)0.25 (Y)0.05 (N)−0.01 (N)   ϕtor (n+3)0.33 (Y)
5.0 ϕtor (n)0.15 (N)0.31 (Y)         
  ϕtor (n+1)0.72 (Y)0.68 (Y)         
  ϕtor (n+2)−0.03 (N)0.03 (N)         
  ϕtor (n+3)0.10 (N)0.04 (N)         

Note. Here, ${\hat{\alpha }}_{0}={\alpha }_{0}/{\alpha }_{0}^{\mathrm{crit}}$ and ${\hat{\eta }}_{0}={\eta }_{0}^{\mathrm{crit}}/{\eta }_{0}$, where the superscript, "crit," represents the minimum (maximum) values of α0 (η0) needed for the dynamo action in each model; see Figure 1. In Models I–V, α0 is increased, while in Model VI, η0 is decreased in different runs. A total of 275 data (cycles) are used to obtain the correlations. The abbreviation Y or N represents whether the correlation is statistically significant or not based on the computation of the significance level (s.l. = (1 − p)100%). The threshold for significance is 95%. In all simulations, 100% fluctuations in α0 are included. However, due to statistical uncertainties in numerical realizations, the correlation coefficients are slightly different when runs are repeated with different realizations of random numbers.

Download table as:  ASCIITypeset image

We first consider the results from Models I and II, which are shown on the left columns of Table 1 and Figure 2. We observe that when ${\hat{\alpha }}_{0}$ is small, ϕr(n) correlates strongly with ϕtor of cycle n, n + 1, n + 2, and n + 3. These multiple-cycle correlations decrease with the increase of ${\hat{\alpha }}_{0}$. In the highly supercritical regime (large ${\hat{\alpha }}_{0}$), ϕr(n) strongly correlates with ϕtor of cycle n + 1 alone.

Figure 2.

Figure 2. Scatter plots between the peaks of the surface polar flux of cycle n, ϕr (n) with that of the low-latitude toroidal flux at the base of CZ, ϕtor of cycle (a) n, (b) n + 1, (c) n + 2, and (d) n + 3 from Models I (blue/circles) and II (red/asterisks). The top four and bottom four panels are obtained from runs at weakly supercritical (${\hat{\alpha }}_{0}=2$) and highly supercritical (${\hat{\alpha }}_{0}=4$) dynamos, respectively. For Model I data, a factor of 2 is multiplied to ϕr (n) in the top four panels, and in the bottom four panels, 2.8 and 1.04 are multiplied to ϕr (n) and ϕtor(n), respectively, for comparison.

Standard image High-resolution image

Let us now discuss the physics behind these correlations. Our discussion will be largely based on Jiang et al. (2007), Yeates et al. (2008), and Charbonneau & Barlet (2011). In the Babcock–Leighton models, the toroidal field gives rise to the poloidal field through the α effect (not the mean-field α but a simplified parameterization of the Babcock–Leighton process). In this process, there is nonlinearity and some randomness. The poloidal field generated in the low latitudes is first transported to high latitudes and then to the deep CZ through meridional circulation and diffusion. Finally, this poloidal field gives rise to the toroidal field for the following cycle through differential rotation. Thus, the dynamo chain can be written as follows

Equation (9)

Clearly, the correlation between ϕr(n) and ϕtor(n) is affected by randomness and nonlinearity. When the model operates near the critical α0, the nonlinearity in α is weak, and thus, the correlation between ϕr(n) and ϕtor(n) is determined only by the randomness included in the Babcock–Leighton α. The randomness that we have included in our model tries to reduce this correlation and thus the correlation between ϕr(n) and ϕtor(n) is not perfect even at the smallest α0 (${\hat{\alpha }}_{0}=2.0;$ Table 1). We emphasize that as long as α0 is not completely random, there is a significant correlation between ϕr(n) and ϕtor(n). This is what is seen in Table 1 for ${\hat{\alpha }}_{0}=2$ in Models I and II.

As the dynamo becomes supercritical, the dynamo growth rate and the nonlinearity increase. This nonlinearity, along with the randomness, spoils the linear relation in the chain: ϕtor(n) → ϕr(n). Hence, the correlation between ϕr(n) and ϕtor(n) reduces with the increase of α0. As seen in Table 1, simulations at ${\hat{\alpha }}_{0}\gt 3$ do not show this correlation strongly.

Next, the strong correlation between ϕr(n) and ϕtor(n + 1) in all the runs is easy to understand. As ϕr(n) → ϕtor(n + 1) involves only the Ω effect, which is deterministic in our model, the correlation between ϕr(n) and ϕtor(n + 1) must be strong in all runs. We note that this correlation holds in any model, as long as the poloidal field feeds back to the toroidal one through diffusion and/or advection (Charbonneau & Barlet 2011). This is what is seen in Table 1. There is a slight decrease of this n–(n + 1) correlation with the increase of α0. This is possibly due to the systematic decrease of the mean cycle period with the increase of α0; see Table 2. When the cycle period decreases, the transport time of the poloidal field effectively decreases. Thus, the less efficient transport of the poloidal field causes a decrease in the correlation between the polar flux and the next cycle toroidal flux.

Table 2. The Mean Cycle Period Tav (in Years) of the Simulations Presented in Table 1

Model:III  IIIIVV VI
 (Local α)  (Nonlocal α) (Nonlocal α)
   
${\hat{\alpha }}_{0}$ Tav Tav   ${\hat{\alpha }}_{0}$ Tav Tav Tav   ${\hat{\eta }}_{0}$ Tav
2.022.413.8 1.09.19.38.3 1.09.0
2.521.613.5 1.58.99.18.4 1.59.1
3.021.513.1 2.08.88.98.5 2.09.1
4.020.812.8 4.08.78.69.0 4.09.3
5.020.212.5        

Download table as:  ASCIITypeset image

Finally, the n–(n + 2) and n–(n + 3) correlations are linked to the part ϕtorϕr. If the nonlinearity is weak, then the correlations of ϕr(n) with ϕtor(n + 2) and ϕtor(n + 3) will be determined by the randomness in α and the efficiency of the magnetic field transport, and hence, there will be some correlation. As the dynamo is marginally supercritical at ${\hat{\alpha }}_{0}=2$, multiple-cycle correlations are seen in the top four panels of Figure 2. On the other hand, in the supercritical regime, these multiple-cycle correlations will disappear because the chain ϕtorϕr is spoiled by the nonlinearity and randomness. Thus, all n–(n + 2) and n–(n + 3) correlations are negligible at large ${\hat{\alpha }}_{0};$ see Table 1 and the bottom four panels of Figure 2.

In Figure 3, we observe that when the dynamo is near the critical value, it produces extended episodes of weaker activity, resembling the solar grand minima, and as the supercriticality increases, the frequency of grand minima decreases. In the highly supercritical regime, the model does not produce any grand minima even when the fluctuation level is increased (because the dynamo growth is large and the magnetic field quickly grows from the weaker value). During these grand-minimum phases, the model becomes very linear and causes multicycle correlations in the polar field. In the supercritical regime, as the model does not enter into any extended grand-minimum phase, the model remains nonlinear all the time and thus it cannot produce multicycle correlation. In fact, we have seen that if we exclude these grand-minimum phases from the data of the weakly supercritical regime (red-colored zones in Figure 3), then we observe that the multicycle correlation is reduced to one cycle. This supports our conclusion that when the dynamo is marginally supercritical (weakly nonlinear) and when the α0 is not completely random, there will be a good correlation between ϕtor(n) and ϕr(n). When this nn correlation holds, the dynamo chain will allow ϕr(n)–ϕtor(n + 2) and ϕr(n)–ϕtor(n + 3) correlations to persist.

Figure 3.

Figure 3. Time series of polar flux from Model I for (a) $\hat{\alpha }=2$ (weakly supercritical), (b) $\hat{\alpha }=2.5$, and (c) $\hat{\alpha }=4$ (highly supercritical). The red color highlights the extended weaker activity episodes.

Standard image High-resolution image

We recall that both simulations labeled diffusion-dominated and advection-dominated regimes in Yeates et al. (2008) were performed at α0 = 30 m s−1 (their Runs 1 and 2). They claimed that the one-cycle correlation found in diffusion-dominated simulation is due to the dominant contribution of the diffusion over the advection. We do not agree with this conclusion because if this is the case, then at smaller α0, the same correlations would also have been observed, which is not the case; see Table 1 and Figure 2. In reality, the supercriticality of their two models is different. As we have shown (Figure 1) that the critical α0 in their two models are 6.2 and 10 m s−1. Hence, α0 = 30 m s−1 correspond to ${\hat{\alpha }}_{0}=4.8387$ in Model I (their diffusion-dominated regime) and ${\hat{\alpha }}_{0}=3$ in Model II (their advection-dominated regime). Thus, what Yeates et al. (2008) call diffusion dominated is actually more highly supercritical than their advection-dominated model. We have seen in our study that with the increase of supercriticality, the model becomes more nonlinear (grand minima become very rare; see the Appendix and Figure 7), and the multiple-cycle correlations are reduced to one cycle.

Incidentally, Hazra et al. (2020) did not find multiple-cycle correlations even in their advection-dominated model (see insignificant correlation values for nn, n–(n + 2), and n–(n + 3) as given in their Tables 1 and 2, last columns)—a clear contradiction to Yeates et al. (2008). The reason for not finding multiple-cycle correlations in Hazra et al. (2020) is that they included a mean field α, in addition to the Babcock–Leighton source for the poloidal field and this additional α possibly made the model considerably supercritical. And, as we have shown that in the supercritical regime, even if the dynamo is advection dominated, the polar field memory is limited to only one cycle.

Karak & Nandy (2012) showed that when downward turbulent pumping in the poloidal field is included in the advection-dominated dynamo, the multiple-cycle correlation is reduced to one cycle. Actually, when pumping is included in the model, the dynamo becomes strong (because of the suppression of magnetic flux through the surface; see Cameron et al. 2012; Karak & Cameron 2016; Karak & Miesch 2018), and the dynamo transition happens at smaller α0. Thus, the same model that produced multiple-cycle correlations because of weak supercriticality becomes heavily supercritical with the inclusion of turbulent pumping. This is the hidden reason for a one-cycle correlation being displayed in the advection-dominated model with turbulent pumping.

We do not mean that the turbulent transports, namely, the turbulent diffusivity, pumping, and meridional flow, do not play any role in determining these correlations. The transport is necessary to connect the spatially segregated source regions of the poloidal and the toroidal fields. Through various transport processes, the polar field of a cycle (ϕr(n)) is strongly correlated with the amplitude of the next cycle's toroidal field (ϕtor(n + 1)). The generation of poloidal field and thus the connection between ϕtor(n) and ϕr(n) is also possible through turbulent transport. However, as we do not change any parameter of diffusion and meridional flow in each set of runs, our correlations are not affected by turbulent transport processes.

Now we examine the results from Models III and IV (Table 1). These models produce cycles even at ${\hat{\alpha }}_{0}=1$. This is possibly due to much smaller diffusivity and different α prescriptions. We observe that the n–(n + 1) cycle correlation is very strong in all the runs. For Models I and II, this was not the case, because the diffusivity of the poloidal field in the CZ was about 20 times higher than that in Models III and IV. This higher diffusion in Models I and II allowed some of the fluctuations in α to propagate to the same cycle toroidal flux. Despite these differences, we find similar behavior with the increase of ${\hat{\alpha }}_{0}$. That is, when the dynamo is only marginally supercritical (${\hat{\alpha }}_{0}$ near 1), multiple-cycle correlations exist. On the other hand, when the dynamo is supercritical, only a one-cycle correlation holds. This conclusion remains unaffected even when we increase the level of fluctuations. Table 3 shows the results at 150% and 200% fluctuation levels. If we increase the fluctuation levels too much, then the model tends to decay, particularly Models I and II in the weakly supercritical regime.

Table 3. Same as Table 1 but Only for Model IV and at Increased Fluctuation Levels

Nonlocal α
  150%200%
${\hat{\alpha }}_{0}$ ϕr (n) & r (s.l.?) r (s.l.?)
1.0 ϕtor (n)0.80 (Y)0.83 (Y)
  ϕtor (n+1)0.97 (Y)0.98 (Y)
  ϕtor (n+2)0.79 (Y)0.85 (Y)
  ϕtor (n+3)0.71 (Y)0.82 (Y)
1.5 ϕtor (n)0.40 (Y)0.35 (Y)
  ϕtor (n+1)0.97 (Y)0.96 (Y)
  ϕtor (n+2)0.48 (Y)0.42 (Y)
  ϕtor (n+3)0.43 (Y)0.37 (Y)
2.0 ϕtor (n)0.30 (Y)0.33 (Y)
  ϕtor (n+1)0.96 (Y)0.96 (Y)
  ϕtor (n+2)0.36 (Y)0.42 (Y)
  ϕtor (n+3)0.31 (Y)0.35 (Y)
4.0 ϕtor (n)0.03 (N)−0.08 (N)
  ϕtor (n+1)0.92 (Y)0.91 (Y)
  ϕtor (n+2)0.14 (N)0.07 (N)
  ϕtor (n+3)0.12 (N)0.18 (N)

Download table as:  ASCIITypeset image

We note that even the model without meridional flow, Model V, also shows the same behavior of the polar field with the increase of the supercriticality; see Table 1. This confirms that advection plays no role in setting the memory of the polar field beyond one cycle, which is in contradiction to Yeates et al. (2008).

Finally, we observe the results from Model VI. As noted earlier, the set of runs in Model VI, is the same as that in Model III, except α0 is fixed at 0.4 m s−1 and η0 is decreased (or ${\hat{\eta }}_{0}={\eta }_{0}^{\mathrm{crit}}/{\eta }_{0}$ is increased) in each run. When η0 is decreased, the model becomes less diffusion dominated. Interestingly, even when the model becomes less diffusive, we observe the shorter of memory of the polar flux. In Table 1, we observe that at smaller ${\hat{\eta }}_{0}$ (1 and 1.5), multiple-cycle correlations hold, while at larger ${\hat{\eta }}_{0}$, only n–(n + 1) correlation holds. The correlation plots for marginally supercritical (${\hat{\eta }}_{0}=1$) and highly supercritical (${\hat{\eta }}_{0}=4$) are shown in Figure 4.

Figure 4.

Figure 4. The format is the same as Figure 2 but obtained from Model V at α0 = 0.4 m s−1. The top four and bottom four panels are obtained from a weakly supercritical (${\hat{\eta }}_{0}=1$) and highly supercritical dynamos (${\hat{\eta }}_{0}=4$), respectively.

Standard image High-resolution image

4. Results from the Time-delay Dynamo Model

To demonstrate that the above conclusion is not limited to only the flux-transport dynamo models, but also generically to simplified Babcock–Leighton-type dynamo models, we present the results from a truncated low-order time-delay model. As discussed in Section 2.2, the (physically motivated) time delay between the poloidal and toroidal fields is captured by the parameters T0 and T1. In the same manner as we have done for the flux-transport dynamo models, we include stochastic fluctuations in the poloidal source; however, in this delay model, we first include 30% fluctuations in α0, appearing in Equation (8). We note that a previous study based on the same model (Hazra et al. 2014) also includes this amount of fluctuations in α0. The results for 45% and 100% fluctuations are shown in Figure 6. Again in this model, we find that, as we make the dynamo more and more supercritical by increasing α0 (Figures 5(a) and 6) or the diffusion time τd (Figure 5(b)), the correlations, nn, nn + 2, and nn + 3, are all diminished and only the nn + 1 correlation is retained.

Figure 5.

Figure 5. Results from the time-delay dynamo model: variations of the correlation coefficients of the peak poloidal fields of cycle n with the peak toroidal fields of cycle n (black filled circles), n + 1 (red triangles), n + 2 (blue squares), and n + 3 (green diamonds) with 30% fluctuations. In the left panel, τd is fixed at 15 and α0 is increased, while in the right panel, α0 is fixed at 0.29 and τd is increased.

Standard image High-resolution image
Figure 6.

Figure 6. Same as Figure 5(a) but at higher α fluctuations; dotted and dashed for 45% and 100% fluctuations, respectively.

Standard image High-resolution image

5. Conclusions and Discussions

By performing stochastically forced dynamo simulations at different parameter regimes and with different models, we identify the memory of the polar flux, explicitly, how the peak polar flux of cycle n, ϕr(n) is correlated with the toroidal flux of the same cycle (ϕr(n)) and subsequent cycles. We show that when the dynamo is barely above the transition or when the dynamo is not heavily supercritical, the nonlinearity in the Babcock–Leighton α is weak. In this case, the ϕtor(n) → ϕr(n) chain is only affected by the fluctuations in α, and as long as α is not completely stochastic, the polar flux is linearly correlated with the toroidal flux of the same cycle and many subsequent cycles. The dynamo in this weakly supercritical regime also produces occasional grand minima. On the other hand, when the dynamo is sufficiently above the critical dynamo transition, the nonlinearity and stochasticity in α spoil the linear correlation between ϕtor(n) and ϕr(n). This subsequently breaks all multiple-cycle correlations: ϕr(n) versus ϕtor(n), ϕr(n) versus ϕtor(n + 2) and ϕr(n) versus ϕtor(n + 3), and only the correlation ϕr(n) versus ϕtor(n + 1) survives.

In the flux-transport dynamo models, the polar flux is coupled to the toroidal flux of the following cycle through the meridional circulation and turbulent diffusivity. As long as this coupling is there, the polar flux is highly correlated with the following cycle toroidal flux; see Charbonneau & Barlet (2011) for a detailed study. This coupling is indeed observed in terms of the correlation between the surface polar flux and following cycle amplitude (Choudhuri et al. 2007; Wang & Sheeley 2009; Karak et al. 2018; Kitchatinov et al. 2018; Kumar et al. 2021). In our study, we have shown that when the dynamo is highly supercritical, the memory of the polar flux cannot be propagated to more than one cycle. This one-cycle memory is independent of the relative importance of the diffusion in the model. We do not agree with the explanation given in previous studies (Yeates et al. 2008; Karak & Nandy 2012; Hazra et al. 2020) which say that the multicycle memory of the polar field is determined by the relative importance of advection versus diffusion or turbulent pumping; see Section 3.2 and the Appendix for details.

Analyses of polar faculae counts (Muñoz-Jaramillo et al. 2013) show that the polar flux at the cycle minimum has no statistically significant correlation beyond one cycle. This, however, does not reflect the true feature of the solar cycle because of two reasons. First, the polar faculae data are too noisy and poorly binned. Hence, the weak multicycle correlation that might be present in the real Sun is not detected in these poor-quality data. Second, the observed polar faculae data are available only for 10 cycles (1907–2011) when the solar activity was relatively strong, and it does not cover any extended weak phase like the Maunder or Dalton minima. Basically, what we want to point out is that the observed data only represent a small subset of the whole solar cycle pattern. For example, if we take a small subset of the data presented in Figure 3, then we get a different result from what we get from the full data set. If we exclude the extended weaker field episodes like grand minima (marked by the red color), then the multicycle correlation is reduced to one cycle. When the model produces extended weak cycles, the nonlinearity becomes weak and the multicycle correlation of the polar field is unavoidable. As the Sun produces frequent grand minima and these are produced only when the dynamo is weakly supercritical, we can indirectly say that the solar dynamo is weakly supercritical. We need better and longer data of the polar field to confirm the multicycle correlation that is the outcome of a weakly supercritical dynamo. Due to the simplicity (e.g., ignoring the backreaction of the magnetic field on the flow and turbulent transport) and uncertainties of some parameters in our models, we, however, cannot provide the exact value of supercriticality of the solar dynamo from our simplified simulations. Thus, our prediction is in qualitative agreement with previous studies (e.g., Kitchatinov & Nepomnyashchikh 2017) that suggested that solar dynamo is weakly supercritical.

The authors sincerely thank Dibyendu Nandi, Kristof Petrovay, Arnab Rai Choudhuri, and Leonid Kitchatinov for reading this manuscript and providing valuable comments, and finding a few errors; all of these helped to improve the presentation of results. The authors also gratefully acknowledge the constructive comments and suggestions from an anonymous referee. Financial supports from the Department of Science and Technology (SERB/DST), India through the Ramanujan Fellowship (project No. SB/S2/RJN-017/2018) and ISRO/RESPOND (project No. SRO/RES/2/430/19-20) are acknowledged. V.V. acknowledges financial support from DST through the INSPIRE Fellowship. B.B.K. acknowledges the funding provided by the Alexander von Humboldt Foundation.

Appendix: Additional Material to Demonstrate the Issue in the Conclusion of Yeates et al. (2008)

Figure 7 demonstrates the time series of the polar flux of runs at α0 = 30 m s−1 in Models I (top) and II (bottom). We note that α0 = 30 m s−1 corresponds to $\hat{\alpha }=4.8$ and 3 in these two runs. Yeates et al. (2008) called these two runs diffusion and advection dominated. Their diffusion-dominated model (shown in the top panel of Figure 7) indeed has a larger supercriticality, while the advection-dominated model (lower panel) has a lower supercriticality, and thus it produces several grand-minimum-like weaker activity episodes. The model is close to a linear one when it is in these grand-minimum phases, and thus, it produces multicycle correlations. These multicycle correlations are not due to the dominant contribution of the advection over the diffusion in the model as suggested by Yeates et al. (2008).

Figure 7.

Figure 7. Time series of surface polar flux from Models I (top) and II (bottom) at α0 = 30 m s−1 (or $\hat{\alpha }=4.8$ and 3) which respectively correspond to the diffusion- and advection-dominated models of Yeates et al. (2008).

Standard image High-resolution image

Footnotes

  • 1  

    Yeates et al. (2008) called it 200% although they used the same level of fluctuations (private communication). We believe that it is just the convention of measurement of the percentage of fluctuation level.

Please wait… references are loading.
10.3847/1538-4357/abf0a1