Estimate of the Mass and Radial Profile of the Orphan–Chenab Stream's Dwarf-galaxy Progenitor Using MilkyWay@home

Eric J. Mendelsohn; Heidi Jo Newberg; Siddhartha Shelton; Lawrence M. Widrow; Jeffery M. Thompson; Carl J. Grillmair

doi:10.3847/1538-4357/ac498a

1. Introduction

A few dozen dwarf galaxies are known to orbit around the Milky Way. Over the course of billions of years, these galaxies tidally disrupt and stretch around the Milky Way into tidal streams. The positions and velocities of the stars that make up these streams therefore carry information about the Galaxy's gravitational field. As such, these dwarf galaxies act as gravitational probes for determining the distribution of gravitating mass in the Milky Way (Ibata et al. 2001; Johnston et al. 2002; Koposov et al. 2010; Newberg et al. 2010; Law & Majewski 2016; Bonaca & Hogg 2018; Ibata et al. 2021). For example, the path of the stream probes the Galactic potential, the transverse motion of the stream probes the time dependence of the potential, and the width of the stream probes the internal structure of the original progenitor (Willett 2010). This idea of mapping the Galactic potential with stellar streams is not new. However, the focus has mostly been on using streams to constrain the gravitational potential of the Galaxy and hence the structure of its dark halo (Koposov et al. 2010; Bovy et al. 2016; Bonaca & Hogg 2018). However, it is clear that the streams are also affected by the internal structure of their progenitors. Here, we fix the potential and use one well-known halo stream to probe the internal structure of its progenitor.

Belokurov et al. (2006) and Grillmair (2006) independently discovered this stellar stream while examining the Sagittarius stream. Due to the lack of a visible progenitor, the stream was named the Orphan Stream. The southern portion of the stream was later named Chenab (Shipp et al. 2018) before it was discovered that both pieces of the stream resulted from the tidal disruption of the same dwarf galaxy. Therefore, we will refer to the stream as the Orphan‒Chenab Stream (OCS). Preliminary measurements of the OCS found that it contained old, metal-poor stars possibly from the merger of a satellite galaxy with the Milky Way. Belokurov et al. (2007) placed a lower bound of 10⁵ M_⊙ on the total mass of the OCS's missing progenitor by studying the interactions of the OCS with the high-velocity clouds known as Complex A, the dwarf galaxy Ursa Major II (UMa II), and other globular clusters (Segue I, Ruprecht 106, and Palomar 1). However, it was later shown in Sales et al. (2008) that the OCS was kinematically separate from both Complex A and UMa II. In 2010, Newberg et al. (2010) calculated the orbit of the OCS and showed that a progenitor with a total mass of 2.5 × 10⁶ M_⊙ could be used to fit the tidal stream rather well. This mass is roughly 100 times smaller than most measured ultrafaint Dwarf (UFD) galaxies, and other estimates of the OCS's progenitor suggest a total mass on the order of 10⁸ to 10⁹ M_⊙ (Hendel et al. 2018; Fardal et al. 2019).

The disparity in the mass estimates of the progenitor is interesting because the measured velocity dispersions of stars in ultrafaint galaxies lead many to conclude that dwarf spheroidal galaxies, including ultrafaint dwarf galaxies (UFDs), have 10⁷ M_⊙ enclosed within the central 300 pc, independent of the dwarf galaxy's luminosity (Mateo et al. 1993; Gilmore et al. 2007; Strigari et al. 2008). A mass of a few times 10⁶ M_⊙ is less than UFDs are believed to have. But most measurements of dwarf-galaxy masses are derived from velocity dispersions and the assumption that dwarf galaxies are in equilibrium.

While equilibrium is a reasonable assumption for dwarf spheroidal galaxies, it has been suggested that observations should be obtained to look for signs of tidal stretching as a check (Battaglia et al. 2013). UFDs in particular are susceptible to errors in measurement from velocity dispersion due to their small number of bright stars, complications in measuring velocities due to the presence of binary stars, and the possibility that tidal forces could make the assumption of equilibrium invalid. Martin et al. (2008) find that UFDs are elongated and suggest tidal disruption is the "least problematic" explanation. Objects that are nearly completely disrupted or close to apogalacticon could exhibit velocity dispersions that are systematically an order of magnitude or more higher than equilibrium values due to contamination from extratidal stars (Smith et al. 2013; Blaña et al. 2015). Depending on the angle of observation, enforcing dynamical equilibrium on a dwarf spheroidal galaxy undergoing tidal disruption can either overestimate the mass when measuring dispersion along its major axis or underestimate it when measuring on a perpendicular axis (Łokas et al. 2010). As an example, the kinematics of stars in Willman 1 is so far from Gaussian that one cannot even pretend that it is in dynamical equilibrium for the purpose of computing a mass-to-light ratio (Willman et al. 2011). In addition, Triangulum II, previously thought to be the most dark-matter-dominated galaxy known (Kirby et al. 2015), has been downgraded to possibly "a star cluster or tidally stripped dwarf galaxy" because the originally measured velocity dispersion was calculated including a star that is now known to be in a binary system (Kirby et al. 2017; Buttry et al. 2021). More recently, a new ultrafaint structure named DELVE 2 was discovered whose mass-to-light ratio is sensitive to whether or not the system is undergoing tidal disruption with the Large Magellanic Cloud (LMC) and, thus, whether it is classified as a globular cluster or a UFD (Cerny et al. 2021). Clearly, the mass-to-light ratios of UFDs, thought to have the highest mass-to-light ratios of any known objects, are uncertain.

Due to their status as the most dark-matter-dominated objects in the cosmos, UFDs are popular targets for dark matter indirect detection experiments. The lack of gamma-ray signals from the centers of ultrafaint galaxies is being used to place upper limits on the properties of the as-yet undetected dark matter particles (Abdallah et al. 2020). Null results from dark matter searches in dwarf galaxies with Fermi-LAT data have provided some of the strongest constraints on the dark matter annihilation cross section (Ackermann et al. 2015); more recently, very weak excesses have been found in three UFDs (Albert et al. 2017), but one is already found to be from a background source (Li et al. 2021). But indirect detection experiments rely heavily on the estimate of the amount of dark matter above the background that they are targeting to determine the detection limits. If we find that UFDs are less massive than previously assumed, the constraints these experiments put on dark matter particles would be modified.

Using the petaFLOPS-scale MilkyWay@home volunteer supercomputer, Shelton et al. (2021) showed that in a perfect world it would be possible to determine the mass and radial profile of both the stars and the dark matter in a dwarf-galaxy progenitor that fell into the Milky Way and was ripped apart into a tidal stream, using only the density distribution of stars in the tidal stream. In this context, a perfect world means that the Milky Way potential is known and does not change with time, the orbit of the progenitor in the potential is known, and the dwarf galaxy is known to consist of a Plummer sphere distribution of stars embedded in a Plummer sphere distribution of dark matter, created so that the combination is in stable equilibrium. In this case, only the mass and Plummer radius of the two dwarf-galaxy components and the evolution time of the simulation need to be fit.

This method of determining the progenitor satellite's properties from the tidal stream it produces does not rely on the assumption of dynamical equilibrium. We use MilkyWay@home to generate a large population of simulated dwarf galaxies with varying masses and shapes. Each dwarf galaxy is placed within a static Milky Way potential and evolved for a given amount of time to create a simulated tidal stream. This simulated tidal stream is then compared with the measured distribution of stars in the actual OCS and is assigned a likelihood that a dwarf galaxy with those simulated parameters produced the observed stream. We use differential evolution to evolve the dwarf-galaxy parameters until the generated tidal stream closely matches the stellar data.

Our goal is to use MilkyWay@home to measure the mass and shape of the stars and dark matter in the progenitor galaxy that was tidally disrupted to become the OCS. In Section 2, we describe the methods we implemented to obtain the stellar data used to constrain the OCS's progenitor. In Section 3, we outline the models and methods our N-body simulator uses to generate a tidal stream from a set of input progenitor parameters. Within the same section, we also define the likelihood score, the metric by which we quantitatively measure the similarity between the simulated tidal stream and the data. In Section 4, we explain the algorithm we use on MilkyWay@home to find the progenitor parameters that best fit the data. In Section 5, we report the raw findings of our optimizations, and in Section 6, we discuss the implications of our results.

2. Stellar Data of the OCS

To optimize our dwarf progenitor's parameters, we need accurate stream data to compare against our simulations. To this end, we extract the density of stars along the OCS as well as its width using actual data from the sky. We then parse it into a binned histogram that MilkyWay@home will compare with simulations. We use data from both the Sloan Digital Sky Survey (SDSS; Blanton et al. 2017) and the Dark Energy Camera (DEC; Flaugher et al. 2015) to map the OCS. We use the same Lambda-Beta (Λ, B) coordinate system defined in Newberg et al. (2010) to follow the stream across the sky. Because the OCS does not follow a great circle across the sky and does not maintain the same distance from us as a function of Λ, we apply corrections to the unextincted magnitude in the g band (g₀) and the B coordinate. The equations for these corrected values are as follows:

$\begin{eqnarray}&&{g}_{\mathrm{corr}}={g}_{0}-0.00022{{\rm{\Lambda }}}^{2}+0.034{\rm{\Lambda }},\end{eqnarray} \tag{ 1 }$

$\begin{eqnarray}{B}_{\mathrm{corr}}=\left\{\begin{array}{cl}B+0.00628{{\rm{\Lambda }}}^{2}+0.42{\rm{\Lambda }}+5.0 & {\rm{\Lambda }}\leqslant -15\buildrel{\circ}\over{.} 0\\ B & {\rm{\Lambda }}\gt -15\buildrel{\circ}\over{.} 0\end{array}\right.,\end{eqnarray} \tag{ 2 }$

as calculated in Newberg et al. (2010). To select F-turnoff stars at the same distance as the OCS, we filter stars such that 20.7 < g_corr < 21.7. Because we removed faint stars (keeping g₀ < 22.5) so that we do not need to consider a variable reduction in completeness (Newberg et al. 2002; Weiss et al. 2018), we miss fainter OCS stars at lower Λ_OCS, where the stream is farther away. We correct for the missing data when generating the binned histogram.

2.1. SDSS Data

From the SDSS DR16 release (Ahumada et al. 2020), we selected all stars with R.A. (α) between and 123 fdg 75 and 172 fdg 0 and decl. (δ) between −25 fdg 0 and 60 fdg 0. We define our on field to be the remaining stars within 2° of B_corr = 0° and our off field to be the stars with 2 fdg 0 < ∣B_corr∣ < 4 fdg 0. The difference between these two fields in a given Λ bin is defined to be the excess in that bin. To avoid overlap between our two data sets, we also apply a Λ cut to the SDSS data, keeping only stars with Λ < 21°.

We select F-turnoff stars with a ${\left(g-i\right)}_{0}$ color between 0.12 and 0.47. We select this color range to maximize the signal-to-noise ratio between the on-field and off-field stars (see Figure 1). We also filter stars such that 17.0 < g₀ < 22.5. To remove stars in front of or behind the OCS, we enforced a distance cut of 20.7 < g_corr < 21.7, which is the same cut applied in Newberg et al. (2010).

**Figure 1.** Distribution of stars based on ${\left(g-i\right)}_{0}$ color. Blue represents stars in the on field and green reflects the off field. The strongest signal can be detected by allowing stars with $0.12\lt {\left(g-i\right)}_{0}\lt 0.47$ , where we see the largest excess in on-field star counts. We represent the upper and lower bounds of this range with red lines in this plot. Only data from the SDSS are shown in this histogram.
Download figure:
Standard image High-resolution image

The errors in the counts are assumed to follow a Poisson distribution. Thus, the error of the counts in the ith Λ bin N_i, for both the on field and the off field, is simply given by

$\begin{eqnarray}&&{\sigma }_{{N}_{i}}=\sqrt{{N}_{i}}.\end{eqnarray} \tag{ 3 }$

To account for the incompleteness in our fields at Λ_OCS < −21°, we must calculate what fraction of each Λ bin is filled. Because the incompleteness results from the g₀ = 22.5 magnitude limit, we can use Newberg et al. (2010) to exactly determine the boundary g_b(Λ_OCS):

$\begin{eqnarray}&&{g}_{b}({{\rm{\Lambda }}}_{\mathrm{OCS}})={a}_{g}{{\rm{\Lambda }}}_{\mathrm{OCS}}^{2}+{b}_{g}{{\rm{\Lambda }}}_{\mathrm{OCS}}+{c}_{g},\end{eqnarray} \tag{ 4 }$

where a_g = −0.00022, b_g = 0.034, and c_g = 22.5. As can be seen in Figure 2, there are more F-turnoff stars at higher magnitudes in both the on field and the off field. We therefore correct for the missing stars by fitting a linear model to the star counts as a function of g_corr, as described in Appendix A.

2.2. DEC Data

The DEC data was taken from Grillmair et al. (2015). Because the DEC uses the same g-band and i-band filters as the SDSS, we applied the same color and magnitude cuts as in the SDSS data, keeping stars with 21° ≤ Λ ≤ 48°. We define our on field and off field in the same way as we did for the SDSS data. However, as can be seen in Figure 3, the DEC data do not completely fill our on and off fields.

**Figure 3.** Footprint of stellar data from the SDSS and DEC in Galactic longitude (l) and Galactic latitude (b). As can be seen in the DEC data (right), the data are incomplete as much of their off field (yellow) is unobserved. There are also places where the on field (blue) is incomplete. Our ${\left(g-i\right)}_{0}$ and g₀ cuts were used to generate this footprint. The pink line shows the orbit of the OCS as determined from Newberg et al. (2010).
Download figure:
Standard image High-resolution image

To correct our fields for the lack of data, we apply Monte Carlo approximations to "fill in" the missing patches in the sky. For the ith Λ bin in the on field, we randomly populate the bin with 8192 (2¹³) test stars. For each test star, we check whether the test star is within 0 fdg 18 of a real star in that bin. We selected 0 fdg 18 as our threshold because this was slightly larger than the maximum nearest-neighbor angular distance between two stars in the DEC data. We count the number of test stars within 0 fdg 18 of a real star, p_i, and divide it by the total number of test stars, M = 8192, to get the "filled" fraction of the ith bin, k_i:

$\begin{eqnarray}&&{k}_{i}=\displaystyle \frac{{p}_{i}}{M}.\end{eqnarray} \tag{ 5 }$

Assuming a Poisson distribution for p_i, we derive the following expression for the error in k_i:

$\begin{eqnarray}&&{\sigma }_{{k}_{i}}=\displaystyle \frac{\sqrt{{p}_{i}}}{M}.\end{eqnarray} \tag{ 6 }$

By dividing the number of counts in an on-field bin N_i by the ratio k_i, we can approximate the true number of stars in that bin:

$\begin{eqnarray}&&N{{\prime} }_{i}=\displaystyle \frac{{N}_{i}}{{k}_{i}}.\end{eqnarray} \tag{ 7 }$

This means that the errors in counts in the Λ bin are given by

$\begin{eqnarray}\begin{array}{rcl}{\sigma }_{N{{\prime} }_{i}} & = & \displaystyle \frac{1}{{k}_{i}}\sqrt{{\sigma }_{{N}_{i}}^{2}+{\left(\displaystyle \frac{{N}_{i}}{{k}_{i}}\right)}^{2}{\sigma }_{{k}_{i}}^{2}}\\ & = & \displaystyle \frac{1}{{k}_{i}}\sqrt{{N}_{i}\left(1+\displaystyle \frac{{N}_{i}}{{{Mk}}_{i}}\right)}.\end{array}\end{eqnarray} \tag{ 8 }$

We apply the same process to each bin in the off field as well.

2.3. Distribution of Stream Stars along Λ_OCS

After combining the stellar data from the SDSS with those from the DEC, we look at each bin and calculate the number of stars within the on field and off field. Using the off field as a background, we subtract it from the on field and determine the excess within the stream. The excess of the ith bin (E_i) and its error ( ${\sigma }_{{E}_{i}}$ ) are calculated using the following formulae:

$\begin{eqnarray}{E}_{i}=\left\{\begin{array}{cl}{N}_{\mathrm{on},i}-{N}_{\mathrm{off},i} & {N}_{\mathrm{on},i}\geqslant {N}_{\mathrm{off},i}\\ 0 & {N}_{\mathrm{on},i}\lt {N}_{\mathrm{off},i}\end{array}\right.\end{eqnarray} \tag{ 9 }$

$\begin{eqnarray}&&{\sigma }_{{E}_{i}}=\sqrt{{\sigma }_{{N}_{\mathrm{on},i}}^{2}+{\sigma }_{{N}_{\mathrm{off},i}}^{2}}.\end{eqnarray} \tag{ 10 }$

The excess and errors of each bin are listed in Table 1 along with the number of stars in each on-field and off-field bin. We also present a histogram representing this data in Figure 4. Using our cuts, we find an excess of 5631 ± 356 F-turnoff stars in the OCS within the range of −33° ≤ Λ ≤ 48°. For the purposes of our likelihood calculation, which we will describe later, we also calculate the normalized excess star count (e_i) and its error $\left({\sigma }_{{e}_{i}}\right)$ :

$\begin{eqnarray}&&{e}_{i}=\displaystyle \frac{{E}_{i}}{{\sum }_{j}{E}_{j}}\end{eqnarray} \tag{ 11 }$

$\begin{eqnarray}&&{\sigma }_{{e}_{i}}=\displaystyle \frac{1}{{\sum }_{j}{E}_{j}}\sqrt{(1-2{e}_{i}){\sigma }_{{E}_{i}}^{2}+{e}_{i}^{2}\sum _{j}{\sigma }_{{E}_{j}}^{2}}\end{eqnarray} \tag{ 12 }$

**Figure 4.** The on field (red), off field (green), and excess (blue) of the OCS. Error bars of the excess are shown by the blue brackets.
Download figure:
Standard image High-resolution image

Table 1. The Number of F-turnoff Stars in the On Field and Off Field as a Function of Λ

Λ Center	On Field	Off Field	Excess (E)	Error (σ_E)
−31.5	1314	1136	178	40
−28.5	1315	1144	171	44
−25.5	1283	1178	105	46
−22.5	1335	1271	64	49
−19.5	1412	1262	150	52
−16.5	1542	1395	147	54
−13.5	1774	1530	244	57
−10.5	1891	1676	215	60
−7.5	1990	1893	97	62
−4.5	2286	2173	113	67
−1.5	2722	2596	126	73
1.5	2737	2567	170	73
4.5	2329	2201	128	67
7.5	1989	1822	167	62
10.5	1933	1927	6	62
13.5	1968	1821	147	62
16.5	2006	1850	156	62
19.5	2152	2004	148	64

22.5	2480	2294	186	69
25.5	2646	2269	337	70
28.5	2546	2189	357	69
31.5	2686	2356	330	71
34.5	2668	2134	534	69
37.5	2611	2163	448	69
40.5	2817	2445	372	73
43.5	2637	2491	146	72
46.5	2953	2597	356	74

Notes. We notice a small gap in the OCS at Λ = 10 fdg 5; however, our optimizations will not be able to properly resolve the gap due to the large error of its associated bin. We split this table into the SDSS data (top) and the DEC data (bottom).

Download table as: ASCII Typeset image

2.4. Estimation of Total Stellar Mass

The simulations include bodies that represent both baryons (stars) and dark matter. The baryonic simulation bodies represent all of the stellar mass, not just the mass of the turnoff stars. Therefore, to constrain the mass of the OCS, we must relate the number of F-turnoff stars in the sky to an amount of baryonic mass. To accomplish this, we search for a globular cluster whose CMD properties most closely match those of the OCS. Once we find a globular cluster with a turnoff star color that is close enough to that of the OCS, we use that globular cluster to estimate the stellar mass per F-turnoff star in the OCS. Simply multiplying this ratio by the number of F-turnoff stars we detect in the OCS will give us an approximate stellar mass. For our analysis, we look at 11 candidate globular clusters using the globular cluster data from An et al. (2008).

It should be noted that because An et al. (2008) use the dust map from Schlegel et al. (1998) to calculate their unextincted magnitudes and our stellar data calculates extinctions using Schlafly & Finkbeiner (2011), we need to correct the magnitudes from our globular clusters to adjust for overreddening. Before we perform our globular cluster analysis, we add the extinction calculated from Schlegel et al. (1998) to each star's magnitude in An et al. (2008) and then subtract the extinction derived from Schlafly & Finkbeiner (2011).

When selecting globular cluster stars for our F-turnoff color calculation, we must search over a defined magnitude range for each cluster. To select this range, we plot a color–magnitude diagram (CMD) in ${\left(g-i\right)}_{0}$ and g₀. We set the minimum (brightest) turnoff magnitude at the intersection of the subgiants with the red giant branch. We record the ${\left(g-i\right)}_{0}$ color of the intersection and determine the brightest magnitude for main-sequence stars of that color, defining that g₀ as the maximum turnoff magnitude. The ranges are listed Table 2, and the CMDs of each candidate can be seen in Figure 5.

**Figure 5.** CMDs of our 11 candidate globular clusters for our OCS color proxy. We plotted these data using TOPCAT (Taylor 2005) and globular cluster data from An et al. (2008). The red lines show the g₀ boundaries of our turnoff range, and the blue line represents the peak ${\left(g-i\right)}_{0}$ color.
Download figure:
Standard image High-resolution image

Table 2. Color Peaks and the Magnitude Range for Each of Our 11 Globular Clusters

Globular Cluster	g₀ F-turnoff Range	${\left(g-r\right)}_{0}$	${\left(g-i\right)}_{0}$
M2	19.0–21.0	0.351 ± 0.015	0.457 ± 0.028
M3	18.5–20.5	0.309 ± 0.022	0.382 ± 0.047
M5	18.0–20.0	0.326 ± 0.024	0.456 ± 0.051
M13	18.0–20.0	0.318 ± 0.035	0.425 ± 0.056
M15	18.75–20.75	0.338 ± 0.020	0.483 ± 0.033
M53	19.75–21.5	0.264 ± 0.034	0.351 ± 0.042
M92	18.0–20.0	0.262 ± 0.027	0.359 ± 0.043
NGC 4147	20.0–21.5	0.267 ± 0.038	0.353 ± 0.050
NGC 5053	19.5–21.5	0.241 ± 0.025	0.326 ± 0.041
NGC 5466	19.25–21.25	0.283 ± 0.024	0.351 ± 0.029
Palomar 5	20.5–22.25	0.361 ± 0.040	0.505 ± 0.053

Download table as: ASCII Typeset image

Using these stars, we generate two one-dimensional histograms with bins in ${\left(g-r\right)}_{0}$ and ${\left(g-i\right)}_{0}$ color, respectively. We then fit the histogram to a skew-normal distribution with a linear background. The model has the form

$\begin{eqnarray}&&f(x)={mx}+b+{{Ae}}^{-\tfrac{{\left(x-\xi \right)}^{2}}{2{\omega }^{2}}}\left(1+\mathrm{erf}\left(\gamma \left(\displaystyle \frac{x-\xi }{\omega \sqrt{2}}\right)\right)\right),\end{eqnarray} \tag{ 13 }$

where m is the linear slope, b is the linear y intercept, A is the amplitude, ξ is the "unskewed" mean, ω is the "unskewed" standard deviation, and γ is the skew parameter. In our fits, we assume that γ ≥ 0 because we expect the number of stars to drop off more slowly toward the redder end of the distribution. We fit these six parameters using a χ² best-fit method and a differential evolution algorithm. We run the algorithm 10 independent times and keep the best fit to ensure our χ² fit does not fall into a local minimum. After fitting the six parameters, we calculate the Hessian of the natural logarithm of the χ² surface around that minimum. By inverting the Hessian, we derive the covariance matrix of the fitted parameters:

$\begin{eqnarray}\begin{array}{rcl}{\left(H\mathrm{ln}{\chi }^{2}\right)}^{-1} & = & {\left(\begin{array}{ccc}\displaystyle \frac{{\partial }^{2}\mathrm{ln}{\chi }^{2}}{\partial {{x}_{1}}^{2}} & \displaystyle \frac{{\partial }^{2}\mathrm{ln}{\chi }^{2}}{\partial {x}_{1}\partial {x}_{2}} & \cdots \\ \displaystyle \frac{{\partial }^{2}\mathrm{ln}{\chi }^{2}}{\partial {x}_{2}\partial {x}_{1}} & \displaystyle \frac{{\partial }^{2}\mathrm{ln}{\chi }^{2}}{\partial {{x}_{2}}^{2}} & \cdots \\ \vdots & \vdots & \ddots \end{array}\right)}^{-1}\\ & = & \left(\begin{array}{ccc}{{\sigma }_{{x}_{1}}}^{2} & {\sigma }_{{x}_{1}{x}_{2}}^{2} & \cdots \\ {\sigma }_{{x}_{2}{x}_{1}}^{2} & {{\sigma }_{{x}_{2}}}^{2} & \cdots \\ \vdots & \vdots & \ddots \end{array}\right)\end{array}\end{eqnarray} \tag{ 14 }$

Each second derivative is calculated numerically using a finite step size, where each ith parameter is assigned a step size h_i. We approximate each second derivative using the following formula:

$\begin{eqnarray}\displaystyle \frac{{\partial }^{2}f}{\partial {x}_{i}\partial {x}_{j}}\simeq \left\{\begin{array}{cl}\displaystyle \frac{f({x}_{i}-{h}_{i},{x}_{j}-{h}_{j})-f({x}_{i}-{h}_{i},{x}_{j}+{h}_{j})-f({x}_{i}+{h}_{i},{x}_{j}-{h}_{j})+f({x}_{i}+{h}_{i},{x}_{j}+{h}_{j})}{4{h}_{i}{h}_{j}} & i\ne j\\ \displaystyle \frac{f({x}_{i}-{h}_{i})-2f({x}_{i})+f({x}_{i}+{h}_{i})}{{{h}_{i}}^{2}} & i=j.\end{array}\right.\end{eqnarray} \tag{ 15 }$

After calculating the errors of each parameter, we take these errors and set them as the new step size for each of their respective parameters. We then calculate the Hessian again, taking those errors and feeding them back into the Hessian calculation over and over until the errors converge within a significantly small enough threshold (0.0001).

Taking our six fitted parameters and their errors, we calculate the mode of the fitted distribution. While skewed normal distributions do not have an analytical solution for the mode Mo, there does exist a numerical approximation for it while γ ≥ 0:

$\begin{eqnarray}&&{Mo}\simeq \xi +\omega \left(\zeta \sqrt{\displaystyle \frac{2}{\pi }}-\left(\displaystyle \frac{4-\pi }{4}\right)\displaystyle \frac{{\left(\zeta \sqrt{\tfrac{2}{\pi }}\right)}^{3}}{1-\tfrac{2}{\pi }{\zeta }^{2}}-\displaystyle \frac{1}{2}{e}^{-\tfrac{2\pi }{\gamma }}\right),\end{eqnarray} \tag{ 16 }$

where

$\begin{eqnarray}&&\zeta =\displaystyle \frac{\gamma }{\sqrt{1+{\gamma }^{2}}}.\end{eqnarray} \tag{ 17 }$

Running our SDSS OCS data through this pipeline, we find that the excess of the OCS has a peak ${\left(g-r\right)}_{0}$ color of 0.214 ± 0.066 and a ${\left(g-i\right)}_{0}$ color of 0.286 ± 0.083. We similarly run our 11 candidate globular clusters through our algorithm to determine which has the closest color peak. Table 2 lists the color peaks for each of these globular clusters.

The globular cluster whose peak F-turnoff star color is most similar to that of the OCS is NGC 5053.

After determining which globular cluster to use, we count the total number of F-turnoff stars we see in NGC 5053 from the photometric data provided by An et al. (2008), applying the same color cuts we employed in our stream separation and selecting stars with $0.12\lt {\left(g-i\right)}_{0}\lt 0.47$ and 19.5 < g₀ <21.5 as our turnoff stars. However, because globular clusters have a high stellar density in the sky, it is difficult to resolve all the stars within the center of the cluster. Although An et al. (2008) use a DAOPHOT pipeline that is specifically designed for crowded-field stellar photometry (Stetson 1987) to parse the SDSS data, we find that there are still stars that are not resolved. To correct for the missing stars, we fit the radial distribution of NGC 5053 to that of a Plummer sphere. The 2D surface number density of a Plummer sphere is given by

$\begin{eqnarray}&&{\rm{\Sigma }}(R)=\displaystyle \frac{{N}_{\mathrm{total}}{a}^{2}}{\pi {\left({R}^{2}+{a}^{2}\right)}^{2}},\end{eqnarray} \tag{ 18 }$

where the number of stars ΔN within ΔR of radius R from the center of the cluster is represented by the formula:

$\begin{eqnarray}&&{\rm{\Delta }}N=2\pi R{\rm{\Sigma }}(R){\rm{\Delta }}R.\end{eqnarray} \tag{ 19 }$

We fit the total number of F-turnoff stars (N_total) and the angular scale radius (a) to NGC 5053 using the method of least squares, setting the radius R for a given star as the angular distance between that star and the center of the cluster, which we find to be at (α, δ) = (199 fdg 107, 17 fdg 6927). To ensure our fit is not heavily impacted by the overcrowding near the center of the globular cluster, we exclude all stars within the central core by requiring R > θ_1/2 ∼ 0 fdg 035, where θ_1/2 is the angular distance from the globular cluster's center where the stellar density drops to half of its maximum value. Figure 6 shows the radial distribution of F-turnoff stars in NGC 5053 and the fitted Plummer curve.

**Figure 6.** Radial distribution of stars in NGC 5053 that passed our color cuts. The stellar densities (red) in the outer radii of the globular cluster fit relatively well to a Plummer distribution (black). Only data points to the right of the dotted blue line contributed to the fit. We notice a sizable number of F-turnoff stars missing in the core due to high stellar density in the center.
Download figure:
Standard image High-resolution image

Correcting for missing stars near the center of the cluster, we find a total of 3932 ± 110 F-turnoff stars within NGC 5053. Taking the mass of NGC 5053 to be (5.37 ± 1.32) × 10⁴ M_⊙ (Baumgardt 2017), we calculate that each turnoff star represents a stellar mass of roughly (13.7 ± 3.4)M_⊙. A calculation of the mass per turnoff star using isochrones, which finds a similar value, is done in Appendix B. Multiplying this number by the 5631 ± 356 F-turnoff stars we calculate in the OCS gives us a total baryonic mass of (7.7 ± 2.0) × 10⁴ M_⊙ within the range −33 fdg 0 < Λ < 48 fdg 0.

2.4.1. Systematic Error from Using a Redder Globular Cluster

While the turnoff color of NGC 5053 matches that of the OCS within errors, the globular cluster does have a somewhat redder measure of turnoff color. To illustrate how a redder color would impact our estimate of the mass per turnoff star, we recalculate this quantity using all of our other globular clusters as the reference. Table 3 below shows the mass per F-turnoff star we find if we use each of these redder globular clusters as a reference.

Table 3. Mass per F-Turnoff Star Using Different Globular Clusters as a Reference

Globular Cluster	${\left(g-r\right)}_{0}$	${\left(g-i\right)}_{0}$	Total Mass (M_⊙)	Num. F Stars	Mass/Turnoff Star (M_⊙)
M2	0.351 ± 0.015	0.457 ± 0.028	(5.75 ± 2.65) × 10⁵	16,701 ± 1687	34.4 ± 15.9
M3	0.309 ± 0.022	0.382 ± 0.047	(4.68 ± 1.40) × 10⁵	15,808 ± 261	29.6 ± 8.9
M5	0.326 ± 0.024	0.456 ± 0.051	(3.89 ± 1.16) × 10⁵	12,061 ± 376	32.3 ± 9.7
M13	0.318 ± 0.024	0.425 ± 0.056	(6.05 ± 0.33) × 10⁵	17,464 ± 554	34.6 ± 2.2
M15	0.338 ± 0.020	0.483 ± 0.033	(5.13 ± 1.06) × 10⁵	6543 ± 1843	78.4 ± 27.4
M53	0.264 ± 0.034	0.351 ± 0.042	(3.48 ± 1.04) × 10⁵	10,913 ± 328	31.9 ± 9.6
M92	0.262 ± 0.027	0.359 ± 0.043	(2.75 ± 0.57) × 10⁵	11,382 ± 1107	24.2 ± 5.5
NGC 4147	0.267 ± 0.038	0.353 ± 0.050	(3.72 ± 2.82) × 10⁴	1564 ± 87	23.8 ± 18.1
NGC 5053	0.241 ± 0.025	0.326 ± 0.041	(5.37 ± 1.32) × 10⁴	3932 ± 110	13.7 ± 3.4
NGC 5466	0.283 ± 0.024	0.351 ± 0.029	(3.80 ± 2.45) × 10⁴	4425 ± 126	8.6 ± 5.5
Palomar 5	0.361 ± 0.040	0.505 ± 0.053	(5.2 ± 0.7) × 10³	528 ± 37	9.8 ± 1.5

Note. The total mass of each globular cluster, with the exception of M13, NGC 5053, and Palomar 5, came from Kimmig et al. (2015). M13's total mass was pulled from Leonard et al. (1992) and Palomar 5's mass came from Odenkirchen et al. (2002).

Download table as: ASCII Typeset image

Plotting the mass per turnoff star as a function of the peak color (Figure 7), we see that redder globular clusters typically produce a larger calculated mass per turnoff star. Fitting these points to an exponential curve of the form $\mathrm{ln}(M/{M}_{\odot })=r{\left(g-i\right)}_{0}+\mathrm{ln}(A)$ , we find r = 17.7 ± 4.6 and $\mathrm{ln}(A)=-3.96\pm 1.89$ (χ² = 14.8). This implies that using a globular cluster for the mass estimate of a stellar system with a bluer peak color overestimates its mass. Because we select a fixed (blue) color range to select our turnoff stars $(0.12\lt {\left(g-i\right)}_{0}\lt 0.47)$ , it makes sense that using a redder globular cluster would result in fewer stars falling within this color range and thus give us a larger stellar mass per turnoff star. If we extrapolate the curve to find the mass per turnoff star using a globular cluster with a color similar to the OCS, we find 3.0 ± 6.9M_⊙ per turnoff star, roughly one-quarter of the mass per turnoff star estimated using NGC 5053. It should be noted, however, that we do not use this extrapolated mass per turnoff for analyzing the OCS due to the extrapolation's poor fit and large errors. We instead use the mass per turnoff derived from NGC 5053, keeping in mind that any masses we fit using this data are overestimates.

**Figure 7.** A plot of the peak turnoff color vs. the mass per turnoff star for various globular clusters. Redder globular clusters seem to have a larger mass per F-turnoff star, following an exponential curve. The solid red curve shows the curve of best fit (χ² = 14.8), while the dashed red lines are the curves one standard deviation above and below the best fit. The curve has the form $\mathrm{ln}(M/{M}_{\odot })=r{\left(g-i\right)}_{0}+\mathrm{ln}(A)$ , where r = 17.7 ± 4.6 and $\mathrm{ln}(A)=-3.96\,\pm 1.89$ .
Download figure:
Standard image High-resolution image

2.5. Beta Dispersion as a Function of Position along the Stream

To fit the apparent width of the stream as a function of Λ, we measure the Beta dispersion of the OCS by splitting each Λ bin into 20 B bins, covering both the on and off fields. In each Λ bin, we fit the distribution of B values to a Gaussian distribution with a linear background, taking the fitted σ value as the Beta dispersion:

$\begin{eqnarray}&&f(x)={mx}+b+{{Ae}}^{-\tfrac{{\left(x-\mu \right)}^{2}}{2{\sigma }^{2}}},\end{eqnarray} \tag{ 20 }$

where m is the slope of the background, b is the y intercept, A is the amplitude, μ is the mean, and σ is the standard deviation of the Gaussian distribution.

To ensure a larger sample size, we group each consecutive set of three Λ bins into a single bin for the purposes of the Beta dispersion calculation. We employ the same Monte Carlo method of bin completion used in Section 2.2 to determine the number of stars in each Beta bin. However, we exclude any bins whose data coverage is less than half of the bin area from our fit. We use the "curve fit" algorithm from SciPy (Virtanen et al. 2020) to fit the beta dispersions and their respective errors. Figure 8 shows the fit in each set of Λ bins while Table 4 lists the numerical values of the Beta dispersion and their errors.

**Figure 8.** The distribution of stars within each set of Λ bins as a function of B. The open circles represent the number of stars in each B subbin, and the black line is the curve of best fit using Equation (20). We excluded B subbins from the fit if that subbin was less than half-filled due to the incomplete areal coverage of the DEC data.
Download figure:
Standard image High-resolution image

Table 4. Fitted Values for the Beta Dispersion and Dispersion Error in Each Λ Range

Λ Range	Beta Dispersion	Dispersion Error
[−33.0, −24.0]	1.003	0.323
[−24.0, −15.0]	1.001	0.208
[−15.0, −6.0]	1.076	0.218
[−6.0, 3.0]	0.612	0.146
[3.0, 12.0]	0.791	0.308
[12.0, 21.0]	0.947	0.257
[21.0, 30.0]	0.886	0.193
[30.0, 39.0]	2.263	0.825
[39.0, 48.0]	1.230	0.275

Download table as: ASCII Typeset image

3. N-body Simulations

For a given set of dwarf-galaxy parameters, we calculate the likelihood that those parameters produce the observed data histogram, as constructed from the values calculated in Section 2. This section summarizes the methods and algorithms that are laid out in more detail in Shelton et al. (2021).

To calculate the likelihood of a set of progenitor dwarf-galaxy parameters, we place the prescribed simulated dwarf galaxy in a static Milky Way potential and let it evolve through time, interacting with the gravitational well and its own bodies. We employ multithreading in our N-body algorithm to speed up our calculations. After running the simulation for a prescribed number of time steps, we calculate the likelihood that the simulated and observed streams result from the same progenitor.

3.1. Progenitor Creation

We represent our simulated dwarf galaxy as a collection of 40,000 bodies: half of the bodies represent visible baryonic matter, and the other half represent dark matter. Regardless of the baryonic and dark matter masses, we employ the same number of bodies, changing the mass per particle to represent the full mass of a component. We choose 40,000 bodies because it is the minimum number of particles needed to consistently generate the same stream over different random seeds (see Figure 9).

**Figure 9.** Plot of average likelihood score as a function of the number of bodies. The likelihood is a measure of how well an individual simulation matches the comparison simulation; zero likelihood is a perfect reproduction. Each point represents the likelihood score of a simulation against a comparison stream generated using the same dwarf parameters averaged over 12 random seeds. Each run calculates the likelihood using the same comparison stream, which was run using 100,000 bodies. The red dashed line is where the number of bodies equals 40,000. Because this is within the area where the likelihood curve flattens, we choose to run 40,000 bodies in all of our simulations.
Download figure:
Standard image High-resolution image

To model our progenitor dwarf galaxy, we arrange our 40,000 bodies into two nested, concentric Plummer spheres (Plummer 1911). The radial profile of a Plummer sphere is given by

$\begin{eqnarray}&&\rho (r)=\displaystyle \frac{3M}{4\pi {a}^{3}}{\left(1+\displaystyle \frac{{r}^{2}}{{a}^{2}}\right)}^{-5/2},\end{eqnarray} \tag{ 21 }$

where a is a scale radius and M is the total mass of the Plummer sphere. Each progenitor can thus be described by four parameters, the baryonic mass (M_B), the dark matter mass (M_D), the baryon scale radius (a_B), and the dark matter scale radius (a_D). However, because the properties of our dwarf galaxies have a degree of scale invariance, it is more efficient to optimize over the total baryonic component (M_B and a_B) and two ratios (ξ_R and ξ_M). These ratios are defined as follows:

$\begin{eqnarray}\begin{array}{rcl}{\xi }_{R} & = & \displaystyle \frac{{a}_{B}}{{a}_{B}+{a}_{D}}\\ {\xi }_{M} & = & \displaystyle \frac{{M}_{B}}{{M}_{B}+{M}_{D}}.\end{array}\end{eqnarray} \tag{ 22 }$

The method we use to randomly generate these progenitor dwarf galaxies is detailed in Shelton et al. (2021).

We determine the starting location of the progenitor by first specifying its present-day position and velocity, which we calculated using an orbit for the OCS determined from Newberg et al. (2010). Using its present-day position and velocity, we integrate a single-body orbit backward in time up to the evolution time τ_evolve to find the progenitor's starting position and velocity, centering the 40,000 body simulated progenitor at that point. Although the orbit remains fixed across simulations, we determined it from information different from what we will use to measure the dwarf-galaxy parameters; the orbit was fit using sky position, line-of-sight velocity, and heliocentric distance.

Our progenitor dwarf galaxy is thus completely characterized with five parameters: the evolution time τ_evolve, the baryonic Plummer radius a_B, the radius ratio ξ_R, the baryonic Plummer mass M_B, and the baryonic mass ratio ξ_M. As such, these are the parameters we wish to optimize over using MilkyWay@home.

3.2. Accelerations on Each Body

Our Milky Way potential consists of three components: a Miyamoto‒Nagai disk, a Hernquist bulge, and a logarithmic halo (Miyamoto & Nagai 1975; Hernquist 1990; Sakamoto et al. 2003). Our spherical Hernquist bulge has a scale radius of 0.7 kpc and a mass of 9.9 ×10¹⁰ M_⊙ (Law et al. 2005). The Miyamoto‒Nagai disk in our simulations has a scale radius of 6.5 kpc, a scale height of 0.26 kpc, and a mass of 3.4 ×10¹⁰ M_⊙ (Law et al. 2005). We implement a logarithmic halo with a circular velocity of 73 km s⁻¹ (74.61 kpc Gyr⁻¹) (Newberg et al. 2010) and a scale radius of 12 kpc (Law et al. 2005) to model the dark matter halo. We translate these potentials into accelerations by calculating the negative gradient of each potential ( a = − ∇ϕ). Within our simulations, we represent distances in kiloparsecs (kpc), time in gigayears (Gyr), and mass in structure masses (SM), where 1 SM = 222,288.47 M_⊙. Using these units, G = 1.

Our algorithm uses classical Newtonian gravitation to calculate the acceleration between two bodies. We employ a Barnes‒Hut tree algorithm (Barnes & Hut 1986), which has a complexity of $O(N\mathrm{log}N)$ to speed up calculations. Our N-body simulations employ a Velocity Verlet method (Verlet 1967) to calculate the new positions and velocities of each particle at the end of each time step. We use this Velocity Verlet algorithm because it is a symplectic integrator, meaning the area of its phase space is conserved as the system evolves through time. This property is especially attractive for our integrator because it implies that energy is conserved as the dwarf galaxy falls into the Milky Way, an important constraint that must be maintained throughout our N-body simulations. The calculation of our accelerations, time step, and softening length is further detailed in Shelton et al. (2021).

3.3. Phase-space Evolution Method

After establishing the Milky Way potential and creating the progenitor dwarf, we determine the starting point of our simulation by defining approximately where we expect the progenitor's core to lie in the stream at the present time and thus where we would like the simulated progenitor core to end up at the end of the simulation. For the OCS, we place this starting point at (l, b, R) = (258 fdg 0, 45 fdg 8, 21.5 kpc) with a velocity (v_x, v_y, v_z) = (−185.5, 54.7, 147.4) kpc Gyr⁻¹. This velocity is given in right-handed Galactic Cartesian coordinates, in which the X-axis is the direction from the Sun to the Galactic Center and the Y-axis is in the direction of the Sun's motion. This starting point defines the orbit of the OCS and was calculated using the orbit fit from Newberg et al. (2010). We place a single body at the starting point and evolve that body backward in time by our evolution time, τ_evolve. We replace the body with the simulated dwarf-galaxy progenitor, and the N-body integration begins.

3.4. Likelihood Calculation

After running a simulation to completion, we need to translate the resulting tidal stream into a histogram to compare with the data histogram. We transform our bodies into a Lambda−Beta (Λ, B) system that follows the OCS along its B = 0 great circle. We use the same coordinate system as derived in Newberg et al. (2010). We split the stream into a histogram that consists of 28 Λ bins subtending the range (−33°, 48°) and one Beta bin covering the range (−15°, 15°). Within each bin, we record the number of bodies and normalize the counts with respect to the total number of bodies within the histogram's range. We also calculate the Beta dispersion at each Λ where we have a measured stream width.

To compare the similarity between two histograms, we employ a metric called the likelihood score. This value represents the natural logarithm of the probability that two histograms match. As such, the likelihood score ranges from negative infinity (worst case set to −9999999.9) to 0, where a higher likelihood is indicative of a closer match. Our likelihood score consists of three components: the Earth Mover Distance (EMD) Component, the Cost Component, and the Beta Dispersion Component.

The EMD Component (Rubner et al. 2000) measures how well the shapes of two normalized histograms match each other. Given two histograms, one representing the number of stream stars as a function of Λ and the other representing the number of simulation bodies as a function of Λ, the code calculates the minimum amount of "work" necessary to deform one histogram into the other and translates that work into an EMD score. If the EMD score is greater than some maximum EMD score we define ( ${\mathrm{EMD}}_{\max }=50$ ), the code outputs the worst-case likelihood score. Otherwise, we calculate the EMD component to be

$\begin{eqnarray}&&\mathrm{ln}({{ \mathcal L }}_{\mathrm{EMD}})=300\mathrm{ln}\left(1-\frac{\mathrm{EMD}}{{\mathrm{EMD}}_{\max }}\right).\end{eqnarray} \tag{ 23 }$

The Cost Component compares the total mass within each histogram and adds a penalty commensurate to the mass difference. We utilize this component because the EMD score can only be calculated if both histograms are normalized, so the information about their masses is lost. The Cost Component is given by

$\begin{eqnarray}&&\begin{array}{l}\mathrm{ln}({{ \mathcal L }}_{\mathrm{Cost}})\\ \ =\ -\displaystyle \frac{{\left({M}_{\mathrm{sim}}{N}_{\mathrm{sim}}-{M}_{\mathrm{data}}{N}_{\mathrm{data}}\right)}^{2}}{2\left({M}_{\mathrm{data}}^{2}{N}_{\mathrm{data}}+{M}_{\mathrm{sim}}^{2}{N}_{\mathrm{sim}}\left(\tfrac{{N}_{\mathrm{sim}}}{{N}_{\mathrm{total}}}\right)\left(1-\tfrac{{N}_{\mathrm{sim}}}{{N}_{\mathrm{total}}}\right)\right)},\end{array}\end{eqnarray} \tag{ 24 }$

where M_sim is the mass per body in the simulation, M_data is the mass per turnoff star we calculated in Section 2.4, N_sim is the number of baryonic bodies within the histogram's range, N_data is the number of turnoff stars within the histogram's range (5631 stars), and N_total is the total number of baryonic bodies we ran in the simulation (20,000).

The Beta Dispersion Component compares the width of the streams as a function of Λ. For each Λ bin in the simulation, we calculate the Beta dispersion ${\sigma }_{\beta ,\mathrm{sim},i}$ and its error ${\delta }_{{\sigma }_{\beta ,\mathrm{sim},i}}$ , and then compare them to the values determined from the stellar data, σ_β,data,i and ${\delta }_{{\sigma }_{\beta ,\mathrm{data},i}}$ . The formula for the Beta Dispersion Component is given by

$\begin{eqnarray}&&\mathrm{ln}({{ \mathcal L }}_{\mathrm{Disp}})=-\displaystyle \frac{1}{2}\sum _{i}^{{N}_{\mathrm{bins}}}\displaystyle \frac{{\left({\sigma }_{\beta ,\mathrm{sim},i}-{\sigma }_{\beta ,\mathrm{data},i}\right)}^{2}}{{\delta }_{{\sigma }_{\beta ,\mathrm{sim},i}}^{2}+{\delta }_{{\sigma }_{\beta ,\mathrm{data},i}}^{2}}.\end{eqnarray} \tag{ 25 }$

The total likelihood score is the sum of each of these components:

$\begin{eqnarray}&&\mathrm{ln}({ \mathcal L })=\mathrm{ln}({{ \mathcal L }}_{\mathrm{EMD}})+\mathrm{ln}({{ \mathcal L }}_{\mathrm{Cost}})+\mathrm{ln}({{ \mathcal L }}_{\mathrm{Disp}})\end{eqnarray} \tag{ 26 }$

The details of how each component is calculated can be found in Shelton et al. (2021). Only the locations of the baryonic bodies contribute to the calculation of the likelihood score because we are only able to observe the stars in the tidal stream.

Due to the chaotic behavior of N-body systems, the slightest deviation in the progenitor parameters (or the random seed) can shift the core of our progenitor further ahead or behind in the stream, drastically affecting the likelihood. To mitigate this chaos, we check the likelihood score over a range of evolution times to "match" the cores of the simulated stream to those of the data. Once the simulation reaches 98% of the evolution time, the code calculates the likelihood at each time step and keeps the snapshot with the best likelihood. This comparison continues until the simulation time reaches 102% of the evolution time. The best likelihood in this range of forward evolution times is returned as the likelihood for the given parameter set.

4. Optimization Procedure

Exploring our parameter space requires hundreds of thousands of likelihood calculations, each requiring an N-body simulation. Our N-body simulations on average take 15 minutes to compute on a single processor; it would take several years to run this optimization on an average laptop. Therefore, computing these optimized parameters within a reasonable timeframe demands the use of a much more sophisticated computer network.

MilkyWay@home is a collection of roughly 26,000 volunteered computers connected by the Berkeley Open Infrastructure for Network Computing (BOINC), operating at 1.5 PetaFLOPS of combined computing power. MilkyWay@home uses this massive computing power in conjunction with the Time Asynchronous Optimization (TAO; Desell et al. 2009) package to optimize the parameters that define the size and shape of the progenitor dwarf galaxy. This section explains the differential evolution genetic algorithm we use to optimize our progenitor parameters and describes the parameter space we explore.

4.1. Differential Evolution Genetic Algorithm

MilkyWay@home starts an optimization by first generating a random population of 50 parameter sets, each containing the five parameters described in Section 3.1, on the main server. For each parameter set in the population, MilkyWay@home sends out a package containing an N-body executable, a Lua file containing important settings for the simulation, the aforementioned parameter set, and a comparison histogram containing the real stellar data to several of the 26,000 volunteered computers. The volunteered computer runs the N-body executable with the chosen parameters and generates a simulated tidal stream, calculates the density of bodies across the stream, and logs the information into a histogram. The newly generated histogram is compared to the data histogram, and the computer generates a likelihood score based on how well the two histograms match. This score and its associated parameter set are sent back to the main server.

After calculating the likelihood scores of the initial population, we employ a differential evolution algorithm to generate new parameter sets to test. For each member of our population x, we select three other random distinct members (a, b, and c) to act as "genetic donors" to the new parameter set y. We then generate a random integer p between 1 and n, where n is the number of elements in our parameter set (in this case 5). Also, we generate a random number r_i between 0 and 1 for each element x_i. Given these numbers, the ith element y_i of the new parameter set is thus defined as

$\begin{eqnarray}{y}_{i}=\left\{\begin{array}{cl}{a}_{i}+F({b}_{i}-{c}_{i}) & {r}_{i}\lt {CR}\vee i=p\\ {x}_{i} & \mathrm{otherwise}\end{array}\right.,\end{eqnarray} \tag{ 27 }$

where CR is the crossover rate (set to 0.9) and F is the differential weight (set to 0.8). These newly created sets are then sent to our volunteers to calculate a likelihood score. Once a parameter set returns with its likelihood score, it is compared against the rest of the population. Only the parameter sets with the highest likelihood scores are kept to act as "genetic donors for new parameter sets.

Ultimately, the population of parameter sets converges to the phase-space point with the highest likelihood and, thus, to a set of parameters that most accurately matches the stellar data. We say an optimization has converged when all members of the population are identical. MilkyWay@home can take between two to four weeks to converge.

4.2. Time-step Cut in Parameter Space

Although Shelton et al. (2021) give us a realistic physical basis for our choice of time step, there are still regions of our parameter space that could lead to unwieldy run times. In particular, these occur when our generated dwarf progenitor is extraordinarily dense. MilkyWay@home searches over a five-dimensional parameter space for the parameters with the highest likelihood (search range defined in Table 5). However, it would not be wise to explore the full 5D box as the corners of our parameter space with high mass and low radius would produce exceedingly dense progenitors that could take up to 80 days to run. Regions of parameter space that would require more than 1.5 million time steps in the simulation are excluded from our optimization. Figure 10 shows the excised region of parameter space.

**Figure 10.** Plots illustrating the fraction of our 5D parameter space searched as a function of the baryonic Plummer scale radius (a_B) and the baryonic progenitor mass (M_B). The color of a pixel shows the fraction of the phase space tested for a given (a_B, M_B). Redder regions in these plots are where a larger percentage of our parameter space is removed. Note that areas where the density of the progenitor galaxy is higher are where we lose most of our phase space. Of the total phase space, 8.92% of our parameter space is cut out.
Download figure:
Standard image High-resolution image

Table 5. The Search Ranges for Our Optimizations Against Stellar Data

τ_evolve (Gyr)	R_B (kpc)	Radius Ratio $\left({\xi }_{R}=\tfrac{{R}_{B}}{{R}_{B}+{R}_{D}}\right)$	M_B (SM)	Mass Ratio $\left({\xi }_{M}=\tfrac{{M}_{B}}{{M}_{B}+{M}_{D}}\right)$
[2.0–6.0]	[0.01–0.50]	[0.1–0.6]	[0.1–100.0]	[0.001–0.95]

Download table as: ASCII Typeset image

Placing a nonlinear cut on our phase space like this does remove the ability for MilkyWay@home to fit the densest ultracompact dwarf (UCD) galaxies. For example, simulating a dwarf galaxy like M60-UCD1, one of the densest dwarf galaxies ever measured (Strader et al. 2013), would require roughly 17 million time steps to evolve the progenitor through our gravitational well for 6.0 Gyr. Fortunately, our fits to the OCS do not suggest a UCD progenitor, but rather a dwarf galaxy with a similar radius but over 100 times less mass.

5. Results

We ran six optimizations over our five dwarf parameters using MilkyWay@home's distributed supercomputer. The raw converged results for each optimization run can be found in Table 6.

Table 6. Fitted Parameters from MilkyWay@home Optimizations

Run	τ_evolve (Gyr)	R_B (kpc)	ξ_R	M_B (SM)	ξ_M	$\mathrm{ln}({ \mathcal L })$
1	3.6354 ± 0.0004	0.2560 ± 0.0010	0.3455 ± 0.0014	1.017 ± 0.011	0.0413 ± 0.0005	−9.604550
2	3.6345 ± 0.0004	0.2327 ± 0.0005	0.2543 ± 0.0010	1.146 ± 0.007	0.0179 ± 0.0003	−8.809277
3	3.6333 ± 0.0004	0.1812 ± 0.0007	0.1828 ± 0.0009	1.223 ± 0.020	0.0126 ± 0.0003	−7.726065
4	5.3921 ± 0.0004	0.2060 ± 0.0010	0.1315 ± 0.0015	1.635 ± 0.017	0.1445 ± 0.0006	−18.033154
5	3.6334 ± 0.0003	0.1842 ± 0.0007	0.1820 ± 0.0010	1.251 ± 0.014	0.0119 ± 0.0002	−7.997704
6	5.3956 ± 0.0004	0.1987 ± 0.0004	0.1305 ± 0.0021	1.682 ± 0.009	0.1507 ± 0.0016	−18.982624

Notes. Errors were calculated using the Hessian method as described in Section 2.4. Likelihoods closer to 0 imply a better fit. Note that the primary source of error is from the navigation of the likelihood surface as measured by the differences between individual runs and not the statistical error calculated from the Hessian.

Download table as: ASCII Typeset image

From these results, we see that the differential evolution genetic algorithm we used did not consistently converge to the same parameter set. We find that for Runs 4 and 6, the algorithm reached a local maximum whose parameters suggest a lower dark matter concentration. Runs 3 and 5 have the highest likelihood scores and seem to reflect the true maximum in the likelihood surface, whereas Runs 1 and 2 got close to this solution but got failed to optimize further.

Table 7 gives the same optimized dwarf parameters as in Table 6 but measured in physical units. We also directly compare our data histogram against the optimized run with the best likelihood in Table 8.

Table 7. Radial Profile and Mass Calculated from Fitted Parameters

Run	τ_evolve (Gyr)	R_B (kpc)	R_D (kpc)	M_B (M_⊙)	M_D (M_⊙)
1	3.6354 ± 0.0004	0.2560 ± 0.0010	0.485 ± 0.004	(2.26 ± 0.02) × 10⁵	(5.24 ± 0.09) × 10⁶
2	3.6345 ± 0.0004	0.2327 ± 0.0005	0.682 ± 0.004	(2.55 ± 0.02) × 10⁵	(1.39 ± 0.02) × 10⁷
3	3.6333 ± 0.0004	0.1812 ± 0.0007	0.810 ± 0.006	(2.72 ± 0.04) × 10⁵	(2.13 ± 0.05) × 10⁷
4	5.3921 ± 0.0004	0.2060 ± 0.0010	1.361 ± 0.019	(3.63 ± 0.04) × 10⁵	(2.15 ± 0.02) × 10⁶
5	3.6334 ± 0.0003	0.1842 ± 0.0007	0.828 ± 0.006	(2.78 ± 0.03) × 10⁵	(2.30 ± 0.05) × 10⁷
6	5.3956 ± 0.0004	0.1987 ± 0.0004	1.323 ± 0.024	(3.74 ± 0.02) × 10⁵	(2.11 ± 0.03) × 10⁶

Notes. The primary source of error is from the navigation of the likelihood surface as measured by the differences between individual runs and not the statistical error calculated from the Hessian.

Download table as: ASCII Typeset image

Table 8. Histograms of Stellar Density and Beta Dispersion along the Stream

	DATA				SIMULATION (RUN 3)
Λ	Norm. Counts	Error	B Disp.	Disp. Error	Norm. Counts	Error	B Disp.	Disp. Error
−31.5	0.0316	0.0072	⋯	⋯	0.0264	0.0022	1.543	0.130
−28.5	0.0304	0.0078	1.003	0.323	0.0261	0.0021	1.220	0.104
−25.5	0.0186	0.0081	⋯	⋯	0.0284	0.0022	1.315	0.106
−22.5	0.0114	0.0089	⋯	⋯	0.0301	0.0023	1.184	0.092
−19.5	0.0266	0.0091	1.001	0.208	0.0273	0.0022	1.138	0.094
−16.5	0.0261	0.0095	⋯	⋯	0.0278	0.0022	1.068	0.087
−13.5	0.0433	0.0101	⋯	⋯	0.0300	0.0023	1.071	0.084
−10.5	0.0382	0.0105	1.076	0.218	0.0284	0.0022	0.994	0.080
−7.5	0.0172	0.0109	⋯	⋯	0.0216	0.0020	0.969	0.089
−4.5	0.0201	0.0117	⋯	⋯	0.0216	0.0020	0.924	0.085
−1.5	0.0224	0.0127	0.612	0.146	0.0223	0.0020	0.716	0.066
1.5	0.0302	0.0127	⋯	⋯	0.0207	0.0019	0.829	0.077
4.5	0.0227	0.0118	⋯	⋯	0.0190	0.0018	0.764	0.075
7.5	0.0297	0.0108	0.791	0.308	0.0209	0.0019	0.656	0.062
10.5	0.0011	0.0110	⋯	⋯	0.0190	0.0018	0.707	0.070
13.5	0.0261	0.0108	⋯	⋯	0.0220	0.0020	0.502	0.046
16.5	0.0277	0.0109	0.947	0.257	0.0262	0.0022	0.702	0.058
19.5	0.0262	0.0113	⋯	⋯	0.0287	0.0023	0.666	0.053
22.5	0.0325	0.0139	⋯	⋯	0.0388	0.0026	0.610	0.038
25.5	0.0655	0.0146	0.869	0.195	0.0546	0.0031	0.660	0.038
28.5	0.0618	0.0140	⋯	⋯	0.0679	0.0035	0.561	0.029
31.5	0.0586	0.0141	⋯	⋯	0.0745	0.0036	0.710	0.035
34.5	0.0925	0.0145	2.606	1.217	0.0784	0.0037	0.771	0.037
37.5	0.0870	0.0149	⋯	⋯	0.0800	0.0038	0.784	0.037
40.5	0.0655	0.0144	⋯	⋯	0.0598	0.0033	0.887	0.049
43.5	0.0240	0.0152	1.325	0.311	0.0527	0.0031	0.863	0.051
46.5	0.0629	0.0150	⋯	⋯	0.0468	0.0029	0.660	0.042

Notes. The left histogram was generated from the data. The right histogram shows the best simulation fit. The number of counts in each bin is normalized for the purposes of the Earth Mover Distance calculation in our likelihood score.

Download table as: ASCII Typeset image

5.1. Using Marginalized Parameter Sweeps to Estimate Error

We use the Hessian method as described in Section 2.4 to calculate the errors in each of our fitted parameters. However, to accurately calculate these errors, we need our step size to already be comparable to the errors we wish to calculate. To accomplish this, we perform a marginalized one-dimensional parameter sweep along each dwarf parameter to visualize an adequate step size. In this section, we describe the methods by which we quickly and efficiently calculate the marginalized likelihood scores for these parameter sweeps.

Calculating a marginalized likelihood score from N-body simulations is difficult for two reasons. First, the marginalization computation must be completed within the likelihood space, rather than the log(likelihood) space we represent in our version of the likelihood score. This requires us to numerically compute integrals over regions of our parameter space where the likelihood is several orders of magnitude lower than machine precision. Second, any marginalization method we implement over our likelihood space is inherently computationally expensive, requiring us to calculate a complicated four-dimensional integral over a rough likelihood surface. Calculating one likelihood score from a set of parameters requires an N-body simulation. If we were to use a crude three-point Gaussian quadrature method to calculate each integral, that would require us to run 81 N-body simulations to calculate the marginalized likelihood for one point of our parameter sweep. Assuming we use about 30 points for our parameter sweep, that brings the total number of simulations to 2430, requiring one to three months for each parameter sweep.

To address the first issue, we define a special convention for adding together likelihood scores. Given two likelihood scores $\mathrm{ln}({{ \mathcal L }}_{\gt })$ and $\mathrm{ln}({{ \mathcal L }}_{\lt })$ (where ${{ \mathcal L }}_{\gt }\gt {{ \mathcal L }}_{\lt }$ ), we define their sum $\mathrm{ln}({{ \mathcal L }}_{\mathrm{sum}})$ to follow the following equation:

$\begin{eqnarray}&&{e}^{\mathrm{ln}({{ \mathcal L }}_{\mathrm{sum}})}={e}^{\mathrm{ln}({{ \mathcal L }}_{\gt })}+{e}^{\mathrm{ln}({{ \mathcal L }}_{\lt })}.\end{eqnarray} \tag{ 28 }$

Doing some simple math, we can rewrite this equation as

$\begin{eqnarray}&&\mathrm{ln}({{ \mathcal L }}_{\mathrm{sum}})=\mathrm{ln}({{ \mathcal L }}_{\gt })+\mathrm{ln}\left(1+{e}^{\mathrm{ln}({{ \mathcal L }}_{\lt })-\mathrm{ln}({{ \mathcal L }}_{\gt })}\right).\end{eqnarray} \tag{ 29 }$

Overall, larger likelihood scores dominate smaller ones. In a double-precision computation, if the two likelihood scores differ by more than 64, the smaller likelihood score is completely ignored in the calculation. Therefore, given a list of likelihoods to add, it is most prudent to add together the smaller likelihood scores first.

To calculate the marginalized likelihood score $\mathrm{ln}(\widetilde{{ \mathcal L }})$ , we use a Monte Carlo integration method using N random points in our parameter space of volume V. To determine the minimum number of random points we need for our calculation, we need to know how N affects the error in $\mathrm{ln}(\widetilde{{ \mathcal L }})$ . Given a list of likelihoods $\{{ \mathcal L }\}$ , the marginalized likelihood $\widetilde{{ \mathcal L }}$ calculated using Monte Carlo is given by the following equation:

$\begin{eqnarray}&&\widetilde{{ \mathcal L }}=\displaystyle \frac{V}{N}\sum _{i}{{ \mathcal L }}_{i}=V{\mu }_{{ \mathcal L }},\end{eqnarray} \tag{ 30 }$

where ${\mu }_{{ \mathcal L }}$ is the average of ${ \mathcal L }$ . From this, it is clear that the main source of error in this equation comes from the calculation of ${\mu }_{{ \mathcal L }}$ , which is simply the error in the mean:

$\begin{eqnarray}&&{\delta }_{\widetilde{{ \mathcal L }}}=V\displaystyle \frac{{\sigma }_{{ \mathcal L }}}{\sqrt{N}},\end{eqnarray} \tag{ 31 }$

where ${\sigma }_{{ \mathcal L }}$ is the standard deviation of the population where ${ \mathcal L }$ was pulled from. Performing basic error propagation, we find the error in the marginalized likelihood score to be

$\begin{eqnarray}&&{\delta }_{\mathrm{ln}(\widetilde{{ \mathcal L }})}=\displaystyle \frac{{\delta }_{\widetilde{{ \mathcal L }}}}{\widetilde{{ \mathcal L }}}=\displaystyle \frac{{\sigma }_{{ \mathcal L }}}{{\mu }_{{ \mathcal L }}\sqrt{N}}.\end{eqnarray} \tag{ 32 }$

It can then be shown that

$\begin{eqnarray}&&{\delta }_{\mathrm{ln}(\widetilde{{ \mathcal L }})}=\displaystyle \frac{1}{\sqrt{N-1}}\sqrt{N\displaystyle \frac{\sum {{ \mathcal L }}_{i}^{2}}{{\left(\sum {{ \mathcal L }}_{i}\right)}^{2}}-1}.\end{eqnarray} \tag{ 33 }$

Equation (33) is what we use in Figure 11 to determine the errors in our parameter sweeps (shaded in blue). However, given this equation, we can generate a rough estimate for what this error looks like on average. We can reorder our list $\{{ \mathcal L }\}$ in descending order, where ${{ \mathcal L }}_{0}$ is the highest likelihood and ${{ \mathcal L }}_{1}$ is the second-highest likelihood. Because we anticipate that each pair of adjacent entries in the reordered list differs by tens of orders of magnitude, we can make the following approximations:

$\begin{eqnarray}&&\sum _{i}{{ \mathcal L }}_{i}\approx {{ \mathcal L }}_{0}+{{ \mathcal L }}_{1}={{ \mathcal L }}_{0}\left(1+k\right),0\lt k\leqslant 1,\end{eqnarray} \tag{ 34 }$

$\begin{eqnarray}&&\sum _{i}{{ \mathcal L }}_{i}^{2}\approx {{ \mathcal L }}_{0}^{2}+{{ \mathcal L }}_{1}^{2}={{ \mathcal L }}_{0}^{2}\left(1+{k}^{2}\right),0\lt k\leqslant 1,\end{eqnarray} \tag{ 35 }$

where k = p₁/p₀. We can make this approximation because we expect only the largest few likelihoods in our list to impact the summations. Substituting these quantities into Equation (33) gives us

$\begin{eqnarray}\begin{array}{rcl}{\delta }_{\widetilde{{ \mathcal L }}} & \approx & \displaystyle \frac{1}{\sqrt{N-1}}\sqrt{N\displaystyle \frac{\left(1+{k}^{2}\right)}{{\left(1+k\right)}^{2}}-1}\\ & = & \sqrt{\displaystyle \frac{1+{k}^{2}-2k/(N-1)}{{\left(1+k\right)}^{2}}}.\end{array}\end{eqnarray} \tag{ 36 }$

Taking the limit where k approaches 0, we find that ${\delta }_{\widetilde{{ \mathcal L }}}$ takes the form

$\begin{eqnarray}&&{\delta }_{\widetilde{{ \mathcal L }}}\approx \sqrt{1-\displaystyle \frac{2k}{N-1}}\approx 1.\end{eqnarray} \tag{ 37 }$

Surprisingly, we find that the effect of N on ${\delta }_{\widetilde{{ \mathcal L }}}$ is incredibly muted in our likelihood space. In fact, looking at the parameter sweeps in Figure 11, we find that the errors associated with each point are extremely close to 1. It is therefore unreasonable to use a bound in ${\delta }_{\widetilde{{ \mathcal L }}}$ to select a minimum value for N.

**Figure 11.** One-dimensional marginalized parameter sweeps over each dwarf parameter. The blue line shows the negative of the likelihood calculated at each dwarf parameter value while the red line indicates the best-fitted value for that parameter (from Run 3). The blue shaded area shows the error in our marginalized likelihood score.
Download figure:
Standard image High-resolution image

Instead, we should set N to be the minimum number of measurements by which the unbiased value of σ_p can be most efficiently estimated to a set percentage. To determine σ_p to within 1%, the unbiased variance ${\sigma }_{p}^{2}$ must be determined to within 2%. Also, the relationship between ${\sigma }_{p}^{2}$ and the sample variance ${s}_{p}^{2}$ can be estimated using Bessel's correction:

$\begin{eqnarray}&&\displaystyle \frac{{\sigma }_{p}^{2}}{{s}_{p}^{2}}=\displaystyle \frac{N}{N-1}.\end{eqnarray} \tag{ 38 }$

Therefore, to know ${\sigma }_{p}^{2}$ to within 2%, we measure the likelihoods from 50 random points for each Monte Carlo approximation to calculate our parameter sweeps.

The likelihood surfaces we observe are not smooth, but we still see an overall dip near the optimal parameters. A Gaussian peak in the likelihood surface translates to a parabola in the log(likelihood) parameter sweeps. Using the apparent width of these peaks, we determine the appropriate step size to use when calculating the Hessian for the purposes of error analysis.

5.2. Full Result

After finding the converged values for each of our optimizations, we reran the simulations using each best-fit parameter set to check that the generated simulated tidal stream matched the data, as shown in Figures 12 and 13. In all optimizations, we see a general trend where the B dispersion is relatively well fit for Λ < 10° but poorly fit for Λ > 10°, especially near the core. This is likely due to the larger errors in the B dispersion associated with the region near the core in the DEC data at higher Λ_OCS. Most of the optimizations (Runs 2, 3, and 5) converged to similar tidal debris that closely resembles the OCS. Runs 1, 4, and 6 had a poorer likelihood score compared to the other optimizations, and we see that the distribution of bodies near its core is wider across Λ_OCS than the data. Due to the generally poor fit of these runs, we exclude them from our analysis of the OCS progenitor.

**Figure 12.** End-state histograms showing normalized body counts over Λ_OCS.
Download figure:
Standard image High-resolution image

**Figure 13.** End-state histograms showing B dispersion over Λ_OCS.
Download figure:
Standard image High-resolution image

It should be noted that none of these optimizations reproduce the gap (σ = 1.06) in the tidal stream at Λ = 10 fdg 5. This is not only because our data can barely resolve this gap, but also because such a gap would only appear as a result of substructures within the Milky Way's dark matter halo (Koposov et al. 2019), which is not accounted for in our current halo potential.

Taking the average of the three best-fitted parameters (Runs 2, 3, and 5), we estimate the following physical properties of the OCS progenitor dwarf galaxy:

$\begin{eqnarray*}\begin{array}{rcl}{\tau }_{\mathrm{evolve}} & = & 3.6337\pm {0.0004}^{(\mathrm{stdev}.)}\pm {0.0004}^{(\mathrm{Hess}.)}\mathrm{Gyr}\\ {R}_{B} & = & 0.1994\pm {0.0167}^{(\mathrm{stdev}.)}\pm {0.0006}^{(\mathrm{Hess}.)}\mathrm{kpc}\\ {R}_{D} & = & 0.773\pm {0.046}^{(\mathrm{stdev}.)}\pm {0.005}^{(\mathrm{Hess}.)}\mathrm{kpc}\\ {M}_{B} & = & \left(2.68\pm {0.07}^{(\mathrm{stdev}.)}\pm {0.03}^{(\mathrm{Hess}.)}\right)\times {10}^{5}{M}_{\odot }\\ {M}_{D} & = & \left(1.94\pm {0.28}^{(\mathrm{stdev}.)}\pm {0.04}^{(\mathrm{Hess}.)}\right)\times {10}^{7}{M}_{\odot }\\ {M}_{\mathrm{total}} & = & \left(1.97\pm {0.28}^{(\mathrm{stdev}.)}\pm {0.04}^{(\mathrm{Hess}.)}\right)\times {10}^{7}{M}_{\odot }\end{array}\end{eqnarray*}$

where the first error is the standard deviation of the mean and the second error is the propagated error from each individual optimization's Hessian error. We see that our largest source of error comes from the optimization process rather than the curvature of the likelihood surface. So, we only propagate those errors. From these parameters, we calculate a fitted mass-to-light ratio of γ = 73.5 ± 10.6.

Note that these results and errors are derived under the assumption of a perfect Milky Way potential, orbit, and functional form of the dwarf galaxy and do not include systematic errors due to these assumptions.

6. Discussion

6.1. An Extremely Low-mass Diffuse Progenitor

Our estimated mass for the OCS's progenitor is not consistent with the masses of UFDs measured from the dispersion of the line-of-sight velocities, which assume spherical symmetry and virial equilibrium. To illustrate this point more clearly, let us use these parameters to calculate the total mass within 300 pc of the progenitor's center. Within this radius, we calculate (1.6 ± 0.1) ×10⁵ M_⊙ of baryonic matter and (9.2 ± 2.2) × 10⁵ M_⊙ of dark matter. In total, we find a mass of (1.1 ± 0.2) × 10⁶ M_⊙ within 300 pc of the progenitor's center, roughly one order of magnitude smaller than what is claimed possible in Strigari et al. (2008). However, these results are only a factor of ∼2 smaller than the mass estimated in Newberg et al. (2010), who estimated the total OCS progenitor mass from N-body simulations to be closer to 2.5 × 10⁶ M_⊙. We note that this mass calculation was done under the assumption that the dark matter followed a Plummer profile. However, we note that 300 pc is very close to the half-light radius of the best-fit dwarf progenitor, and Shelton et al. (2021) showed that our algorithm was most sensitive to the mass within the half-light radius.

6.2. The Location of the Remnant Progenitor

We use our fitted simulations to estimate the current location of the OCS's progenitor. Note that there is a peak in the star counts of both the data and the stream simulation around Λ ∼ 35° in Figure 12. We know this point of high stellar density must be the location of the progenitor remnant because there is no other reason for stars to pile up at this position, so far from apogalacticon. In this section, we will determine the position of the progenitor remnant and show that it is no longer gravitationally bound.

For each optimization, we translate the positions of all the bodies into spherical coordinates (r, θ, ϕ), where the origin is the Galactic Center and the z-axis is perpendicular to the Galactic Plane. We bin the baryonic (stellar) bodies in the polar (θ) and azimuthal (ϕ) angles and select the bin with the highest density of bodies. Using the bodies in the selected bin, we calculate the average Galactocentric distance (r) and its standard deviation. To improve the calculation, we remove all bodies with distances more than 2.5 standard deviations away from the mean and recalculate the average, a process known as sigma clipping. We perform this sigma clipping 50 times for the sake of being thorough. After calculating the 3D point in space with the highest density of baryonic bodies, we translate it into Galactic and equatorial coordinates. The progenitor position of each run is shown in Table 9.

Table 9. The Current Sky Position of the Progenitor in Both Galactic and Equatorial Coordinates Calculated from Each Optimization

Run	l (°)	b (°)	α (°)	δ (°)
2	264.0	44.4	165.7	−10.1
3	270.4	40.5	167.7	−15.9
5	260.4	46.0	164.6	−7.3

Download table as: ASCII Typeset image

Our simulations predict that the remnant progenitor core is expected to be at around (l, b) = ((264.9 ± 2.9)°, ((43.6 ±2.8)°) or (α, δ) = ((166.0 ± 0.9)°, (−11.1 ± 2.5)°). This is close to the sky position where Grillmair et al. (2015) detected the OCS's possible missing progenitor, (α, δ) ≈ (167°,−14°).

We next check whether or not the simulated remnant progenitor is still gravitationally bound. For each optimization, we calculate the total kinetic energy of the remnant and compare it to its gravitational potential energy. We select bodies that are within 0.5 kpc of the progenitor's center because that distance is roughly the cylindrical radius of our simulated stream (see Figure 14). We transform our bodies into the center-of-momentum reference frame and add together the kinetic energies $({E}_{K}=\tfrac{1}{2}{\sum }_{i}^{N}{m}_{i}{{v}_{i}}^{2})$ of all the bodies. Similarly, we add together the gravitational energies from each pair of bodies $\left({E}_{P}={\sum }_{i}^{N}{\sum }_{i\lt j}^{N}\tfrac{{{Gm}}_{i}{m}_{j}}{| {r}_{i}-{r}_{j}| }\right)$ . We record the energies we calculate for each optimization in Table 10.

**Figure 14.** Close-up view of simulated progenitor remnant (Run 3). The remnant has been fully disrupted and the progenitor's original structure cannot be easily observed.
Download figure:
Standard image High-resolution image

Table 10. The Kinetic and Gravitational Potential Energies of the Simulated Remnant Progenitor in Each Optimization

Run	E_K (SM kpc² Gyr⁻²)	E_P (SM kpc² Gyr⁻²)
2	5.977	−0.096
3	7.587	−0.092
5	5.751	−0.093

Note. Note that the kinetic energy is much larger than the gravitational potential energy, indicating that the progenitor is no longer gravitationally bound.

Download table as: ASCII Typeset image

We find that the simulated remnant progenitor has a kinetic energy of (6.4 ± 0.6) SM kpc² Gyr⁻² and a gravitational potential energy of −(0.094 ± 0.001) SM kpc² Gyr⁻² and is thus gravitationally unbound. Looking at the stream directly in Figure 14 makes this readily apparent.

6.3. The Distribution of Dark Matter along the Stream

The tidal debris we observe in our simulations reveals some interesting aspects of the distribution of baryonic and dark matter in the stellar stream. We analyze the distribution of matter within the tails of the simulated OCS. Figure 15 shows that the tails of the OCS are thicker than the central region nearer to the progenitor remnant. There are two possible contributions to this thickening: either the tails are naturally fanning out at apogalacticon due to the symplectic (phase-space area-preserving) property of gravitational systems, or material with a higher velocity dispersion in the progenitor is being tidally stripped first, populating the regions of the stream farther from the core.

**Figure 15.** Tidal stream produced by Run 3 viewed from different angles. We see that the tidal tails of the stream are thicker near apogalacticon. The green star is the Galactic Center (GC), and the cyan star is the location of the OCS's progenitor dwarf galaxy.
Download figure:
Standard image High-resolution image

We can test this by looking at the simulated tidal stream generated using Run 3, our best run, when the core is at apogalacticon. If a high-velocity dispersion has a stronger impact on the thickness of the tails than the fanning at apogalacticon, then the tails should maintain their thickness even when they are not at apogalacticon. However, as can be clearly seen in Figure 16, while the core is at apogalacticon, it bears a similar thickness to the tails in the fully evolved stream. Also, the tails of the stream in the core‒apogalacticon state share a similar thickness to the core in the final state. This indicates that the thickening of the tails in the final state is mostly caused by the natural fanning that occurs at apogalacticon.

**Figure 16.** The final state stream (left) compared to the stream while its progenitor is at apogalacticon (right). The thickness of the stream at apogalacticon is comparable in both images regardless of where the progenitor is.
Download figure:
Standard image High-resolution image

In addition to being thicker than the core, the tails also possess a higher mass-to-light ratio. We see this in Figure 17, which maps out the density of baryons and dark matter along Λ_OCS. While the core of the OCS has a relatively high dark matter density, it is still not as high as in its tails, where the mass-to-light ratio jumps to as high as 142 in Run 3. Due to its high dark matter concentration and low baryonic contamination, the tails of the OCS might serve as a better candidate for indirect dark matter detection than the central core.

6.4. The Effect of the LMC on the OCS

When creating our data histogram, we excluded the southern tail of the OCS because our current N-body simulations do not account for the gravitational influence of the LMC. While the OCS's northern tail is not greatly influenced by the presence of the LMC, its southern tail exhibits peculiar perturbations in its stellar velocities, which cannot be explained by a static Milky Way potential or a low-mass LMC (Erkal et al. 2019). While this paper does not fit data from the southern tail, we do plan on including the tail and the gravitational effects of the LMC in a future MilkyWay@home paper.

6.5. Sources of Systematic Error

While we measured the OCS progenitor properties without the assumption of dynamical equilibrium, our measurement still relies on a few key presuppositions. First, we assume the Milky Way has a static, axisymmetric Galactic potential for the entirety of the simulation. There currently exists no mechanism in our N-body simulations for our progenitor to perturb the Milky Way to give rise to second-order effects. We do not include any time dependence in our Milky Way potential. We do not take into account other gravitating systems, which could interact with the OCS, such as the LMC or other Milky Way satellites. Adding the LMC to our simulations would not only further perturb the stream but also induce a reflex motion in the entirety of the Milky Way, forcing the dwarf-galaxy progenitor to fall into an accelerating potential. It has also been recently discovered that most of the mass in the inner disk of the Milky Way is located in its fast rotating bar (Portail et al. 2017), which is also not included in our Galactic model.

Second, we fix the orbit of our simulated OCS throughout each of our optimizations to the values determined in Newberg et al. (2010). Because the orbit is fit to the northern (leading) tail of the OCS, which should be at lower energy than the orbit of the dwarf galaxy itself, we expect the orbit is not exact. We have not explored how the orbit affects the fit properties of the dwarf-galaxy progenitor. However, we are only using the stellar distributions along and across the stream, which are not thought to be strongly affected by small differences in the orbit, to constrain the stream. Additionally, the stream created in our best-fit simulation is similar to the stream observed, which gives us some confidence that our orbit and progenitor properties are reasonable.

Third, we assume that the OCS progenitor has the form of a two-component Plummer sphere. We used a Plummer profile for our progenitor because it is a simple model with an analytic energy distribution function. However, it is probably not the best (and certainly not a perfect) model, particularly for the dark matter halo. We are unsure whether using a different profile, such as a Hernquist or Navarro–Frenk–White profile, would significantly affect the optimization results.

Finally, there is the measurement of baryonic mass in the OCS. As discussed in Section 2.4.1, using the redder NGC 5053 to estimate the mass of the bluer OCS overestimates the mass per turnoff star. From our analysis, we find that the stellar mass could be reduced by a factor of 4 due to inaccuracies in our calculation of the stellar mass per turnoff. This would affect the mass-to-light ratio but would not significantly change the total mass within 300 pc of the center of the progenitor.

7. Conclusion and Future Research

We made the first estimate of the mass and radial profile of the stars and dark matter in the dwarf-galaxy progenitor of a tidal stream. To do this, we developed a procedure for characterizing the stellar distribution along and across the OCS using turnoff stars in the OCS observed in the SDSS and the DEC. We used turnoff stars in NGC 5053 to approximate the stellar mass of the OCS. Using this information for a real observed tidal stream as input to MilkyWay@home, which uses the N-body optimization method developed in Shelton et al. (2021), we calculate the following properties for the dwarf-galaxy progenitor of the OCS:

$\begin{eqnarray*}\begin{array}{rcl}{\tau }_{\mathrm{evolve}} & = & (3.6337\pm 0.0004)\mathrm{Gyr}\\ {R}_{B} & = & (0.20\pm 0.02)\mathrm{kpc}\\ {R}_{D} & = & (0.77\pm 0.05)\mathrm{kpc}\\ {M}_{B} & = & \left(2.68\pm 0.07\right)\times {10}^{5}{M}_{\odot }\\ {M}_{D} & = & \left(1.9\pm 0.3\right)\times {10}^{7}{M}_{\odot }\\ {M}_{\mathrm{total}} & = & \left(2.0\pm 0.3\right)\times {10}^{7}{M}_{\odot }.\end{array}\end{eqnarray*}$

From these numbers, we calculated a mass-to-light ratio of γ = 73.5 ± 10.6. The implied (1.1 ± 0.2) × 10⁶ M_⊙ within 300 pc of the progenitor's center is one order of magnitude smaller than the presumed minimum mass of UFDs.

Our simulations of the OCS's tidal debris show an unbounded and heavily disrupted progenitor remnant at a current sky position of around (l, b) = ((264.9 ± 2.9)°, ((43.6 ± 2.8)°), or (α, δ) = ((166.0 ± 0.9)°, ( − 11.1 ± 2.5)°). Further studies of the simulated tidal stream suggest that most of the mass of the OCS (especially its dark matter) may reside in its tails, making them optimal candidates for indirect dark matter detection experiments.

While we provide a best-fit measurement of the OCS's mass and radial profile, it should be noted that this is only a preliminary fit. There still exist a significant number of variables that need to be addressed before we can definitively measure these stream properties. The most important of these is the gravitational influence of the LMC. Erkal et al. (2019) used perturbations in the southern tail of the OCS to place the mass of the LMC at around ${1.38}_{-0.24}^{+0.27}\times {10}^{11}{M}_{\odot }$ , roughly 10% of the Milky Way's mass. It is thus necessary for us to include the gravitational contributions of the LMC in order to properly simulate the infall of the OCS's progenitor.

We also wish to implement a more rigorous method by which to define the orbit of the OCS. In our optimizations, we set a constant orbit using the stellar data from Newberg et al. (2010), assuming the tidal stream itself is representative of the progenitor's orbit. The northern leading tail has a lower energy than the progenitor dwarf galaxy and thus is expected to trace out a different orbital path. In future work, we plan to fit the orbit simultaneously with the progenitor's mass and radial profile. This will also require us to verify that it is indeed possible to simultaneously fit the five dwarf-galaxy parameters and five parameters describing its orbit from fitting simulations to tidal debris.

In future papers, we also plan on evaluating the possible sources of error intrinsic to the N-body simulations themselves as well as the simplifying assumptions we have in this paper. We will explore how different Galactic models representing the Milky Way shift the fitted dwarf parameters and whether different assumed dwarf-galaxy radial profiles affect the best-fit dark matter mass.

This work was supported by NSF grant AST19-08653; the NASA/NY Space Grant; contributions made by the Marvin Clan, Babette Josephs, and Manit Limlamai; and the 2015 Crowd Funding Campaign to support Milky Way research.

Funding for the SDSS and SDSS-II has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, the US Department of Energy, the National Aeronautics and Space Administration, the Japanese Monbukagakusho, the Max Planck Society, and the Higher Education Funding Council for England. The SDSS website is http://www.sdss.org/.

The SDSS is managed by the Astrophysical Research Consortium for the Participating Institutions. The Participating Institutions are the American Museum of Natural History, Astrophysical Institute Potsdam, University of Basel, University of Cambridge, Case Western Reserve University, University of Chicago, Drexel University, Fermilab, the Institute for Advanced Study, the Japan Participation Group, Johns Hopkins University, the Joint Institute for Nuclear Astrophysics, the Kavli Institute for Particle Astrophysics and Cosmology, the Korean Scientist Group, the Chinese Academy of Sciences (LAMOST), Los Alamos National Laboratory, the Max-Planck-Institute for Astronomy (MPIA), the Max-Planck-Institute for Astrophysics (MPA), New Mexico State University, Ohio State University, University of Pittsburgh, University of Portsmouth, Princeton University, the United States Naval Observatory, and the University of Washington.

Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions.

SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss.org. SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics—Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU)/ University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional / MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University.

This project used data obtained with the Dark Energy Camera (DECam), which was constructed by the Dark Energy Survey (DES) collaboration. Funding for the DES Projects has been provided by the DOE and NSF(USA), MISE(Spain), STFC(UK), HEFCE(UK), NCSA(UIUC), KICP(U. Chicago), CCAPP(Ohio State), MIFPA(Texas A&M), CNPQ, FAPERJ, FINEP (Brazil), MINECO(Spain), DFG(Germany) and the collaborating institutions in the Dark Energy Survey, which are Argonne Lab, UC Santa Cruz, University of Cambridge, CIEMATMadrid, University of Chicago, University College London, DES-Brazil Consortium, University of Edinburgh, ETH Zurich, Fermilab, University of Illinois, ICE (IEECCSIC), IFAE Barcelona, Lawrence Berkeley Lab, LMU Munchen and the associated Excellence Cluster Universe, University of Michigan, NOAO, University of Nottingham, Ohio State University, University of Pennsylvania, University of Portsmouth, SLAC National Lab, Stanford University, University of Sussex, and Texas A&M University.

This research has made use of the NASA/IPAC Infrared Science Archive, which is funded by the National Aeronautics and Space Administration and operated by the California Institute of Technology.

Facilities: CTIO:Blanco (DECam) - , IRSA - , SDSS. -

Appendix A: Incompleteness Corrections for SDSS Data

Using stars in the range −21° < Λ_OCS < −7° and 10° < Λ_OCS < 21° with 20.7 < g_corr < 21.7, we fit the stellar density of F-turnoff stars s(g_corr) to a linear model:

$\begin{eqnarray}&&s({g}_{\mathrm{corr}})={m}_{s}{g}_{\mathrm{corr}}+{b}_{s}.\end{eqnarray} \tag{ A1 }$

We exclude stars with −7° < Λ_OCS < 10° to avoid contamination from the Sagittarius Stream. As seen in Figure 2, the line that fits the on field is different from that of the off field, so we must fit them separately. Binning over 20 g_corr bins ([g₁, g₂,...,g₂₀]), we calculate the number of stars that fall within each bin, differentiating between the on field and off field. After normalizing these star counts, we use the "curve fit" algorithm from SciPy (Virtanen et al. 2020) to calculate the slope (m_on/off) and intercept (b_on/off) and their respective errors (δ m_on/off, δ b_on/off) for the on field and off field:

$\begin{eqnarray}\begin{array}{rcl}{m}_{\mathrm{on}}\pm \delta {m}_{\mathrm{on}} & = & (8.16\pm 1.62)\times {10}^{-3},\\ {m}_{\mathrm{off}}\pm \delta {m}_{\mathrm{off}} & = & (4.91\pm 1.68)\times {10}^{-3},\end{array}\end{eqnarray} \tag{ A2 }$

$\begin{eqnarray}\begin{array}{rcl}{b}_{\mathrm{on}}\pm \delta {b}_{\mathrm{on}} & = & -0.123\pm 0.034,\\ {b}_{\mathrm{off}}\pm \delta {b}_{\mathrm{off}} & = & -0.054\pm 0.036.\end{array}\end{eqnarray} \tag{ A3 }$

For each ith Λ_OCS bin in the on field subtending the Λ_OCS range ${{\rm{\Lambda }}}_{\min ,i}$ to ${{\rm{\Lambda }}}_{\max ,i}$ , we assume each completed bin in (Λ, g_corr) space has the form of a trapezoidal prism bounded by the fitted planes described in Equations (A2)and (A3). This makes the total volume of such a bin equal to

$\begin{eqnarray}&&{V}_{i}=(21.4{m}_{s}+{b}_{s})({{\rm{\Lambda }}}_{\max ,i}-{{\rm{\Lambda }}}_{\min ,i}).\end{eqnarray} \tag{ A4 }$

To calculate the volume of the bin that is actually filled, we slice this volume using the curve described in Equation (4). The infinitesimal volume dv of a trapezoidal cross-section with width dΛ is given by

$\begin{eqnarray}&&{dv}=\displaystyle \frac{1}{2}({m}_{s}({g}_{b}({\rm{\Lambda }})+20.7)+2{b}_{s})({g}_{b}({\rm{\Lambda }})-20.7)d{\rm{\Lambda }}.\end{eqnarray} \tag{ A5 }$

Substituting in g_b(Λ), we find that the filled volume of the ith bin is

$\begin{eqnarray}\begin{array}{rcl}{v}_{i} & = & \displaystyle \frac{{m}_{s}{a}_{g}^{2}}{10}\left({{\rm{\Lambda }}}_{\max ,i}^{5}-{{\rm{\Lambda }}}_{\min ,i}^{5}\right)\\ & & +\displaystyle \frac{{m}_{s}{a}_{g}{b}_{g}}{4}\left({{\rm{\Lambda }}}_{\max ,i}^{4}-{{\rm{\Lambda }}}_{\min ,i}^{4}\right)\\ & & +\displaystyle \frac{1}{3}\left({m}_{s}{a}_{g}{c}_{g}+\displaystyle \frac{{m}_{s}{b}_{g}^{2}}{2}+{b}_{s}{a}_{g}\right)\left({{\rm{\Lambda }}}_{\max ,i}^{3}-{{\rm{\Lambda }}}_{\min ,i}^{3}\right)\\ & & +\displaystyle \frac{1}{2}({m}_{s}{b}_{g}{c}_{g}+{b}_{s}{b}_{g})\left({{\rm{\Lambda }}}_{\max ,i}^{2}-{{\rm{\Lambda }}}_{\min ,i}^{2}\right)\\ & & +\left({m}_{s}\displaystyle \frac{{c}_{g}^{2}-{20.7}^{2}}{2}+{b}_{s}({c}_{g}-20.7)\right)\left({{\rm{\Lambda }}}_{\max ,i}-{{\rm{\Lambda }}}_{\min ,i}\right).\end{array}\end{eqnarray} \tag{ A6 }$

Dividing v_i by V_i gives us the filled ratio k_i of the bin:

$\begin{eqnarray}&&{k}_{i}=\displaystyle \frac{{v}_{i}}{{V}_{i}}.\end{eqnarray} \tag{ A7 }$

Performing error propagation, we get the error ${\delta }_{{k}_{i}}$ :

$\begin{eqnarray}\begin{array}{rcl}{\delta }_{{k}_{i}}^{2} & = & \displaystyle \frac{1}{{V}_{i}^{4}}\left(\left(\displaystyle \frac{{a}_{g}^{2}}{10}\left({{\rm{\Lambda }}}_{\max ,i}^{5}-{{\rm{\Lambda }}}_{\min ,i}^{5}\right)\right.\right.\\ & & +\displaystyle \frac{{a}_{g}{b}_{g}}{4}\left({{\rm{\Lambda }}}_{\max ,i}^{4}-{{\rm{\Lambda }}}_{\min ,i}^{4}\right)+\displaystyle \frac{1}{3}\left({a}_{g}{c}_{g}\right.\\ & & \left.+\displaystyle \frac{{b}_{g}^{2}}{2}\right)\left({{\rm{\Lambda }}}_{\max ,i}^{3}-{{\rm{\Lambda }}}_{\min ,i}^{3}\right)\\ & & +\displaystyle \frac{{b}_{g}{c}_{g}}{2}\left({{\rm{\Lambda }}}_{\max ,i}^{2}-{{\rm{\Lambda }}}_{\min ,i}^{2}\right)+\displaystyle \frac{{c}_{g}^{2}-{20.7}^{2}}{2}\\ & & \left.\times \left({{\rm{\Lambda }}}_{\max ,i}-{{\rm{\Lambda }}}_{\min ,i}\right)\right){V}_{i}\\ & & {\left.-21.4\left({{\rm{\Lambda }}}_{\max ,i}-{{\rm{\Lambda }}}_{\min ,i}\right){v}_{i}\right)}^{2}{\delta }_{m}^{2}\\ & & +\displaystyle \frac{1}{{V}_{i}^{4}}\left(\left(\displaystyle \frac{{a}_{g}}{3}\left({{\rm{\Lambda }}}_{\max ,i}^{3}-{{\rm{\Lambda }}}_{\min ,i}^{3}\right)+\displaystyle \frac{{b}_{g}}{2}\left({{\rm{\Lambda }}}_{\max ,i}^{2}-{{\rm{\Lambda }}}_{\min ,i}^{2}\right)\right.\right.\\ & & \left.+({c}_{g}-20.7)\left({{\rm{\Lambda }}}_{\max ,i}-{{\rm{\Lambda }}}_{\min ,i}\right)\right){V}_{i}\\ & & -{\left.\left({{\rm{\Lambda }}}_{\max ,i}-{{\rm{\Lambda }}}_{\min ,i}\right){v}_{i}\right)}^{2}{\delta }_{s}^{2}\end{array}\end{eqnarray} \tag{ A8 }$

After calculating the filled ratio, we approximate the corrected star count ( ${N}_{i}^{{\prime} }$ ) and errors ( ${\sigma }_{{N}_{i}^{{\prime} }}$ ) using the following formulas:

$\begin{eqnarray}&&{N}_{i}^{{\prime} }=\displaystyle \frac{{N}_{i}}{{k}_{i}},\end{eqnarray} \tag{ A9 }$

$\begin{eqnarray}&&{\sigma }_{{N}_{i}^{{\prime} }}=\displaystyle \frac{1}{{k}_{i}}\sqrt{{N}_{i}+{\left(\displaystyle \frac{{N}_{i}}{{k}_{i}}\right)}^{2}{\delta }_{{k}_{i}}^{2}}.\end{eqnarray} \tag{ A10 }$

This algorithm is repeated for each bin in the off field as well.

Appendix B: Calculating Mass per Turnoff Star Using Isochrones

As a sanity check for our previous calculation, we recalculate the baryonic mass per F-turnoff star using theoretical isochrones of NGC 5053. To perform this calculation, we first need to know three things about NGC 5053: its age, its metallicity ([Fe/H]), and its alpha abundance ([α/Fe]). From the literature, we find NGC 5053's age to be 12.5 ± 2.0 Gyr (Arellano Ferro et al. 2010), its [Fe/H] to be −2.27 dex (Harris 1996, 2010 edition), and its [α/Fe] to be 0.2 (Tang et al. 2018). We use isochrone data from the Dartmouth Stellar Evolution Database (DSED; Dotter et al. 2008), selecting the isochrones that best fit the globular cluster's age and chemical abundances. For our calculations, we use the isochrone with [Fe/H] = −2.49 dex as it was the closest metallicity. However, we will later demonstrate that this deviation does not greatly impact our result or errors by redoing the calculation assuming [Fe/H] = −1.98 dex.

The initial mass function (IMF) we implement comes from Kroupa (2001) and has the form

$\begin{eqnarray}\varepsilon (m)=\left\{\begin{array}{ll}{{Am}}^{-{\alpha }_{0}} & 0.01\leqslant m\lt 0.08\\ A{\left(0.08\right)}^{{\alpha }_{1}-{\alpha }_{0}}{m}^{-{\alpha }_{1}} & 0.08\leqslant m\lt 0.5\\ A{\left(0.5\right)}^{{\alpha }_{2}-{\alpha }_{1}}{\left(0.08\right)}^{{\alpha }_{1}-{\alpha }_{0}}{m}^{-{\alpha }_{2}} & 0.5\leqslant m\lt 1.0\\ A{\left(1.0\right)}^{{\alpha }_{3}-{\alpha }_{2}}{\left(0.5\right)}^{{\alpha }_{2}-{\alpha }_{1}}{\left(0.08\right)}^{{\alpha }_{1}-{\alpha }_{0}}{m}^{-{\alpha }_{3}} & m\gt 1.0,\end{array}\right.\end{eqnarray} \tag{ B1 }$

where m is the mass of the initial progenitor star in solar masses, A is the normalization constant, α₀ = 0.3 ± 0.4, α₁ = 1.3 ± 0.3, α₂ = 2.3 ± 0.1, and α₃ = 2.3 ± 0.2. The uncertainties in these powers are one-standard-deviation errors.

The formula we use to calculate the stellar mass per turnoff star is fairly straightforward. We take the total stellar mass within the cluster and divide it by the number of F-turnoff stars we find in our isochrone:

$\begin{eqnarray}&&{m}_{\mathrm{FT}}=\displaystyle \frac{{M}_{\mathrm{stars}}}{{N}_{\mathrm{FT}}}=\displaystyle \frac{\int m\varepsilon (m){dm}}{\int \varepsilon (m){dm}}.\end{eqnarray} \tag{ B2 }$

Note that because the IMF is in both our numerator and denominator, the normalization constant A cancels out of our formula, making the normalization we select arbitrary. The easiest quantity to calculate is N_FT, the number of F-turnoff stars in our isochrone. We say a star is an F-turnoff star if its ${\left(g-i\right)}_{0}$ color falls between 0.12 and 0.47, and its g₀ magnitude within the g₀ F-turnoff range of the isochrone, using the same convention developed in Section 2.4 to define the range.

We define a function we call the F-check function (F_C(m)), which outputs 1 if the input initial mass m falls within our F-turnoff range as a result of our isochrone model and 0 otherwise:

$\begin{eqnarray}{F}_{C}(m)=\left\{\begin{array}{cl}1 & {\rm{m}}\ \mathrm{is}\ {\rm{F}}-\mathrm{turnoff}\ \mathrm{star}\\ 0 & \mathrm{otherwise}.\end{array}\right.\end{eqnarray} \tag{ B3 }$

Given this function and a list of initial masses from the isochrone model ([m₁, m₂,...,m_N]), the formula for counting the number of F-turnoff stars becomes straightforward:

$\begin{eqnarray}\begin{array}{rcl}{N}_{\mathrm{FT}} & = & \displaystyle \int {F}_{C}(m)\varepsilon (m){dm}\\ & \simeq & \sum _{i=1}^{N}{F}_{C}({m}_{i})\varepsilon ({m}_{i}){\rm{\Delta }}{m}_{i},\end{array}\end{eqnarray} \tag{ B4 }$

where Δm_i is the width of the i^th mass bin defined below:

$\begin{eqnarray}{\rm{\Delta }}{m}_{i}=\left\{\begin{array}{ll}\,{m}_{2}-{m}_{1} & i=1\\ {m}_{N}-{m}_{N-1} & i=N\\ \displaystyle \frac{{m}_{i+1}-{m}_{i-1}}{2} & \mathrm{otherwise}.\end{array}\right.\end{eqnarray} \tag{ B5 }$

Calculating the total stellar mass, on the other hand, is a bit more complicated. For stars whose initial mass is larger than the largest mass in the isochrone model (m > m_N), their current mass is only but a small fraction of their original mass. This is because such stars have turned into white dwarfs, neutron stars, or black holes, ejecting most of their original mass through their planetary nebulae or supernovae. This ejected mass does not necessarily disappear from the globular cluster but could be accelerated with enough energy to push it past the cluster's escape velocity. We know the mass of NGC 5053 is (5.37 ± 1.32) × 10⁴ M_⊙, and from our previous fit, we find the scale Plummer radius of NGC 5053 to be 11.7 ± 0.5 pc. Performing a simple back-of-the-envelope calculation of the cluster's highest escape velocity $\left({v}_{e,\max }=\sqrt{\tfrac{2{GM}}{a}}\right)$ gives an escape velocity of ∼6.4 km s⁻¹. Because planetary nebulae expand from their center stars with speeds between 20 and 40 km s⁻¹ (Schönberner et al. 2014), and the shocks of supernovae reach speeds of several thousand km s⁻¹ (Hovey et al. 2015, 2016), it is safe to assume that all ejecta of stellar transitions is not gravitationally bound by the cluster and can effectively be ignored in the mass calculation.

Under these assumptions, the total stellar mass can be calculated using the following formula:

$\begin{eqnarray}\begin{array}{rcl}{M}_{\mathrm{stars}} & = & {\displaystyle \int }_{0.01}^{{m}_{N}+\tfrac{{\rm{\Delta }}{m}_{N}}{2}}m\varepsilon (m){dm}\\ & & +{\displaystyle \int }_{{m}_{N}+\tfrac{{\rm{\Delta }}{m}_{N}}{2}}^{100.0}{M}_{R}(m,[{Fe}/H])\varepsilon (m){dm},\end{array}\end{eqnarray} \tag{ B6 }$

where M_R(m, [Fe/H]) is the remnant mass function, a function that outputs the stellar remnant mass of a star given its initial progenitor mass. We use the models fitted in Cummings et al. (2018) to calculate the remnant mass of white dwarfs and the models from Fryer et al. (2012) to account for the masses in neutron stars and black holes. We cut off our mass calculation at an initial mass of 100 solar masses as the remnant masses past that point become more uncertain due to mass loss and pair-instability supernovae, which leave behind no remnant (Fryer et al. 2012). Combining these models, we find the mass of the remnant in solar masses is given by the following formula:

$\begin{eqnarray}{M}_{R}(m,[\mathrm{Fe}/{\rm{H}}])=\left\{\begin{array}{cl}(0.080\pm 0.016)m+(0.489\pm 0.030) & m\lt 2.85\\ (0.187\pm 0.061)m+(0.184\pm 0.199) & 2.85\leqslant m\lt 3.60\\ (0.107\pm 0.016)m+(0.471\pm 0.077) & 3.60\leqslant m\lt 9.0+0.9[\mathrm{Fe}/{\rm{H}}]\\ 1.36 & 9.0+0.9[\mathrm{Fe}/{\rm{H}}]\leqslant m\lt 11.0\\ 1.1+0.2{e}^{(m-11.0)/4.0}-(2.0+{10}^{[\mathrm{Fe}/{\rm{H}}]}){e}^{0.4(m-26.0)} & 11.0\leqslant m\lt 30.0\\ {\rm{\Theta }}(m,[\mathrm{Fe}/{\rm{H}}]) & 30.0\leqslant m\lt 50.0\\ \max (1.8+0.04(90-m),{\rm{\Theta }}(m,[\mathrm{Fe}/{\rm{H}}])) & 50.0\leqslant m\lt 90.0\\ \max (1.8+{\mathrm{log}}_{10}(m-89),{\rm{\Theta }}(m,[\mathrm{Fe}/{\rm{H}}])) & m\gt 90.0,\end{array}\right.\end{eqnarray} \tag{ B7 }$

where

$\begin{eqnarray}&&\begin{array}{l}{\rm{\Theta }}(m,[\mathrm{Fe}/{\rm{H}}])=\min (33.35+(4.75\\ \ \ +\ 1.25\times {10}^{[\mathrm{Fe}/{\rm{H}}]})(m-34),\\ \ \ m-{10}^{[\mathrm{Fe}/{\rm{H}}]/2}(1.3m-18.35)).\end{array}\end{eqnarray} \tag{ B8 }$

To propagate the errors in this calculation, we recompute the stellar mass per turnoff star several times, changing the parameters with errors by their respective error. We have 11 quantities with errors in this calculation: the age, the four powers in the IMF, and the six parameters from the white dwarf remnant formula derived by Cummings et al. (2018). For each parameter, we either add its error, subtract its error, or leave the parameter unchanged. We calculate the mass per turnoff star for each permutation (3¹¹ total) and treat each one as a separate data point. We then take the average value and calculate the standard deviation of all the points as the error. Using this method, we find that the stellar mass per F-turnoff star we expect to measure in NGC 5053 is 13.5 ± 2.7M_⊙ per F-turnoff star. Using an isochrone with [Fe/H] = –1.98 dex, we get an answer of 14.9 ± 3.5M_⊙ per turnoff star. Both of these numbers are exceptionally close to the value we measure in Section 2.4 and well within the expected errors.

Estimate of the Mass and Radial Profile of the Orphan–Chenab Stream's Dwarf-galaxy Progenitor Using MilkyWay@home

Article metrics

Author affiliations

ORCID iDs

Dates

Abstract

1. Introduction

2. Stellar Data of the OCS

2.1. SDSS Data

2.2. DEC Data

2.3. Distribution of Stream Stars along Λ_OCS

2.4. Estimation of Total Stellar Mass

2.4.1. Systematic Error from Using a Redder Globular Cluster

2.5. Beta Dispersion as a Function of Position along the Stream

3. N-body Simulations

3.1. Progenitor Creation

3.2. Accelerations on Each Body

3.3. Phase-space Evolution Method

3.4. Likelihood Calculation

4. Optimization Procedure

4.1. Differential Evolution Genetic Algorithm

4.2. Time-step Cut in Parameter Space

5. Results

5.1. Using Marginalized Parameter Sweeps to Estimate Error

5.2. Full Result

6. Discussion

6.1. An Extremely Low-mass Diffuse Progenitor

6.2. The Location of the Remnant Progenitor

6.3. The Distribution of Dark Matter along the Stream

6.4. The Effect of the LMC on the OCS

6.5. Sources of Systematic Error

7. Conclusion and Future Research

Appendix A: Incompleteness Corrections for SDSS Data

Appendix B: Calculating Mass per Turnoff Star Using Isochrones

Estimate of the Mass and Radial Profile of the Orphan–Chenab Stream's Dwarf-galaxy Progenitor Using MilkyWay@home

Article metrics

Share this article

Author affiliations

ORCID iDs

Dates

Abstract

1. Introduction

2. Stellar Data of the OCS

2.1. SDSS Data

2.2. DEC Data

2.3. Distribution of Stream Stars along ΛOCS

2.4. Estimation of Total Stellar Mass

2.4.1. Systematic Error from Using a Redder Globular Cluster

2.5. Beta Dispersion as a Function of Position along the Stream

3. N-body Simulations

3.1. Progenitor Creation

3.2. Accelerations on Each Body

3.3. Phase-space Evolution Method

3.4. Likelihood Calculation

4. Optimization Procedure

4.1. Differential Evolution Genetic Algorithm

4.2. Time-step Cut in Parameter Space

5. Results

5.1. Using Marginalized Parameter Sweeps to Estimate Error

5.2. Full Result

6. Discussion

6.1. An Extremely Low-mass Diffuse Progenitor

6.2. The Location of the Remnant Progenitor

6.3. The Distribution of Dark Matter along the Stream

6.4. The Effect of the LMC on the OCS

6.5. Sources of Systematic Error

7. Conclusion and Future Research

Appendix A: Incompleteness Corrections for SDSS Data

Appendix B: Calculating Mass per Turnoff Star Using Isochrones

2.3. Distribution of Stream Stars along Λ_OCS