This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.

The Milky Way's Shell Structure Reveals the Time of a Radial Collision

, , , and

Published 2020 October 20 © 2020. The American Astronomical Society. All rights reserved.
, , Citation Thomas Donlon II et al 2020 ApJ 902 119 DOI 10.3847/1538-4357/abb5f6

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/902/2/119

Abstract

We identify shell structures in the Milky Way for the first time. We find two shells in the Virgo Overdensity region and two shells in the Hercules Aquila Cloud region using Sloan Digital Sky Survey, Gaia, and LAMOST data. These shell stars are a subset of the substructure previously identified as the Virgo Radial Merger (VRM). Timing arguments for these shells indicate that their progenitor dwarf galaxy passed through the Galactic center 2.7 ± 0.2 Gyr ago. Based on the time of collision, it is also possible that the VRM is related to the phenomenon that created phase-space spirals in the vertical motion of the disk and/or the Splash and could have caused a burst of star formation in the inner disk. We analyze phase mixing in a collection of radial merger N-body simulations and find that shell structure similar to that observed in Milky Way data disappears by 5 Gyr after collision with the Galactic center. The method used to calculate the merger time of the VRM was able to reliably recover the correct merger times for these simulations. Previous work supports the idea that the VRM and the Gaia Sausage/Gaia–Enceladus Merger are the same. However, the Gaia Sausage is widely believed to be 8–11 Gyr old. The disparate ages could be reconciled if the larger age is associated with an infall time when the progenitor crossed the virial radius; we do not constrain the time at which the progenitor became bound to the Milky Way. Alternatively, the Gaia Sausage could be younger than previously thought.

Export citation and abstract BibTeX RIS

1. Introduction

Overdensities in the Milky Way's stellar halo provide information about the shape of the halo's gravitational potential and indicate that the outer portions of the Galaxy are not in equilibrium (Ivezić et al. 2012). One such overdense feature in the halo is the Virgo Overdensity (VOD), which was originally identified by Vivas et al. (2001) as an overdensity of RR Lyrae stars (RRLs) in the Virgo constellation. The VOD contains a wealth of stellar substructure, including the Sagittarius Stream (Ibata et al. 2001), the Virgo Stellar Stream (Duffau et al. 2006), the Parallel Stream (Sohn et al. 2016; Weiss et al. 2018b), the Perpendicular Stream (Weiss et al. 2018b), associated moving groups (Duffau et al. 2014; Vivas et al. 2016), and other minor structures, such as the Cocytos Stream (Grillmair 2009; Donlon et al. 2019). A thorough history and background of the VOD can be found in the introduction of Donlon et al. (2019).

The Hercules Aquila Cloud (HAC; Belokurov et al. 2007) is another overdensity in the Milky Way's stellar halo. A common origin for the VOD and HAC was proposed by Simion et al. (2019), who noted that the VOD and the southern portion of the HAC were on opposite sides of the Galaxy and had similar kinematics. Simion et al. (2019) claimed that the VOD and HAC are connected to the Gaia Sausage, a structure in galactocentric velocity space that is characterized by a wide dispersion in radial velocity and a narrow dispersion in rotational velocity (Belokurov et al. 2018). This signature velocity structure is thought to have been caused by a major merger in the Milky Way's early history. The two overdensities would then be the result of a massive, ancient, and highly radial merger event that has since been dubbed the Gaia Sausage Merger (GSM; Simion et al. 2019), also known as the Gaia–Enceladus Merger (Helmi et al. 2018). We choose to refer to the merger event as the GSM for the remainder of this work, though it should be acknowledged that the merger event was independently discovered by multiple groups. While Helmi et al. (2018) characterized the Gaia–Enceladus Merger as a retrograde halo structure, Belokurov et al. (2019) classified the retrograde portion of the material in the halo as the "Sequoia," not belonging to the Sausage.

The GSM is thought to have occurred between 8 and 11 Gyr ago (Belokurov et al. 2018; Helmi et al. 2018; Simion et al. 2019). The main argument for this age of the GSM is that the merger is responsible for the creation of the thick disk, which is dated by an end of star formation in thick disk stars between 8 and 11 Gyr ago. If the GSM is indeed responsible for the puffing up of the thick disk, then the time that the thick disk was quenched would correspond to the time at which the GSM heated the disk. This timeline suggests a quiescent Milky Way, where our Galaxy experienced a few massive mergers early on in its history but was mostly collision-free until recent times.

Around the same time that the VOD was connected to the GSM, Donlon et al. (2019) showed that a single orbit passed through the two largest moving groups identified by Duffau et al. (2014) in the VOD. This orbit was highly radial, with an apogalacticon of 26 kpc and a perigalacticon of 0.3 kpc. Donlon et al. (2019) evolved an N-body simulation of a single Sagittarius-sized dwarf galaxy along this radial orbit for 2 Gyr and found that it simultaneously fit material previously attributed to the Perpendicular Stream, the Parallel Stream, the Virgo Stellar Stream, all of the associated moving groups, and some material previously thought to belong to the Sagittarius Stream. Thus, a single radial structure explained the majority of the substructure in the VOD. Donlon et al. (2019) named the radial merger event that created the VOD the Virgo Radial Merger (VRM), which is characterized as being responsible for a collection of stars in the VOD on radial orbits with a wide range of energies. In this work, we use the term "VRM" to refer to the merger event between the progenitor of the VOD and the Milky Way.

The VRM has been connected with other halo substructure besides the VOD. Li et al. (2016) identified the Eridanus–Phoenix Overdensity (EPO) in the south Galactic halo and noted that the VOD, HAC, and EPO all lie on a single polar orbit. Donlon et al. (2019) found that a simulated VRM left debris in the regions of the VOD, HAC, and EPO and proposed that a single radial merger could be responsible for all three overdensities. In that case, the overdensities would not share a single polar orbital plane but would lie along the three "spokes" of a single trefoil structure with perigalacticons within a kpc of the Galactic center.

Figure 5 of Donlon et al. (2019) shows that the N-body simulation of the VRM left material in the local solar region that was nearly identical to the characteristic shape of the Gaia Sausage. Additionally, the VRM left debris in the VOD and HAC, which is the same material that Simion et al. (2019) claimed to be GSM debris. The halo is thought to be composed primarily of debris from a single radial merger event (Deason et al. 2013; Belokurov et al. 2018; Deason et al. 2019). If this is the case, it is possible that the VRM and the GSM are the same, and that the single merger event is responsible for the majority of mass in the stellar halo. The counterargument comes from the timeline; Donlon et al. (2019) used simulations to show that the VRM could recreate the observed debris in the local solar neighborhood, the VOD, and the HAC if it occurred just 2 Gyr ago. This is 6–9 Gyr after the hypothesized time of the GSM. In this work, we show that the coherent structures associated with the VRM could not have survived for 8–11 Gyr. Thus, if the VRM and GSM are the same, the latter must have occurred more recently than originally thought.

Motivated by the disparity between the proposed ages of the VRM and the GSM, we seek a new method to identify the age of the VRM using shell substructure. Shells are common in elliptical galaxies and widely thought to be the artifacts of major radial merger events (Hernquist & Quinn 1988; Sanderson & Helmi 2013). They are named for their appearance as thin, extended "umbrella"-like groups of stars at a uniform galactocentric radius. Shells arise at the turning points in the orbits of stars in the debris field of a radial merger event, and the stars in the shells should therefore have near-zero galactocentric radial velocity. We are not able to measure the velocities of material in these shells outside of the Milky Way, which would help confirm this interpretation. However, such a full kinematic survey of shell stars is possible within the Milky Way.

In this work, we identify shell substructure in the Milky Way for the first time, and we argue that these shells are indeed associated with the VRM and therefore a radial merger event. Through analysis of N-body simulations of radial mergers, we develop a metric to describe how radial mergers evolve and the timescales over which this occurs. We find that phase mixing places an upper limit of 5 Gyr on locating shell substructure in this radial merger; after that, the shells cannot be isolated in any of the simulations. Rewinding the particles in the VRM's shell substructure back to the time when the progenitor fell through the center of the Milky Way allows us to calculate an infall time of 2.7 ± 0.2 Gyr ago. This result is similar to the previously hypothesized age of the VRM from Donlon et al. (2019).

2. Data

We construct two sets of observational data in the Milky Way halo: one for RRLs and the other for blue horizontal branch stars (BHBs). Each data set contains full 6D phase-space information for each star. This 6D information is calculated from distances derived from the presumed absolute magnitudes for the standard candles, proper motions obtained from the Gaia Data Release 2 (DR2; Gaia Collaboration et al. 2016, 2018), and radial velocities determined from spectra obtained from the Sloan Digital Sky Survey Data Release 14 (SDSS DR14; Blanton et al. 2017; Abolfathi et al. 2018) or the LAMOST Experiment for Galactic Understanding and Exploration (LEGUE; Deng et al. 2012). We utilized the method outlined in Johnson & Soderblom (1987) to calculate 3D galactocentric velocities from this information.

Many recent analyses of the stellar halo were performed in the local solar region using only Gaia data. By utilizing SDSS, we restrict ourselves to a much smaller region of the sky (notably missing the EPO) but gain more accurate radial velocities at larger distances than would be possible using Gaia data alone. Additionally, using SDSS as well as Gaia allows us to utilize BHBs as tracers; selecting BHBs and calculating their distances requires spectroscopy and accurate ugriz photometry.

The RRL data set consists of objects classified as RRLs in Gaia DR2 plus stars classified as RRLs in the Liu et al. (2020) LEGUE catalog. We follow the process outlined in Iorio & Belokurov (2019) in order to obtain the most RRLs possible and ensure clean data. This process consists of taking stars from the Gaia DR2 vari_rrlyrae and vari_classifier_result tables combined with data from the general gaia_source table based on the source_id of each star. The apparent magnitudes of the RRLs in the variable star tables were calculated by modeling the light curves with a truncated Fourier series in order to determine the intensity-averaged mean magnitudes (Neeley et al. 2019; Clementini et al. 2019). The apparent magnitudes were then used to calculate the distances to each star, where the absolute magnitudes of all RRLs were assumed to be 0.63 in the Gaia G band (Muraveva et al. 2018).

Following the procedure used in Iorio & Belokurov (2019), we restrict our data set based on two selection criteria; we enforce that astrometric_excess_noise < 0.25 and phot_bp_rp_excess_factor < 1.5. The former, astrometric_excess_noise, describes the disagreement between observations of a source and the Gaia astrometric model and is likely insignificant for values below 2 (Lindegren et al. 2012). The latter, phot_bp_rp_excess_factor, is an estimation of background and contamination issues affecting the Gaia BP and RP photometry (Riello et al. 2018). Both of these criteria are found in the main gaia_source table. Iorio & Belokurov (2019) also enforced a constraint that the reddening must be low (E(B − V) < 0.8). We do not enforce this cut, as we did not find any portion of our two fields to have an average reddening E(B − V) > 0.9 due to the fact that our fields are not positioned near the disk. We then matched our Gaia sample to all stars with spectra in SDSS DR14 by using the TOPCAT software (Taylor 2005) to perform an on-sky match with a maximum tolerance of 1'' separation.

In order to maximize the number of available RRLs, we also include stars that were identified as RRLs by Liu et al. (2020) using data from the LEGUE and SEGUE surveys. In the case of a star being identified as an RRL in both the Gaia data and the Liu et al. (2020) catalog, we opted to use the Gaia data. The Liu et al. (2020) catalog adds 3244 unique RRLs to our data set. Figure 1 shows our final RRL data set, containing 6023 stars.

Figure 1.

Figure 1. On-sky plot of our full 6D data set. Red points represent RRLs, and blue points represent BHBs. The extents of the VOD (solid line) and HAC (dashed line) fields are indicated. There are 6023 RRLs and 5743 BHBs in this figure, for a total of 11,766 stars between the two data sets. There are 1553 stars in the VOD region and 1480 stars in the HAC region.

Standard image High-resolution image

We select BHBs for our second data set from SDSS DR14 photometry and spectroscopy. We use extinction-corrected color cuts of −0.25 < (g − r)0 < 0 and 0.8 < (u − g)0 < 1.5 to identify BHBs (Yanny et al. 2000). The color cuts select a specific temperature range of BHBs and eliminate white dwarfs and QSOs. We also employ a surface gravity cut of 0 < log gWBG < 3.5 in order to eliminate blue straggler contamination (Newberg et al. 2009). The distance to each star was calculated using the absolute magnitude relation for BHBs (Deason et al. 2011):

Equation (1)

We matched these data to Gaia DR2 proper motions in the same way that we matched the RRL data set. Note that the color selection we used for BHBs overlaps with the possible range of colors of RRLs, so several stars classified as RRLs in Gaia also satisfied the BHB color cut. If a star was found in both data sets, the RRL data were retained, as the distance estimate for an RRL variable star (∼4% distance error at a heliocentric distance of 20 kpc) is more accurate than the photometric distance estimation for a BHB (∼10% distance error). After removing the duplicate stars, our BHB data set contained 5743 stars, for a total of 11,766 stars between the two data sets.

Figure 1 shows the VOD and HAC regions, where we expect to find primarily VRM debris, in relation to the SDSS footprint. The two regions are selected based on visible overdensities in RRL and BHB populations in the north Galactic cap and previous literature. We define the VOD region to be 175° < R.A. < 210° and −10° < decl. < 20°, similar to the extent of the region given by Vivas et al. (2016) but extending to slightly higher decl. The HAC canonically exists in two parts: the HAC in the north (b > 0°) and the HAC in the south (b < 0°). Our data are limited to the SDSS footprint in the north Galactic cap, so we will only be considering the HAC region in the north. We define the extent of the HAC in the north to be 20° < l < 75° and 30° < b < 55°. This is similar to the HAC region defined by Martin et al. (2018). In order to minimize thick disk contamination, we chose to only include data above b = 30°, whereas Martin et al. (2018) included data above b = 20°. From this point forward, we will refer to the north portion of the HAC as the "HAC region."

There are 908 RRLs (8% of total RRLs in the sample) and 645 BHBs (5% of sample BHBs) in the VOD region. There is a similar number of stars in the HAC region, with 675 RRLs (7% of sample RRLs) and 805 BHBs (7% of sample BHBs). This gives us 1553 stars in total in our VOD data set and 1480 stars in total in our HAC data set.

We derive our 6D phase-space information for each star from standard candle estimates of distances, position on the sky, SDSS or LEGUE radial velocity, and Gaia proper motions. Errors in radial velocity and Gaia proper motions are given in the survey data. We assume negligible errors in position on the sky (σα  = σδ  = 0). Distance errors in RRLs are calculated using an absolute magnitude error of ±0.08 in the Gaia G band (Muraveva et al. 2018), and distance errors in BHBs as measured using SDSS photometry were approximated as 10% of the distance value (Martin et al. 2018). We calculate errors in galactocentric radius r, galactocentric radial velocity vr , and angular momentum Lz with standard error propagation techniques. The average errors over both data sets are σr  = ±0.6 kpc, ${\sigma }_{{v}_{r}}=\pm 21$ km s−1, and ${\sigma }_{{L}_{z}}=\pm 222$ kpc km s−1.

3. Identifying Shells in the Milky Way

Radial collisions result in "shells" on opposite sides of the host galaxy's center at the location of each apogalacticon (Hernquist & Quinn 1988). In the idealized case, where the host potential is spherical and the progenitor's orbit is exactly radial, the corresponding shells will be spherically symmetric, or "umbrella-shaped." In galaxies such as the Milky Way, the disk is enough to break spherical symmetry. It is also widely believed that dark halos are aspherical. The shells that arise in systems such as these are, in general, less sharply defined and no longer lie at surfaces of constant galactocentric radius (Hernquist & Quinn 1988; Sanderson & Helmi 2013). Caustic structures that form in asymmetric potentials or from progenitors with nonzero angular momentum will typically have thicker "blurred" shells but nevertheless share similar kinematic properties with their spherical counterparts (Sanderson & Helmi 2013). In the Milky Way, we expect to identify shells that are not spherically symmetric because the Milky Way's potential is not spherical, as is obvious from the presence of a significant disk component.

In this work, we use the terminology laid out in Sanderson & Helmi (2013) regarding shell structure. The terms are defined from a top-down perspective. "Radial merger" describes a merger event with low angular momentum that will create shell structure. "Caustic structure" describes the entirety of the material with similar energy values in a radial merger that will cause a particular shell. A caustic structure is not only limited to material in a shell but also includes material that is falling from a shell back toward the host galaxy or moving away from the host galaxy to form a shell. "Caustic surface" refers to the rvr phase-space surface along which material in a caustic structure lies. "Shell," or "shell (sub)structure," refers to the portion of the caustic surface with radial velocities near zero where stars bunch up to form the characteristic "umbrella" in position space.

One way to identify shell substructure is to plot the $r-{v}_{r}$ phase-space distribution of stars, where r and vr are the galactocentric radius and radial velocity, respectively. In these coordinates, shells appear as parabolic or "candy corn"-shaped structures (Hernquist & Quinn 1988; Sanderson & Helmi 2013). However, phase mixing on the order of even a few Gyr wraps the merger debris enough that the shells begin to overlap in phase space. Based on the velocity errors in our data sets, we do not have the necessary resolution to identify caustic substructure in the Milky Way using a phase-space density approach.

We will instead identify caustic structure in stars with near-zero radial velocity, vr , as measured from the Galactic center. By selecting stars in a small vr range around zero, we preferentially select stars at the surface of shells and remove stars in other structures at the same distances, as well as eliminate the interior portions of the radial merger debris, where the stars are not currently in shells. This technique allows us to avoid fitting models to our velocity data, which have larger errors than the position data. In a histogram of the radial position from the Galactic center, r, of merger stars, peaks arise at the surface of each shell (Hernquist & Quinn 1988). This works in a spherically symmetric potential but would be problematic in an axisymmetric potential, as any particular shell will not necessarily be located at the same r everywhere in an axisymmetric potential (Hernquist & Quinn 1988; Sanderson & Helmi 2013). However, by restricting our analysis to a small region of the sky, any particular shell in that region will be located within a small range of galactocentric radii, which allows us to recover the density peak. Since shells arise where the member stars are bunched up at apogalacticon, the vr of stars in a shell will be near zero.

Other stellar structures with small vr may be located at the same distances as shells in the sky, notably structures on nearly circular orbits. Shells must have small angular momentum because they are radial structures. In an axisymmetric potential, we typically only consider Lz angular momentum, as Lx , Ly , and L are not true integrals of the motion. By cutting the data so that $| {L}_{z}| \lt 500$ kpc km s−1, we eliminate structures on nonradial orbits while retaining shell structure. In the VOD region, this eliminates Sagittarius Stream member stars from our data at higher distances. It should be noted that further angular momentum cuts may be required in data sets in other parts of the sky, as objects on highly polar, nonradial orbits will still have small Lz but are not in shells.

3.1. Developing a Shell Model

We utilize the analytical phase-space density model for a caustic structure derived in Sanderson & Helmi (2013),

Equation (2)

where rs is the distance of the shell from the Galactic center, κ describes the curvature of the caustic surface, δr is the characteristic width of the shell, and Ωs is the solid angle spanned by the shell. Sanderson & Helmi (2013) also included a term for the radial velocity of the debris at the surface of the shell in their model, vs . This term is directly proportional to the total angular momentum of the progenitor galaxy. The total angular momentum of the VRM debris was shown to be very small by Donlon et al. (2019), so we ignore the vs term in our analysis. It should be noted that we expect our shells to be relatively thick due to the axisymmetric potential of the Milky Way. The curvature of the caustic surface approximately depends on the strength of the underlying potential,

Equation (3)

where $g({\boldsymbol{r}})$ is the gravitational force toward the Galactic center at some position ${\boldsymbol{r}}$ with respect to the center of the Galaxy.

In order to isolate shell stars, we wish to describe the density of stars with some $| {v}_{r}| \lt {v}_{c}$ in a caustic surface, where vc is chosen to select as large and pure a sample of shell stars as possible. This selection will allow us to determine the corresponding shape of the shells in our radial density histograms. We integrate over our velocity range (−vc  < vr  < vc ) to find our density function:

Equation (4)

We do not require a complete computation of this expression, as we simply wish to determine its approximate form. Assuming that vc is small, we approximate the integral as a rectangle of width 2vc and height f(r,0):

Equation (5)

Evaluating this expression yields

Equation (6)

which is a Gaussian distribution, where the overall amplitude of the density peak depends linearly on our value of vc . Thus, it is reasonable to approximate the radial density of stars in a caustic structure as a Gaussian for sufficiently small values of vc .

Figure 2 shows a comparison of this Gaussian approximation and the original model. Note the slight difference in the radial location of the maximum of each model, and that the Gaussian distribution has a smaller maximum than the original model. This means that we have fewer stars available when looking at shells compared to entire caustic surfaces; the trade-off is that shells are substantially easier to isolate.

Figure 2.

Figure 2. Selection of shell structure from a caustic structure. In the left panel, we compare the radial density of the caustic model from Sanderson et al. (2017; blue) and a Gaussian approximation to the portion of the caustic that is in a shell, as selected using $| {v}_{r}| \lt 10$ km s−1 (orange). In this example, our model parameters are rs  = 20 kpc, δr  = 1 kpc, and κ = 4.74 × 10−4 kpc s2 km−2. Smaller values of κ (a steeper phase-space curve) result in larger amplitudes in the corresponding Gaussian distributions. The center of the Gaussian is slightly offset to larger values of r compared to the peak of the caustic distribution. Note that the Gaussian approximation has a smaller amplitude than the caustic model. This is due to removal of stars near apogalacticon in the caustic structure but with velocities outside of $| {v}_{r}| \lt 10$ km s−1. In the right panel, we show the vr r phase-space distribution for the caustic model given in the left panel. The subset of the phase space used in the Gaussian approximation lies between the red lines. The maximum value of the approximation is less than the maximum value of the caustic model because stars at similar r in phase space are excluded from the approximation by the vr cut.

Standard image High-resolution image

3.2. Fitting the Model to Observed Data

The requirement for the approximation performed in Section 3.1 requires that $\kappa {v}_{r}^{2}\ll | {r}_{s}-r| $. We determine that κ = 4.74 × 10−4 kpc s2 km−2 after evaluating Equation (3) for the model potential we use for the Milky Way in this work (see Section 4.1) at a distance of 25 kpc from the Galactic center and 25/$\sqrt{2}$ kpc above the plane of the disk. This is roughly the location of the VOD. A reasonable shell width, δr , is on the order of 1 kpc (Sanderson & Helmi 2013), which corresponds to a typical separation of $| {r}_{s}-r| \lt $ 0.5 kpc. Solving for the requirement of the approximation, we find that $| {v}_{r}| \,\lt 30$ km s−1. Due to the size of the velocity errors in our data set (${\sigma }_{{v}_{r}}=\pm 21$ km s−1), we require that $| {v}_{r}| \lt 10$ km s−1 in order to reduce contamination from material that is not actually in shells.

Figure 3 shows histograms of galactocentric radius, r, for the observed data, including all of the sample stars in both the VOD and HAC fields with $| {v}_{r}| \lt 10$ km s−1 and $| {L}_{z}| \lt 500\,\mathrm{kpc}$ km s−1. We acknowledge that our radial velocity cut likely removes many stars that are actually in shells from the data sets due to our large radial velocity errors; the entire range of velocity in the selected data is about the same as the 1σ velocity error. Cutting in Lz is less problematic, as our error in angular momentum is less than a quarter of our cut range. The candidate shell star data can be found in Tables 1 and 2.

Figure 3.

Figure 3. Gaussian mixture model fits for the data in the VOD (top row) and HAC (bottom row) regions using the EM algorithm. The first three columns show the one-, two-, and three-component Gaussian models for both regions plotted over a histogram of the data (dashed lines). The data were not binned when computing the EM algorithm. A KDE of the data is shown with solid black lines, calculated by placing a Gaussian at the distance of each star with a standard deviation equal to the uncertainty in the distance value for that star. The KDE is scaled to match the normalization of the histogram. The right column shows the BIC, AIC, and AICc values for models containing up to five components. Stars with r > 30 kpc are not shown here, as they were not considered in the EM algorithm fits.

Standard image High-resolution image

Table 1. Candidate Shell Stars in the VOD Region

TypeIDR.A.Decl. dhel vGSR μα μδ rgal vr Lz
  (deg)(deg)(kpc)(km s−1)(mas yr−1)(mas yr−1)(kpc)(km s−1)(kpc km s−1)
RRL2740181.592.07.4317−1.140.049.533400
RRL3694880517013152256186.04−1.125.57−15−0.170.0125.57−9−235
RRL3681530762224477696187.88−3.2714.14−18−0.49−0.4514.78−841
RRL3697444371970657792188.252.116.41170.03−0.417.057287
RRL2744188.252.115.65140.1−0.2916.21−395
RRL3931754693600423296188.4212.8155.15−1−0.17−0.0455.174−320
RRL3902831765353826176188.718.8316.5180.15−0.5617.522384
RRL3932634096743777536189.7614.6719.19−17−0.440.0620.24−1−64
RRL2681190.16−6.818.3−2−0.17−0.1425.177372
RRL3696258475664594688190.190.8428.932−0.38−0.3828.427−64
RRL3705706686457241728191.474.6915.4313−0.14−0.6116.04313
RRL3689968964211832448191.940.2313.95−4−0.12−0.4614.38−8163
RRL2828192.347.7536.36−27−0.280.3836.77−227
BHB3704495986715838208193.413.168.622−0.48−0.9910.33−2268
RRL2864194.319.2716.96−450.03−0.0924.68−2−135
BHB3691105756155452672195.981.49.69−41−1.33−0.8910.69−6137
RRL3617196.9612.5619.829−0.150.0523.511−6
RRL3366197.82−0.4815.96−360.80.0613.539245
RRL3686287623187964800199.19−1.8119.9790.310.118.70136
RRL3687531827969154560199.420.2213.21140.490.0612.91−358
RRL3565199.589.3314.78−440.231.3713.713−126
RRL3332200.42−2.6319.3737−0.360.3113.930171
RRL3488202.115.8619.3546−0.620.3724.38−5147
BHB3713136670641272704202.694.2111.06461.770.6311.14−6−106
RRL3397203.720.597.1984−4.492.029.67−7320
RRL3637352243286265856203.95−2.6217.9814−0.02−0.3816.265118
BHB3726199074937460736205.2810.0912.69280.24−0.8912.56−3217
BHB3662317754306076160206.67−0.0126.47−30.10.224.04−379
RRL3463206.784.6720.6350.13−0.0825.36−128
BHB3658276744131704832207.17−2.3115.39−1−0.49−0.6913.64135
BHB3657800170264903680208.75−2.4917.28−10−0.41−0.3515.09−3−89
BHB3618850490542779776209.04−8.239.35−74−0.221.838.44−6−464
RRL3619212294292093952209.55−8.1826.8900.180.5123.533312
RRL4003210.84−10.296.27−443.47−2.6110.282−201
RRL3959211.16−1.7915.45−260.19−0.220.35−3
BHB3661068369794896768212.081.3112.2420.09−0.110.77−318
BHB3660278572553385216212.610.9612.8515−0.41−1.3511.16−5183
RRL3981213.71−2.7528.9828−0.340.2434.114−296
RRL4000214.21−8.054.7944−2.39−0.0511.06−7479
BHB3673720686318314240215.597.7941.3190.22−0.0938.10235
BHB3649557165951306752217.41−1.7721.04−23−0.44−0.0817.56−10−255

Note. All velocities and proper motions have had the solar reflex motion removed. The ID assigned by Liu et al. (2020) is provided when a star came from their catalog. Otherwise, the 19-digit Gaia source_id is given as an identifier.

Download table as:  ASCIITypeset image

Table 2. Candidate Shell Stars in the HAC Region

TypeIDR.A.Decl. dhel vGSR μα μδ rgal vr Lz
  (deg)(deg)(kpc)(km s−1)(mas yr−1)(mas yr−1)(kpc)(km s−1)(kpc km s−1)
RRL1196273946718241024237.5815.869.85−23−1.85−0.538.15266
RRL4711238.2126.9125.722−0.05−0.0620.435−26
BHB1191634488685212800239.1813.4727.78−70.190.123.44−7253
RRL4454989329251784064239.479.5911.2983−2.03−3.368.19−7−315
BHB1383512732452176896240.1641.9245.42−8−1.05−0.2644.02−1385
BHB1379462651307214208240.5538.7726.7−11−0.110.0325.38−8174
BHB1386004779852419328240.7943.7210.53−7−0.00.3811.626103
RRL1379489073945967744241.3539.088.82−290.290.719.9−1128
BHB4453428950452824192243.029.9912.971014.640.219.241−373
BHB4453487774325208448243.1210.5923.67−7−0.26−0.1418.94−6−233
BHB1302909807058760064243.5324.812.850−6.46−3.7310.93827
BHB1200510674256549120243.917.858.02−450.970.836.97−843
RRL1305272142150939392245.1227.0526.08−5−0.41−0.2823.08−9−411
RRL1305220465104424960245.2626.948.4−122.581.48.145−260
BHB4465226504059767424245.2816.389.3326−2.16−1.587.38−556
BHB1201084756765042944245.3219.2924.57−1−0.45−0.1720.68−1−289
BHB4459959259247977600246.4512.338.3350.39−0.126.33−8−52
RRL4996247.027.495.28−333.28−5.285.51−472
RRL1297123558398503680247.2720.1523.5810−0.22−0.3219.752−497
RRL4993247.47.27.63−1634.19−6.073.06−10−21
RRL1330968484805660800248.3236.828.82−0.24−0.126.831−150
RRL1304493443103794304248.8227.1726.291−0.050.0423.15362
RRL1325177902522801664248.9934.0130.76−2−0.130.1128.344351
BHB4459171734045577600249.3811.3225.4960.21−0.0620.441−23
RRL1312598012032706688249.4731.3314.41−33−0.340.3912.84−9399
RRL1355750824060585088251.0339.8821.610−0.45−0.0620.36287
BHB1313324106319470080252.6132.2714.058−0.44−0.5512.56−10−218
BHB4449230843258872064253.3113.3714.2740.860.4110.198149
RRL4564859299966696704253.5121.2420.029−0.32−0.3216.32−0−390
BHB4447791578241500032253.6111.5826.67−9−0.58−0.0621.45−5−446
BHB4447770897975217792253.6411.288.3250−5.08−2.275.86−589
BHB4561332650781267200253.9219.359.7125−3.84−1.677.61−1040
BHB4448291173134284288254.4712.094.7388−5.24−0.155.54952
BHB1306870660259421568255.427.7611.4150−0.79−0.889.8310−381
RRL1313544901997725824255.6232.6625.063−0.1−0.2522.5−5−496
BHB1308320327684015360255.7727.6818.58−5−0.420.2415.87277
BHB4571771203718705536257.5323.9317.41−190.140.2314.2−9272
RRL5563259.4942.1929.32−330.31−0.1427.82−4−351
BHB1348471331235718656264.1143.6445.97−2−0.490.0344.24092

Note. All velocities and proper motions have had the solar reflex motion removed. The ID assigned by Liu et al. (2020) is provided when a star came from their catalog. Otherwise, the 19-digit Gaia source_id is given as an identifier.

Download table as:  ASCIITypeset image

In Figure 3, Gaussian mixture models are fit to the cut candidate shell data using the scikit-learn sklearn.mixture.GaussianMixture implementation (Pedregosa et al. 2011) of the expectation-maximization (EM) algorithm (Dempster et al. 1977). A kernel density estimation (KDE) of the data is shown in Figure 3 to provide a smooth approximation of the underlying density of the sample that is not dependent on binning and is a better representation of what the fitting algorithm "sees." The radial density of each Gaussian is modeled as

Equation (7)

where the values of a, b, and c are determined by the fitting algorithm.

Since the true number of Gaussian components in our data is not clear, we use the EM algorithm to fit Gaussian mixture models with up to five components to the data. Each quality of each fit was then assessed with its associated Bayesian information criterion (BIC; Schwarz 1978) and Akaike information criterion (AIC; Akaike 1974). These information criteria help prevent over- or underfitting the data. By adding another Gaussian component to a Gaussian mixture model, one will generally find that the model is a better fit to the data. The BIC and AIC weigh the improvement in the likelihood of the fit against a penalty for adding additional parameters. The model where the information criteria are minimized is then the most statistically significant result.

Both the BIC and AIC suffer from the assumption that the number of data points is much larger than the number of parameters in the model. The small number of data points in our sample impacts the validity of the BIC and AIC values, particularly when the number of Gaussians is large. The corrected AIC (AICc; Hurvich & Tsai 1989) is an adjustment of the AIC for small numbers of data points. We find that this value is a better indicator of the quality of a model than the AIC. The BIC is less affected by the small number of data points, and we find that the BIC agrees with the AICc for models with fewer than five Gaussian components.

The EM algorithm preferentially fit Gaussians to individual stars at large distances instead of the interior structures with many stars. Finding a shell with only one star in it is not reasonable, so we omitted data points with r ≥ 30 kpc while fitting the Gaussian mixture models.

Comparing the information criteria of the data for Gaussian mixture models with varying numbers of components, we find that both the VOD and HAC data sets are best modeled by only two Gaussians. This corresponds to four statistically significant shell structures in total. Our Gaussian mixture model fit to the candidate shell star data suggests the existence of shells in the VOD region at r = 13.8 and 24.5 kpc and the HAC region at r = 8.8 and 21.6 kpc. These results show that the shells in the HAC region are closer to the center of the Galaxy than the shells in the VOD region. This supports the idea that the shells are all formed from debris of the same merger event; one expects shells to arise at different distances on opposite sides of the Galaxy, since the material in one shell must have a different energy than the material in the other shells.

In the case of a smooth halo with no substructure, we would expect to see a single Gaussian distribution. If enough cuts are made to the data, the single Gaussian distribution could be quite noisy due to small number statistics. The EM algorithm produces a best-fit single Gaussian at a distance of 15 kpc in the VOD region and 16 kpc in the HAC region. While the information criteria of the data are minimized for a two-component model, we wish to strongly rule out the possibility that the data come from a single Gaussian. To this end, we use two tests: the Anderson–Darling test (Anderson & Darling 1952; Stephens 1974) and Hartigan's dip test (Hartigan & Hartigan 1985). Both of these tests are designed to evaluate the likelihood that a given distribution is derived from a single normal distribution. The Anderson–Darling test yielded p < 0.01 for both regions, meaning that we can confidently reject the null hypothesis that either region is composed of data derived from a single Gaussian. The Hartigan's dip test produced p = 0.16 for the VOD region data and p = 0.15 for the HAC region, providing an ∼85% chance that either region is derived from more than one Gaussian distribution. Between these two statistics, we claim that the shell candidate stars make up substructure in the halo and are not simply a smooth halo background.

Figure 4 shows histograms of the candidate shell stars against the best fits from the EM algorithm. The middle row of Figure 4 shows the data after velocity cuts have been applied to select only shell stars, split into separate distributions of RRL and BHBs. Although BHBs make up a much smaller percentage of the sample in the VOD region than in the HAC region, the distributions for different types of stars in each region appear to be similarly distributed in galactocentric radius. Slight differences in the distributions of the different types of stars may be due to distance errors, which are about the size of a bin width beyond 20 kpc for BHBs and around half a bin width for RRLs. Curiously, the shell at ∼25 kpc in the VOD region appears to be composed almost entirely of RRLs. A comparison of the candidate shell stars in both regions shows that the shells in the VOD region do not lie at the same distance as the shells in the HAC region. We note that BHBs have distance errors of around 10%. At 20 kpc from the Galactic center, a 10% error can move the stars over in the histogram by as much as an entire bin. The slight differences in the distances in the peaks of the RRL and BHB distributions in the HAC region may be due to the errors in distance or simply small number statistics.

Figure 4.

Figure 4. Top: histograms of the number of candidate shell stars in the VOD (left) and HAC (right) regions as a function of galactocentric radius (solid gray). The data have been cut to satisfy $| {v}_{r}| \lt 10$ km s−1 and $| {L}_{z}| \lt 500\,\mathrm{kpc}$ km s−1. Plotted on top of these data with dashed lines are our best Gaussian mixture model fits for shells in the data. The left panel shows the two fit (binned) Gaussians with parameters (a, b, c) = (7.2 stars kpc−1, 13.8 kpc, 3.2 kpc) and (8.4 stars kpc−1, 24.5 kpc, 0.7 kpc). The right panel shows the two fit Gaussians with parameters (a, b, c) = (6.2 stars kpc−1, 8.8 kpc, 2.4 kpc) and (3.8 stars kpc−1, 21.6 kpc, 3.93 kpc). A KDE of the data is shown with solid black lines, calculated as in Figure 3. Middle: histograms of the data separated into RRL (red) and BHB (blue) populations. Bottom: on the right, the RRL distribution for the VOD region (dashed white bars) is compared to the RRL data from the Vivas et al. (2016) catalog after identical velocity cuts. On the left, the data for the two regions are superimposed.

Standard image High-resolution image

Some stars in the data sets that were identified as RRLs also satisfied the constraints for BHBs (Section 2). We chose to take the RRL distance data for these stars, as they are probably more accurate than the corresponding BHB distance calculation. However, we want to identify how our analysis changes if these stars are actually BHBs and not RRLs. There were a total of 1994 stars in our data (16.9% of all stars) that were identified as both RRLs and BHBs. Out of these stars, 371 (3.2% of all stars) had a difference between the RRL and BHB distance calculations of greater than 0.1 kpc. For these 371 stars, the mean difference between the calculated distances from the Galactic center is 3.9 kpc, and the standard deviation in the distance differences is 3.4 kpc.

We then recalculated which of these stars are located in shells if we use the BHB distances instead of the RRL distances. In the VOD region, two stars in the 18–20 kpc bin and one star in the 14–16 kpc bin are removed. The VOD region gains one star in the 14–16 kpc bin and two stars in the 12–14 kpc bin. In the HAC region, one star is removed from the 24–26 kpc bin and two stars are removed from the 26–28 kpc bins. The HAC region gains one star in the 20–22 kpc bin and two stars in the 18–20 kpc bin. If we use the BHB distances for the stars in this work that are identified as both types of star, the interior shell of the VOD and the exterior shell of the HAC become more pronounced. Since it seems more likely that stars identified as variable are RRLs rather than BHBs, we choose to adopt the RRL distance values, even though they make the shells slightly less obvious.

Donlon et al. (2019) showed an excess of material at 40 kpc from the Sun in the VOD, and Fardal et al. (2019) showed an "Outer Virgo Overdensity" at ∼75 kpc from the Sun. The data from Vivas et al. (2016) in the bottom right panel of Figure 4 show stars with small vr at ∼40 kpc from the Galactic center that extend out to larger distances. These structures may be outer shells formed from the VRM in the VOD region, but we are not able to analyze them in this work due to the lack of 6D information for those stars. The Lz and vr errors are large at these distances, which inhibits our ability to properly locate distant shell substructure in observed data. These distant shells are not incompatible with our current models, as the VRM N-body simulation from Donlon et al. (2019) placed merger material at 40–70 kpc from the Sun in the VOD region, and shells are seen out to 60 kpc from the Galactic center in the simulations described later in this work. We predict that simulations of a progenitor falling in from the virial radius would also form shells at large r values from material in its trailing tidal tail.

3.3. Radial Velocity Errors of RRL Variable Stars

When considering stellar velocities, it is important that we recognize that RRLs are pulsating variable stars; light from these stars will have Doppler shifts due to both translational motion and the time-dependent expansion and contraction of the surface of the star. These pulsational velocities can be as large as 120 km s−1 (see Figure 2 of Sesar 2012, Figure 5 of Duffau et al. 2014, and Figure 2 of Vivas et al. 2016). The radial velocities of the RRLs in our catalog that are not from the Liu et al. (2020) catalog are derived from SDSS stack spectra. Since RRLs are typically targeted for SDSS spectra as potential BHB or quasar candidates based on color, SDSS radial velocities are not calculated with pulsating variable stars in mind. This means that our measured velocities, based only on the Doppler shift, could be wildly different than the actual velocities of the stars.

Vivas et al. (2016) created a data set of RRLs in the VOD in a way that cuts out stars with wildly erroneous radial velocity measurements. In this data set, the majority of RRLs had radial velocity uncertainties below 10 km s−1. We matched this data set with Gaia data in order to obtain 3D velocities and then cut it so that $| {v}_{r}| \lt 10$ km s−1 and $| {L}_{z}| \lt 500\,\mathrm{kpc}$ km s−1 in order to isolate shell structure. This is compared with our RRL data in the VOD region in the bottom left panel of Figure 4. In this figure, the Vivas et al. (2016) data imply the existence of shells in the VOD region at the same distances as the two statistically significant shells that were identified in RRLs and BHBs.

One might be concerned that large radial velocity errors would confuse the selection of shell stars using galactocentric radial velocities. However, the velocity errors just from measurement are already quite large, of the order of the random velocity errors due to pulsation of the RRLs. Many stars that are actually in the shell are not selected due to random error or RRL pulsation, reducing our shell signal. Stars that are actually up to 120 km s−1 away from vr  = 0 could be erroneously included in the sample. However, on average, the radial velocity measured from stack spectra will not be 120 km s−1 off from the actual radial velocity measurement, since the pulsational velocity is only that large for a small portion of the RRL oscillation period, and stack spectra usually include measurements from several different epochs. In practice, the maximum difference between a stack spectrum with multiple measurement epochs and the actual radial velocity is closer to 50 km s−1, and the actual error is usually even smaller (see Figure 3 of Vivas et al. 2016). The right panel of Figure 2 shows that within the same caustic structure, stars that actually have galactocentric radial velocities of vr  = ±50 km s−1 would only be ∼2 kpc closer than the actual shell structure, which is within the systematic distance error for stars in our data set beyond 20 kpc from the Sun.

The radial velocities of RRLs from the Liu et al. (2020) catalog are calculated by fitting the observed velocity as a function of phase to an empirical pulsating velocity template curve. It is expected that this will reduce the error in radial velocity due to pulsational velocity compared to calculations of radial velocity from stack spectra. In order to test the differences between radial velocities that are calculated using stack spectra and velocities that are calculated by fitting an observed velocity to an empirical velocity curve, we looked at the sample of 369 RRLs that had measured spectral radial velocities in both the Gaia RRL and Liu et al. (2020) catalogs. The radial velocities of both catalogs are generally consistent; the mean difference between the two catalogs' radial velocities is 21 km s−1, and the standard deviation of the differences is 15 km s−1. The mean difference between the catalogs' radial velocities is roughly the same size as the upper limit for the estimated error in the radial velocity from either catalog. This suggests that a sizable fraction of the differences between the two catalogs' radial velocities is due to measurement uncertainties and not necessarily to pulsational velocities at the surface of a star. While radial velocities of RRLs derived from stack spectra are likely less accurate than radial velocities derived from empirical velocity curves, the typical departure of the radial velocity value derived from stack spectra from the "correct" value is in practice much smaller than the 120 km s−1 upper limit for the magnitude of the pulsational velocity. We maintain that radial velocities derived from stack spectra for RRLs are fairly good estimates for the actual radial velocities of those stars, although they should be used with caution.

Especially in the VOD and HAC regions of the sky, RRLs that are not associated with any VRM caustic structure and also have low Lz angular momentum are relatively uncommon. This explains why we are able to recover the shells even though many RRLs could have large radial velocity errors. The similarity between the BHB and RRL data and the positions of shells from RRLs in Vivas et al. (2016) data in the VOD support the conclusion that the RRL radial velocity errors that are derived from SDSS spectra do not significantly impact the outcomes of this work.

It should be noted that using the radial velocity values from the Liu et al. (2020) catalog in place of the radial velocity values from the Gaia/SDSS catalog did not substantially change the histogram of stars (Figure 4) that satisfy the galactocentric radial velocity and angular momentum cuts. This is consistent with our analysis that the radial velocities of RRLs derived from stack spectra that we use in this work are typically not very far off from the "correct" radial velocity measurements, and that the errors in the measured radial velocity values are often just as large as the difference between the stack spectra and velocity curve fit radial velocity measurements. This may not be the case for all data sets, as the quality of the radial velocities derived from stack spectra depends on several factors, such as the number of observations, the quality of the spectra, and the times at which the observations are made.

4. Timing the VRM: Phase-mixing Constraints

4.1. Radial Merger Simulations

In order to explore phase mixing in radial mergers, we create a suite of 120 N-body simulations of radial mergers. These simulations are all initialized with a single Plummer profile progenitor dwarf galaxy (Plummer 1911). A third of these simulations use a dynamical mass of 109 M and a scale radius of 3 kpc, which is consistent with measurements of typical dwarf galaxies such as the Fornax dSph (Penarrubia et al. 2008). These parameters were previously used as likely parameters for the progenitor of the VRM by Donlon et al. (2019), who claimed that an N-body simulation with this mass and scale radius more accurately recovered the kinematics of the VRM than simulations with a mass of 107 M and a scale radius of 0.4 kpc. In order to explore the effect of progenitor mass on radial merger phase mixing, we also perform simulations with a dynamical mass of 108 M and a scale radius of 1 kpc, as well as simulations with a dynamical mass of 1010 M and a scale radius of 10 kpc. The progenitors in these simulations are given zero initial velocity, so that the initial energy of the dwarf galaxy is determined solely by its initial distance from the Galactic center, and the merger is guaranteed to be nearly radial.

Since we are starting the dwarf galaxy at rest in an axisymmetric potential, there are only two parameters required to create the full range of substructure for a particular dwarf model. These are the initial inclination angle (i) from the Galactic disk as viewed from the Galactic center and the initial distance from the Galactic center (r0). The inclination angle is defined so that i = 0° in the disk and i = 90° when the dwarf galaxy is positioned along the Galactic Z-axis. The values we explored for i range from 0° to 90° in increments of 10°, and the values we explored for r0 are 20, 30, 45, and 60 kpc. The observed VRM debris has an Lz distribution centered on zero (Donlon et al. 2019), so we did not run simulations with nonzero angular momenta. We used a value of Y = 0 for the initial position of the dwarf galaxy, as the azimuthal angle of impact does not change the simulation due to the axisymmetric symmetry of the Galaxy.

The simulations were run on the MilkyWay@home N-body software (Shelton 2018) in a static gravitational potential consisting of a Hernquist bulge, Miyamoto–Nagai disk, and logarithmic halo using potential parameters from Orphan Stream Model 5 in Newberg et al. (2010). Each simulation contained 20,000 bodies and was integrated forward in time for 10 Gyr. For the rest of this work, we will refer to each simulation solely based on its progenitor's mass, initial inclination angle, and initial distance from the Galactic center. When necessary, the results of the simulation are also identified by a time step.

Note that it is likely that the progenitor of the VRM was originally located at or near the Milky Way's virial radius and then fell into the Milky Way along a decaying orbit. As the orbit decayed, each apogalacticon of the dwarf galaxy's orbit would be located closer to the Galactic center than the previous apogalacticon. Eventually, the dwarf galaxy collided with the Milky Way; this occurred after the final apogalacticon on the dwarf's orbit, which determines the final orbital energy of the progenitor dwarf galaxy. After the dwarf galaxy passed through the Milky Way, its constituent stars became bound solely to the Milky Way, and their individual energies were locked in. We need only to constrain the distance of the final apogalacticon pass of the progenitor of the VRM in order to characterize the present-day kinematics of the structure, which corresponds to r0.

4.2. Causticality: A Metric for Phase Mixing

Simulated radial mergers experience phase mixing of the dwarf galaxy debris over large timescales. Figure 5 shows how small variations in the initial energies of stars in a radial merger will cause the material in shells to segregate based on energy. This phase-wrapping causes more shells to form as the age of the radial merger increases. Phase mixing is also caused by slight tangential velocities of stars in each shell due to variations in angular momentum. This causes shells to grow in surface area over time. Eventually, debris from a merger reaches a phase-mixed equilibrium within the Galaxy.

Figure 5.

Figure 5. Galactocentric Z vs. X for six time steps of an N-body simulation of a radial merger in a model Milky Way potential. The six panels show a progression of time steps spanning 1.25 Gyr. The initial parameters for this simulation are i = 30°, r0 = 30 kpc, and a mass of 109 M (see Section 4.1). The black dots show a random sample of one-fourth of the bodies in the simulation. The red cross marks the location of the Galactic center. We follow two particles, a red circle and a cyan diamond, throughout time; the relative magnitudes and directions of the velocities of these particles are shown with arrows. At the beginning of the simulation, the red particle is at a lower initial energy than the cyan particle. In the next panel, they are both falling inward toward the Galactic center. By the top right panel, the red particle has reached apogalacticon and begun falling inward, while the cyan particle is moving outward toward apogalacticon (in a shell). In the bottom left panel, one sees that the red particle is located in a shell on the right, while the cyan particle is now beginning its second infall. By this point, there are already two shells formed in the simulation. In the bottom panels, the two particles are located in different shells. The period of oscillation for the cyan particle is longer than that of the red particle, since the cyan particle had a larger initial energy. Note how two shells are located above the Galactic plane, corresponding to our HAC and VOD regions, and that this simulation predicts a third location for shells below the Galactic plane, which could correspond to the EPO. This simulation demonstrates how shells form and explains how the VOD, HAC, and EPO could be related through a single "trefoil" structure. Not all particles oscillate between all three locations (for example, the red and cyan model particles do not move into the shell located at negative Z values). The shells corresponding to the VOD and HAC are both above the plane due to the influence of the disk; in a spherically symmetric potential, two sets of shells would be diametrically opposed.

Standard image High-resolution image

Figure 6 shows the simulation of a radial merger with initial inclination angle i = 30°, r0 = 30 kpc, and a mass of 109 M. The data for this simulation were cut to mimic our data shown in Figure 4. This simulation shows that over time, the strong peaks demonstrating shells phase mix into a smooth distribution. When a merger has phase mixed completely, shells will no longer be present (that is, the number of caustic surfaces eventually approaches infinity, and it is impossible to detect any one shell in particular).

Figure 6.

Figure 6. Radial density histograms for the radial merger simulation with i = 30°, r0 = 30 kpc, and a mass of 109 M. The plotted data include only particles with $| {v}_{r}| \lt 10$ km s−1, $| {L}_{z}| \lt 500\,\mathrm{kpc}$ km s−1 and located within a 50° by 50° region of the sky in order to mimic the observed data. Each panel shows the data at a fixed time in the simulation; the times range from 0.5 to 10 Gyr. Note the strong peak at 0.5 Gyr. By 1.5 Gyr, there are two peaks, corresponding to two shells. After 3 Gyr, the number of shells has grown large enough that individual peaks are no longer apparent. The value of the causticality decreases over time as the peaks become less distinct. By 5 Gyr, the simulation is fairly phase mixed, with a small causticality.

Standard image High-resolution image

We seek an objective way to describe the amount that a radial merger has phase mixed from its initial state. To this end, we develop a numerical metric for a histogram h, which we call the "causticality" C of that histogram. With n bins in h and Ni counts in the ith bin of h, we define causticality as

Equation (8)

Causticality consists of a ratio of the sum of the squared differences between each bin and its previous neighbor and the sum of the squared sums of each bin and its previous neighbor. The denominator in Equation (8) imposes that C = C(m · h) for any constant m > 0, which ensures that the causticality only depends on the shape of the input histogram and not the total number of data points in the histogram.

Causticality can take on all values between zero (completely mixed) and unity (completely unmixed). Consider a distribution with particles in only one bin or that has undergone no phase mixing; the measured causticality of this system would be equal to unity. This is also the case for a system where every other bin is empty. On the other end, a uniform distribution has a causticality of zero. Here C is larger for distributions with sharp peaks than distributions with gradual, smooth peaks; Figure 6 shows how phase mixing lowers the measured value of the causticality over time in a simulation of a radial merger.

In practice, the measured causticality of our systems never actually becomes zero. This is because the debris of radial mergers relaxes into a smooth Gaussian distribution, not a uniform distribution. Additionally, the invariance of causticality with respect to the number of particles assumes that a constant bin size is being used. Typically, bin size decreases as more objects are added to a histogram, which can alter the measured value of causticality. Thus, it is important that the bin sizes for the observed and simulated data are identical.

If observable shells are present in our data, we would expect to see a few strong, sharp peaks. As radial mergers age, these few thin shells become wider and more numerous until the overall radial distribution approaches a smooth distribution. We therefore expect the causticality to decrease over time for each simulation of a radial merger until the merger is completely phase mixed, at which point the causticality assumes some small constant value.

Appendix A contains an analysis of the uncertainty in causticality, as well as the effects of bin size on measured causticality values. We determine that 2 kpc bin sizes are best for these data.

This is the first time that causticality has been introduced, and we recognize that it is beneficial to perform a similar analysis of phase mixing using a widely studied statistic. In Appendix B, we use the Kullback–Leibler divergence (KLD; Kullback & Leibler 1951) instead of causticality to evaluate how phase mixed a distribution is. We find that it is more difficult to get an estimate of the merger time of the VRM from measurements of the KLD, but we are still able to determine that the VRM must have occurred within the last 5 Gyr. One advantage of causticality over the KLD is that causticality does not require comparison to some assumed baseline distribution; while the KLD as it is applied in this work only approaches zero when the distribution is uniform, the causticality rapidly approaches zero as a distribution becomes smooth, regardless of its shape.

4.3. Constraining the VRM Merger Time

Figure 7 shows plots of the causticality over time for all 120 radial merger simulations with varying inclination angle, radius of initial infall, and mass. Evaluating the causticality for our observed data, we find that the value of the causticality is 0.20 ± 0.09 for the VOD region and 0.12 ± 0.06 for the HAC region. Any background halo stars or observational errors in these data sets will decrease the measured values of the causticality in the observed data, so these calculated values of the causticality of the observed data are lower limits. Background stars or observational errors will make the observed data appear to be older than they actually are, since the simulated data do not suffer from these issues.

Figure 7.

Figure 7. Values of causticality vs. time for all 120 simulated radial mergers. The causticality was calculated for each time step of a radial histogram of all stars in the simulation with $| {v}_{r}| \lt 10$ km s−1, $| {L}_{z}| \lt 500\,\mathrm{kpc}$ km s−1, and a roughly 50° by 50° portion of the sky, which was manually selected to contain the majority of the shell(s) for each simulation. The causticality lines have been interpolated in order for trends to be more visible. Dashed horizontal lines give the values of causticality for the observed data in the VOD (0.20) and HAC (0.12) regions. Observational errors will move the measured causticality of the observed data downward, making the observed data appear older than they actually are. Each column contains data for a given progenitor dwarf galaxy mass, given in the top left corner of the upper panel. On average, a smaller progenitor mass means that a radial merger will take longer to phase mix. The top panels show mean causticality for simulations with a given initial inclination angle, averaged over the four initial distances. Inclination angle does not appear to have a monotonic effect on how long shells take to phase mix; increasing the inclination angle could increase or decrease the mixing time. The bottom panels show values of the causticality for simulations with a given initial distance. Mean values of the causticality are shown with thick solid lines and averaged over all inclination angles. Individual simulations are shown with thin dashed lines and colored by their initial distance. Simulations with larger initial distances take longer to phase mix, on average, than simulations with smaller initial distances.

Standard image High-resolution image

Figure 7 shows that simulations with larger initial distances take longer to phase mix, on average, than simulations with smaller initial distances. This means that the approximate time it took the VRM debris to phase mix to its current state depends on what the distance of the final apogalacticon (r0) of the VRM progenitor's orbit was. In order to get a good estimate of the age of the VRM from this data, we need to determine a likely value of r0 for the progenitor of the VRM.

Assume that the progenitor dwarf galaxy remains largely intact prior to reaching the final apogalacticon. The individual stars in the dwarf galaxy have a range of energies that are limited by the fact that they are gravitationally bound together. When the dwarf galaxy is tidally disrupted as it passes through the Galactic center, the individual stars have a range of energies near the energy per mass of the original dwarf galaxy and will therefore populate orbits with a range of apogalacticons that are near the final apogalacticon of the dwarf galaxy. For example, Figure 6 shows that the debris of a simulated radial merger with r0 = 30 kpc primarily populates the halo at distances between 20 and 40 kpc.

If the radial merger event has enough material to dominate the stellar halo, which is true for the VRM, then we expect to see a stellar break in the halo in the range of distances that the radial merger populates. Various works place the stellar breaks at a range of 18–28 kpc (Watkins et al. 2009; Deason et al. 2011; Sesar et al. 2011; Pila-Díez et al. 2015; Xue et al. 2015; Deason et al. 2018). This range in the stellar break radius may be due to measurements in different regions of the sky; certain portions of the Galaxy, such as the VOD and the HAC, will have proportionally more VRM debris than regions without visible overdensities. Measurements of the stellar break radius in the overdense regions will produce values similar to r0 for the VRM progenitor; measurements in other areas of the sky will produce values of the stellar break that are smaller than r0 for the VRM progenitor due to the majority of VRM material in that region not being near apogalacticon and therefore populating regions with smaller galactocentric radii. A measurement of the stellar break over the entire halo would then be an average of the actual r0 value for the VRM progenitor and other regions of the sky and expected to yield a value somewhat smaller than the actual value of r0 for the VRM progenitor.

These values for the stellar break suggest that the value of r0 for the VRM must be between 20 and 30 kpc in order to generate a stellar break near those distances. This value of r0 for the VRM is supported by Villalobos & Helmi (2008), who showed that simulations of dwarf galaxies falling into the Milky Way from distances near the virial radius tend to have a final apogalacticon around r0 ∼ 20 kpc from the Galactic center before colliding with the Galaxy.

In Figure 7, we find that simulations with r0 = 20 kpc have similar causticalities to the observed data between 1 and 2 Gyr after the simulation is started. The simulations with r0 = 30 kpc have causticalities that are similar to the observed data between 3 and 5 Gyr. Since we expect the progenitor of the VRM to have a value of r0 between 20 and 30 kpc, the time at which the VRM has values of causticality that are similar to the observed data is expected to be within the last 5 Gyr.

This is again consistent with Villalobos & Helmi (2008), who stated that shell structure is present for only ∼2 Gyr after dwarf galaxies collide with the Milky Way in their simulations. Further, we were unable to locate identifiable shell structure beyond 5 Gyr after collision in our simulations with r0 < 60 kpc. These points suggest that after 5 Gyr, VRM-like mergers have phase mixed beyond what we see in the observed data, and that the VRM cannot have happened that long ago, as we still see shell substructure.

Note that as the mass of the progenitor dwarf galaxy increases, less time is required to phase mix the debris to observed levels. For progenitors with a mass of 1010 M, this mixing time is very consistent and does not appear to strongly depend on initial inclination angle or distance. No simulations with this mass had a value of causticality similar to the observed data after 5 Gyr.

For simulations with masses of 109 and 1010 M, the time to phase mix increases monotonically with initial distance. However, simulations with a mass of 108 M and r0 = 60 kpc appeared to phase mix more quickly than simulations with identical mass and smaller initial distances. This is due to the distance cut that we made on our simulations when calculating causticality. Since the observed data did not extend beyond 50 kpc, we cut the simulated data to only include objects within 50 kpc. The 108 M, r0 = 60 kpc simulations place the majority of their material beyond 50 kpc from the Galactic center, so the measured causticalities only include the small number of objects that remain within 50 kpc. This deflates the measured value of causticality. However, a structure that places almost all of its debris beyond 50 kpc cannot possibly be responsible for shells within 30 kpc of the Galactic center, so simulations with 108 M and r0 = 60 kpc are not viable models for the VRM. Data from the 108 M and r0 = 60 kpc simulations are omitted from Figure 7 to avoid potentially misleading the reader.

5. Timing Radial Mergers in Simulations

It is difficult to determine the orbit of a radial merger's progenitor through the usual methods used to fit orbits to tidal streams because there is no visible progenitor and no coherent motion of member stars on the sky. Without an orbit, we cannot properly simulate the progenitor of the VRM in order to explore how the merger event evolves over time. Donlon et al. (2019) provided a possible orbit for the progenitor of the VRM based on the motion of moving groups in the VOD, but they did not determine whether that orbit properly constrained the infall of the VRM progenitor or its debris in other regions. Here we derive a new method for determining the merger time of a radial merger without knowing its orbit.

Stars with similar energies in a radial merger event will oscillate back and forth with similar periods, producing shells wherever the stars turn around on their orbits. This period is analytically calculable for spherically symmetric systems. However, in an axisymmetric potential, the period for each group of stars will depend on the initial energy of the stars and the inclination of the material with respect to the Galactic disk. The initial energies of the stars and their motions also depend on the details of the Milky Way potential. We do not know the initial orbital parameters of the progenitor of the VRM for certain, so it is not possible to derive an analytic expression for the position of a caustic surface in this case. We therefore take a numerical approach using the data that we have available to us.

With nonradial mergers, the progenitor typically survives as a bound object for many orbits while material is tidally stripped away from the progenitor. It might, therefore, be necessary to account for dynamical friction, which could substantially change the orbit of the progenitor. On the other hand, with a radial merger, the system becomes unbound after its plunge through the Galactic center, and dynamical friction is not a factor in the orbits of the merger debris. For this reason, we do not consider dynamical friction in our models.

5.1. Modeling Oscillations of Shells

Stars in a shell will possess similar energies and will have moved similarly throughout the lifetime of the radial merger. Thus, we can expect a single star in some shell to oscillate with the same period as the rest of the stars in the shell. This means that we can model caustic oscillation in the Galactic potential as a free oscillation of a single model mass at the location of a shell. At any given point in time, we expect to see multiple shells at different radii and positions on the sky for a given radial merger. By assigning model particles with zero initial velocity to each shell and then tracing these particles' motions back in time, we can compare the relative positions of each caustic surface throughout time. At some time in the past, the particles on all of the shells must have been at the same place, bound to the dwarf galaxy. The difference in each shell's distance from one another can then be used to pinpoint the time that a radial merger was still self-bound.

We begin by placing a model particle at each shell found by the BIC fitting algorithm for some radial merger (either simulated or observed data). We must be able to allow motion from one side of the Galaxy to the other, so we arbitrarily select the shells on one side of the Galaxy to be located at positive r and the shells on the other side to be located at negative r. Particles will then oscillate between positive and negative r values. Using Galpy version 1.4.1 (Bovy 2015http://github.com/jobovy/galpy), we calculate the forces on each model particle at each time step in order to integrate the particles backward in time for 10 Gyr. By integrating each model particle back in time, we expect to find a point in time where each caustic structure is located at the same point in space. Additionally, if each caustic structure is moving in the same direction (has the same signs of dr/dt) at that point, we claim that we have found the point in time where the progenitor of the radial merger was still self-bound. This point in time will be the "merger time" of the radial merger.

5.2. Constraining Likely Merger Times

In order to measure the overall difference in the location of shells, we introduce δ(t) as a metric to measure the total distance between model particles,

Equation (9)

Here we use vector subtraction in order to ensure that the model particles are spatially close to one another and not simply at similar galactocentric distances. A small δ(t) means the caustic structures are clumped together, while a high δ(t) corresponds to a large separation of the shells. Times with a small δ(t) are more likely to be times when the progenitor was still bound.

As the progenitor of the radial merger collides with the host galaxy, all of the caustic structures are initially falling inward and then transition to moving outward from the host galaxy. We count the number of first derivatives of the model particles that are positive (dr/dt > 0) in order to determine the number of caustic structures moving in the same direction. The passage of the dwarf galaxy through the Galactic center would cause the number of positive first derivatives to go directly from zero to the total number of shells. If a minimum of δ(t) corresponds to a time where the number of model particles with dr/dt > 0 is either zero or the number of shells in the data, then that is a likely time that the dwarf galaxy passed through the Galactic center and was tidally disrupted. This is especially true if the number of model particles with dr/dt > 0 goes directly from zero to the number of shells or vice versa at that time, as this corresponds to the progenitor passing through the Galaxy.

5.3. Recovering Merger Times from Radial Merger Simulations

We tested the method outlined in Sections 5.1 and 5.2 by using it to measure merger times for the series of simulations of radial mergers in a Milky Way–like galaxy described in Section 4.1. These simulations had a known evolve time, which allowed us to verify whether or not the method was able to recover the correct time of collision for each different simulation.

We created data sets similar to the ones used for the analysis of the VRM debris by cutting our simulation data in vr and Lz identically to the way they were cut in the observed data. Additionally, we only looked at data out to r = 50 kpc, in order to ensure that our method could recover the correct infall time despite our limitations in distance in the observed data sets. The majority (>75%) of the simulation data were typically found in this distance range.

Next, we cut the simulated data based on sky position. Our observed data were cut to only two regions of the sky, the VOD and HAC regions. In order to emulate this, we looked at the radial merger simulations in R.A. and decl. and selected the data from two regions that were chosen to be similar in size to the VOD and HAC regions, have roughly the same positions in the sky (within ∼20°), and contain an overdensity of the simulated bodies. The regions had to be chosen by hand, since the simulations did not all place shell structure in the same places on the sky. We had to cut out regions around the shell overdensities because the Galactic potential was not spherically symmetric, so the shells are not at a constant radius; by selecting a limited region of the sky, the shells were at an approximately constant radius over the selected data. We made these cuts in the 40 radial merger simulations with a mass of 109 M for a range of evolve times between 1 and 5 Gyr. The correct merger time was calculated for each simulation as when the center of mass of the dwarf galaxy passed through the Galactic center. Only the simulations with a mass of 109 M were used, since this mass is similar to the predicted mass of the VRM (see Section 7.1) and still has a wide range of possible phase-mixing times compared to more massive merger events (Figure 7). Using only a third of the simulations also allowed us to cut down on the number of simulations that had to be analyzed manually, while still retaining an idea of the capabilities and limitations of the method.

The next step was to locate the shells in the data. First, we used an EM algorithm to fit a Gaussian mixture model to a histogram of the data and optimized the number of Gaussians using a BIC method, similar to the process outlined in Section 3.2. However, the EM algorithm often preferred to fit Gaussian components to outlier peaks with small star counts if the interior shells were located close to one another. As it was not possible to supervise every fit in order to avoid this behavior, we chose to use a more robust algorithm utilizing a residual sum of squares as the goodness of fit.

This fitting algorithm minimizes a binned residual sum of squares,

Equation (10)

where m is the number of bins, f(r) is the fit Gaussian mixture model, ri and ri+1 describe the bounds of the ith bin, and ni,obs is the value of the ith bin of the histogram of observed data. The observed data are compared to the integral of the fit model over each bin instead of the value of the model at the center of each bin; this will constrain the fit model to the number of stars in the data, as well as improve the estimate of the width of each shell. We also use a normalization constant

Equation (11)

where N is the total number of data points in the histogram and k is the total number of parameters being fit. We used the differential evolution algorithm (Storn & Price 1997) to minimize the residual sum of squares of our data, which provides the optimal fit for our data. Specifically, we use the implementation of the differential evolution algorithm from the scikit-learn python package (Pedregosa et al. 2011).

We employ the BIC in order to determine how many shells are statistically significant. For this case, the BIC is given by

Equation (12)

where N and k are the same as in Equation (11). As before, the BIC prevents overfitting our observed data. We fit up to three Gaussians to both regions individually and compare the corresponding BICs; the fit with the lowest BIC was taken as the most statistically significant result. The positions of the peaks of the best-fit Gaussian mixture model were taken to be the distances of the shells.

We then used the model particle rewinding method on the adjusted simulation data to recover the most likely merger time for each simulation. We placed model particles in the centers of the selected regions at the distances of the shells. These model particles were allowed to move freely in our Milky Way potential, resulting in oscillations in their distance from the Galactic center. Tracing these oscillations throughout this period identified times when the model particles were all close to one another, corresponding to a likely time where the progenitor of the merger was still coherent. The best-fit merger time from the simulation was then selected from the results and compared to the correct merger time.

Figure 8 shows the result of our method on one such simulated radial merger. The galactocentric distances of the model particles are provided in order to show how δ(t) depends on the positions of the model particles. It is clear that the method is able to recover the time at which the oscillating model particles line up, as this is, on average, within a few tenths of a Gyr of the actual merger time of the simulation.

Figure 8.

Figure 8. Top: plot of total separation, or δ(t), over 5 Gyr for the oscillating caustic surface model outlined in Section 5.2. This figure uses a simulation with initial inclination angle i = 30°, initial distance r0 = 30 kpc, a mass of 109 M, an evolve time of 4 Gyr (solid red line), and a merger time, defined as the time since the center of mass of the dwarf galaxy passed through the Galactic center, of 3.9 Gyr (dashed red line). The distance of each shell from the Galactic center is shown with colored solid lines, where positive r values correspond to the model particle being in the VOD region, and negative r values correspond to the model particle being in the HAC region. In order to be considered for the best merger time, minima were required to have a value of δ(t) at least two standard deviations below the mean of the overall δ(t) distribution. Bottom: plot of the number of caustic surfaces with dr/dt > 0 over the same time period as the top panel. Times where δ(t) is at a local minimum and at least two standard deviations below the mean are marked with dashed vertical lines. Local minima of δ(t) that coincide with times where the number of model particles with dr/dt > 0 is zero or four likely indicate the time of the merger. Note that although there are three local minima of δ(t), the only one that lines up with a spot where the number of model particles with dr/dt > 0 is zero or four is 3.7 Gyr. This also corresponds to the shells all lining up in their oscillations in the top panel.

Standard image High-resolution image

In order to examine the dependence of the method on the shape of the Galactic potential, we also rewound the model particles in the Galpy built-in potential MWPotential2014 (Bovy 2015). This uses a spherical power-law potential for the central bulge, a less massive Miyamoto–Nagai disk than the Newberg et al. (2010) model, and a double power-law spherical halo potential. The two model potentials are compared in Figure 9. Calculating merger times with this second potential helps us determine whether or not an "incorrect" model potential impacts the results, as the radial merger simulations were only run in the Newberg et al. (2010) model potential. This helps us understand the limitations of our method, as the Milky Way's actual gravitational potential will not be truly identical to any model potential that we choose.

Figure 9.

Figure 9. Two model potentials that are used in this work. The model potential fit using the orbit of the Orphan Stream in Newberg et al. (2010) is plotted as a solid line, while the Galpy built-in MWPotential2014 is plotted as a dashed line. The circular speed curves in the plane of the disk are shown in black, while the circular speed curves along the axis of symmetry (Z-axis, perpendicular to the disk) are plotted in red. These potentials are similar, as they are both fit to the Milky Way, but have different shapes.

Standard image High-resolution image

Table 3 and Figure 10 show the results of our method on simulated data using both the "correct" and "incorrect" potentials. It is clear that the method is able to recover the infall times of the simulated radial mergers for a range of initial distances and evolve times. Including simulations that recovered two or more shells in the data, the standard deviation of the differences between the calculated and correct values is 0.52 Gyr for the "correct" Newberg et al. (2010) model potential and 0.53 for the "incorrect" MWPotential2014 model. The standard deviation of the differences decreases to 0.38 Gyr in the Newberg et al. (2010) model and 0.39 Gyr in the MWPotential2014 model if only simulations with three or more shells are considered, and it drops even further to 0.20 Gyr in the Newberg et al. (2010) model and 0.12 in the MWPotential2014 model if only simulations with four shells are considered. There are four shells identified in our observed data, so we estimate the error in infall time to be the larger of the two standard deviations of the differences: σt  = ±0.2 Gyr.

Figure 10.

Figure 10. Recovered vs. actual merger time for all results given in Table 3. The left panel shows merger times recovered with the Newberg et al. (2010) potential, and the middle panel shows merger times recovered with the MWPotential2014 potential. The right panel shows a comparison of the recovered merger times for each simulation using the two model potentials. The closer a result is to the solid black line, the more accurate the result. Red squares denote simulations with two identified shells, cyan diamonds correspond to three shells, and blue circles correspond to four shells. As merger time increases, it becomes more difficult to accurately recover the time of merger of a simulation. Simulations with more shells are more accurate. Considering all simulations, the standard deviation of the differences between the calculated and correct values is 0.52 Gyr for the Newberg et al. (2010) model potential and 0.53 for the MWPotential2014 model. The standard deviation of the differences drops to 0.20 Gyr for the Newberg et al. (2010) model potential and 0.12 Gyr for the MWPotential2014 model if only simulations where the algorithm recovered four shells are considered. The mean difference in recovered merger times between the two model potentials over all simulations is 0.42 Gyr.

Standard image High-resolution image

Table 3. Recovered and Actual Merger Times for Our Series of Radial Merger Simulations with a Mass of 109 M

SimulationNewberg2010 MWPot.2014 SimulationNewberg2010 MWPot.2014
i r0 te tm tr No. Sh. tr No. Sh. i r0 te tm tr No. Sh. tr No. Sh.
(deg)(kpc)(Gyr)(Gyr)(Gyr) (Gyr) (deg)(kpc)(Gyr)(Gyr)(Gyr) (Gyr) 
0201.00.81.231.3350201.00.71.320.92
0202.01.82.322.9250202.01.72.021.92
0203.02.8......2.1250203.02.72.922.82
0204.03.81.92......50204.03.74.524.13
0205.04.82.721.9250205.04.72.424.82
0301.0...............50301.00.91.040.83
0302.0...............50302.01.91.931.84
0303.0...............50303.02.92.942.52
0304.0...............50304.03.92.523.82
0305.0...............50305.04.94.72......
0451.00.91.420.8250451.00.91.020.82
0452.01.91.931.7450452.01.91.941.62
0453.02.91.332.5350453.02.92.923.03
0454.03.93.933.9450454.03.94.224.04
0455.04.94.932.8350455.04.93.03......
0601.0...............50601.0...............
0602.0...............50602.01.91.822.12
0603.0...............50603.02.92.832.53
0604.0...............50604.03.92.52......
0605.0...............50605.04.94.724.02
10201.00.70.941.1460201.00.72.02......
10202.0...............60202.01.72.02......
10203.02.73.122.9260203.02.73.422.12
10204.03.73.124.1160204.03.73.822.82
10205.04.74.924.7260205.04.74.72......
10301.00.90.920.9260301.00.91.020.92
10302.01.91.841.6460302.01.9......1.62
10303.02.91.73......60303.02.9......3.14
10304.0...............60304.03.93.924.62
10305.04.9......3.4260305.04.94.53......
10451.00.7......1.1360451.00.91.12......
10452.01.71.821.9260452.01.92.131.73
10453.02.72.742.9460453.02.93.132.63
10454.03.7......3.7460454.03.93.322.72
10455.04.74.934.7360455.04.92.722.32
10601.00.71.231.6360601.00.91.22......
10602.0...............60602.0...............
10603.0...............60603.02.92.833.02
10604.03.73.743.6460604.03.93.93......
10605.04.73.533.7360605.04.94.934.73
20201.00.70.940.8470201.00.50.820.52
20202.01.7......2.0270202.01.5......1.82
20203.02.72.922.7270203.02.53.82......
20204.03.72.022.3270204.03.54.123.92
20205.04.74.82......70205.04.54.723.82
20301.00.81.020.8270301.00.71.120.92
20302.01.82.821.7370302.01.72.131.84
20303.02.82.722.7270303.02.73.733.04
20304.03.83.422.3270304.03.73.933.93
20305.04.84.932.4270305.04.74.92......
20452.0...............70451.0...............
20452.01.81.822.0270452.0...............
20453.02.82.742.6470453.0...............
20454.03.83.743.7470454.03.92.52......
20455.04.84.834.6470455.0...............
20601.00.80.821.2270601.0...............
20602.00.80.82......70602.0...............
20603.02.82.632.9370603.0...............
20604.03.83.633.7370604.0...............
20605.04.84.634.5370605.0...............
30201.00.70.74......80201.0...............
30202.01.73.121.8280202.0...............
30203.02.7......4.5280203.0...............
30204.03.72.222.5280204.0...............
30205.04.74.82......80205.0...............
30301.00.91.021.0280301.00.90.820.72
30302.01.91.731.5380302.01.91.831.53
30303.02.93.133.3380303.02.92.733.24
30304.03.93.74......80304.03.93.343.84
30305.04.94.744.5480305.04.94.244.54
30451.00.90.921.2280451.00.41.131.52
30452.01.91.932.1280452.01.41.421.72
30453.02.92.942.6480453.02.43.023.02
30454.03.93.943.4380454.03.43.424.02
30455.04.94.844.7480455.04.45.024.32
30601.00.81.421.2380601.00.91.02......
30602.01.81.74......80602.01.91.932.22
30603.02.82.842.4380603.02.92.923.02
30604.03.83.643.6380604.03.93.93......
30605.04.84.444.6280605.04.94.534.62
40201.0...............90201.0...............
40202.0...............90202.0...............
40203.0...............90203.0...............
40204.0...............90204.0...............
40205.0...............90205.0...............
40301.00.71.221.0290301.0...............
40302.01.7......2.8390302.0...............
40303.02.72.923.2390303.0...............
40304.03.73.634.3390304.0...............
40305.03.73.63......90305.0...............
40451.00.8......0.9290451.00.7......0.72
40452.01.81.922.2390452.01.71.721.92
40453.02.83.033.1390453.02.72.632.73
40454.03.84.034.0390454.03.74.343.64
40455.04.84.334.6390455.04.73.124.54
40601.00.81.022.9390601.0...............
40602.01.8......2.1290602.0...............
40603.02.8......3.0290603.0...............
40604.03.84.323.3290604.0...............
40605.04.84.834.7390605.0...............

Download table as:  ASCIITypeset images: 1 2

This value is only the approximate error contributed by the recovery method. Other factors, such as using an incorrect shape of the Galaxy potential, might increase the actual error in our calculated value. However, the two model potentials used in this work did not seem to have a large impact on our method. This leads us to believe that any reasonably shaped model potential would likely produce similar results.

Some combinations of initial angles and distances of the simulated dwarf galaxy progenitors resulted in little to no shell structure. This is why, for example, there are no simulations with i = 0°, r0 = 30 kpc, and a mass of 109 M in Table 3, as this simulation did not produce identifiable shell substructure. The simulations with inclination angle i = 30° appeared similar in R.A. and decl. to the simulations of individual components of the VRM debris in Donlon et al. (2019). It is possible that this inclination angle may be close to the correct value of the inclination angle of the progenitor of the VRM.

We only tested simulated evolve times up to 5 Gyr, as that is near the age of radial mergers at which shells stop being easily located. A more thorough suite of tests would include tests exploring the impacts of different progenitor profiles. Even without this exhaustive testing, we believe that our method is able to determine the correct time of collision of the VRM progenitor and the Milky Way, as the method shows clear success in recovering the correct values for a variety of radial merger simulations.

6. Timing the VRM Revisited: Shell Oscillations

Figure 11 shows the results of our oscillating model particle method on the observed data. The top two panels show the value of δ(t) and the number of particles with dr/dt > 0 for the Milky Way potential from Newberg et al. (2010) that was used to generate the simulations in Section 4. A minimum was considered significant if it was at least two standard deviations below the mean of δ(t). Nine significant minima were identified, of which only two corresponded to a time where the number of model particles with dr/dt > 0 was either zero or four. These minimima are for merger times of 2.7 and 8.2 Gyr. It is unlikely that the 8.2 Gyr merger time is realistic due to the phase-mixing constraints provided in Section 4.

Figure 11.

Figure 11. Plots of total separation δ(t) over 10 Gyr for the oscillating caustic surface model outlined in Section 5.2 for the observed data. Below each plot of δ(t) vs. time is a plot of the number of caustic surfaces with dr/dt > 0 over the same time period. The top two panels are for the Orphan Stream Model 5 potential, and the bottom two panels are for the MWPotential2014. Times where δ(t) is at a significant local minimum are marked with dashed vertical lines. A minimum is considered to be significant if it is at least two standard deviations below the mean value of δ(t). In the case of the Orphan Stream Model 5 potential, this corresponds to values less than δ(t) = 67 kpc, and for MWPotential2014, it corresponds to values less than 61 kpc. Times marked with dashed lines that coincide with times where the number of model particles with dr/dt > 0 = 0 or 4 are marked with solid red vertical lines and likely merger times. Corresponding values for each local minima are given in Table 4. We find that 2.7 Gyr is a likely merger time in both models, so we conclude that 2.7 Gyr ago, the progenitor of the VRM collided with the Galactic center.

Standard image High-resolution image

The bottom two panels of Figure 11 show the same values but calculated with the MWPotential2014 model. If we instead use MWPotential2014 to approximate the actual Milky Way, we find 10 significant minima, of which only one corresponds to a time when all of the particles are moving in the same direction. This time is 2.7 Gyr ago, which is consistent with the first minimum of the Orphan Stream Model 5 potential. The 8.2 Gyr merger time is not recovered with the second potential. This fact and the phase-mixing constraints on the merger time of the VRM suggest that the 8.2 Gyr merger time is a false positive. Table 4 shows the positions and velocities of the model particles at local minima for these calculations. From these results, we propose that the progenitor of the VRM collided with the Galactic center 2.7 ± 0.2 Gyr ago.

Table 4. Times and Values of Local Minima of δ(t) and Number of Model Particles with dr/dt > 0 at Each Time in Figure 11

Orphan Stream Model 5 MWPotential2014
Time δ(t)No. dr/dt > 0Time δ(t)No. dr/dt > 0
(Gyr)(kpc) (Gyr)(kpc) 
0.25110.2541
0.56520.5582
2.46212.4473
2.7 30 4 2.7 58 0
3.15812.9303
5.35215.5391
5.66028.0602
8.2 49 0 8.3451
8.53338.9601
   8.9572

Note. A minimum is significant if it is at least two standard deviations from the mean of δ(t). Minima are given for both Milky Way potentials used in this work. Local minima with dr/dt > 0 = 0 or 4 correspond to likely merger times and are given in bold. Both model potentials suggest 2.7 Gyr as a likely merger time. The 8.2 Gyr merger time in the Orphan Stream Model 5 potential is not a likely merger time, as it was not recovered in the second potential and is outside of the phase-mixing constraints for the merger time of the VRM.

Download table as:  ASCIITypeset image

A merger time of 2.7 Gyr ago is consistent with the conclusions from Section 4: phase-mixing constraints on the VRM debris lead one to believe that the progenitor of the VRM collided with the Galactic center between 1 and 5 Gyr ago. A merger time of 2.7 Gyr ago corresponds with the progenitor of the VRM having a final apogalacticon before collision between 20 and 30 kpc and agrees with our previous analysis.

The motion of particles in the halo strongly depends on the shape of the Galactic potential. It is not clear at the moment what shape the Galactic halo potential actually takes. Further complicating the Galactic potential, the Large Magellanic Cloud (LMC) was recently found to be approximately 10% of the total mass of the Milky Way (Erkal et al. 2019) and therefore has a large impact on halo substructure. The LMC is then expected to produce a substantial time-variable torque on the VRM debris throughout its evolution, which would generate precession of the structure and, if anything, accelerate phase mixing. However, the similar collision times we recover from two different potentials are a good sign that the shape of the potential is not a dominant factor.

7. Relationship with Previously Discovered Substructure

7.1. The Argument for an Ancient Gaia Sausage

The concept of the "ancient last major merger" of the Milky Way has occupied the literature on Galactic structure for decades (Kuijken & Gilmore 1989; Gilmore et al. 2002). Gilmore et al. (2002) identified a group of disk stars rotating slower than expected in the local solar neighborhood and claimed that this group was evidence of the last major merger, which had occurred 10–12 Gyr ago. Further, Deason et al. (2013) claimed that a massive merger approximately 10 Gyr ago would explain the break in the density profile of the Milky Way halo at 30 kpc. The theory suggests that in the time since this massive merger event, there has been a period of Galactic quiescence, up until the Sagittarius dSph and Magellanic Cloud mergers that are currently underway.

The last major merger is thought to have formed the Milky Way's thick disk, which has stellar ages older than 10 Gyr. One theory for the creation of a thick disk is that the Milky Way's protodisk was heated by a collision with a large dwarf galaxy merger (Quinn & Goodman 1986; Velazquez & White 1999). This would have caused the existing disk stars to be kicked up on orbits with larger vertical action. Gas in the disk would radiatively cool, so stars formed after this merger event could remain in a cold, younger thin disk (effectively "quenching" the thick disk).

With the release of Gaia DR2, evidence suggested that the last major merger was finally identified. The GSM (Simion et al. 2019) and the Gaia–Enceladus Merger (Helmi et al. 2018) were independently discovered and are interpreted to be the same "last major merger of the Milky Way." The age of this merger event was reported to correspond to the age of the thick disk: between 8 and 11 Gyr. Helmi et al. (2018) found that the youngest age of stars in this merger was 8–10 Gyr, which is consistent with this massive merger indeed having created the thick disk. This agrees with the results of Gallart et al. (2019), who claimed that star formation in the Milky Way transitioned from thick to thin disk star formation around 9.5 Gyr ago, corresponding to the GSM. Gallart et al. (2019) also claimed that both the Gaia Sausage progenitor and the Milky Way's in situ halo had finished the bulk of their star formation by 10 Gyr ago. Belokurov et al. (2019) claimed that star formation in the inner halo ended around the same era, and, assuming that the cause for the quenching of star formation was the merger event itself, this is additional evidence for an ancient merger.

The presumed merger age determines a preferred mass of the ancient merger progenitor. Comparison of the metallicity of the RRLs in the Gaia Sausage to the RRLs in the LMC suggests that the progenitor of the Gaia Sausage had a similar mass as the LMC did 10 Gyr ago (Belokurov et al. 2017; Zinn et al. 2020). Mackereth et al. (2019) analyzed the chemical and orbital properties of the Gaia Sausage in comparison to EAGLE simulations and determined that it likely had a mass of at least 109 M and was accreted around 9 Gyr ago.

This large mass suggests that this merger is the main component of the stellar halo; Deason et al. (2019) showed that the total stellar mass of the halo is on the order of 109 M. Simulations suggested that most of the stars on radial orbits in the halo came from massive satellites approximately 6–10 Gyr ago, and that these stars would dominate the halo between 10 and 30 kpc of the Galactic center (Belokurov et al. 2018). The inner halo is indeed dominated by stars on highly eccentric orbits (Iorio & Belokurov 2019). This is all consistent with the halo being primarily formed from a single merger event.

For all of these reasons, the present view of the Gaia Sausage is that it is an ancient merger event with a stellar mass between 108 and 109 M that is responsible for the formation of the thick disk and makes up the majority of the stellar mass of the halo. We will now attempt to reconcile this viewpoint with the merger time of the VRM progenitor that was calculated in this work.

7.2. The Argument for a Young Gaia Sausage

The Gaia Sausage is characterized by a structure in velocity space with high dispersion in vr and small rotational velocities in the local solar neighborhood (Belokurov et al. 2018). It has been shown that the VRM debris looks very similar to the Gaia Sausage in the local solar neighborhood, so much so that the two structures are likely identical (Donlon et al. 2019). If the VRM debris and Gaia Sausage are identical, then how do we explain the gap between the 2.7 Gyr ago infall time of the progenitor of the VRM and the 8–10 Gyr age of the Gaia Sausage?

In Section 4, we developed an argument that the VRM debris must populate the stellar halo at the distance ranges in which we identify shells. Since the VRM debris dominates the stellar halo, the debris must also be located near and inside the stellar break at ∼20 kpc. To satisfy these conditions, the progenitor of the VRM must have had a final apogalacticon before collision between 20 and 30 kpc. The measured values of the causticality in simulations with r0 = 20 and 30 kpc in Figure 7 match the measured causticality values for the observed data around 1 and 5 Gyr, respectively. This suggests that the merger time of the VRM is within the last 5 Gyr. Additionally, by 5 Gyr after collision, our method's ability to recover the merger time of a radial merger dropped substantially (Figure 10). Finally, this work recovered a merger time of only 2.7 Gyr for the VRM progenitor. All of this suggests that the progenitor of the VRM collided with the Milky Way more recently than 8–11 Gyr ago.

If the dynamical mass of the VRM is actually closer to 1010 M, then even large initial distances constrain its merger time to be within the last 5 Gyr (Figure 7). The GSM is expected to dominate the halo and would then have a dynamical mass of around 109–1010 M. This suggests two possible scenarios: if the GSM is indeed ancient, it either cannot dominate the material in the halo or cannot be responsible for the relatively unmixed stellar overdensities in the halo. In this case, the VRM and GSM cannot be identical events, as the VRM debris populates shells in the VOD and HAC. If the VRM and GSM are not the same, then the Milky Way halo is composed of more than one major merger event. On the other hand, if the GSM is responsible for the VOD and HAC and dominates the halo, then it is unlikely that the merger event occurred more than 5 Gyr ago. If this is the case, the VRM and GSM probably describe the same merger event.

Villalobos & Helmi (2008) ran simulations of dwarf galaxy mergers that collided with a model Galactic disk after falling in from near the virial radius. After an orbital decay over a period of ∼1.5 Gyr, the dwarf galaxy finally collides with the Galaxy from a distance of ∼20 kpc from the Galactic center. Thick disks are shown to form approximately 0.25 Gyr after these collisions. The work states that thick disk and satellite disruption reach an equilibrium ∼2 Gyr after the collision occurs, and that major shell structure is only visible for these 2 Gyr. If the thick disk was formed in this way 8–11 Gyr ago, then shells in the resulting merger debris would not be visible at the present day.

In this work, we use the term "merger time" to describe when the progenitor of the VRM's center of mass passes through the Galactic center. This is a collision between the dwarf galaxy and the Milky Way. If the progenitor for the Gaia Sausage crossed the Milky Way's virial radius sometime around 10 Gyr ago ("infall time"), it could have spent the last 8 Gyr in a prolonged orbit about the Galaxy while it shed its initial energy until finally colliding with the Galaxy 2.7 Gyr ago. This would have resulted in the progenitor of the Gaia Sausage making many passes through the Milky Way over its lifetime, which could cause tidal shocks in both structures. We speculate that these tidal shocks from perigalacticon passes of the Gaia Sausage progenitor could potentially kick up thick disk stars from the Milky Way protodisk through large-scale tidal torques, as well as generate star formation through the collapse of gas clouds in the disk. This could explain the findings that the star formation history of the thick disk oscillates with a period of ∼3 Gyr (Gallart et al. 2019), which is a reasonable time between passes for a distant halo object with a highly eccentric orbit.

We did not include dynamical friction in our models of the VRM in this work, because it does not play a role during and after the disruption of the satellite as it passes through the center of the Milky Way. However, dynamical friction would allow the VRM progenitor to shed energy as it orbits and falls inward from the virial radius. A reduction in energy is required for a dwarf galaxy formed outside or near the Milky Way's virial radius to eventually become substantially more bound to the Milky Way and would cause the VRM progenitor to slow down from earlier nonradial passes of the Milky Way before its final radial collision.

An initial tidal shock from a close encounter with the Milky Way could be responsible for the quenching of star formation in the Gaia Sausage 8–11 Gyr ago (the stellar "age" of the progenitor). However, according to Brown et al. (2014), many dwarf galaxies in the Local Group stopped star formation around the same time 12 Gyr ago during reionization of the universe, so alternatively, it could be that the Gaia Sausage had already stopped active star formation even before it became gravitationally bound to the Milky Way. This could also explain why the star formation in both the Milky Way in situ halo and the Gaia Sausage progenitor appear to slow down around 12 Gyr ago (Gallart et al. 2019), even before the proposed accretion time of the progenitor of the Gaia Sausage around 8–11 Gyr ago. If the accretion event were the primary underlying cause of the drop in star formation rates in both structures, as Gallart et al. (2019) claimed, then it is not expected for star formation to begin slowing down before the accretion event occurs. It is possible that the previous metallicity arguments for the age of the Gaia Sausage were not dating the time of the collision between the dwarf progenitor and the Milky Way but instead were measuring other properties of the infall event. Note, however, that our method for determining merger time only measures the time since the collision between the progenitor and the Milky Way and does not constrain when the progenitor became bound to the Milky Way, star formation was quenched in the Milky Way or the progenitor dwarf galaxy, or the thick disk was created.

If the Gaia Sausage did not collide with the Milky Way 8–11 Gyr ago, then what caused the thick disk? Amarante et al. (2019), Beraldo e Silva et al. (2020), and Clarke et al. (2019) all showed that it is possible to form a thick disk in Milky Way analogs in isolation, provided that there is a period of lumpy disk activity early in the Galaxy's life. Additionally, Ma et al. (2017) showed a cosmological simulation of a Milky Way–mass galaxy forming a bifurcated disk without a collision. These systems are able to generate both the kinematic and chemical properties of the thick and thin disks without merger events. The thick disk may form itself without perturbation from a satellite, in which case, there is no need for the Gaia Sausage to be the perturber of the thick disk. Additionally, Rodriguez Wimberly et al. (2019) claimed that it is unlikely that star formation in a thick disk would be quenched by the infall of a large dwarf galaxy. If either of these situations is the case, then the age of the GSM is not required to line up with the time of the quenching of star formation in the thick disk.

7.3. Additional Possibly Related Substructure

The EPO is a recently discovered overdensity with a lower surface brightness than either the VOD or the HAC. It has been suggested that the EPO is associated with the VOD and HAC and that they share a common origin (Li et al. 2016; Donlon et al. 2019). The results of this work are consistent with this idea; Figure 5 shows ejecta in a direction 120° from the two regions containing the majority of the shell substructure, as viewed from the Galactic center. This work suggests a single "trefoil" structure connecting the three overdensities, instead of a model where material oscillates only between the HAC in the south and the VOD.

We note that the VOD appears to visually extend to higher declinations, tracing the Perpendicular Stream (Weiss et al. 2018b). It is possible that the Perpendicular Stream is associated with one or more shells or that it traces a long radial tail of infalling or outgoing material. It may be beneficial to look for shell structure in the upper portion of the Perpendicular Stream in the future in order to further constrain the behavior of the progenitor of the VRM. A study of the southern portion of the HAC would also be beneficial in exploring the full extent of the VRM debris, as well as continuing the search for shells in other regions of the Milky Way.

The phase-space spiral in the disk is a structure in zvz phase space that has been characterized as ongoing phase mixing in the local solar neighborhood from a recent perturbation in the disk (Li & Shen 2020). The disk is known to not be in equilibrium (Widrow et al. 2012; Carlin et al. 2013; Williams et al. 2013; Yanny & Gardner 2013; Xu et al. 2015), and the vertical disk disequilibrium presents locally as the phase-space spiral (Li & Shen 2020). The phenomenon that caused the phase-space spiral occurred at least 500 Myr ago, and it is suggested that the structure could only exist in the Milky Way for ∼4 Gyr (Li & Shen 2020). Our merger time for the progenitor of the VRM lies within these constraints, and we suggest that it is possible that the cause of the phase-space spiral is indeed the VRM. Perhaps the VRM "wobbled" the Milky Way's central bulge in a way that would propagate waves in the disk and generate vertical motion in disk stars. Further exploration is required to determine the relationship between the VRM and the phase-space spiral, if any exists.

The "Splash" (Belokurov et al. 2019) is another substructure of the Galactic disk in the local solar region. The Splash is characterized by a large population of metal-rich stars on highly radial orbits in the inner halo. While the formation origin of stars in the Splash is not yet known, it has been hypothesized that the Splash consists of stars thrown out of the Milky Way's (proto)disk from the GSM 9.5 Gyr ago (Belokurov et al. 2019). Since the VRM is a radial merger, it is likely that metal-rich material in the Galactic center would be disrupted from the initial impact of the progenitor of the VRM with the Galactic center, which could explain the high-metallicity material in the Splash. Amarante et al. (2019) showed that the Splash could instead have been created through another disk process, such as a lumpy disk early in the Milky Way's formation history. Either way, there is no need for an ancient infall event to have created this substructure.

Snaith et al. (2014) showed an oscillation in star formation of the Milky Way inner disk from 6 Gyr ago until approximately 2 Gyr ago. There is a strong peak in star formation around 2.5 Gyr ago. It is possible that this oscillation in the star formation rate could be due to tidal shocks from the passes of the VRM progenitor as it fell in from the virial radius, and that the burst of star formation 2.5 Gyr ago was due to the infall of gas from the VRM and perturbations in the disk due to the collision event. Mor et al. (2019) also find a strong peak in star formation around 2.7 Gyr ago that may correspond to the VRM. It is also possible that the increase in star formation around 2.5 Gyr ago could be caused at least in part by an impact of the Sagittarius dSph progenitor with the disk; Ruiz-Lara et al. (2020) attributed a peak in the star formation history of the local solar region around 2.0 Gyr ago to such an event. It should be noted that Mor et al. (2019) do not find these peaks in star formation associated with the passages of the Sgr dwarf.

8. Conclusions

The VRM is a recently discovered radial merger in the Galactic halo (Donlon et al. 2019). It is known that radial mergers cause shell substructure (Hernquist & Quinn 1988; Sanderson & Helmi 2013). In this work, we successfully locate these shells and use them to determine an elapsed time of 2.7 ± 0.2 Gyr since the progenitor passed through the center of the Milky Way.

In order to achieve this result, we used Gaia, SDSS, and LAMOST data to build two 6D phase-space data sets: one in the VOD region and another in the HAC region. These regions were chosen due to their high concentration of VRM debris. Each data set contains both RRLs and BHBs in order to maximize the amount of available data. In order to isolate shell substructure, these data were limited to $| {v}_{r}| \lt 10$ km s−1 and $| {L}_{z}| \lt 500\,\mathrm{kpc}$ km s−1. These cuts only allow stars that are on radial orbits and at apogalacticon, where the shells form.

The surface of a shell has a small galactocentric radial velocity, and the density as a function of galactocentric radius has a Gaussian distribution. We fit shell models to the data using a Gaussian mixture model with BIC and AICc to determine the most likely number of shells in our data set in each region. We were able to identify four statistically significant shells in our data: two in the VOD region and two in the HAC region. A simulation of the VRM shows that it can connect the VOD and HAC with the EPO as a single trefoil structure in the halo.

In order to populate the stellar halo inside the stellar break and produce shells at the observed distances, the progenitor of the VRM had to have a final apogalacticon before collision between 20 and 30 kpc. We utilize N-body simulations to analyze phase mixing in radial mergers. Using causticality as a measurement of the phase mixing a radial merger has gone through, we find that radial mergers with r0 between 20 and 30 kpc have similar measured causticality as the observed data earlier than 5 Gyr after collision. Identifiable shell structures are not seen in these simulations after 5 Gyr at the distances where shells are observed in the Milky Way.

Simulations of radial mergers with large masses (∼1010 M) become phased-mixed to the levels of the VOD and HAC within 5 Gyr. If the GSM is indeed ancient, it cannot either dominate the material in the halo or be responsible for the relatively unmixed halo substructure. In this case, the VRM and GSM cannot be identical events, as the VRM was discovered in VOD debris, which is populated with shell structure. If the VRM and GSM are not the same, then the Milky Way halo is composed of more than one major merger event. On the other hand, if the GSM is responsible for the VOD and HAC and dominates the halo, then it is unlikely that the merger event occurred more than 5 Gyr ago. If this is the case, the VRM and GSM probably describe the same merger event.

Caustic surfaces contain stars that oscillate back and forth from one side of the Galaxy to the other. We modeled the four caustic surfaces as point particles and rewound their orbits in a model Milky Way potential. By measuring the total separation of these model particles over a reverse integration of 10 Gyr and requiring that all of the particles be moving in the same direction, we were able to recover the infall times for a collection of simulated radial mergers. Using this technique on the observed data, we determined that the caustic surfaces making up the VRM were all located in the same place with similar velocities approximately 2.7 Gyr ago. This corresponds to a collision between the progenitor of the VRM and the Galactic center 2.7 ± 0.2 Gyr ago.

Based on this age, it is possible that the VRM is responsible for the Splash, the phase-space spiral in the disk, and possibly a burst of star formation in the inner disk through perturbation of the Galactic center during the infall collision.

Although we believe that the VRM and GSM represent the same radial merger event, both our phase-mixing argument and our calculated collision time show that observed levels of phase mixing are too recent to have been caused by a collision 8–11 Gyr ago. This result produces tension between our merger time of the VRM and the published age of the GSM. However, we do not constrain the time at which the progenitor became gravitationally bound to the Milky Way. This apparent conflict could be resolved if the GSM is younger than previously thought or if its published age is closer to the time of gravitational capture than the time of collision with the Milky Way.

We wish to express our deep gratitude to Yang Huang, from Yunnan University, who provided us with a copy of the catalog (Liu et al. 2020) of 5290 RRL spectra of the LAMOST Experiment for Galactic Understanding and Exploration (LEGUE; Deng et al. 2012) and the Sloan Extension for Galactic Understanding and Exploration (SEGUE; Yanny et al. 2009) surveys before it was publicly available. We would like to thank the anonymous referee and statistical editor for their insightful comments that improved this work. This work was supported by NSF grant AST 19-08653; contributions made by Marvin Clan, Babette Josephs, and Manit Limlamai; and the 2015 Crowd Funding Campaign to Support Milky Way Research. L.M.W. was supported by a Discovery Grant with the Natural Sciences and Engineering Research Council of Canada. This project was developed in part at the 2019 Santa Barbara Gaia Sprint, hosted by the Kavli Institute for Theoretical Physics at the University of California, Santa Barbara. This research was supported in part at KITP by the Heising-Simons Foundation and the National Science Foundation under grant No. NSF PHY-1748958. This work has made use of data from the Sloan Digital Sky Survey. Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions. The SDSS-IV acknowledges support and resources from the Center for High-Performance Computing at the University of Utah. The SDSS website is www.sdss.org. This work also made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC; https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. This work made use of data from the LAMOST survey. The Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST)) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Appendix A: Uncertainty in Causticality

Here we derive the uncertainty in causticality as defined in Equation (8). Assuming a Poisson error for each particular Ni  = Nj ,

Equation (A1)

and the partial derivative of the causticality with respect to each Nj can be written as

Equation (A2)

Note that if all bins are multiplied by a factor of m, the uncertainty in causticality becomes ${\sigma }_{C}/\sqrt{m}$. Thus, while the value of causticality remains unchanged when the overall number of data points is increased, the corresponding uncertainty in the causticality decreases. Note that the first and final bins are omitted in the calculation of the uncertainty in causticality. This is because Nj−1 is ill-defined for the first bin, and Nj+1 is likewise ill-defined for the final bin. This rarely impacts the data in practice, as the first and final bins are typically zero.

It is beneficial to get an idea of the effect of bin size on our measurements of causticality. Figure 12 shows causticality for a single radial merger simulation and the observed data, calculated with different histogram bin sizes. Also shown in this figure are the 1σ uncertainties of the causticality for the observed data. We only explore bin sizes ≥2 kpc, since the uncertainty in the distances of the observed data is ∼2 kpc at larger distances. For larger bin widths, the width of a bin is much larger than the expected width of a shell (∼2–3 kpc), and information is lost regarding the shape of the shell structure. We support 2 kpc as a bin width, as it is large enough to eliminate many errors due to observational uncertainties but small enough to resolve shell structure.

Figure 12.

Figure 12. Causticality vs. time for the radial merger simulation with i = 40°, r0 = 30 kpc, and a mass of 109 M. The color of each line denotes the bin size used to calculate causticality. Thick solid lines are causticality for the simulated data, while thin solid and dashed horizontal lines are the measured causticality of the VOD and HAC region data, respectively. The shape of the causticality curve does not change much with bin size, but the location of the curve moves up and to the left as bin size increases. The left group of error bars provides the Poisson errors in the measured causticality for the VOD region, the center group provides Poisson errors for the HAC region, and the right group provides the Poisson errors for the simulated data at the time where each error bar is located. The Poisson errors in the simulated data are much smaller than the observed data, as there are more objects in the simulated data.

Standard image High-resolution image

Note that the general shape of the causticality curve is similar regardless of bin size. A smaller bin size tends to push the equilibrium causticality value closer to zero, and the overall position of the curve is moved to the right as bin size decreases. Curves with larger bin sizes tend to "bottom out" and reach apparent equilibrium more quickly, as smaller shell structures are lost in wide bins.

Figure 12 also shows the 1σ error bars in the causticality values for each region for each bin size. The uncertainties in the observed data (left and center error bars) are fairly large due to small number statistics. The uncertainties in the simulated data are much smaller, as the simulated data have many more objects than the observed data. The merger time estimates from both observed data sets are consistent with the 2.7 Gyr ago result in this work for this single simulation. The uncertainties in the data correspond to an uncertainty in merger time of around ±1 Gyr. The measured causticality values for the observed data also vary substantially for different bin sizes, which is likely due to the small number of objects in the histograms of the observed data.

Appendix B: KLD as a Metric for Phase Mixing

In Section 4, we use causticality as a metric for measuring the amount of phase mixing in simulations of radial mergers. Causticality is introduced in this publication and has not been widely studied. For this reason, we repeat the analysis of the phase-mixing constraints on the VRM using the KLD (Kullback & Leibler 1951) of the distribution. The KLD is a canonical, widely studied statistic that measures how different one distribution is from another and is defined as

Equation (B1)

for two discrete probability distributions, P(r) and Q(r). Each term in the KLD is defined to be equal to zero when P(r) = 0 or Q(r) = 0, in order to avoid singularities in the natural logarithm.

In this case, P(r) is the observed histogram of shell stars as a function of galactocentric radius, and Q(r) is a uniform distribution equal to the mean of P(r). The KLD is then a measure of how much information is gained by using P(r) instead of a uniform distribution, or in other words, the KLD is a measurement of how sharp the peaks in P(r) are. The value of the KLD will be larger when the peaks in P(r) are large and small for a relatively flat distribution of P(r). A more phase-mixed distribution will have smaller peaks than a distribution that has undergone less phase mixing and therefore will have a smaller value of the KLD. The KLD is analagous to causticality in this respect, as smaller values of both quantities correspond to measurements of more phase-mixed distributions. We similarly expect the KLD to decrease over time for each simulation of a radial merger until the merger is completely phase mixed.

Figure 13 shows the measured values of the KLD over time for our radial merger simulations with a mass of 109 M, similar to Figure 7. As in Section 4, we are primarily interested in the simulations r0 = 20 and 30 kpc in Figure 13.

Figure 13.

Figure 13. Values of the KLD vs. time for all 40 simulated radial mergers with a mass of 109 M. The KLD was calculated for each time step for a radial histogram of all stars in the simulation, cut identically as in Figure 7. Dashed horizontal lines give the values of the KLD for the observed data in the VOD (1.32) and HAC (1.19) regions. Observational errors will move the measured KLD of the observed data downward, making the observed data appear older than they actually are. Left: mean KLD for simulations with a given initial inclination angle, averaged over the four initial distances. The inclination angle does not appear to have a monotonic effect on how long shells take to phase mix; increasing the inclination angle could increase or decrease the mixing time. Right: values of the KLD for simulations with a given initial distance. Mean values of the KLD are shown with thick solid lines and averaged over all inclination angles. Individual simulations are shown with thin dashed lines and colored by their initial distance. Simulations with larger initial distances take longer to phase mix, on average, than simulations with smaller initial distances.

Standard image High-resolution image

One notices that the KLD of each simulation "bottoms out" at a lower bound and remains fairly constant for the rest of the simulation. The KLD does not bottom out at zero for our radial merger simulations because it assumes that a mixed distribution is uniform, while histograms of relaxed radial merger simulations tend toward smooth Gaussian-like distributions. In theory, one could use a Gaussian distribution centered on r0 as the denominator for the KLD in order to get a more accurate measurement of how phase mixed the distribution is. However, this requires knowledge of the phase-mixed distribution, and it may turn out that a Gaussian is not actually a good fit to the phase-mixed distribution. Simulations with smaller values of r0 appear to bottom out more quickly than simulations with larger values of r0. We anticipate that once a simulation approaches this lower bound, its overall structure does not change much, and it has effectively reached equilibrium. When a merger event reaches equilibrium, shells are no longer identifiable. Since we see shell structure in the observed data, the KLD of the VRM cannot have bottomed out yet.

In Section 4, we argued that the value of r0 for the VRM is likely between 20 and 30 kpc. The radial merger simulations with these values of r0 all bottom out before 5 Gyr. Since the VRM cannot have bottomed out yet, we claim that the VRM merger time occurred within the last 5 Gyr.

We seek to evaluate whether our choice of bin size has an adverse effect on our measurements of the KLD. In order to do this, we analytically calculate the value of the KLD for a continuous Gaussian distribution in our model and then compare this value to the KLD for a binned approximation of a Gaussian in our model. We do this over a variety of Gaussian shapes and bin sizes.

Over a pair of continuous probability distributions, the KLD becomes

Equation (B2)

In our case, our probability distributions are defined over r = [0, l]. Elsewhere, we take P(r) to be zero. So the KLD becomes

Equation (B3)

where we have used

Equation (B4)

in order to satisfy the normalization requirement

Equation (B5)

Our model takes P(r) to be a normalized Gaussian distribution:

Equation (B6)

The tails of P(r) quickly approach zero as one moves away from its peak. This makes our KLD

Equation (B7)

as long as the Gaussian distribution is essentially within the discrete boundaries. Evaluating this integral leaves us with

Equation (B8)

which only depends on the width of the Gaussian distribution.

This value changes slightly for N Gaussian distributions that do not overlap, which is a fair approximation of the simulated data. This change is easily computed; for N Gaussians, the new P(r) is given as

Equation (B9)

where Pi (r) is the probability distribution for the ith Gaussian, and the coefficients Ci satisfy

Equation (B10)

The integral in Equation (B7) then splits into

Equation (B11)

Since the Gaussians do not overlap,

Equation (B12)

because the integrand is required to disappear for all regions where P(r) does not overlap with Pi (r). If this was not the case, singularities would arise in the natural logarithm. This reduces to

Equation (B13)

The first integral is equivalent to the value given in Equation (B8). The second integral is simply a normalized Gaussian integral, which is equal to unity. For N Gaussian distributions with variances ci , one obtains

Equation (B14)

Figure 14 shows the ratio of the KLD computed for a single binned Gaussian to the analytical KLD as a function of bin size. The bin size used in this work, 2 kpc, is marked with a dashed red line. For Gaussian models with values of c ≥ 1 kpc, the binned KLD is equal to the analytical KLD to within a few percent when 2 kpc bins are used. As the variance of the Gaussian drops below 1 kpc, our binned KLD begins to underapproximate the actual value of the KLD by 10%–20% per Gaussian when using 2 kpc bin widths. A typical shell width in the radial merger simulations is between 0.8 and 1.2 kpc, so we conclude that our bin size is appropriate for the data.

Figure 14.

Figure 14. Ratio of a binned KLD of a single Gaussian to the analytical (correct) value of the KLD with respect to bin size of the binned distribution. We mark the bin size used in this work, 2 kpc, with a dashed red line. For Gaussian widths c ≥ 1 kpc, a 2 kpc bin size approximates the KLD within a few percent of the correct value. For smaller values of c, we begin to underestimate the value of the KLD. Typical shell widths in our simulations are ≥0.8 kpc, so we feel that our bin size is appropriate for the data. The variable periodicity in the values of the KLD has to do with changes in how many bins are used to approximate each Gaussian, which depends on bin size.

Standard image High-resolution image

While decreasing the bin width used in this work would provide a more accurate estimation of the KLD, we chose a bin width of 2 kpc, as our average distance error was ±0.5 kpc. Decreasing the size of the bins in the observed data would increase noise in the data and decrease the quality of our fits to the observed data. We expect a bin width of 2 kpc to produce KLD estimates within a few percent of the actual values (Figure 14). We also elected to keep the bin sizes of the simulated data equal to the bin sizes of the observed data in order to allow for easy comparison.

Note that if we are actually underestimating the values of the KLD calculated in Figure 13, then the distributions should be moved upward. This will increase the time since collision at which the distribution appears similar to the observed data. However, since we show that the KLD is only off by a few percent per Gaussian component, the effect of this error on the estimated time since collision from phase mixing will be small (something like 10%). This change in the KLD for the simulations with r0 = 20 and 30 kpc corresponds to a change of <1 Gyr in the upper constraint of the VRM merger time. Since we do not use phase-mixing arguments to calculate the precise merger time of the VRM, this is still consistent with the conclusions of this work. Note that the addition of background halo stars in the simulations used in Figure 13 would decrease the measured KLD and partially cancel out the increase in the measured KLD due to binning.

A description of the simulations is provided in Section 4.1. A range of inclination angles (i), evolve times (te ), and initial distances (r0) were tested, and the results of the method described in Section 5 are listed above. The actual merger time calculated by when the dwarf galaxy's center of mass passes through the Galactic center is tm , and the recovered merger time calculated with our method is tr . These values are also shown in Figure 10. We show values calculated by both the Newberg et al. (2010) model potential and the MWPotential2014 model. Simulations that did not generate shell structure and time steps where the method was unable to recover a single merger time are given as blank rows. Of all trials, 74.5% recovered a merger time in at least one of the model potentials. The majority of instances where merger times were not recovered were due to a lack of shell structure in the simulated data.

Please wait… references are loading.
10.3847/1538-4357/abb5f6