Revealing the Structure and Internal Rotation of the Sagittarius Dwarf Spheroidal Galaxy with Gaia and Machine Learning

, , , , , and

Published 2021 February 26 © 2021. The American Astronomical Society. All rights reserved.
, , Citation Andrés del Pino et al 2021 ApJ 908 244 DOI 10.3847/1538-4357/abd5bf

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/908/2/244

Abstract

We present a detailed study of the internal structure and kinematics of the core of the Sagittarius dwarf spheroidal galaxy (Sgr). Using machine-learning techniques, we have combined the information provided by 3300 RR Lyrae stars, more than 2000 spectroscopically observed stars, and the Gaia second data release to derive the full phase space, i.e., 3D positions and kinematics, of more than 1.2 × 105 member stars in the core of the galaxy. Our results show that Sgr has a bar structure ∼2.5 kpc long, and that tidal tails emerge from its tips to form what it is known as the Sgr stream. The main body of the galaxy, strongly sheared by tidal forces, is a triaxial (almost prolate) ellipsoid with its longest principal axis of inertia inclined 43° ± 6° with respect to the plane of the sky and axis ratios of 1:0.67:0.60. Its external regions are expanding mainly along its longest principal axis, yet the galaxy conserves an inner core of about 500×330×300 pc that shows no net expansion and is rotating at vrot = 4.13 ± 0.16 km s−1. The internal angular momentum of the galaxy forms an angle θ = 18° ± 6° with respect to its orbital angular momentum, meaning that Sgr is in an inclined prograde orbit around the Milky Way. We compared our results with predictions from N-body models with spherical, pressure-supported progenitors and a model whose progenitor is a flattened rotating disk. Only the rotating model, based on preexisting simulations aimed at reproducing the line-of-sight velocity gradients observed in Sgr, was able to reproduce the observed properties in the core of the galaxy.

Export citation and abstract BibTeX RIS

1. Introduction

The dwarf spheroidal galaxy of Sagittarius (Sgr) is the nearest and most prominent example of ongoing galactic cannibalism in the entire sky (Ibata et al. 1994). The majority of its stars have been stripped from the main body due to tidal forces from the Milky Way (MW; Niederste-Ostholt et al. 2010) and span a >360° great circle on the sky forming what is known as the Sgr stream (Majewski et al. 2003; Law & Majewski 2016). This structure is one of the best tracers of the MW's potential well that we have. In turn, Sgr is thought to shake the MW, inducing vertical displacements, kinematic distortions, and star formation in the MW disk (Purcell et al. 2011; Antoja et al. 2018; Laporte et al. 2018; Ruiz-Lara et al. 2020). The stars of the Sgr system show a wide dispersion in age and composition despite the current lack of gas, indicating a gas-rich progenitor galaxy heavily modified by the process of infall (as reviewed in Law & Majewski 2016). Tidal forces also put their imprint on the elongated structure of the galaxy in the present day (Ibata et al. 1997), but the dwarf's structure still offers encoded clues to the primeval condition of its progenitor.

While the low Galactic latitude, large spatial extension (${r}_{h}=342^{\prime} \pm 12^{\prime} $; McConnachie 2012), and significant extinction toward Sgr place obstacles in the way of its study, each new survey covering the Sgr footprint provides us with a clearer view of its stellar content and kinematic properties. Even so, numerous features of the Sgr system remain to be understood. There have been several attempts at modeling and reproducing the stream (Law & Majewski 2010; Gibbons et al. 2014; Dierickx & Loeb 2017; Fardal et al. 2019; Cunningham et al. 2020). While all are generally consistent with the shape of the stream, many struggle to reproduce its kinematics and the bifurcation of the stream now observed in both the leading and trailing arms (Belokurov et al. 2006; Koposov et al. 2012; Ramos et al. 2020). This feature, as well as other kinematic characteristics, could be naturally explained if the Sgr progenitor was an inclined rotating disk in infall into the MW potential well (Peñarrubia et al. 2010). Then, Sgr's morphology and kinematics would have been eroded after several close passages through the MW halo, resulting in its current spheroidal shape and stirred kinematics, although some residual rotation signal is expected to remain in the core of the galaxy (Mayer 2010, and references therein).

Line-of-sight velocities (vlos) have been derived for several thousand of Sgr's giant stars (Ibata et al. 1997; Giuffrida et al. 2010; Peñarrubia et al. 2011; Frinchaboy et al. 2012; McDonald et al. 2012), a significant effort that has allowed comparisons with N-body model predictions. The core of Sgr is being disrupted by the MW's tidal forces, but both rotating and pressure-supported N-body models have reproduced well the observed vlos profiles (Łokas et al. 2010; Frinchaboy et al. 2012), leaving the rotation as an open question. The shape of Sgr is also under discussion. Rotating models normally result in a prolate spheroid with tidally induced bars, while pressure-supported models result in more spherical shapes. Different observation techniques, on the other hand, have led to mixed results between prolate (Ibata et al. 1997) and triaxial (Ferguson & Strigari 2020) spheroids.

Recently, the Gaia second data release (DR2; Gaia Collaboration et al. 2018a) has given us access to the tangential component of the stellar velocities of its stars, or proper motions (PMs). This has increased current interest in Sgr, as shown in particular by two papers that appeared while we were working on this one (Ferguson & Strigari 2020; Vasiliev & Belokurov 2020). The former uses RR Lyrae variable stars to analyze the spheroidal shape of the core of the galaxy, while the latter combines DR2 with spectroscopic vlos available in the literature to try to find the best N-body reproduction of the observed features of the Sgr core (though not the stream).

In this paper, we revisit the galaxy from a different perspective. We did not try to fit an N-body model to the data and learn from it. Instead, we used machine-learning (ML) techniques to combine three different data sets and predict the full phase space (3D positions and velocities) of each individual star in the core of the galaxy. This allowed us to construct the full phase space of the core of the galaxy without using any physical prior or constraint, study its internal dynamics in detail, and compare the results with physically motivated N-body models.

The paper is organized as follows. In Section 2, we present the data and preliminary membership selection. Section 3 explains the ML models, their predictions for distance and vlos, and the final selection of member stars. Section 4 shows the color–magnitude diagram (CMD) of the selected member stars. Section 5 shows how the selected member stars are arranged in the PM–parallax space and the bulk properties of the galaxy as it orbits the MW. Section 6 introduces a comoving coordinate system centered in Sgr's center of mass (COM). Section 7 analyzes the internal dynamics of Sgr in 6D using different projections and coordinate systems. In Section 8, a comparison with three N-body models is shown. Our results are discussed in Section 9. Finally, a summary and the main conclusions of the paper are presented in Section 10.

2. First Data Processing

We downloaded all of the stars from the Gaia gaia_source table in a circular region of 5fdg7 radius around the Sgr central coordinates (1 half-light radius). Only stars with available colors and astrometric solutions were considered. We imposed parallax ϖ and PM cuts in order to select stars compatible with the bulk and internal dynamical properties of Sgr. In particular, we selected stars compatible within 3σ with the Sgr bulk dynamic properties reported by Gaia Collaboration et al. (2018b) and their respective 3σ errors and with the dispersion expected in the PMs resulting from the projected real intrinsic velocity dispersion observed in vlos of its stars (11.4 km s−1; McConnachie 2012). This selection was made by taking into account Gaia systematic errors calculated from Equations (16) and (18) from Lindegren et al. (2018). We also accounted for errors being 1.1 times larger than the ones listed in the gaia_source table, following the DR2 verification papers' findings that the uncertainties may be underestimated by this factor (Arenou et al. 2018; Lindegren et al. 2018). Once the data were downloaded, we multiplied all statistical astrometric errors by this factor prior to any further analysis.

We proceeded to impose various quality cuts intended to screen out bad measurements. First, we removed sources with bad astrometric fits following Equation (C.1) in Lindegren et al. (2018) and the GAIA-C3-TN-LU-LL-124 5 technical note: defining the renormalized unit weight error (RUWE) ≡ (astrometric_chi2_al/astrometric_n_good_obs_al − 5)1/2/u0(Gmag,GBPmagGBRmag), we require $\mathrm{RUWE}\,\lt 1.2\times \max (1,\exp (-0.2(G-19.5)))$ and RUWE < 1.4. We used the u0(Gmag, GBPmagGBRmag) reference value available on the ESA Gaia DR2 Known Issues page. 6 This selection removed approximately 2% of the stars from the original list, leaving a sample of 2,877,172 stars. Figure 1 shows the distribution in the sky of the data fulfilling our quality cuts.

Figure 1.

Figure 1. Stellar surface density for all stars passing our first quality and astrometric cuts. Coordinates are in degrees in the celestial sphere. Stripes are related to the Gaia scanning pattern. At the center of the image, M54 is clearly visible, whereas MW stars populate mostly the upper right regions.

Standard image High-resolution image

We corrected the photometry for distance modulus and reddening. A distance modulus of (mM)0 = 17.10 ± 0.15 (26 ± 2 kpc) was adopted for Sgr (McConnachie 2012). The reddening correction was applied individually over each star by cross-correlating its position with the dust maps of Schlafly & Finkbeiner (2011) and using coefficients from Evans et al. (2018) for the conversion to the Gaia bands.

We next consider a set of nested criteria for our membership selection based on PM, parallax, and position in the CMD. We start by selecting stars compatible within 3σ with a set of PARSEC isochrones (Bressan et al. 2012), with ages from 5 to 13.5 Gyr and metallicities spanning [Fe/H] = −0.4 (McConnachie 2012) to −2.5.

The selected stars are then observed in the PM–parallax space, where member stars are expected to cluster around the bulk PMs and parallax values of Sgr. A Gaussian model is fitted to the data in this space. Stars are scored based on their logarithmic likelihood of belonging to the Gaussian distribution. Stars whose score, , is lower than the median of all scores' distribution minus n times the sigma of that distribution are rejected. This is the logical expression,

Equation (1)

that stars must meet. The value that maximized the selection of Sgr stars while keeping the MW's pollution low was n = 2 (see Appendix B).

The fitting of the model is performed iteratively until self-convergence, refining the centroid of the distribution while rejecting stars not compatible with it. As a first guess for the parameters of the Gaussian during the first iteration, we used the PMs and parallax provided in Table C.2 of Gaia Collaboration et al. (2018b).

The aforementioned procedure guarantees that stars not compatible with the bulk PMs or parallax of Sgr are rejected, yet some MW stars may lie inside the distribution of selected stars in the PM–parallax space and thus be selected as possible members. Most of these contaminants, however, will show PMs and parallaxes that are not consistent with those from its surrounding neighbor stars, as they follow different trends in such quantities across the sky. In order to screen out most of these contaminants, we divided the sky into a grid of square cells of 1° on a side and applied the procedure described in the paragraph above to each of the cells individually, rejecting stars not fulfilling our logarithmic-likelihood condition. After convergence is reached in all cells, we introduce a shift in the grid position and repeat the procedure in the displaced cells until convergence. The shifts are introduced randomly in steps of 0fdg5 in the west and north directions independently, $\left[{\rm{\Delta }}(x),{\rm{\Delta }}(y)\right]$. After all shifts have been covered (four in total), the procedure starts again from $\left[{\rm{\Delta }}(x),{\rm{\Delta }}(y)\right]=(0,0)$. The whole procedure converges when no further stars are rejected from any cell on any shift after three consecutive iterations.

3. Full Phase Space: ML Predictions

The full phase space for all stars is required in order to fully understand the internal structure and kinematics of the galaxy. Therefore, distances, D, and line-of-sight velocities, vlos, must be derived and combined with PMs and sky coordinates. However, and despite its high astrometric performance, Gaia parallax precision does not allow one to derive distances for Sgr stars. Nor can the vlos be studied through Gaia in detail; the tip of the Sgr red giant branch (RGB) is located at G ∼ 14, 1 mag below the nominal limiting magnitude with Gaia vlos measurements, which results in only a handful of Sgr stars with vlos scattered along the sky.

To overcome this, we have made use of ML models to derive such quantities for each individual star in our sample. Generally speaking, a model, f( x ), is trained over a sample on the dependence of a variable ytrue to a set of independent variables, x , to predict such a variable, ypred, from a different sample of x .

3.1. Training Sets

We assembled two different catalogs that will be used later to train prediction models to determine D and vlos for all stars from the gaia_source table. The vlos training table consists of 2427 stars with observed line-of-sight velocities, vlos,true, either from gaia_source, the APOGEE survey, or one of the following publicly available catalogs: Ibata et al. (1997), Giuffrida et al. (2010), Frinchaboy et al. (2012), and McDonald et al. (2012). Common stars between the catalogs show consistent measurements in most cases. For stars with less than 10 km s−1 of standard deviation between catalogs, we adopted the error-weighted average of their vlos, true as the final value. Stars with larger standard deviation values were rejected.

We composed the distance training table from more than 4000 RR Lyrae stars around the Sgr center from the SOS and VariClassifier catalogs published by DPAC making use of Gaia DR2 (Holl et al. 2018; Clementini et al. 2019; Rimoldini et al. 2019). Contaminants were removed following the method described in Rimoldini et al. (2019). Their observed distances, Dtrue, are derived from their pulsation modes, therefore making them independent from the astrometric properties of the star. We estimated an average error of 1.16 kpc in Dtrue from M54 member stars that accounts for both observational and calibration effects (see Appendix C.1.1). Details on how the joint SOS and VariClassifier catalog was assembled and the computation of the RR Lyrae distances are available in Mateu et al. (2020).

Undesired systematic bias effects can occur when the training sample does not fully cover the entire space defined by x from which later predictions will be made. To avoid such problems, we initially considered a wider PM and ϖ range, as well as a wider region in the sky for the training lists than for the main catalog. Specifically, we selected all stars within 7fdg7 of the central coordinates of Sgr with PMs and ϖ compatible with Sgr by ±1 mas yr−1 and ±0.5 plus 3σ mas, respectively. We then cleaned the two catalogs of contaminants and poorly measured stars following the exact same criteria as for the gaia_source data, ensuring that the three catalogs are subsamples representative of the same set of stars. Up to 3422 RR Lyrae and 2310 stars remained in the distance and line-of-sight velocity training sets, respectively, after the first membership selection. As a last step before fitting the models to the training lists, we standardized all variables in x by subtracting their averages and dividing them by their standard deviation.

3.2. Predicting Velocities and Distances

The two training sets were used to create models that predict distances and line-of-sight velocities on the gaia_source table.

To obtain D, we used a k-neighbors regressor (KNR) model. The KNR models predict ypred by local interpolation of the ytrue of the k nearest neighbors in the parameter space defined by x in the training set. This interpolation is, in our case, weighted by the inverse of the euclidean distance to the target star in the parameter space defined by x . The KNR models are highly scalable and perform fast on large and well-distributed samples. The procedure is as follows.

The KNR model is fitted to the training distances, ytrue, using sky coordinates and a set of astrometric magnitudes as x . The number of neighbors, k, and best combination of independent variables in x are determined through a grid search within a nested cross-validation (NCV) of the training set minimizing (a) the variance in the differences between the predicted and the true values for the distance (generalization error) and (b) possible artificial gradients along the sky introduced by the model (see Appendix C). The validated model is then used to predict distances on the gaia_source based on the same independent variables.

Choosing a KNR over other models has the advantage that the model regularizes (flattens) ytrue when k > 1 is used, avoiding overfitting the data. The relatively large errors observed in our distances (1.16 kpc) and the large FWHM of their distribution (3.69 kpc) compared to previous works (Hamanowicz et al. 2016) indicate that regularization and averaging of the training sample are required prior to deriving distances for the main table (for more information about the errors in the training list and the KNR performance, see Appendix C.1).

The procedure to derive vlos is similar, but due to the sparse nature of the data (their distribution in the sky is far from uniform) and their nonlinearity, we decided to use a stacked regressor (SR; Wolpert 1992; Breiman 1996). The SR models take advantage of different learners by stacking their output through a final regressor that computes the final prediction. They perform much slower than each of the estimators separately, but their results are at least as good as the best of the stacked estimators while improving the results in most cases. We combined six learners, three nonlinear (extra trees, support vector machine with a radial basis function, and Gaussian process) and three linear (lasso, support vector machine with a linear kernel, and elastic net), and used a neural network to combine their outputs. The SR was trained through an out-of-fold scheme: in k = 5 successive iterations, k-1 folds are used to fit the first layer of regressors to make predictions on the remaining subset. The predictions are then stacked and provided as input to the final metaregressor. The optimal metaparameters of the six learners inside our SR model, as well as the best combination of independent variables in x , are optimized through a grid search over the metaparameter space within an NCV. The resulting model was able to reproduce nonlinear features while showing stability against high-order oscillations in areas where no information is available, such as the outer regions of the galaxy, where the data density is much lower. (For more information about the SR architecture and performance, see Appendix C.2.)

The procedure to derive ypred and its associated error is done in a Monte Carlo fashion, considering and propagating the errors from all independent and dependent variables. A total of 106 realizations were performed, randomly sampling each variable from a normal distribution centered on its corresponding nominal value. The predicted values using the nominal values of the independent variables are assumed to be the nominal values of the final result. The quadrature addition of the standard deviation of all realizations plus the square root of the generalization error derived from the NCV test is adopted as the total error.

Stars in the main catalog and the two training sets are finally selected based on their full velocity space, i.e., 3D velocities, through a 3D Gaussian using the same rejection criteria adopted in Section 2. No rejection was imposed on the spatial coordinates. The whole procedure is repeated until no more stars are rejected and the solution converges. The final lists contain 3305, 1992, and 120,168 stars for the RR Lyrae, the vlos, and the final clean sample, respectively.

Values for k and the best combination of independent variables, x , vary a bit from one iteration to another for the KNR. A larger k produces more stable results but at the cost of flattening the results too much, artificially removing gradients from the data. Our criteria seem to be a good compromise, minimizing such changes in distance gradients and keeping low deviations from the true values. The best results were obtained using k between 10 and 30, and x = (x, y, μx , μy , ϖ), where x, y, μx , and μy are the orthographic projections of the usual celestial coordinates and PMs (see Appendix C.1.2). The optimal metaparameters of the SR model, as well as the best x , also vary between iterations yet produce fully compatible results. See Appendix C for more information.

3.3. Final Distances and Line-of-sight Velocities

After convergence, we evaluated the performance of both models by comparing ytrue and ypred from the NCV tests on each corresponding final training list. The NCV procedure uses a series of train, validation, and test set splits to effectively optimize and test the model over different data sets, thus allowing an independent assessment of the models' ability to predict on new data.

Figure 2 shows the predicted distances, D, for our final clean sample and the residuals from the NCV test, DtrueDpred, on the coordinate system (Λ, B), defined along the stream (Law & Majewski 2010). Here Λ almost coincides with the optical major axis of the system (as projected on the sky) and B with the minor one (slightly offset with respect to the center of the galaxy). A linear fit to the training set, ytrue, has been added to shown the general tendency: the western regions of the galaxy (negative Λ) are, on average, closer to the Sun than the eastern parts (positive Λ). A less obvious trend can also be seen along B, with negative B closer to us.

Figure 2.

Figure 2. Predicted distances for our final sample of stars on the stream coordinate system, (Λ, B), shown in the top and bottom panel, respectively (Law & Majewski 2010). Contours show stellar density for the predicted distances, ypred, on the clean sample. Points show the location of RR Lyrae stars selected as Sgr members in our training set. The average distance measured on the training sample is marked by a black horizontal line. Residuals measured from the NCV on the training list are shown as DtrueDpred along both coordinates. Red dashed lines show a linear fit to the training data and residuals. The parameters of the fits are shown in the top right part of each plot. For clarity, only the error bars of the residuals are shown. The typical error for the RR Lyrae star distances is ∼1.16 kpc (see Appendix C.1.1).

Standard image High-resolution image

The agreement between the predicted and true distances is very good, with both distributions showing the same average behavior and a constant variance between Dtrue and Dpred along both axes. Some departures from the linear model (red dashed line) can be easily spotted in our predicted distances for the clean sample (contours), especially along Λ. These departures are also present on the training sample and point to a real twist in the distance along the line of sight. The KNR does a good job regularizing the distances, with a line-of-sight thickness FWHM ∼ 2.2 kpc for horizontal-branch (HB) stars around the instability strip. This is closer to the 2.42 kpc found by the OGLE team (Hamanowicz et al. 2016) and our real FWHM estimation of 2.48 kpc than the FWHM measured directly over our RR Lyrae sample (3.69 kpc). These are not worrisome differences for our purposes of analyzing the general distance trends in Sgr, further taking into account that a number of factors could be affecting the measured thickness along the line of sight, such as the number of stars used, metallicity calibration, etc. See Appendix C.1.1 for more information.

Figure 3 shows the predicted and true vlos for the clean final sample and the training sample, respectively. Again, some departure from the linear fit can be observed in both samples, especially in the very central regions (∣Λ∣ < 1°) and the most external ones (∣Λ∣ > 4°). These changes roughly coincide with the points at which changes in distance are also present, indicating the presence of tidal tails with a slightly different kinematics than the core of the galaxy.

Figure 3.

Figure 3. Same as Figure 2 for vlos. Error bars show the error in the measured vlos.

Standard image High-resolution image

Interestingly, the standard deviation of the differences between the predicted vlos and the real one was found to be σ(vlos) = 11.7 km s−1, just above the internal velocity dispersion of the galaxy. This, combined with the small observational errors in the training sample and the fact that the spatial distribution of this quantity along the sky is very similar to the one from intrinsic dispersion, suggests that the model fails to reproduce the internal vlos dispersion of Sgr but reproduces well the general trends in the galaxy. This comes as no surprise, since the model has been regularized to avoid overfitting, which also limits its capacity to fit random velocities. We take that into account during our subsequent Monte Carlo experiments by allowing the stars to have a vlos in a range that includes this dispersion. For distances, differences were of the order of the error in distance for individual RR Lyraes. In this case, the largest differences were observed for the most extreme cases, i.e., stars with the largest or smallest distances in the training sample, indicating that it is a systematic effect caused by the regularization of the predicted distances, rather than random errors. More information can be found in Appendix C.2.

3.4. Possible MW Contaminants

Despite all of our efforts to clean up our sample from nonmember stars, some MW contaminants may have passed through our screening process. To assess this contamination rate, we compared our member star list with an equivalent sample from the Besancon 2016 model of the Galaxy using the Gaia Object Generator (GOG) model (Luri et al. 2014). We simulated DR2 errors in the GOG model and performed the exact same selections as with real data (see Appendix A for further information). This resulted in ∼7500 GOG stars passing through our membership selection method, which indicates a ∼ 6% of possible contamination from MW stars in our final sample. Figure 4 shows the final distribution in the sky of the stars selected as members and rejected for the gaia_source table and GOG model.

Figure 4.

Figure 4. Stellar surface density of Sgr selected members (top left), possible MW stars polluting the sample inferred from the GOG model (top right), rejected contaminants (bottom left), and rejected contaminants from the GOG model (bottom right). Each row of panels shares the same color shading. The color shading for the rejected stars has been divided by 20 with respect to the top panels. Stripes crossing the data on a diagonal are due to Gaia systematic errors.

Standard image High-resolution image

The stars from the GOG model that have passed our selection criteria are stars whose phase space, color, and magnitude are indistinguishable from Sgr member stars, not only in bulk but by sectors along the sky. Their contribution to our final measurements is well below the 1σ level of uncertainty of these; thus, we do not expect any critical impact from their possible presence in our results. It is worth noting that some probable Sgr members are rejected (overdensity in the center area of the bottom left panel in Figure 4). These are mostly stars with large astrometric and photometric errors. Their distribution in the sky follows the stellar density profile of Sgr, and their PMs randomly scatter in the vicinity of the cluster of members in Figure 6. These stars are expected to have a null contribution to the bulk dynamical properties of the galaxy given their small number and the fact that their astrometric properties are indistinguishable from those of Sgr in bulk and their surrounding neighbor stars within 1° (see Section 2).

4. Color–Magnitude Diagram

The distance-calibrated and reddening-corrected CMD of Sgr is shown in Figure 5. A prominent RGB climbs the CMD with its tip at G0 ∼ −3.2. Together with a conspicuous HB, visible at G0 ∼ 0.5, this shows the presence of mixed stellar populations with ages spanning intermediate to old and metallicities spanning [Fe/H] = −0.5 to <−2, consistent with our understanding of the Sgr populations. The MW stars are distributed mostly in the range $0\lt {({G}_{\mathrm{BP}}-{G}_{\mathrm{RP}})}_{0}\lt 2$, forming two vertical plumes.

Figure 5.

Figure 5. Calibrated CMD of Sgr. Black dots are Sgr selected members. Stars screened out as contaminants are shown in gray. The RR Lyrae and spectroscopically observed stars selected as Sgr members are shown in blue and red, respectively. Two isochrones from the PARSEC stellar evolution library have been drawn over the CMD: [Fe/H] = −3.3, 13.5 Gyr (orange) and [Fe/H] = −0.4, 5 Gyr (light blue).

Standard image High-resolution image

Stars from both training samples are also included in the CMD. Member RR Lyrae stars from the distance training sample (blue dots) were calibrated using their true distance (ytrue). Stars from the vlos training sample (red dots), on the other hand, were calibrated using their predicted distances (ypred). The agreement between the three samples is very good, e.g., the location of the tip of the RGB coincides for the main and the vlos sample, and the RR Lyrae stars lie in the HB, as expected. The dispersion observed for the RR Lyrae stars in G0 is due to their brightness variability and the different reddening maps used to calibrate their distance. These errors are averaged out by the KNR model in the main sample.

5. Sagittarius Bulk Dynamics

5.1. Sagittarius in the Sky

The final selection of member stars is shown in the PM–parallax space in Figure 6. The set of stars has its center at (μα, μδ , ϖ) = ( −2.6924 ± 0.0006, −1.3713 ± 0.0005, 0.0176 ± 0.0003), with some correlation between parallax and PMs, which makes the simultaneous fitting of the three variables necessary in order to maximize the member/nonmember ratio. Our algorithm does a good job of rejecting MW stars (∼2.7 × 106) while keeping a high fraction of Sgr stars in the sample (∼1.4 × 105).

Figure 6.

Figure 6. Distribution of stars in the PM–parallax space. Member stars (1.2 × 105) are shown by black dots. Gray dots show stars ruled out as members (2.74 × 106). A red point with error bars (not visible due to their small size) shows the location of the COM.

Standard image High-resolution image

Our membership selection method iteratively finds the centroid of all stars within 2σ of a 6D Gaussian fitted to their galactocentric Cartesian phase space (X, Y, Z, vX , vY , vZ ) as the error-weighted average of such coordinates. The resulting centroid is assumed to be the COM of Sgr, and its coordinates are then transformed to sky coordinates (R.A., decl., D, μα, μδ , vlos). Such a transformation is done assuming the distance of the Sun from the Galactic center and the circular velocity of the local standard of rest (LSR) to be R0 = 8.29 ± 0.16 kpc and V0 = 239 ± 5 km s−1, respectively (McMillan 2011). The solar peculiar velocity with respect to the LSR was taken from the estimates of Schönrich et al. (2010): (Upec, Vpec, Wpec) = (11.10, 12.24, 7.25) km s−1 with uncertainties of (1.23, 2.05, 0.62) km s−1.

The M54 stars could be shifting the centroid of Sgr toward the center of the globular cluster. In order to study this influence, we recalculated the COM for our table upon removal of all stars around M54 up to its tidal radius. We assumed rt = rc 10c , where c = 2.04 and rc = 0farcm09 are the concentration and core radius of the cluster (Harris 1996, 2010 edition). This provided a list of 1439 stars, from which we selected 352 very likely members of M54 based on their position on the CMD.

In Table 1, we list the sky coordinates and velocities for the COM of all member stars from the last iteration of our selection method. The first row shows the COM for the whole sample, including M54 stars. The second row shows the COM after removing M54 stars within its tidal radius. The third row shows the COM for M54 based on our 352 stars selected. The uncertainties listed here were obtained from a Monte Carlo scheme that propagates all observational uncertainties and their correlations, including those for the Sun and possible systematic errors (see Appendix D).

Table 1. Sgr's COM Astrometric Properties in Sky Coordinates

  α δ D μα μδ vlos
 (deg)(deg)(kpc)(mas yr−1)(mas yr−1)(km s−1)
Sag with M54283.909 ± 0.016−30.599 ± 0.00725.933 ± 0.006−2.7049 ± 0.0027−1.3648 ± 0.0025142.71 ± 0.08
    (−2.73 ± 0.05)(−1.45 ± 0.05) 
Sag without M54283.945 ± 0.019−30.647 ± 0.00925.951 ± 0.006−2.6808 ± 0.0026−1.3344 ± 0.0025141.61 ± 0.09
    (−2.71 ± 0.05)(−1.42 ± 0.05) 
M54283.781 ± 0.009−30.468 ± 0.00525.90 ± 0.06−2.722 ± 0.035−1.367 ± 0.033142.7 ± 0.4
    (−2.75 ± 0.06)(−1.46 ± 0.06) 

Note. The first row shows values for the galaxy including M54 stars. The second row shows values after removing M54 stars. The third row shows values for M54. Values in parentheses are corrected from large-scale systematic errors (see Appendix D.2).

Download table as:  ASCIITypeset image

The presence of M54 stars seems to have a negligible influence on the derived COM properties of Sgr, except for its coordinates in the sky, (α, δ). Such a difference is not surprising; M54 stars are highly concentrated on the sky compared to the rest of the Sgr stars and therefore have a high weight when deriving the Sgr centroid in the sky.

The large difference in coordinates between Sgr and M54 (${\rm{\Delta }}\alpha =16\buildrel{\,\prime}\over{.} 9\pm 2\buildrel{\,\prime}\over{.} 2,{\rm{\Delta }}\delta =10\buildrel{\,\prime}\over{.} 8\pm 0\buildrel{\,\prime}\over{.} 6$) is also expected given the different areas considered. The systematics known to be affecting our sample produce a nonuniform surface stellar density along the observed region, thus affecting the centroid. If we focus on the rest of the properties, Sgr and M54 seem to be indistinguishable within the errors. Furthermore, all M54 properties are compatible with those derived for Sgr after removing M54 stars, indicating that both stellar populations share position and dynamics. We will, therefore, consider M54 as part of Sgr regarding astrometry in any respect.

We also studied the effect of large-scale systematics in the PMs of Sgr using quasars (see Appendix D.2). A total of 217 quasars were used to derive the median zero-points in PMs in the field of Sgr. We obtained $\tilde{{\mu }_{\alpha \star }}=0.03\pm 0.05$ and $\tilde{{\mu }_{\delta }}\,=0.09\pm 0.05\ \mathrm{mas}\ {\mathrm{yr}}^{-1}$, which indicate that Sgr's motion may be slower toward the north direction. The corrected PMs of Sgr and M54 are listed in Table 1 in parentheses.

5.2. Sagittarius around the MW

Table 2 lists the galactocentric positions and velocities for Sgr. The core of Sgr is at a distance of 18.11 ± 0.15 kpc from the Galactic center, almost at the opposite side from the Sun location and 6.383 ± 0.007 kpc below the Galactic plane.

Table 2. Same as Table 1 but for Galactocentric Coordinates

  X Y Z vX vY vZ vR vT
 (kpc)(kpc)(kpc)(km s−1)(km s−1)(km s−1)(km s−1)(km s−1)
Sgr with M5416.73 ± 0.162.4289 ± 0.0028−6.383 ± 0.007234.9 ± 1.3−14.2 ± 0.7203.6 ± 0.7143.6 ± 1.7276.0 ± 1.2
    (236.3 ± 2.1)(−45 ± 8)(202 ± 6)(140.6 ± 2.3)(280 ± 6)
Sgr without M5416.74 ± 0.162.4157 ± 0.0035−6.408 ± 0.008233.2 ± 1.2−10.0 ± 0.7202.8 ± 0.7142.6 ± 1.6274.3 ± 1.1
    (234.6 ± 2.1)(−40 ± 8)(201 ± 6)(139.7 ± 2.3)(278 ± 6)
M5416.71 ± 0.172.462 ± 0.006−6.308 ± 0.015235.1 ± 1.7−15 ± 4205 ± 4144.1 ± 1.8277 ± 4
    (236.5 ± 2.4)(−45 ± 9)(203 ± 7)(141.0 ± 2.4)(282 ± 7)

Download table as:  ASCIITypeset image

Currently, Sgr is on its way to cross the Galactic plane with a net velocity of 311.1 ± 1.1 km s−1 (313 ± 4 if we correct from large-scale systematics) with respect to the Galactic center. Most of its velocity is tangential, VT = 276.0 ± 1.2 km s−1 (280 ± 6 km s−1), while Sgr currently moves away from the Galactic center with a radial velocity of 143.6 ± 1.7 km s−1 (140 ± 2). In general, its motion is well aligned with the Sgr stream, which lies close to the XZ plane of the MW (misaligned by ∼ 15°). The velocity and position of Sgr indicate that it must have passed through its orbit pericenter very recently, and it is now receding with respect to the MW center.

6. Comoving Reference Frame

6.1. Polar Coordinates

The position of any point in Sgr can be uniquely determined by its R.A. and decl. on the sky, (α, δ), and its distance, D. To simplify our analysis and have a better interpretation of our results in terms of internal kinematics, we introduced a new reference system centered on the galaxy COM with coordinates (α0, δ0, D0). Angular coordinates (ρ, ϕ) are defined on the celestial sphere, where ρ is the angular distance between the points (α, δ) and (α0, δ0) and ϕ is the position angle (PA) of the point (α, δ) with respect to (α0, δ0). The angle ϕ is measured counterclockwise starting from the axis that runs in the direction of decreasing R.A. at constant decl. δ0 (see Figure 1 from van der Marel et al. 2002). Equations (1)–(3) from van der Marel & Cioni (2001) allow (ρ, ϕ) to be calculated from any (α, δ):

Equation (2)

The velocity vector at any position (ρ, ϕ, D) can be decomposed into three orthogonal components:

Equation (3)

where v1 is vlos and v2 and v3 are the radial and tangential components of the velocity with respect to the COM in the plane of the sky. These components can be derived in kilometers per second from any (μα*, μδ ) as

Equation (4)

where Γ is the rotation angle of (v2, v3) to the frame on the sky, which is determined by

Equation (5)

6.2. COM Motion

The COM motion must be subtracted prior to the analysis of internal kinematics. Its contribution to the velocity field can be derived as

Equation (6)

where vlos,0 is the systemic vlos of the galaxy, vt is the velocity of the COM projected on the sky, and θt is the angle that defines the direction of vt defined using the same criterion as ϕ:

Equation (7)

We adopted all values listed in Table 1 in order to derive the COM contribution to v1, v2, and v3.

6.3. Transformation to Cartesian Coordinates

The Cartesian coordinates (x, y, z) can be derived from (ρ, ϕ, D) as

Equation (8)

while their respective differentials (vx , vy , vz ) can be obtained from (v1, v2, v3) as

Equation (9)

7. Sagittarius Internal Dynamics

The Sgr is moving outward from the MW center and away from the Sun. In order to study its internal dynamical properties, we subtracted the COM motion from the main sample. This is equivalent to observing the galaxy at rest, i.e., removing the solar motion, as well as the Sgr motion around the MW.

7.1. Sky Projections

We derived (v1, v2, v3) for all of the stars in our sample. As expected after the subtraction of the COM motion, the median of the line-of-sight component, $\tilde{{v}_{1}}=0.1\pm 0.1\ \mathrm{km}\ {{\rm{s}}}^{-1}$, 7 is compatible with zero. Figure 7 shows the distribution of v2 and v3 for all of the stars in the sample. The median radial component of the stellar velocities in the sky, $\tilde{{v}_{2}}\,=-1.1\pm 0.2\ \mathrm{km}\ {{\rm{s}}}^{-1}$, indicates that the galaxy is contracting, while its tangential component, $\tilde{{v}_{3}}=2.9\pm 0.2\ \mathrm{km}\ {{\rm{s}}}^{-1}$, indicates that Sgr is rotating counterclockwise as projected on the celestial sphere. The medians of the Monte Carlo realizations of the same quantities were closer to zero but still significant in rotation: 8 0.1 ± 0.1, −0.15 ± 0.17, and 1.71 ± 0.16 km s−1 for $\tilde{{v}_{1}}$, $\tilde{{v}_{2}}$, and $\tilde{{v}_{3}}$, respectively. The distribution of v2 and v3 extends toward positive and negative values of v2 and v3, respectively. This is likely to be produced by the influence of tidal tails in the kinematics of the galaxy. Indeed, stars with higher values of v2 and lower values of v3 are located at larger distances along the stream. Stars with v2 > 0 km s−1 and v2 < −50 km s−1 were located at a median absolute $\tilde{{\rm{\Lambda }}}=2\buildrel{\circ}\over{.} 65$, whereas the rest are closer to the Sgr center: median absolute stream longitude $\tilde{{\rm{\Lambda }}}=1\buildrel{\circ}\over{.} 96$.

Figure 7.

Figure 7. Error-weighted distribution of the radial, v2, and tangential, v3, components of the velocity projected in the sky. Contours and color indicate concentration. The median of the sample is indicated by a red dot. The error bars of the median are not visible due to their small size.

Standard image High-resolution image

Velocity maps provide an immediate qualitative view of the dynamics of Sgr. Figure 8 shows the stellar density, as well as the error-weighted average of each velocity component, (vx , vy , vz ), for stars within Voronoi cells of approximately 1000 stars defined on their sky positions. Voronoi tessellation ensures a constant number of stars per unit of resolution ("pixel"), thus providing a constant signal-to-noise ratio (Cappellari & Copin 2003).

Figure 8.

Figure 8. Stellar density and kinematics of Sgr projected on the sky plane upon COM subtraction. Stellar density is shown by black contours. From left to right, the color maps indicate the error-weighted average velocity of vx , vy , and vz inside Voronoi cells of ∼1000 stars. The Voronoi cells are the same in the three panels. A black arrow indicates the sky-projected bulk velocity of the COM. Coordinates were derived from Equation (8) and represent physical distances in kiloparsecs. Velocities were obtained from Equation (9) and are the Cartesian components of the velocities projected over the axes. A grid aligned with the stream coordinates system (Law & Majewski 2010) is also plotted.

Standard image High-resolution image

The stellar density projected in the sky has an elliptical profile whose concentration peaks at the position of M54. Results also show a clear velocity gradient (>10 km s−1) in vz to first order comparable with the line-of-sight velocity, indicating rotation about the projected optical minor axis of Sgr. This gradient is almost parallel to the bulk tangential velocity of Sgr and the direction of the stream (along Λ).

Counterclockwise rotation in the plane of the sky is also clearly visible in vx and vy , where velocity gradients are inclined with respect to the axes due to the PA of Sgr in the sky (102fdg4). Southern regions of the galaxy are moving west (right), while northern regions are moving east (left). Here vy shows the same pattern, with eastern regions moving south and western ones moving north. Interesting features can be noticed in the vx and vy maps, such as the apparent contraction of the core, the two negative bands (vx < 0) coming out toward positive values of Λ (east direction), or the region with negative vy around (x, y) = (1.5, 1) kpc. These are the consequences of the rotation of the galaxy, the inclination of its main body with respect to the plane of the sky, and the presence of tidal tails moving outward from its core. It is also worth noticing an area at (x, y) ∼ (−2, 0) kpc with unexpected positive values of vy and negative vx . This is a region showing unusual small values of μδ compared to the average caused by Gaia systematic errors, resulting in clockwise rotation (see Appendix D.2). Stripes with the same inclination can be seen in Figures 1 and 4 and are related to the scanning pattern of the Gaia satellite. This band would probably show inverted velocities if it were not affected by errors. Systematics also affect the stellar density profiles, although they tend to average out when considering large areas, not changing the general view of the galaxy.

7.2. 3D Projections

7.2.1. Sky Projections

The full phase space allows us to represent Sgr in all dimensions (6D), providing a better understanding of the structure and dynamics of the galaxy. Figure 9 shows the projected stellar density and velocities for all stars in our sample in the coordinate system defined by Equations (8) and (9). In this representation, the galaxy is located inside a cube whose sides are the planes xy, xz, and zy. The xy plane is aligned with the sky, with the x-axis pointing to the west and the y-axis to the north (same as in Figure 8). In planes xz and zy, the galaxy is projected along the line of sight, with positive values of z pointing to the Sun. The observer is therefore located at (0, 0, ).

Figure 9.

Figure 9. Projected stellar density and kinematics of Sgr on the sky on the zx, xy, and zy planes derived from Equations (8) and (9). Stellar density is shown by black contours and colors. The orange arrow indicates the projected bulk velocity of the COM. The blue arrow shows the projection of the internal angular momentum, L. Errors of these quantities are shown by thin lines on both sides of the arrows. The projected tangential velocities upon subtraction of the COM velocity are shown by the black arrows. The direction toward the MW center is indicated by a dashed white line. Velocities measured in the Voronoi cells are all in the same scale. The scale of these, L, and the bulk motion of the galaxy is the same for all panels but different from one quantity to another in order to accommodate all arrows in the same plot.

Standard image High-resolution image

Agglomerative Voronoi tessellation was applied to the data, creating cells of approximately 1000 stars for which the median values of the star velocities were calculated. These median velocities are represented by black arrows and qualitatively show how different parts of the galaxy move with respect to the COM. We also have calculated the 3D internal angular momentum of Sgr, L = I ω, where I is the inertia tensor and ω is the angular speed, for stars in the center of Sgr within 1 kpc radius. It is shown by a blue arrow.

The main body of Sgr is inclined with respect to the plane of the sky, being closer to the Sun for positive values of x and y. The data also suggest the presence of a bar-like structure more than 2 kpc long, from the tips of which depart the tidal tails that form the stream. This bar-and-tails configuration confers on Sgr the "S" shape observed in the xz plane, which also projects on the xy plane. We should note that the tails depart the main body at distances of 1.5 kpc (∼3° in the sky), which is much closer to the center than the half-light radius found in the literature for Sgr (5fdg7; McConnachie 2012). Studies analyzing the properties of the core of the galaxy should be aware of the effects of such tails.

Rotation is most obvious on the xz plane (counterclockwise), with some rotation signal also visible on the xy plane (counterclockwise). The stripe showing unusually high μδ is clearly visible in the xy plane as arrows pointing in the northeast direction (opposite from their neighbors). The least obvious rotation is seen in the zy plane, with most of the stars located at positive values of y moving toward positive values of z (right) and stars with negative y moving toward negative values of z (left). The apparent expansion observed in this plane is a projection effect; stars at positive z are moving toward positive z, while stars at negative z are moving toward negative z (see xz panel). The angular momentum of the galaxy, projected in each plane by a blue arrow, reflects these general rotation patterns.

Combined, all dimensions depict an inclined system that is closer to us on the west and north. Closer areas are moving in our direction, while the furthest are moving even further away. Of course, this is in the COM frame, as the whole galaxy is moving away from the Sun at vlos = 142.7 ± 1.7 km s−1, much larger than the ∼20 km s−1 gradient discussed here.

7.2.2. Galactocentric Projections

The most important external factor affecting Sgr morphology and kinematics is the MW potential. Tidal forces stretch the body of the galaxy, accelerating its stars depending on their relative distance to the MW center. Figure 10 shows Sgr projected on the galactocentric frame, (X, Y, Z), with the observer located at a infinite distance above the MW plane in (16.73, 2.44, ) kpc. The XY plane is almost aligned to the orbital plane of Sgr.

Figure 10.

Figure 10. Projected stellar density and kinematics of Sgr on galactocentric coordinates. Markers and their scale coincide with those from Figure 9.

Standard image High-resolution image

The largest component of the bulk velocity for Sgr is found in the XZ plane, which is also the plane where the largest velocity gradient is observed. This suggests that tidal forces are greatly affecting Sgr kinematics, shearing its body and stripping its stars from outer regions. On the other hand, some internal rotation is apparent in the innermost parts of the body, especially in the XZ and ZY planes.

7.2.3. Principal Axis Projections

We can better understand Sgr's inner structure and kinematics by aligning the system with its principal axes of inertia, $(x^{\prime} ,y^{\prime} ,z^{\prime} )$. We calculated $(x^{\prime} ,y^{\prime} ,z^{\prime} )$ within a 3D sphere of a given radius r3D, as well as the Euler angles necessary to align Sgr from its orientation in the sky to these axes, α, β, and γ, defined as zyx rotation.

The ratio between axes changes depending on the considered sphere radius, as does the orientation. We derived the direction of the principal axes of inertia of Sgr and the ratio of the FWHM of the stellar density along them for stars within spheres of radii 0.25 kpc ≤ r3D ≤ 3 kpc (notice that no star is located farther than ∼2.8 kpc in our data). This is shown in Figure 11.

Figure 11.

Figure 11. Principal axis ratios and their Euler angles with respect to the plane of the sky. These are computed for stars within spheres of radii 3D r3D > 0.25 kpc.

Standard image High-resolution image

The Sgr is a triaxial system at all radii considered in our sample. However, the ratio between its intermediate and shortest axes, given by $y^{\prime} $ and $z^{\prime} $, is never larger than 1.14, with an average of 1.08, indicating that Sgr is almost prolate within 3 kpc. The inclination of the longest axis, measured by β, remains almost constant up to distances of 0.75 kpc ∼ r3D, as expected, since the bar dominates the stellar distribution at small radii. The angle α measures the inclination of the sky-projected major axis of the galaxy from the x-axis (east direction) in the counterclockwise direction, i.e., α = 270° − PA. Its variation within just 1 kpc, ∼60°, is due to the transition from the core of the galaxy to the tidal tails that elongate the shape of the galaxy along the stream direction. Here α stabilizes at around r3D = 2.5 kpc at 167fdg5 (PA = 102fdg4). Lastly, γ measures the pitch angle, i.e., the inclination in the north–south direction after the β rotation has been applied. The largest variation in γ occurs at 0.5 kpc ∼ r3D ≲ 0.75 kpc, radii for which the galaxy changes from having its north regions closer to us to the opposite. Given the evident change in morphology for radii larger than 1 kpc, we decided to adopt this value to calculate the nominal orientation of the Sgr principal axes. Using such a radius, the principal axes of Sgr, $(x^{\prime} ,y^{\prime} ,z^{\prime} )$, form net angles of (43°, 2°, 46°) with respect to the plane of the sky and (2°, 18°, 72°) with respect to the orbital plane of Sgr around the MW. We should emphasize, though, that these angles vary depending on the considered galactocentric radius, r3D.

Figure 12 shows the projected stellar density and kinematics over the principal axes of inertia of Sgr. The Euler angles necessary to align Sgr from its orientation in the sky to its principal axes of inertia are α = 177° ± 3°, β = −40° ± 2°, and γ = 16° ± 2°, measured at r3D = 1 kpc. These rotations align Sgr's longest principal axis $x^{\prime} $ with the x-axis, the intermediate $y^{\prime} $ with the y-axis, and the shortest axis $z^{\prime} $ with the z-axis, with (x, y, z) defined by Equation (8).

Figure 12.

Figure 12. Stellar density and kinematics of Sgr projected over the planes defined by principal axes of inertia. The axes $x^{\prime} $, $y^{\prime} $, and $z^{\prime} $ are the longest, intermediate, and shortest axis, respectively. The principal axes were computed within a sphere of radius r3D = 1 kpc. Markers coincide with those of Figure 9.

Standard image High-resolution image

In this projection, the galaxy is aligned with the bar, clearly visible in the $x^{\prime} $$y^{\prime} $ and $x^{\prime} $$z^{\prime} $ planes. The almost spherical view of Sgr from its longest axis, i.e., projected over its intermediate and shortest axes, shows the little length difference found between the intermediate and shortest axis ($z^{\prime} $$y^{\prime} $ plane). Indeed, the ratio between the FWHM of the stellar distribution along the three principal axes inside the 1 kpc radius sphere is 1:0.67:0.60, not far from prolate.

To further investigate the nature and strength of the bar, we measured the bar mode A2 from the positions of the stars projected onto the $x^{\prime} $$y^{\prime} $ plane (along the shortest axis). In general, the amplitude of the mth Fourier mode of the discrete distribution of stars in the galaxy is calculated as ${A}_{m}=(1/N)\left|{{\rm{\Sigma }}}_{j\,=\,1}^{N}\exp ({im}{\phi }_{j})\right|$, where ϕj are the particle phases in cylindrical coordinates and N is the total number of stars (Sellwood & Athanassoula 1986; Debattista & Sellwood 2000). Values larger than 0.2 of the m = 2 term of this expansion indicate the presence of a strong bar. With A2 = 0.36, our data suggest that the elongated shape of Sgr is indeed a strong bar. We should also notice that we obtained A2 = 0.3 measured directly on the positions in the sky, also larger than 0.2.

The Sgr rotates mainly around its intermediate principal axis, $y^{\prime} $. This is clearly visible from the direction of the angular momentum, L, almost parallel to $y^{\prime} $, and the projected velocity field on the $x^{\prime} $$z^{\prime} $ plane. Counterclockwise rotation can be observed in the $x^{\prime} $$y^{\prime} $, causing the angular momentum, L, to point toward positive values of $z^{\prime} $. The galaxy also shows very little clockwise rotation in its inner regions about its major axis, $x^{\prime} $ ($z^{\prime} $$y^{\prime} $ plane). The expansion of the outer regions of Sgr is most visible in the $x^{\prime} $$y^{\prime} $ plane, where areas dominated by the arms are being pulled away from the main body of the galaxy.

We computed the absolute net velocity along each one of the principal axes of Sgr as

with i being each of the principal axes $x^{\prime} $, $y^{\prime} $, or $z^{\prime} $. These velocities as a function of the absolute value of the radial distance along their respective axis ∣i∣ are shown in Figure 13. The rotation velocity around the same axes, computed over the planes defined by tuples of the two other principal axes (as shown in Figure 12), are shown in Figure 14. In this case, the rotation around ∣i∣ is rotation measured on the (j, k) plane with (ijk), as defined by a right-handed Cartesian coordinate system. The galactocentric radius is defined as ${r}_{{jk}}=\sqrt{{j}^{2}+{k}^{2}}$. In both figures, histograms in the right panels show the total velocity distribution (black), as well as the velocity distribution near the center of the galaxy (red). Stars in the center were selected based on their distance to the COM along each given axis using radii of 0.5, 0.33, and 0.3 kpc for the longest, intermediate, and shortest principal axis, respectively (following the axis ratios of the system).

Figure 13.

Figure 13. Velocity of the stars projected over the principal axes of inertia of Sgr. From top to bottom, the panels show velocities along the longest, intermediate, and shortest principal axis, respectively. Negative values of distance along each axis and their corresponding velocities are inverted, showing the absolute value along the axis (see text). The color map shows the error-weighted distribution of the stars. Black vertical lines mark distances along the axes of 0.5, 0.33, and 0.30 kpc, respectively. An error-weighted linear fit to the data is represented by the red dashed line, while its coefficients are shown in the top right part of each panel. A dark red curve shows the boxcar error-weighted average of the distribution along each axis with steps of 0.025 kpc and a window of 0.1 kpc. The shaded area around the curve represent the error of the average. The right panels show the distribution collapsed along each axis. The gray histogram shows all of the stars, whereas the red one shows the stars located within the radius marked by the vertical lines in the left panels measured from the center.

Standard image High-resolution image
Figure 14.

Figure 14. Rotation velocity of the stars projected over the planes defined by the principal planes of inertia. From top to bottom, the panels show rotation about the longest, intermediate, and shortest principal axis, respectively. Black vertical lines mark the projected maximum radial distance of the stars within a 3D ellipsoid with axes of 0.5, 0.33, and 0.30 over the planes defined by y'–z', z'–x', and x'–y', respectively. The rest of the markers coincide with those from Figure 13.

Standard image High-resolution image

The outer regions of Sgr are clearly affected by tidal forces. This can be observed as distortions in all of the rotation curves in the outer regions of the galaxy (see, for example, ${v}_{\mathrm{rot},x^{\prime} }$ in Figure 14, which shows positive rotation of the core up to distances ${r}_{y^{\prime} x^{\prime} }\sim 0.75\,\mathrm{kpc}$). Figure 13 shows the expansion of the galaxy along its longest principal axis, $x^{\prime} $, while the galaxy remains close to stable along $y^{\prime} $ and shows apparent contraction along $z^{\prime} $ for distances larger than ∼0.25 kpc. However, this apparent contraction seems to be the projection of the tidal tails' kinematics along $| z^{\prime} | $ (see plane x'–z' in Figure 26), rather than a real contraction of the core of the galaxy. We confirmed these findings, calculating the variation of the radial components of the velocity as a function of distance along each axis through finite differences. Similar results were obtained in all axes with expansion starting at different distances. We also found that the inner core of the galaxy shows almost no net expansion or contraction within an ellipsoid of axes (0.5, 0.33, 0.3) kpc. This ellipsoid is rotating mainly about its intermediate principal axis, $y^{\prime} $, at an average ${v}_{\mathrm{rot},y^{\prime} }2.46\pm 0.22\ \mathrm{km}\ {{\rm{s}}}^{-1}$. Residual rotation can also be observed about the shortest (${v}_{\mathrm{rot},z^{\prime} }2.11\pm 0.22\ \mathrm{km}\ {{\rm{s}}}^{-1}$) and, to a lesser extent, longest (${v}_{\mathrm{rot},x^{\prime} }1.38\pm 0.22\ \mathrm{km}\ {{\rm{s}}}^{-1}$) axes, indicating the presence of precession and nutation due to the MW's tidal torques. We aligned this inner ellipsoid to its angular velocity vector and measured a maximum rotation velocity of vrot =4.13 ± 0.16 km s−1 within this ellipsoid. It is important to notice that the orientation and strength of this vector vary with radius as shearing tidal forces change the shape and kinematics of the galaxy.

The 3D renderings of Sgr aligned to its principal axes of inertia on a galactocentric frame can be found in Appendix F. An interactive version of this rendering can be found at https://www.stsci.edu/~marel/hstpromo/Sagittarius_GC.html.

8.  N-body Counterpart

The rotation observed in Sgr seems to be a combination of intrinsic residual rotation and rotation induced by tidal torque from the MW potential well. In an attempt to reproduce the observational results and discern between these two factors, we ran a suite of N-body simulations of an interaction between an Sgr-like dwarf and a bigger galaxy with properties similar to the MW. Both galaxies initially contained two components: a spherical dark matter halo and an exponential disk. The structural parameters of the Sgr progenitor were similar to those in Łokas et al. (2010); its dark matter halo had a Navarro–Frenk–White (NFW; Navarro et al. 1997) profile with nominal virial mass Mh = 1.6 × 1010 M and concentration c = 15. In order to mimic the tidal stripping of the halo prior to the simulated period, we introduced a smooth cutoff to its density distribution at a radius of 20 kpc with a width of 10 kpc. The exponential disk had a mass Md = 3.2 × 108 M, scale length Rd = 2.3 kpc, and thickness zd = 0.3Rd = 0.69 kpc. The central values of the radial and vertical velocity dispersions of the disk were chosen to be equal, σz,0 = σR,0 = 19.5 km s−1. The resulting model was stable against bar formation in isolation. The MW model was the same as in Łokas (2019); its dark matter halo had an NFW profile with virial mass MH = 1012 M and concentration c = 25, while the exponential disk had a mass MD = 4.5 × 1010 M, scale length RD = 3 kpc, and thickness zD = 0.42 kpc.

The N-body realizations of the galaxies were initialized using the procedures described in Widrow & Dubinski (2005) and Widrow et al. (2008), with each component containing 106 particles. The progenitor of Sgr was placed at an apocenter of the orbit with apo- and pericenter distances of 58 and 19 kpc, respectively. The initial configuration was similar to the one shown in Figure 2 of Łokas et al. (2010), except that the dwarf's disk was rotated a certain angle around the X-axis of the simulation box. The evolution of the galaxies was followed for 2 Gyr with the GIZMO code (Hopkins 2015), an extension of the widely used GADGET-2 (Springel et al. 2001; Springel 2005), saving outputs every 0.005 Gyr. The adopted softening scales were epsilond = 0.02 and epsilonh = 0.06 kpc for the disk and halo of Sgr and epsilonD = 0.2 and epsilonH = 2 kpc for the disk and halo of the MW. Configurations similar to the present state of Sgr are reached soon after the second pericenter passage, around 1.22–1.24 Gyr after the simulation begins.

We completed this set with two pressure-supported models with three pericenter passages instead of two. The Law & Majewski (2010, hereafter LM10) and Fardal et al. (2019, hereafter F19) models, both focused on reproducing the stream's properties.

Figure 15 shows the best candidate of our originally rotating N-body simulations in galactocentric coordinates, which should be compared with Figure 10. This model is rotated by 10° in the opposite direction to the one shown in Figure 2 of Łokas et al. (2010). The similarity with the observations is remarkable.

  • 1.  
    The geometry of the N-body is almost identical to observations.
  • 2.  
    The direction of the internal angular momentum, L, coincides within the errors.
  • 3.  
    The bulk motion of the N-body around the MW is also remarkably accurate.

Figure 15.

Figure 15. Projected stellar density and kinematics of our best N-body candidate on galactocentric coordinates (rotating model), which should be compared with Figure 10. Markers and their scale coincide with those from Figure 9.

Standard image High-resolution image

We computed the principal axes of the simulated dwarf using the same criteria as for the observations. Figure 16 shows the expansion or contraction of the simulated dwarf along its principal axes, while Figure 17 shows the rotation measured over the planes defined by such axes. These figures should be compared with Figures 13 and 14, respectively. A resemblance between the model and the observations is obvious, with both showing the same general trends as well as similar differences between the inner and more external regions. The two tested spherical models, on the other hand, resulted in flat velocity profiles in the central regions of the galaxy (see Appendix E).

Figure 16.

Figure 16. Same as Figure 14 but for the originally rotating N-body simulation.

Standard image High-resolution image
Figure 17.

Figure 17. Same as Figure 13 but for the originally rotating N-body simulation.

Standard image High-resolution image

Figure 18 shows a comparison between the internal kinematics of the observed galaxy and three N-body models tested in this work along their principal axes.

Figure 18.

Figure 18. Internal kinematics of Sgr and three N-body models measured along the longest, intermediate, and shortest principal axes of inertia ($x^{\prime} $, $y^{\prime} $, $z^{\prime} $). Solid curves show the boxcar median value of the velocity along the axis with steps of 0.1 kpc and a window of 0.2 kpc. Dashed curves show the measured velocity dispersion profile using the same boxcar parameters. The black curve shows the observations using the Gaia DR2 and ML model predictions (DR2+ML), and the blue curve shows our best candidate N-body model with a rotating progenitor, based on Łokas et al. (2010; rotating model). The dark red curve shows the LM10 N-body model, while the orange curve represents the F19 N-body model, both initially spherical and pressure-supported. Errors are shown by the shaded areas around the curves.

Standard image High-resolution image

In general, the rotating model better reproduces the observed velocity profiles. Only the expansion along the intermediate axis, $y^{\prime} $, seems to be better reproduced by pressure-supported models (see vy'$y^{\prime} $ panel). Yet the velocity profiles predicted by the rotating model are too steep compared to the data (vz'$x^{\prime} $, vx'$z^{\prime} $). The seemingly high velocity dispersion observed in vy' and, to a lesser extent, vx' is likely to be caused by the large astrometric errors in the PMs. While their absolute value provides little insight, their shapes give an idea of the velocity anisotropy measured along each axis.

The shape of the core with the observed bar was also better reproduced by the rotating N-body model, whereas the others resulted in remnants that are too spherical and do not have a bar. The two pressure-supported models were aimed at reproducing the properties of the stream; therefore, we do not expect them to fit the observed properties of the core. Yet some correlations in the general kinematic trends can be observed, such the expansion of the galaxy or the tidally induced rotation in vz'$x^{\prime} $. The computed rotation and radial velocity profiles along each axis for these models can be found in Appendix E.

9. Discussion

Our knowledge about Sgr is heavily based on theory. Most previous works have tried to reproduce Sgr observables with N-body models and studied the full phase space in the latter. New available observational data help to better constrain those models, providing more accurate information about the galaxy physical properties. Yet Sgr has proved to be more complicated than what we can fully understand. Many factors are likely affecting its evolution, like the shape of the MW potential well, the influence of the Magellanic System (Erkal et al. 2019), or possible close passages with other systems (Bonaca et al. 2020), to give some examples. While we can learn about all of these mechanisms from N-body models, if properly simulated, the complexity of such simulation forces us to adopt simplifications, normally resulting in an incomplete picture of the evolution of the system.

In this work, we took a different approach. Instead of learning from the full phase space of an N-body model that reproduces the observations, we have used ML to predict the full phase space directly on the observations. Using ML has the advantage that no explicit physical priors are imposed, removing the risk of introducing biases due to our lack of knowledge over a system. Our models, with minimal user input, have predicted the 3D positions and velocities of more than 1.2 × 105 member stars in the core of the galaxy, which allows us to study the internal dynamics of the galaxy directly over the data and later compare the results to different N-body models. The results shown in this paper are fully compatible with those obtained for our training samples, where only one of the variables has been modeled (distance for the vlos catalog and vlos for the distance catalog). We tested the robustness of the results against possible statistical and systematic errors, different models, LSR properties, and COM position and velocity.

9.1. Geometry and Orientation

Our results allow us to discern between at least two components: a main body with a spheroid shape and the tidal tails.

Previous studies have shown that Sgr has an elongated body with some depth along the line of sight. Using red clump stars, Ibata et al. (1997) found that its core is consistent with a prolate spheroid with ratios 3:1:1 for its major, intermediate, and minor axes. More recently, Ferguson & Strigari (2020) used RR Lyrae stars from Gaia DR2 and the OGLE collaboration to find that the Sgr body is likely to have a triaxial geometry with ratios 1:0.76:0.43. Our results sit somewhere in between these two works, with a triaxial geometry (1:0.67:0.60) but closer to prolate than what Ferguson & Strigari (2020) suggested. In terms of depth, our derived FWHM of 2.48 kpc agrees with that from the OGLE collaboration (Hamanowicz et al. 2016), with an FWHM of 2.42 kpc (see Appendix C.1.1).

Our derived inclination of the Sgr main body also differs from the Ferguson & Strigari (2020) results. They found the longest principal axis of the galaxy to be almost parallel to the plane of the sky ($-{4.9}_{-18.8}^{+17.5\circ} $), whereas our results suggest an inclination of 43° ± 6° for the longest and 2° ± 7° for the intermediate axes in the inner regions of the galaxy, r < 1 kpc. It is important to notice that the overall inclination of the system depends on the considered radius. The inclination flattens as we consider larger radii due to the galaxy's elongation along its orbital path but remains as large as ∼35° up to distances of 2.75 kpc. This means that the major principal axis is inclined ∼30° with respect to the MW potential direction (see Figure 10). Stars closer to the MW center are subject to stronger tidal forces and thus get stripped earlier from the Sgr body while gaining angular velocity along their orbit. This mechanism is responsible for the observed "S" shape, a feature whose orientation is tightly constrained by the position of the galaxy relative to the Galactic center and the direction of motion. While tidal tails also occur in the N-body with spherical profiles, only ellipsoids forming certain angles (30° ≲ α ≲ 60°) will feature this characteristic "S" profile. In turn, models that well reproduce the Sgr stream properties show this shape regardless of the shape and internal kinematics of the progenitor. The fact that both the ML and N-body models made qualitatively consistent predictions about this feature strengthens the results obtained in the present work.

Another intriguing aspect is the bar observed in the galaxy. Such a bar was also reproduced by our rotating N-body models at the first pericenter passage on its orbit around the MW, after t = 0.46 Gyr from the start of the simulation. As described in detail by Łokas et al. (2014) and Gajda et al. (2017), the formation of a bar is a characteristic intermediate stage in the process of the transformation of disky dwarfs into dwarf spheroidals. This prolate shape is maintained and again enhanced right after the second pericenter passage, which occurs at 1.2 Gyr. The orientation predicted by the ML models is reached soon after, at around 1.22–1.24 Gyr. Arguably, a third passage would help to better reproduce the observed properties of Sgr, since it could naturally explain some of the stream's farthest features and would reduce the rotation signal of the remnant. However, the present model would result in a remnant that is too spherical and almost nonrotating if it survives the third pericenter, or, if it is less strongly bound, it will disrupt soon after the second pericenter. A model that survived a third pericenter while keeping some internal rotation would require a new selection of structural parameters of the dwarf, addition of the gas, star formation, etc., which would involve a computationally prohibitive parameter survey. We thus decided to stop the simulation after the second passage. In any case, we emphasize that in order to form a strongly elongated shape similar to the one seen in the Gaia data, the dwarf should transform into a bar at some stage of its evolution, and for this to occur, it is necessary that the progenitor has the form of a stellar disk.

It is worth mentioning that the 3D shape derived by our ML model (an elongated shape with tidal tails) falls in with those predicted by different N-body models whose main goal was to reproduce the properties of the core of the galaxy (see, for example, Łokas et al. 2010; Vasiliev & Belokurov 2020).

9.2. Internal Kinematics of the Core

Several works have previously analyzed Sgr's structural and kinematic properties, looking for signs of the ongoing destruction of the galaxy. Aiming to reproduce the stream properties, Peñarrubia et al. (2010) proposed a scenario in which the Sgr progenitor is a late-type rotating disk galaxy inclined 20° with its orbital plane. Łokas et al. (2010) focused on reproducing the properties of the core of the galaxy from a similar starting point. Both works presaged a rotating core remnant for Sgr, which later works were unable to confirm. Of special relevance to this discussion are Peñarrubia et al. (2011) and Frinchaboy et al. (2012). Both used samples of giant stars with vlos, concluding that the Sgr kinematics were well reproduced by those of a pressure-supported system, i.e., nonrotating. Interestingly, a preliminary version of the data presented by Frinchaboy et al. (2012) was used in Łokas et al. (2010) as a comparison with the rotating remnant N-body galaxy. The Gaia mission has brought new possibilities of studying Sgr in detail; in the recent contribution of Vasiliev & Belokurov (2020), the authors did not find any significant rotation signal from their best-fitting N-body model. It is worth noting that some residual rotation (<4 km s−1) was found by Frinchaboy et al. (2012), although they concluded that it could be due to other factors rather than intrinsic rotation from Sgr's progenitor.

Many aspects are contributing to the observed velocity field, making this a difficult issue. Only having access to the full phase-space information allows us to untangle the most relevant ones.

  • 1.  
    The contribution of the COM motion (or bulk motion) to the observed velocity field in the sky. Here Sgr is moving along its projected optical major axis toward negative values of λ with a tangential velocity of 254.8 ± −2.8 km s−1 projected on the sky. This tangential motion projects over the line of sight as a positive gradient in the same direction of movement, in this case 4.54 km s−1 deg–1, assuming a constant distance of D = 25.97 kpc.
  • 2.  
    The Sgr geometry and inclination with respect to the celestial plane. Triaxial system Sgr has its longest principal axis inclined 43° with respect to the celestial plane. That translates to an average of ∼1.15 kpc distance difference between the areas closer to us and the farthest ones along the surveyed area in this work. This gradient in distance reflects on the projected tangential components of the velocity (PMs), with stars that are farther away apparently moving slower than closer ones. In the particular case of Sgr bulk PMs, the difference in distance projects as ∼16.7 km s−1 in tangential velocities.
  • 3.  
    The internal kinematics of Sgr. This includes both residual kinematics from the Sgr progenitor and perturbations of these from an external potential, such as that induced by tidal forces from the MW or the Large Magellanic Cloud.

In this paper, we have derived the full phase space for more than 1.2 × 105 member stars, which allows us to simultaneously account for all of these effects and study the Sgr internal dynamics with unprecedented detail. We have shown that the Sgr core rotates in the plane of the sky at 2.9 ± 0.2 km s−1 counterclockwise. It does so while also rotating perpendicular to the plane of the sky, with its east side moving toward the Sun (upon subtraction of the systemic motion of the galaxy).

It is clear that the internal dynamics of Sgr is linked to its orbital history and interaction with the MW. All tested N-body models show similar velocity gradients in the outer regions of the galaxy, caused by the tidal forces from the MW. However, we found substantial differences in the inner parts of Sgr. Our results show that despite being ripped apart by tidal forces, the galaxy conserves an inner region that is not expanding and is rotating at vrot = 4.13 ± 0.16 km s−1.

The spherical N-body models tested in this work failed to reproduce these characteristics, resulting in flat velocity profiles in the central regions of the galaxy (plots of these models are shown in Appendix E). This was expected, since spherical systems are not subject to torque forces. A flattened system, on the other hand, changes the part of the galaxy that is closer to the MW as it orbits around it, suffering torques that would induce velocity gradients. This results in velocity gradients along axes other than the original axis of rotation, ${v}_{z}^{\prime} $. As the system evolves and stretches along its orbit, the shape and kinematics of the system change in such way that most of the original rotation is conserved about the intermediate principal axis ${v}_{y}^{\prime} $, while some rotation is also transmitted to the other axes. This can be seen as a negative gradient in ${v}_{\mathrm{rot},x^{\prime} }$ along ${r}_{y^{\prime} z^{\prime} }$ or a positive gradient in ${v}_{\mathrm{rot},z^{\prime} }$ along ${r}_{x^{\prime} y^{\prime} }$ in Figure 14 that were well reproduced by the rotating model but not the spherical models.

The flattening of Sgr would imply some initial rotation of the system, which would also contribute to the observed gradients. However, the detected rotation velocity is still 1 km s−1 below the predicted value of our best rotating N-body model candidate (3.5 ± 0.4 km s−1). As commented in the previous section, a possible explanation for the excess of rotation in the rotating model would be that the simulated galaxy suffered just two pericenter passages. During each passage, the progenitor is stirred and perturbed, heating its kinematics and removing rotation. The Sgr is likely to have suffered more than two pericenter passages; only models with three or more passages are able to reproduce the stream's full extension (Dierickx & Loeb 2017). Furthermore, three main star formation events have been observed in the MW's star formation history at the approximate times when such passages occurred (Ruiz-Lara et al. 2020). However, even with just two passages, the rotating model does a good job reproducing the closer parts of the stream (see Figure 5 in Łokas et al. 2010). Future high-resolution N-body models aimed at reproducing both the stream and the core with three or more passages may provide some insight.

9.3. Orbital and Internal Angular Momentum

Our data allow us to derive the 3D vector for the internal angular momentum of the galaxy. For stars within ${r}_{3{\rm{D}}}=1\,{\rm{kpc}}$, this vector forms an angle θ = 18° ± 6° with respect to the orbital angular momentum of Sgr, i.e. the normal vector to the orbital plane of Sgr, meaning that Sgr is in an inclined prograde orbit around the MW. The slight inclination of the axis of rotation with respect to the orbital plane could aid in the explanation of the bifurcation of the stream (Belokurov et al. 2006; Koposov et al. 2012), as different parts of Sgr get stripped with slightly different angles as the dwarf moves along its orbit. This possibility was investigated by Peñarrubia et al. (2010), who found that a retrograde orbit, θ = −20°, was the best inclination that reproduced the observed bifurcation in the south arm. The model was later shown to be unsuccessful in reproducing the internal kinematics of Sgr (Peñarrubia et al. 2011). On the other hand, while our rotating model, prograde and initially inclined 10° with respect to its orbital plane, well reproduces Sgr's internal dynamics, it did not create two clearly distinct arms in the stream but rather a single wider one. Higher initial inclinations were also tested, although they showed a poorer match between the observed and simulated angular momentum vectors.

The 3D positions reveal still more differences between observations and retrograde models, such as the observed bar. Dwarfs in prograde orbits tend to form tidally induced bars after pericenter passages. This is not the case for retrograde systems, whose stellar content remains disky along the entire orbit (Łokas et al. 2015).

9.4. Survivability of the Core

Very recently, Vasiliev & Belokurov (2020) presented N-body models that formed an elongated structure that resembles a bar despite not having initial rotation. The authors argued that the lower concentration used in their models allowed tidal forces to perturb the system to form a bar-like structure. While the morphology could be reproduced in such a way, the authors found that the resulting dwarf would be tidally disrupted in its totality. Our results, on the other hand, show that Sgr conserves a small inner core that is not expanding, supporting a scenario in which the inner regions of the galaxy might survive this last pericenter passage. This is also the case for the rotating model, which was the best at reproducing the configuration predicted by the ML models and whose inner core remains tidally bound at least until the next pericenter passage (see Figure 6 of Łokas et al. 2010). Therefore, we do not confirm the complete disruption of the Sgr after its recent pericenter passage.

Differences in the evolution of our N-body model from that of Vasiliev & Belokurov (2020) are to be expected given the differences in how models were compared to observations; models in Vasiliev & Belokurov (2020) were fitted directly to Sgr's observables projected on the sky, whereas in this work, a small set of models were compared to 6D predictions from ML. We also note the qualitative, "by-eye" nature of the model selection in both cases, which may weight the various observables differently in each case. The fact that in our model, Sgr's inner regions remain tidally bound after this pericenter passage might indicate that some adjustments should be made to the Vasiliev & Belokurov (2020) model's central initial conditions. But, given our own observational errors and model degeneracies, we do not make any strong claim about this particular aspect of Sgr evolution.

10. Conclusions

In this work, we have presented the first comprehensive study of Sgr using ML. We have derived the full phase space, i.e., 3D positions and velocities, for more than 1.2 × 105 stars in the core, carefully considering and propagating all known sources of errors through extensive Monte Carlo experiments. We have also tested different data sets, models, and methods, all providing consistent results, and compared these to different N-body models. We have presented 3D projections of the galaxy in different projections depicting a complex system. From a first look at the ML predictions, the following can be observed.

  • 1.  
    The Sgr has a bar ∼2.5 kpc long that is inclined 43° ± 6° with respect to the plane of the sky.
  • 2.  
    Tidal tails depart from the tips of the bar, conferring an "S" shape on Sgr along its orbital path.
  • 3.  
    The Sgr is rotating and expanding along the stream in its outer regions.

The Sgr is a highly perturbed system. Tidal forces exerted by the MW are shearing its body and stripping its stars from distances as close as ∼1 kpc from the center of the galaxy. However, a closer inspection of the data shows that Sgr conserves a triaxial elliptical core that is not expanding. This region measures approximately 500×330×300 pc and is rotating at an average velocity of vrot = 4.13 ± 0.16 km s−1, mainly about its intermediate principal axis of inertia (2.46 ± 0.22 km s−1), with some nutation and precession indicated by the presence of residual rotation about its shortest and longest principal axes (2.11 ± 0.22 and 1.38 ± 0.22 km s−1, respectively). The inclination of the inner body and its rotation indicate that Sgr is in an inclined prograde orbit, with its internal angular momentum forming an angle θ = 18° ± 6°. This slight inclination of the orbit with respect to the internal angular momentum could aid in explaining some of the features observed in the stream.

We compared our results against the predictions of three N-body models: a rotating, flattened disk in an inclined prograde orbit with two pericenter passages and two spherical, pressure-supported models with three pericenter passages. Only the rotating model was able to qualitatively reproduce the observed velocity gradients. We notice, though, that the velocity gradients predicted by the rotating model are too steep (3.5 ± 0.4 km s−1 along the intermediate axis), which means that the initial structure of the progenitor and/or properties of the orbit require further adjustments.

The two spherical models tested (LM10 and F19) show a general dynamical behavior that is similar to the observations in the outskirts of the galaxy but fail to reproduce any of the internal velocity gradients in Sgr. It is important to remark that these models were aimed at reproducing the stream properties, not the core of the galaxy.

Our results suggest that the Sgr progenitor was a flattened system to some extent and that it had some internal rotation. Investigating this scenario in detail will require a large theoretical and computational effort, as many factors are affecting the last 2–3 Gyr of Sgr evolution as it orbits the MW. Other potentials, such as those from other dSphs or the Large Magellanic Cloud, may have also played an important role.

The authors thank the anonymous referee for comments that have helped to improve this paper. A.d.P. also thanks S. Bertran de Lis for her support and help during the realization of this project. This project benefited from support for Hubble Space Telescope archival proposal 15633 through a grant from STScI, which is operated by AURA, Inc., under NASA contract NAS 5-26555. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC; https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. Topcat (Taylor et al. 2005) was used during the preliminary inspection of the data, and astropy aided in the coordinate transformations (Astropy Collaboration et al. 2013; Price-Whelan et al. 2018). This project is part of the High-resolution Space Telescope PROper MOtion (HSTPROMO) Collaboration, 9 a set of projects aimed at improving our dynamical understanding of stars, clusters, and galaxies in the nearby universe through measurement and interpretation of PMs from HST, Gaia, and other space observatories. We thank the collaboration members for the sharing of their ideas and software.

Appendix A: Estimating Contaminants

We used the Besancon 2016 model of the Galaxy using the GOG model (Luri et al. 2014) in order to estimate how many MW contaminants are polluting our sample. To do so, we subjected stars from the model on the same region of the sky to the same selection procedure as for the observed data described in Section 2. This requires both samples to be comparable in star numbers and observational errors. The GOG model does not include all observational effects found in the Gaia DR2 data. Specifically, the GOG model does not include the global parallax zero-point or any of the systematic errors known to be affecting DR2 data. In addition to that, the GOG model accounts for nominal end-of-mission (EoM) errors, which are estimated to be at least four times smaller than the statistical errors present in DR2. These observational effects must be simulated on the GOG model prior to any comparison with real Gaia DR2 data. We selected the stars and performed such error simulations as follows.

We first download the same region of the sky covered by the real data with wider selection cuts. Systematic errors of 0.66 mas yr−1 in the PMs and 0.43 mas yr−1 in parallax, estimated for length scales smaller than 1°, are added in quadrature to our search limits. We also relaxed our error selection criteria to 12σ. Lastly, we shifted our parallax search to compensate for a global zero-point of ϖ0 = − 0.03 mas yr−1 (Lindegren et al. 2018). These criteria result in a wider selection than the one performed in the real data, ensuring that the number of contaminants is not underestimated.

The DR2 errors and zero-points must be included in the resulting GOG model list prior to any comparison to real data. We simulated the errors by randomly shifting the photometric magnitudes, PMs, and parallaxes of every synthetic star according to the corresponding errors observed in real stars at the same position in the CMDs. These shifts were introduced over GOG stars without the simulated EoM errors. The new simulated errors are adopted as statistical errors for the GOG stars. For further information about the method, we refer the reader to del Pino et al. (2015). A parallax zero-point of ϖ = − 0.055 ± 0.024 mas yr−1, derived from our QSO list, was added to the synthetic stars (see Appendix D.2). After simulating errors and zero-points, DR2 and GOG data should be comparable one to one.

We then proceed to impose the exact same selection and quality cuts as with the real data. First, only synthetic stars with errors compatible within 3σ with the PMs and parallax of Sgr are selected. The GOG model does not account for poor astrometric measurements due to crowding or more complex observational factors. In the case of real stars, those with bad astrometry are removed using Equation (C.1) in L18 and RUWE < 1.4. The fraction of real stars that were screened out by these equations was less than 2%. We randomly removed the same fraction of stars from the GOG model that had similar colors and magnitudes to the rejected ones in the real data.

Synthetic and real stars are then simultaneously selected in the CMD, PMs, and parallaxes following the procedure explained in Section 2. Only real stars are used to fit the Gaussian mixture model (GMM) and define the logarithmic-likelihood condition. The same criteria were applied when dividing the sky into cells. The GOG stars that passed our cleaning procedures, as well as the rejected ones, are shown in Figure 4.

Appendix B: Choosing the Best Selection Parameters

The parameter n that we use on Equation (1) affects our membership selection and may impact our results. The use of a larger value will include more Sgr stars in our sample, yet it will also allow many contaminants to be selected as well. On the other hand, a very conservative value will reject most contaminants, but it will also reject real member stars, reducing our signal-to-noise ratio.

A number of factors affect the position of the stars in the PM–parallax space, altering the shape and concentration of its distribution. In order to account for most of them, we used the full covariance matrix to whiten the data, 10 taking into account their statistical errors, possible systematic errors given by Equations (16) and (18) from Lindegren et al. (2018), and the correlation present between these quantities. The whitening of the data strongly relies on how well errors and correlations are accounted for in the gaia_source table and will impact our membership selection.

The GOG model allows us to determine the optimal value for n and whether we should whiten the data by maximizing the accuracy of our classification. As our classification is positive for stars selected as members of Sgr and negative for stars selected as contaminants, we can define stars as follows.

  • 1.  
    True-positive (TP) stars: Sgr members classified as members.
  • 2.  
    True-negative (TN) stars: not Sgr members but classified as contaminants.
  • 3.  
    False-positive (FN) stars: Sgr members classified as contaminants.
  • 4.  
    False-positive (FP) stars: not Sgr members but classified as members.

While accurate calculations of the total numbers of stars classified as TP or FN is not feasible from the observations, the GOG model provides an estimation for TN and FP. By assuming $\left|\mathrm{TN}\right|=\left|\mathrm{Rejected}\ \mathrm{GOG}\right|$ and $\left|\mathrm{FN}\right|=\left|\mathrm{Selected}\ \mathrm{GOG}\right|$, we can estimate $\left|\mathrm{TP}\right|$ as $\sim \left|\mathrm{Selected}\ \mathrm{DR}2\right|-\left|\mathrm{Selected}\ \mathrm{GOG}\right|$. Hence, the accuracy of our classification can be defined as

Equation (B1)

We calculated the accuracy for the original and whitened data using n = [1.5, 2.0, 2.5, 3.0] in Equation (1) for the bulk and the cell selection in the PM–parallax space. Covariance was left as a free parameter when fitting the GMM to the original data. The result that maximized the signal-to-noise ratio was with n = 2 and no data whitening, although all results provided qualitatively compatible results. These findings indicate that the statistical error values provided in DR2 do not represent well enough the actual errors affecting PMs and parallaxes.

Appendix C: Distances and Line-of-sight Velocities: Models and Caveats

We tested a variety of suitable models to predict distances and vlos. These listed a epsilon-support vector machine regression (SVR) with linear and nonlinear kernels, a KNR, a random forest regression (RFR), an extra trees regression, a Gaussian process, an elastic net, a lasso, and a multilayer perceptron regression (MLP). For each model, f(x), we derived ypred through a nested k × l cross-validation scheme and calculated the residuals as ${y}_{\mathrm{res}}={y}_{\mathrm{true}}-{y}_{\mathrm{pred}}$. The experiment was performed methodically, sampling all possible combinations of features and columns and providing astrometric information. Specifically, we tried all 64 combinations of features listed from (x, y) to the most complete one with (x, y, pmx, pmy, parallax, pmra_pmdec_corr, parallax_pmdec_corr, parallax_pmra_corr). For each combination of features, we covered a wide range of parameters that tune the fitting in each model. For example, in the case of the MLP, we tried all possible combinations of one, two, or three hidden layers, each one with a number of perceptrons ranging from 5 to 305 in steps of 10, totaling ∼1.8 × 106 combinations. For the SVR, we tested a total of ∼2.3 × 105 different combinations, covering epsilon and $z^{\prime} $ values over four different kernels: linear, second-degree polynomial, third-degree polynomial, and Gaussian radial basis function (Rbf). The best model architecture was chosen and tested through a nested k × l cross-validation. The NCVs use an outer loop (k-fold) and an inner loop (l-fold) to effectively split the data into independent train/validation/test sets. This allows the use of all instances to optimize and test the model. Specifically, we used a leave-one-out configuration for both NCVs' loops for the KNR and a k = 10, l = 5 for the SR. The performance and generalization error for each model were obtained by studying yres derived through the NCV in the training lists.

Four quantities were used to determine the goodness of the model: the correlation between ytrue and ypred, the sum of the squared residuals, the dispersion of the residuals, and the gradients along the sky, if any, of the residuals. A linear regression was fitted to each output, using ytrue as an independent variable and ypred as a dependent one, in order to derive the correlation between both. Possible artificial gradients along the sky were studied by fitting a surface to $\left|{y}_{\mathrm{res}}\right|$ using $(\left|x\right|,\left|y\right|)$ as independent variables, where all quantities had been previously averaged over square cells of 0fdg5 × 0fdg5 in order to avoid overrepresentation in the most dense areas.

Except for the approximately 10% worst configurations, all tested models yielded qualitatively compatible results. In fact, no distinction could be made between the ∼15% best configurations of a given model. Only models fitted to at least three features are able to reproduce the dispersion of stars along the line of sight; this ability is increased, in general, by the use of more features. Models fitted to just (x, y) were used as benchmarks for the average distribution of stars along the sky. The different importance of the observational errors made it necessary to take a different approach for each training sample.

C.1. Distances

C.1.1. Errors in the Training Sample and Estimation of Real FWHM

The RR Lyrae stars in Sgr have large observational errors due to their relatively faint magnitudes. This affects both the independent variables and the distances inferred from their pulsation modes. We have kept close control of the independent variable errors by propagating the statistical errors provided in the Gaia gaia_source table and through our small systematic error simulations (see Appendix D.2).

We have measured the dispersion caused by errors affecting our derived distances by selecting RR Lyrae stars likely to be members of the globular cluster M54 and measuring the dispersion of their location along the line of sight. The selection was based on their position in the sky and their distance. We first selected all RR Lyrae stars within the M54 tidal radius, rt , and then selected them in distance using a GMM whose number of components was chosen through a Dirichlet Bayesian process. We repeated the experiment, ranging the initial guess of the number of components in distance from two to eight, always obtaining as a result two components, or groups. We assumed that the narrow component was formed by M54 stars and obtained a dispersion of 0.946 kpc along the line of sight. Assuming M54 sphericity and a tidal radius of ∼73 kpc (using c = 2.04, rc = 0.09; Harris 1996, 2010 edition), we found that 0.944 kpc of artificial dispersion is needed in order to explain the observed dispersion in distances along the line of sight.

The M54 stars provide only an estimation of the distance error due to Gaia photometric limitations. Other sources of error, such metallicity spread or reddening correction, are not likely to affect such a homogeneous stellar population in a very small region in the sky as M54. Therefore, these quantities should be interpreted as a minimum level of error for the entire Sgr.

We calculated that a dispersion of 0.2 dex in metallicity (McConnachie 2012) would cause a dispersion in distance of ∼0.65 kpc in our RR Lyrae list. Errors in the reddening correction do not have a critical impact on the results, adding a mere ∼0.02 kpc of dispersion when considering 15% of error. Thus, assuming that metallicity dispersion and reddening errors caused a dispersion of 0.67 kpc, the total artificial dispersion in the distance amounts to 1.16 kpc. The subtraction of this value from the FWHM measured in the central 4° would yield a real FWHM of 2.48 kpc, rather than the measured 3.69 kpc. This value is close to the one derived by the OGLE team of 2.42 kpc (Hamanowicz et al. 2016) and encourages us to adopt 2.48 kpc as the real FWHM of the distribution of RR Lyrae stars along the line of sight on its central regions.

We took these results into account when scoring the models used to predict distances on the real data, favoring those models producing a distance dispersion of 2.48 kpc in the central regions of the galaxy for HB stars at the instability strip in the CMD. This ensures that the comparison is made between similar stellar populations.

C.1.2. Different Models

As shown in Appendix C.1.1, almost one-third of the observed dispersion along the line of sight (1.16 kpc) could be caused by observational errors. The ideal model will regularize the fitting to the training list, averaging several stars, avoiding overfitting the errors, and providing the most accurate distance possible. Hence, the dispersion of the residuals from the cross-validation is not indicative of how well the model performs when predicting distances, and we decided to use the Spearman correlation coefficient, the dispersion in the predicted distances in the central regions of the galaxy, and the existence of artificial gradients in the predicted distances with respect to the training sample.

The SVR with linear or polynomial kernels and the KNR models tend to produce better results using fewer features, while the SVR with Rbf kernel, MLP, and RFR benefit from including extra information, such correlations between features. This behavior is somewhat expected, as SVR and KNR models are simpler and may not obtain useful information from the correlation features, which show a periodic behavior in the sky along the covered area. In all cases, best results were obtained by fitting at least four features (columns).

The best model overall was SVR with the Rbf kernel fitting all features, although just marginally. This model is computationally expensive and was very closely followed by the KNR, which is the fastest of the seven tested models. We thus decided to use the KNR until less than 100 stars were rejected during an iteration in our code. Then we shifted to the SVR model with the Rbf kernel until convergence. Our final test results obtained with this method were the same within the errors as if only the KNR model was used; thus, we decided to keep the KNR results for their simplicity and faster performance.

The number of neighbors, k, and the best combination of independent variables, x , were determined on every iteration through the NCV test. Figure 19 shows three examples of results when using k = 1, 10, and 100 on the final training list. Low values of k produce slightly steeper predictions versus ground truth (closer to a one-to-one relation) but at the cost of having too high variance. Given our prior knowledge of the training list's errors, we favored configurations producing results with a central distance dispersion around 2.48 kpc measured for HB stars. In other words, we favored bias versus variance in order to prevent the artificial dispersion observed in the training list from propagating to our predictions. Notice that, as mentioned before, the error limitations in the training list prevented us from using the one-to-one relation as an indication of how well the model is performing. In fact, the KNR with the best one-to-one relation between Dpred and Dtrue, k = 1, was the worst model in any of the performed comparisons.

Figure 19.

Figure 19. Examples of distance predictions using the KRN model with different numbers of neighbors: k = 1, 10, and 100 (top to bottom). For each configuration, the left panels show predictions, Dpred, compared to the ground truth, Dtrue. Errors in Dtrue are shown by error bars. The red dashed line shows a linear fit to the data. The parameters of such a fit are shown in the top right part of the panel. The second and third panels show the spatial variation of the residuals, DtrueDpred, in the (Λ, B) coordinate system. Blue curves show the boxcar average of the distribution along each axis with steps of 1fdg5 and a window of 0fdg75. Blue shaded areas show the dispersion of the residuals measured as the standard deviation. The histograms in the right panels show the integrated distribution of the residuals. Their average, μ, and standard deviation, σ, are also shown.

Standard image High-resolution image

Depending on the iteration of the membership selection, the best results were obtained using k between 10 and 30 and x = (x, y, μx , μy , ϖ). Larger values of k tend to produce overly flattened results without much improving the variance. The KNR that produced the best results in the final iteration was with k = 15, although values ranging from k = 9 to 32 provided consistent results. Results removing ϖ or μy were also reasonably consistent, indicating that some correlation between these two variables might exist.

In Figure 20, individual conditional expectation (ICE) curves show the dependence between the KNR predictions and each feature in x . The position on the sky, (x, y), and PM along the west direction, μx , have the largest impact on the prediction. On average, there is also a nonmonotonic dependence of the prediction on parallax, ϖ, which is not intuitive. This dependence may stem from the complex relationship of the random and systematic errors in Gaia parallaxes and PM. In any case, the dependence is weaker than for the first three features, and models not using ϖ or μy produced similar results to our standard model.

Figure 20.

Figure 20. The ICE curves for the KNR model. Thick lines show the average dependence between the model predictions and each used feature in the training list x . Thin curves show 20% of all computed combinations.

Standard image High-resolution image

C.2. Line-of-sight Velocities

The bright stars in the vlos training sample show relatively small errors (<2 km s−1) compared to the observed dispersion. This mitigates the risk of overfitting and allows us to use the sum of the squared residuals as indicative of how well the model is performing. On the other hand, the stars in this training sample are sparsely distributed along the sky, which poses a challenge to the models that have to interpolate the gaps in a sensible way.

We tested models with nonlinear kernels, such as second- and third-degree polynomial or radial basis functions, with mixed results. While in some cases, nonlinear kernels provided better predictions in the training set, they tended to produce overly large or low velocities for extreme values of x in the main sample. This behavior can be controlled by means of regularization, $z^{\prime} $, but introduces a large dependence of the result on this and other metaparameters of the fitting. When using linear kernels, on the other hand, SVRs are highly insensitive to the use of larger $z^{\prime} $ and thus to overfitting. The choice of a linear kernel is thus justified in the outer parts of the galaxy by the simplicity of the model and the readability of its results, although one should be aware of its limitations in reproducing the nonlinear features observed in the training set.

To overcome this, we decided to combine our best-performing linear and nonlinear learners into an SR. In an SR, a model is trained over the prediction of several learners through cross-validation of their results. They generally outperform the best of the individual learners but are computationally expensive to run. Our SR model used a nonlinear artificial neural network, a MLP, to combine the predictions of six regressors, three nonlinear (extra trees, support vector machine with a radial basis function, and Gaussian process) and three linear (lasso, support vector machine with a linear kernel, and an elastic net). We chose the best configuration for each of the learners by maximizing ${R}^{2}={(1-\sum {({y}_{\mathrm{true}}-{y}_{\mathrm{pred}})}^{2}/\sum (({y}_{\mathrm{true}}-\left\langle {y}_{\mathrm{true}}\right\rangle )}^{2})$ in a grid covering different values of their fitting metaparameters through an NCV. The experiment showed that the model provides robust predictions against mid-to-small metaparameter variations. Only shifting all metaparameters toward a specific behavior, e.g., to have more regularization or large variations on the architecture of the final MLP metaregressor, had a nonnegligible impact on the final predictions. This was somewhat expected, as the metaregressor leverages the input of the six independent models by choosing the best possible prediction, hence making it more stable against spurious bad predictions.

Figure 21 shows an example for three different MLP architectures in the metaregressor. The use of more neurons and extra hidden layers can help to better predict velocities. However, adding more layers or neurons can result in an overfitted model, producing predictions with needlessly large variances. The selected best architecture for the MLP varies slightly depending on the combination of the input models' metaparameters, and we found that the best 25% of our models produced indistinguishable results within the errors. Our final model in the last iteration had an architecture of (5, 95, 85, 1) and uses a ReLU activation function and an L-BFGS solver. The ICE curves showing the dependence of the prediction on each feature in x are shown in Figure 22.

Figure 21.

Figure 21. Same as Figure 19 but for the SR. Three different configurations are shown, with metaparameter architectures of one hidden layer and 15 neurons (15), two hidden layers and 105 neurons per layer (105, 105), and three layers of 105 neurons each (105, 105, 105). The shaded area around the one-to-one line in the left panels shows the internal velocity dispersion of the galaxy, 11.4 km s−1 (McConnachie 2012).

Standard image High-resolution image
Figure 22.

Figure 22. Same as Figure 20 but for the SR model.

Standard image High-resolution image

The resulting model was able to reproduce nonlinear features while showing stability against high-order oscillations on areas where no information is available. However, although the model produced some dispersion in the predicted velocities, cross-validation and final tests showed that the reproduced dispersion is likely to be driven by random differences with the actual velocities rather than an accurate prediction of naturally dispersed values. This manifests as the inability of the model to reproduce the extreme values of the velocity in the left panels of Figure 21, failing to achieve a one-to-one relation. This was the also the case for all individual regressors, regardless of their degree of complexity and nonlinearity. These findings were totally expected, as velocity stochasticity cannot be reproduced star by star by nonphysically motivated models such the ones used here. Furthermore, despite having much better errors than the distance training list, the line-of-sight velocity errors in PMs are still high star by star (∼0.12 mas yr−1), causing random variance in the residuals. This does not pose a problem to our analysis, where our aim is to study the general velocity trends in the galaxy, averaging stars in Voronoi cells containing ∼1000 stars. Our NCV tests show that the SR does not produce any undesired spatial bias that could otherwise affect our results.

C.3. Caveats

We have conscientiously selected, configured, and trained our models in order to minimize possible biases and errors in their predictions. The models presented in this work were able to make accurate predictions corroborated by our extensive cross-validation tests. The results obtained in the gaia_source table shown in this paper are fully compatible with those obtained separately for our training samples, where only one of the variables has been modeled (distance for the vlos catalog and vlos for the distance catalog). Furthermore, results obtained by imposing a constant D on the vlos training list or a constant vlos on the distance training list also provided compatible similar results, indicating that the ML models used are able to make good predictions based on the training list they used to learn. Yet the ML models are limited by the quality of both the target data, ypred, and the features used to train the model and make predictions on new data, x . The observational errors affecting these quantities negatively affect the models' prediction abilities. We believe that, in terms of the internal dynamics and structure of Sgr, no meaningful star-by-star prediction can be made with our models given the current limitations of the data. Furthermore, training data are likely to be affected by systematic errors. A model that well reproduced a feature, ytrue, in the training list affected by systematics would be propagating those systematics to its final predictions, ypred.

We have performed extensive tests in order to find and analyze systematics in the values of ytrue in our training lists, finding the data to be highly homogeneous and of similar quality. We have also propagated all sources of errors we are aware of to our final results and averaged these within Voronoi cells of ∼1000 stars, allowing statistically meaningful comparisons between cells. Lastly, we have inspected "by-eye" the predictions from our best candidate ML models, which all provided qualitatively consistent results between them, with our training lists and previous studies in Sgr. Thus, our decisions on the plausibility of our solutions are ultimately based on our previous knowledge of Sgr and comparisons to physically motivated N-body models as a final check.

Appendix D: Astrometric Zero-points and Errors

A proper treatment of errors is crucial to this analysis. Two main sources of error are known to be affecting our measurements: standard and systematic errors.

D.1. Standard Errors

Standard uncertainties, or statistical errors, accompany each magnitude in the Gaia DR2 catalog. These measure the precision rather than the accuracy and are known to be underestimated by a factor of 1.10 (Arenou et al. 2018; Lindegren et al. 2018). We propagated with Monte Carlo all statistical errors through all coordinate transformations performed. These include errors in the coordinates of the stars and the determination of the COM, σ(α*), σ(δ), $\sigma ({\alpha }_{0}^{* })$, and σ(δ0), as well as errors in the PMs, σ(μα*), σ(μδ ), σ(μα*0), and σ(μδ0). Errors from external quantities such the distance to the system, σ(D0), or its vlos, σ(vlos), are also considered and properly propagated.

The procedure for deriving distances and line-of-sight velocities is also performed in a Monte Carlo fashion, considering and propagating the errors from all independent and dependent variables. A total of 106 realizations are performed, randomly sampling each independent and dependent variable from a normal distribution centered in its corresponding nominal value. The predicted values using the nominal values of the independent variables are assumed to be the nominal values of the final result. The quadrature addition of the standard deviation of all realizations plus the typical standard deviation from the cross-validation test is adopted as the total error.

D.2. Systematic Errors and Zero-points

Systematics are known to be affecting DR2 data, artificially shifting the astrometric zero-points. They are yet not well understood, but it appears that their effects can be differentiated into two components. The first one consists of large-scale systematics with a scale length of θ = 20°. With such a large scale length, the main effect these systematics will have in our data is to modify the bulk parallax and PMs of the galaxy. This will, in principle, have negligible effects on the detection of rotation.

We have studied any possible large-scale systematics effects on the bulk PMs and rotation using quasars. We found 278 AllWISE AGN sources (Secrest et al. 2015) that cross-match with Gaia DR2 sources within our field. We cleaned this sample of spurious sources by applying the procedure explained in Section 2, i.e., rejecting sources not compatible with the centroid of the distribution up to 3σ. The median of their PMs was added to the PMs of the stars in our sample in Table 1 and the subsequent calculations.

The small-scale systematics, on the other hand, introduce a variation in the zero-point of the parallax and PMs with a regular pattern of scales of ∼1° likely related to the Gaia scanning law. This banding pattern should average out for sufficiently large objects on the sky, such as Sgr. We have run tests to quantify how much these small-scale systematics may be affecting our measurements. Figure 23 shows the observed small variations in the astrometric zero-points caused by the small-scale systematics present in Gaia after removing most of the Sgr star. Trying to correct for these systematics could introduce an extra source of error. We therefore decided to calculate and propagate the variance introduced by these patterns in the final solution.

Figure 23.

Figure 23. Small systematic errors affecting the parallax and PMs along the observed region. These manifest as position-dependent variations in the astrometric zero-points. Notice that some Sgr stars are polluting these samples, seen as an area of lower PM R.A. and excess of PM decl.

Standard image High-resolution image

Following the same prescription adopted by van der Marel et al. (2019), we create a mock systematic pattern with an rms of 35 μas yr−1 (consistent with Gaia Collaboration et al. 2018b). We then shift the origin in sky coordinates (x, y) by random amounts between [−0fdg5, 0fdg5) and rotate the pattern between [−π, π). This is done separately for μα* and μδ 106 times, recalculating the COM PM and the rotation signal about it in each iteration. From all realizations, we obtained a standard deviation of 0.0006 mas yr−1 for both σ(μα) and σ(μδ ) from the distribution of the COM PMs and 4 × 10−6 mas yr−1 for σ(μr ) and σ(μρ ). This supposes a variation of σ(vW ) = 0.8 and σ(vN ) = 0.7 km s−1 and σ(v3) =0.0005 and σ(v2) = 0.001 km s−1. These errors are included in all error intervals provided in this paper.

Appendix E: Kinematic Properties of the Spherical N-body Models

Spherical models fail to reproduce the internal velocity gradients we observe in the data. Figures 24 and 25 show the expansion and rotation signals measured over the principal axes of the simulated Sgr of the Fardal et al. (2019) and Law & Majewski (2010) models, respectively. These figures should be compared with Figures 13 and 14.

Figure 24.

Figure 24. Same as Figures 13 and 14 but for the model in Fardal et al. (2019). The model has a core that is more extended than the observations, as it can be seen by the kinematic influence of the tidal tails for distances larger than ∼1.5 kpc.

Standard image High-resolution image
Figure 25.

Figure 25. Same as Figures 13 and 14 but for the model in Law & Majewski (2010). The model has a core that is too spherical, and that transition to the stream too close from the center. This was also observed in Vasiliev & Belokurov (2020).

Standard image High-resolution image

Appendix F: 3D Projections of the Galaxy

Figure 26 shows a 3D rendering of the galaxy as seen from each one of its principal axes. In these projections, the axes of the Cartesian grid are aligned with the galactocentric axes. These are the same axes as in Figure 10; the Galactic plane is contained in XY, and the Sun is located at (X, Y, Z) = (−8.29, 0, 0) kpc.

Figure 26.

Figure 26. The 3D renderings of the stellar density and kinematics of Sgr. The Cartesian grid is aligned with the galactocentric coordinates. The points of view are for an observer located at 5.2 kpc from the Sgr COM along the longest, intermediate, and shortest principal axes of inertia of Sgr, respectively. Black cones mark the error-weighted average motion of the stars within 3D Voronoi cells of ∼500 stars each. The rest of the markers coincide with those used in Figure 9. An interactive version of this figure can be found in the online journal and at https://www.stsci.edu/∼marel/hstpromo/Sagittarius_GC.html. Note that the interactive graphic appears in full when viewed with the current versions of the Chrome and Firefox web browsers. The stellar density may appear to be missing when viewed with the current Safari browser, which is due to a lack of webP support.(An interactive version of this figure is available.)

Start interaction
Standard image High-resolution image Figure data file

Footnotes

Please wait… references are loading.
10.3847/1538-4357/abd5bf