The following article is Open access

Monochromatic Globular Clusters as a Critical Test of Formation Models for the Dark Matter–deficient Galaxies NGC 1052-DF2 and NGC 1052-DF4

, , , , , , , , , , and

Published 2022 November 15 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Pieter van Dokkum et al 2022 ApJL 940 L9 DOI 10.3847/2041-8213/ac94d6

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

2041-8205/940/1/L9

Abstract

It was recently proposed that the dark matter–deficient ultradiffuse galaxies DF2 and DF4 in the NGC 1052 group could be the products of a "bullet dwarf" collision between two gas-rich progenitor galaxies. In this model, DF2 and DF4 formed at the same time in the immediate aftermath of the collision, and a strong prediction is that their globular clusters should have nearly identical stellar populations. Here we test this prediction by measuring accurate V606I814 colors from deep HST/ACS imaging. We find that the clusters are extremely homogeneous. The mean color difference between the globular clusters in DF2 and DF4 is ΔDF2−DF4 = −0.003 ± 0.005 mag, and the observed scatter for the combined sample of 18 clusters with M606 < −8.6 in both galaxies is σobs = 0.015 ± 0.002 mag. After accounting for observational uncertainties and stochastic cluster-to-cluster variation in the number of red giants, the remaining scatter is ${\sigma }_{\mathrm{intr}}={0.008}_{-0.006}^{+0.005}$ mag. Both the color difference and the scatter are an order of magnitude smaller than in other dwarf galaxies, and we infer that the bullet scenario passes an important test that could have falsified it. No other formation models have predicted this extreme uniformity of the globular clusters in the two galaxies. We find that the galaxies themselves are slightly redder than the clusters, consistent with a previously measured metallicity difference. Numerical simulations have shown that such differences are expected in the bullet scenario, as the galaxies continued to self-enrich after the formation of the globular clusters.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

NGC 1052-DF2 (or DF2) and NGC 1052-DF4 (DF4) share two unusual properties that set them apart from all other known galaxies. First, their globular clusters are, on average, a factor of ≈4 brighter and a factor of ≈2 larger than canonical values (van Dokkum et al. 2018a, 2019; Ma et al. 2020; Shen et al. 2021a). Furthermore, their velocity dispersions are consistent with their stellar mass alone and much smaller than expected from a normal dark matter halo (van Dokkum et al. 2018b; Wasserman et al. 2018; Danieli et al. 2019; Emsellem et al. 2019).

Initially, the main question was whether the dynamical masses and distances were measured correctly (see, e.g., Martin et al. 2018; Laporte et al. 2019; Trujillo et al. 2019), but as the anomalous properties of the galaxies were gradually confirmed (and corroborated with independent evidence; see Dutta Chowdhury et al. 2019; Keim et al. 2022), the focus shifted to the question of how they were formed. Proposals include assembly out of tidally removed gas (Fensch et al. 2019), stripping of dark matter by close encounters with NGC 1052 (Ogiya 2018; Carleton et al. 2019; Nusser 2020; Moreno et al. 2022; Ogiya et al. 2022) or NGC 1035 (Montes et al. 2020), jet- or outflow-induced star formation, like Minkowski's object (van Breugel et al. 1985; Natarajan et al. 1998), and extreme feedback in low-mass halos (Trujillo-Gomez et al. 2022).

Silk (2019) suggested that DF2 could be the result of a "mini bullet cluster" event, where the dark matter and the baryons became separated in a nearly head-on encounter between two gas-rich progenitor galaxies. Such a collision produces two dark matter–dominated remnants with a globular cluster-rich, dark matter–free object in between them that formed from the shocked gas. This scenario was further explored with simulations in Shin et al. (2020) and Lee et al. (2021), who showed that collisions between an unbound object and a satellite could explain many of the observed properties of the galaxies and occur with some regularity in cosmological simulations. The main issue with this model—as with many of the alternative explanations listed above—is the presence of two dark matter–deficient galaxies, seemingly requiring lightning to strike twice in the same group.

Recently, we suggested that a single bullet dwarf collision may have produced both DF2 and DF4 (van Dokkum et al. 2022). This joint formation is consistent with the striking similarities between the two galaxies, their radial velocities and line-of-sight distances, the emergence of multiple clumps in at least some bullet collision simulations (Shin et al. 2020), and with the discovery that DF2 and DF4 are part of a remarkable trail of ≈10 galaxies in the group.

As noted in van Dokkum et al. (2022), this formation model for DF2 and DF4 is falsifiable, as it makes a very specific prediction for their globular clusters. Both galaxies formed out of the gas that was left behind by the progenitor galaxies at the same time. This gas mixed efficiently during the collision and should have a uniform metallicity. While overall star formation likely lasted for ≳500 Myr (Shin et al. 2020; Lee et al. 2021), the globular clusters formed almost instantaneously before further enrichment could substantially change the abundance of the gas (Lee et al. 2021). As a result, the stellar populations of all the globular clusters, in both galaxies, should be extremely similar. 10 This is not what is typically observed in dwarf galaxies. In a comprehensive study of the colors of globular clusters in Virgo dwarf galaxies, Peng et al. (2006) found that there is substantial cluster-to-cluster and galaxy-to-galaxy scatter.

Previous measurements of the colors of the globular clusters in DF2 and DF4 do in fact suggest differences between them, in potential conflict with the bullet model. Tables 1 and 2 in Shen et al. (2021a) imply a mean V606I814 globular cluster color of 0.37 ± 0.01 AB mag in DF2 and 0.41 ± 0.01 in DF4. This difference seems small, but it corresponds to ≈0.3 dex in age or ≈0.4 dex in metallicity. Furthermore, the cluster-to-cluster scatter is nonzero, with σ = 0.04 ± 0.01 mag for both galaxies. There is also a hint that the galaxies themselves might have different V606I814 colors: Cohen et al. (2018) find 0.40 for the diffuse light in DF2 and 0.32 for DF4, albeit with an uncertainty of 0.1 for both.

Table 1. Photometry of Globular Clusters

ID M606 V606I814 ± Meas± Phot± Stoch
Main Sample
DF2-39−9.400.3630.0010.0070.008
DF2-59−8.970.3890.0040.0080.010
DF2-71−9.120.3960.0070.0100.009
DF2-73−10.240.3650.0020.0070.007
DF2-77−9.760.3560.0060.0090.007
DF2-85−9.350.3880.0100.0120.008
DF2-91−9.290.3770.0050.0080.009
DF2-92−9.610.3820.0050.0090.007
DF2-93−8.890.3490.0110.0130.010
DF2-98−8.970.3740.0030.0080.010
DF2-101−8.880.3650.0030.0080.010
DF4-1092−9.120.3770.0020.0070.008
DF4-1403−8.960.3840.0040.0080.009
DF4-3515−8.670.3740.0050.0090.010
DF4-4045−9.690.3750.0060.0090.007
DF4-5675−9.290.3600.0040.0080.008
DF4-6129−9.130.4110.0040.0080.008
DF4-6571−8.930.3810.0040.0080.009
Faint Sample
DF2-2764−8.020.4170.0260.0270.016
DF2-3555−7.700.3910.0170.0180.017
DF2-4061−8.020.3780.0200.0210.016
DF4-1755−7.920.4290.0150.0170.016
DF4-3278−8.170.3240.0260.0270.013
DF4-3602−8.570.4270.0210.0220.011
DF4-4176−7.250.3790.0700.0710.023
DF4-4366−7.400.4290.0180.0190.020

Note. Magnitudes are on the AB system. IDs are from Shen et al. (2021a). Uncertainties on the colors are ±1σ, with ± meas the measurement uncertainty, ± phot the measurement uncertainty combined with the flat-fielding uncertainty, and ± stoch the stochastic uncertainty due to variations in the number of red giants.

Download table as:  ASCIITypeset image

These measurements were performed using standard techniques (such as Source Extractor; Bertin & Arnouts 1996) and are largely based on single-orbit HST/ACS images that were reduced with the default STScI flat fields. Here we remeasure the colors of the clusters and the diffuse light in both DF2 and DF4 using custom techniques and well-calibrated, much deeper data, as a stringent test of the bullet dwarf model. Where needed, we use a distance of D = 21.7 Mpc for DF2 and D = 20.0 Mpc for DF4 (Danieli et al. 2020; Shen et al. 2021b; Z. Shen et al., in preparation).

2. Data

We make use of deep HST/ACS data that were obtained in programs GO-15695 and GO-15851. The aim of these programs was to measure distances to DF4 (GO-15695) and DF2 (GO-15851) from the tip of the red giant branch (TRGB). For DF4, the exposure times were seven and three orbits in I814 and V606, respectively. In Danieli et al. (2020), these were combined with the 1+1 orbits that had been obtained in GO-14644 (see Cohen et al. 2018), and the location of the TRGB was measured from these 8+4 orbit depth data. The DF2 observations of GO-15851 were deeper, at 19 orbits in I814 and 19 orbits in V606. Shen et al. (2021b) used these data to measure the TRGB distance of DF2, again after adding the GO-14644 data for a total exposure of 20+20 orbits.

In this study we redrizzle the ACS data, with several changes. First, we discard the 1+1 orbits that were obtained for DF2 and DF4 in GO-14644. The orientation of these observations was very different from the more recent data, which means the point-spread function (PSF) and charge transfer efficiency corrections are different, and there is no need to maximize depth: The fluxes of the globular clusters are typically ∼30 e s−1, which means that Poisson errors are ≲0.5% even in a single orbit.

More importantly, we apply a flat-fielding correction to the flc files prior to drizzling. The ACS flat fields were last updated in 2006, and they no longer provide optimal corrections. As described in the instrument science report 11 ACS 2017-09, ultradeep stacks of Frontier Fields images show that there are systematic flat-fielding residuals at the level of 1%. This effect is also described in ISR ACS 2020-08. The residuals correlate with the local thickness of the CCD, and as the correlation between quantum efficiency and thickness reverses at ≈700 nm, the residuals in V606 and I814 are spatially anticorrelated. As a result, flat-fielding errors in the V606I814 color reach 2%, even though they are only half that in each filter individually.

The ACS team at STScI provided us with preliminary correction flats, and we applied these to the flc files. The flc files were aligned with each other using tweakreg. The tweakreg algorithm is sensitive to cosmic rays; we addressed this by running the code on versions of the data where cosmic rays were removed (with L.A.Cosmic van Dokkum 2001). I814 images of DF2 and DF4 are shown in Figure 1. Visually comparing the default and flat-field-corrected images, there is a clear improvement in the flatness of the sky background, particularly for DF2. From the remaining variation in the background, we estimate that residual flat-fielding errors are 0.5% ± 0.2%. Assuming that the errors are no longer (anti)correlated, the uncertainty in the color due to spatially varying flat-fielding uncertainties is then 0.7% ± 0.3% in our images.

Figure 1.

Figure 1. Top panels: images of DF2 and DF4 in the HST/ACS I814 filter, after applying flat-field corrections to the flc files and drizzling. The images span 80'' × 80''. Bottom panels: photometry procedure for two of the globular clusters, indicated with yellow boxes above. From left to right, we show the original I814 image (with a size of 7'' × 7''), the final object mask, the PSF-convolved King (1962) model fit with the best-fitting background plane, and the cleaned image. The cleaned image is the original image with the masked areas filled in by the King model. Colors and total magnitudes are measured from aperture photometry on the cleaned V606 and I814 images.

Standard image High-resolution image

3. Measurements of Colors and Magnitudes

3.1. Sample

At present, 24 globular clusters have been spectroscopically confirmed, 12 in DF2 and 12 in DF4. These are listed in Tables 1 and 2 of Shen et al. (2021a); the sources of the spectra are van Dokkum et al. (2018a, 2019), Emsellem et al. (2019), and Shen et al. (2021a). The sample is likely complete for MV ≲ −8.5. Several clusters are not included in our analysis. DF2-80 is superposed on a faint spectroscopically identified blue background galaxy (Shen et al. 2021a) with no possibility of disentangling the two objects; DF4-97 falls outside of the coverage of the GO-15695 data, and DF4-926 is projected onto a bright background merger (see van Dokkum et al. 2019). This leaves 11 confirmed clusters in DF2 and 10 in DF4.

Three of the confirmed DF4 clusters are fainter than MV = −8. As we show later the errors increase sharply at fainter magnitudes, and our quantitative analysis of the V606I814 scatter focuses on the 11 + 7 = 18 objects that have MV < −8.6. The faint clusters are used to investigate trends with magnitude; to this end, we use an augmented sample in Section 4 that includes five faint objects without a spectrum. These candidate clusters satisfy the Shen et al. (2021a) color and size criteria and are located within 45'' of the center of DF2 or DF4. The sample is listed in Table 1.

3.2. Cleaned Globular Cluster Images

Even with flat-fielding errors largely eliminated, it is not straightforward to obtain photometry with the desired accuracy. An investigation into the effects of changes in aperture sizes and background subtraction shows seemingly random variation in the colors at the 1%–2% level for individual clusters. These variations can be traced to the cumulative effects of contaminating objects, mostly red giants in the main bodies of the galaxies, on both the flux measurements and the estimation of the background.

In shallow data, this is an unavoidable source of noise, but owing to the depth of the I814 images, we can mitigate these effects. We use the following procedure for each cluster. We begin by creating a mask for the brightest contaminating objects in a 7'' × 7'' region centered on the cluster. A simple lower limit of 0.03 e s−1 pix−1 is used, excluding the central 1farcs0 × 1farcs0 to avoid masking the cluster itself. Next, an approximate model for the cluster is generated by fitting a PSF-convolved 2D modified King profile with α = 2 (Elson 1999; Peng et al. 2002) to the image, using galfit (Peng et al. 2002). The bright object mask is used, the background is modeled as a constant, and for the PSF, we choose a nonsaturated star in the image.

With a model for the cluster in hand an improved mask can be created by identifying pixels that deviate significantly from the model. Specifically, we create a residual image R = (IM)/M0.02, with I the image, M the model, and M0.02 the model with all pixels ≤0.02 e s−1 pix−1 set to 0.02. After median-filtering R by a 3 × 3 pixel filter, all pixels >0.15 are flagged and added to the initial mask. This process masks objects whose median flux in a 3 × 3 pixel aperture deviates by more than 15% from the model, as well as objects away from the central regions whose flux exceeds a fixed threshold of ≳0.003 e s−1 (with the precise threshold depending on the background level). The chosen values are a compromise between masking as many contaminants as possible and leaving sufficient data for reliable flux and background measurements. The masks for two representative clusters, DF2-71 and DF4-4045, are shown in Figure 1.

The fit to the globular cluster is repeated with this new mask, now modeling the background as a plane with gradients in x and y to properly account for the diffuse light of DF2 and DF4. The models for DF2-71 and DF4-4045 are shown in the third column of Figure 1. Finally, a cleaned image is created by replacing all masked pixels in the original image by the corresponding pixels in the model (fourth column of Figure 1).

3.3. Aperture Photometry

Fluxes are measured directly from the cleaned images using simple aperture photometry. Colors are measured in 0farcs5 diameter apertures and the total V606 magnitudes in 1farcs5 diameter apertures. The background is determined from an annulus with an inner diameter of 1farcs5 and an outer diameter of 3farcs0. As noted above, the results depend on the precise choice of these parameters when applied to the original images, but we find that they are insensitive to them when applied to the cleaned images.

Errors on the fluxes were determined empirically by repeating the aperture photometry in a grid of 6 × 6 blank positions within the 7'' × 7'' cleaned images. This procedure ensures that the local environment of each globular cluster is taken into account. As can be seen in the mask images of Figure 1, the central region (where the globular cluster is) is typically not masked as faint stars do not exceed the thresholds there. This leads to a bias that can be quantified and corrected for in the grid photometry. At each grid position, we replaced the pixels within ±0farcs25 of that position with those from the original image. The grid photometry now closely resembles the globular cluster photometry. The mean flux from the 36 grid positions is subtracted from the globular cluster measurements, and the scatter among them is taken as the uncertainty. Both values were determined with the biweight estimator (Beers et al. 1990) as it is insensitive to outliers.

Aperture corrections are taken from Bohlin (2016); as the half-light radii of the globular clusters are typically 0farcs05 (van Dokkum et al. 2018a), these point-source corrections are adequate for obtaining accurate colors. Fluxes are converted to AB magnitudes using the zero-points in the headers of the images and corrected for Galactic extinction using the Schlafly & Finkbeiner (2011) maps. The corrections are −0.060 mag, −0.062 mag in V606 for DF2, DF4, and −0.037 mag, −0.038 mag in I814. Finally, total magnitudes are converted to absolute magnitudes.

4. Observed Color Variation

The distribution of the DF2 and DF4 globular clusters in the color–magnitude plane is shown in the top panel of Figure 2. Error bars are a combination of the measurement uncertainty and the 0.007 mag flat-fielding uncertainty. For reference, the grayscale background shows the parameterized distribution of globular clusters in low-luminosity galaxies in the Virgo cluster. This is a combination of the two-component decomposition of the color distribution for the faintest galaxies 12 in Peng et al. (2006) and the Gaussian fit to the globular cluster luminosity function of the faintest galaxies 13 in Jordán et al. (2007).

Figure 2.

Figure 2. Top panel: DF2/DF4 globular clusters in the color–magnitude plane. Solid symbols are spectroscopically confirmed, and light-colored symbols have uncertainties <0.015 mag. The parameterized distribution for globular clusters in low-luminosity Virgo galaxies is shown in gray (see text). Bottom-left panel: mean color of globular clusters with uncertainties <0.015, compared to Virgo galaxies from Peng et al. (2006). There is no significant systematic color difference between DF2 and DF4. Bottom-right panel: observed and intrinsic scatter in the colors, again compared to Virgo galaxies. The clusters in DF2 and DF4 are extremely homogeneous. The orange line is the minimum possible scatter, arising from stochastic fluctuations in the number of red giants.

Standard image High-resolution image

The globular clusters in DF2 and DF4 are much brighter than those in Virgo dwarfs, as has been discussed extensively in earlier papers (see van Dokkum et al. 2018a; Trujillo et al. 2019; Shen et al. 2021a), and they are also somewhat bluer. There is no evidence for a systematic trend with magnitude when the full sample of confirmed and candidate clusters is considered. The photometric uncertainties increase sharply at fainter magnitudes, and in the following, we only consider the 18 clusters with errors <0.015. This sample corresponds to the full sample of clusters with M606 < −8.6.

The mean color of the DF2 and DF4 clusters is nearly identical: 〈V606I814〉 = 0.374 ± 0.004 for DF2 and 〈V606I814〉 = 0.377 ± 0.003 for DF4, 14 as determined with the biweight estimator (Beers et al. 1990). The mean color difference is ΔDF2−DF4 = −0.003 ± 0.005. The mean colors are compared to those of globular clusters in Virgo galaxies 15 in Figure 2 (lower left). The clusters in DF2 and DF4 are bluer than those in Virgo galaxies with similar NGC. 16 Furthermore, in the Virgo sample with NGC < 20 the median color difference between any two data points is 0.036, an order of magnitude larger than the difference between DF2 and DF4.

The cluster-to-cluster scatter is also very small and within the errors is the same for the two galaxies: σobs = 0.015 ± 0.003 for DF2 and σobs = 0.010 ± 0.003 for DF4. The observed scatter in the combined sample of 18 bright globular clusters in DF2 and DF4 is σobs = 0.015 ± 0.002 (all determined with the biweight estimator; the rms is also 0.015). The intrinsic scatter ${\sigma }_{\mathrm{intr}}$ can be determined by constructing the likelihood function,

Equation (1)

with ci the colors of the individual clusters, μ the mean, and ${\sigma }_{\mathrm{eff}}^{2}={\sigma }_{\mathrm{intr}}^{2}+{e}_{i}^{2}$. We find an intrinsic scatter of ${\sigma }_{\mathrm{intr}}={0.012}_{-0.003}^{+0.004}$. As shown in Figure 2 (lower right), the typical scatter in Virgo galaxies is σ ≈ 0.1 mag, and there are no galaxies with σ < 0.05 mag. We conclude that the globular clusters in DF2 and DF4 form a remarkably homogeneous population, as predicted by the bullet dwarf collision model. This level of homogeneity is not observed in normal dwarf galaxies.

5. Interpretation of the Variation

The color variation is small but not zero. As noted above, observational uncertainties explain some of the observed scatter. This is shown in the top-left panel of Figure 3, where the error bar for each data point is split into several distinct contributions. Measurement uncertainties are shown in black and the total photometric uncertainties, which include the 0.007 ± 0.003 mag flat-fielding errors, are shown in dark gray.

Figure 3.

Figure 3. Top-left panel: zoom-in on color–magnitude measurements with errors <0.015, corresponding to all clusters with M606 < −8.6. Error bars are the measurement uncertainty (black), the combined measurement and flat-fielding uncertainty (dark gray), and the combined measurement, flat-fielding, and stochastic sampling uncertainty (light gray). Dotted orange lines show the expected ±1σ variation due to stochastic sampling alone. Solid lines include the other uncertainties. Top-right panel: likelihood for the intrinsic dispersion, for the photometric errors (broken line), and for the total error, including the effect of stochastic sampling (solid line). Bottom: the effects of stochastic sampling of the isochrone. Intrinsic cluster-to-cluster color differences of 0.01–0.02 mag arise from random variations in the number of red giants. Two examples of identical clusters with different random seeds are shown on the left (see the text for details).

Standard image High-resolution image

The intrinsic scatter is so small that the effects of stochastic sampling of the isochrone need to be taken into account. Red giants contribute significantly to the integrated light, and the Poisson variation in their number causes some clusters to be redder and brighter than others. The same effect causes the well-known pixel-to-pixel surface brightness fluctuations in galaxy images (Tonry & Schneider 1988; Greco et al. 2021). We quantify this effect by generating artificial globular clusters with the ArtPop code (Greco & Danieli 2021) and measuring their integrated colors. The clusters have $\mathrm{log}(\mathrm{age})=9.9$, −1.8 ≤ [Fe/H] ≤ −0.8, and cover a factor of 20 in mass (parameterized by $\mathrm{log}({n}_{\mathrm{stars}})$, which ranges from 5 to 6.3 with steps of 0.2). At each mass, 500 clusters are generated with different random seeds, and the rms scatter in the V606I814 colors is measured.

The results are shown in the bottom-right panel of Figure 3. The cluster-to-cluster V606I814 scatter is ∼1% in the relevant luminosity range, of the same order as the intrinsic scatter in the DF2/DF4 globular clusters, with a modest dependence on metallicity. Two example clusters that illustrate the effect are shown at left (see also Figure 6 in Greco & Danieli 2021). A polynomial fit to the luminosity-dependent variation has the form

Equation (2)

and is shown by the black line in Figure 3.

Light-gray error bars in the top panel of Figure 3 show the effect of including this uncertainty for each cluster. The broken orange lines show where 68% of the points are expected to fall due to the stochastic sampling effect alone. Solid orange lines include the measurement error; for a normal distribution entirely defined by these lines, 12 out of 18 would fall within them. They encompass 10, only slightly fewer. Using the total errors (that is, the combination of the measurement error, the flat-fielding error, and the stochastic variation) in the likelihood analysis gives the "stellar population scatter", ${\sigma }_{\mathrm{sp}}={0.008}_{-0.006}^{+0.005}$. This scatter is not significantly different from zero. The likelihood function, marginalized over μ, is shown by the solid line in Figure 3.

We use the flexible stellar population synthesis (FSPS) framework (Conroy et al. 2009) with the MIST isochrones (Choi et al. 2016) to determine the (limits on) variation in age and metallicity that is implied by the stellar population scatter. For ages near 8 Gyr and metallicities near [Fe/H] = −1.3, we find Δ(V606I814) = 0.10Δ[Fe/H] and Δ(V606I814) = 0.0064Δ(age) to good approximation, with age in Gyr. The observed stellar population scatter σsp implies ${\sigma }_{[\mathrm{Fe}/{\rm{H}}]}={0.08}_{-0.06}^{+0.05}$ if there is no variation in age or ${\sigma }_{\mathrm{age}}={1.3}_{-1.0}^{+0.8}$ Gyr if there is no variation in metallicity. In the simulations of Lee et al. (2021), the globular clusters can have a spread of up to ≈0.1 dex in metallicity and ≈150 Myr in age, and we conclude that the bullet hypothesis cannot be ruled out.

6. Colors of the Diffuse Light

The bullet model also makes predictions for the global colors of DF2 and DF4, although these are more model dependent than the predictions for the globular clusters (Shin et al. 2020; Lee et al. 2021). The galaxies are predicted to have higher metallicities than the globular clusters as they form stars over a longer time period (see Lee et al. 2021). Therefore, while the colors of DF2 and DF4 should be very similar to each other, they are predicted to be redder than those of the globular clusters.

We measure the average colors of the galaxies in the following way. An object mask is created by comparing the summed V606 + I814 image to a binned and median-filtered version of itself. A first model for the galaxy is made by median-filtering the V606 and I814 images, not taking masked pixels into account. This model is subtracted from the data, and the object mask is optimized with a lower threshold, taking care not to include giants in the mask. Then, the median-filtering is repeated to create a final model in each filter. There is a background gradient in all images, and at this stage, a surface is fitted to the background and subtracted. The rms of this background model (that is, 68% of the gradient that is removed) is taken as the uncertainty in each filter. Finally, the I814 model is divided by the V606 model, multiplied by the object mask, and a flux threshold is applied so that the faint outskirts are excluded.

These color images are shown in Figure 4 and compared to the mean color of the globular clusters. The variations within each galaxy are caused by stochastic variation in the number of red giants (surface brightness fluctuations). We measure the mean colors of the galaxies to be ${({V}_{606}-{I}_{814})}_{\mathrm{DF}2}=0.420\pm 0.024$ and ${({V}_{606}-{I}_{814})}_{\mathrm{DF}4}=0.436\pm 0.075$, consistent with the Cohen et al. (2018) measurements with smaller uncertainties. We infer that the colors of the galaxies are identical within the errors, as predicted by the bullet model.

Figure 4.

Figure 4. Color images of the galaxies, created by dividing binned and median-filtered I814 and V606 images of the galaxies. Numerical values are simply the ratios of the observed fluxes in e s−1. The mean value for the globular clusters is indicated with the circle. Color variations within each galaxy are caused by surface brightness fluctuations. The average color of DF4 is slightly redder than that of DF2 but the difference is not significant: ${({V}_{606}-{I}_{814})}_{\mathrm{DF}2}=0.420\pm 0.024$ and ${({V}_{606}-{I}_{814})}_{\mathrm{DF}4}=0.436\pm 0.075$. The galaxies are redder than the globular clusters.

Standard image High-resolution image

The galaxies are redder than the luminous globular clusters. If the actual color of both galaxies is V606I814 = 0.375 and the errors are Gaussian, the probability that we measure ${({V}_{606}-{I}_{814})}_{\mathrm{DF}2}\geqslant 0.420$ and ${({V}_{606}-{I}_{814})}_{\mathrm{DF}4}\geqslant 0.436$ is <1%. The color difference between the galaxies and the clusters of 0.05 mag is qualitatively consistent with the metallicity difference predicted by the hydrodynamical model of Lee et al. (2021). Using the relation in Section 5, we find Δ[Fe/H] ≈ 0.5, somewhat larger than the ≈0.2 predicted by Lee et al. (2021) but in good agreement with the spectroscopically determined value of Δ[Fe/H] = 0.56 ± 0.15 of Fensch et al. (2019).

7. Discussion

The central result of this paper is that the bright globular clusters in DF2 and DF4 have extremely similar colors. The observed scatter is ≈0.015 mag, and this can be explained by a combination of measurement uncertainties (≈0.01) and stochastic variations in the number of red giants (≈0.01). The remaining scatter among the 18 luminous clusters in DF2 and DF4 is ${\sigma }_{\mathrm{ssp}}={0.008}_{-0.006}^{+0.005}$, that is, not significantly different from zero. The diffuse light is redder in both galaxies, with again no significant difference between DF2 and DF4. These results are expected in the bullet dwarf scenario (see Silk 2019; Shin et al. 2020; Lee et al. 2021; van Dokkum et al. 2022), and we conclude that this model survives an important falsification test. Returning to the formation models listed in Section 1, no other published explanation for the lack of dark matter in DF2 and DF4 also "naturally" produces the extreme uniformity of their globular cluster populations.

The globular clusters in DF2 and DF4 are different from those in Virgo galaxies; as detailed in Section 4, they are brighter, bluer, and have a much smaller scatter. DF2 and DF4 are ultradiffuse galaxies (UDGs; van Dokkum et al. 2015) and at least two other UDGs also have very homogeneous globular cluster populations, NGC 5846-UDG1 (Müller et al. 2021; Danieli et al. 2022) and DGSAT i (Janssens et al. 2022). In other respects they are different; NGC 5846-UDG1 has a canonical globular cluster luminosity function, and DGSAT i's clusters are significantly redder than those in DF2/DF4. A small scatter indicates synchronized formation in a dense, homogeneous medium, and it may arise generically in any formation scenario that results in the extreme (factor of 109) density contrast between the globular clusters and the galaxy light that is seen in UDGs (see, e.g., van Dokkum et al. 2018a; Trujillo-Gomez et al. 2021). Specifically, intense feedback from the formation of the globular clusters may have caused the galaxies to expand and turn into UDGs (Trujillo-Gomez et al. 2021; Danieli et al. 2022). Further deep HST or JWST studies of globular clusters in UDGs in various environments are needed to investigate this further.

Our analysis focuses on the brightest clusters as these have the smallest uncertainties. Shen et al. (2021a) showed that the globular cluster luminosity function in DF2 and DF4 can be modeled as a combination of a bright peak of overluminous clusters plus a "normal" luminosity function with a normalization and mean that are typical for the galaxies' luminosities. In this context, it is interesting that the clusters with M606 > −8.6 are somewhat redder than the brighter ones, with 〈V606I814〉 = 0.400 ± 0.011. Perhaps the faint clusters formed together with the diffuse light, with a higher mean metallicity than the brighter ones. 17 The observed scatter is also higher for the faint clusters at σ606–814 = 0.034 ± 0.008, although this can be explained by a combination of measurement errors (0.028) and stochastic sampling (0.016). Extremely deep spectroscopy may shed more might on these questions.

Further tests of the bullet model are possible. The most straightforward next step is probably obtaining radial velocities and line-of-sight distances of other galaxies along the trail, as the bullet model predicts that these follow a regular sequence (with some contamination from unrelated objects). Dynamical mass measurements of other trail galaxies will also be highly constraining and interesting in a broader context as bullet dwarf events can, in principle, constrain the self-interaction cross section of dark matter. Modeling of the bullet cluster has provided an upper limit (Randall et al. 2008), but as self-interacting dark matter was introduced to explain the "cored" dark matter density profiles of low-mass galaxies (Spergel & Steinhardt 2000), it is important to measure the cross section on those scales (Tulin & Yu 2018).

We thank Yotam Cohen and the rest of the STScI ACS team for their help with the data reduction. Support from HST grants GO-14644, GO-15695, and GO-15851 is gratefully acknowledged. S.D. is supported by NASA through Hubble Fellowship grant HST-HF2-51454.001-A.

Footnotes

  • 10  

    We note here that simulations have so far only focused on comparing the properties of clusters across a single resulting galaxy fragment and not yet across several galaxies.

  • 11  
  • 12  

    Specifically, we averaged the fit values for the MB = −16.6 and MB = −15.7 bins in Table 4 of Peng et al. (2006). The gz colors were converted using V606I814 = 0.445(gz) + 0.061.

  • 13  

    We averaged the g-band fit values for the MB = −16.6, MB = −16.4, and MB = −15.7 bins in Table 3 of Jordán et al. (2007) and converted to V606 using V606 = g − 0.40.

  • 14  

    The error bars do not include an uncertainty of ≈1% in the absolute calibration of the ACS filters, as this systematic error affects all colors and magnitudes by the same amount.

  • 15  

    The Virgo data are the single-component fits in Table 3 of Peng et al. (2006).

  • 16  

    The color difference partly reflects an age difference: The DF2 clusters have ages of 9 ± 2 Gyr (van Dokkum et al. 2018a; Fensch et al. 2019) whereas the typical ages of metal-poor globular clusters are similar to those in the Milky Way (see Strader et al. 2005).

  • 17  

    Lee et al. (2021) predict the opposite trend, with the brighter clusters having a higher metallicity.

Please wait… references are loading.
10.3847/2041-8213/ac94d6