This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.

Precise Mass Determination of SPT-CL J2106-5844, the Most Massive Cluster at z > 1

, , , , , , , and

Published 2019 December 11 © 2019. The American Astronomical Society. All rights reserved.
, , Citation Jinhyub Kim et al 2019 ApJ 887 76 DOI 10.3847/1538-4357/ab521e

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/887/1/76

Abstract

We present a detailed high-resolution weak-lensing study of SPT-CL J2106-5844 at z = 1.132, claimed to be the most massive system discovered at z > 1 in the South Pole Telescope Sunyaev–Zel'dovich survey. Based on the deep imaging data from the Advanced Camera for Surveys and Wide Field Camera 3 on board the Hubble Space Telescope, we find that the cluster mass distribution is asymmetric, composed of a main clump and a subclump ∼640 kpc west thereof. The central clump is further resolved into two smaller northwestern and southeastern substructures separated by ∼150 kpc. We show that this rather complex mass distribution is more consistent with the cluster galaxy distribution than a unimodal distribution as previously presented. The northwestern substructure coincides with the brightest cluster galaxy and the X-ray peak while the southeastern one agrees with the location of the peak in number density. These morphological features and the comparison with the X-ray emission suggest that the cluster might be a merging system. We estimate the virial mass of the cluster to be ${M}_{200c}=({10.4}_{-3.0}^{+3.3}\pm 1.0)\times $ ${10}^{14}\,{M}_{\odot }$, where the second error bar is the systematic uncertainty. Our result confirms that the cluster SPT-CL J2106-5844 is indeed the most massive cluster at z > 1 known to date. We demonstrate the robustness of this mass estimate by performing a number of tests with different assumptions on the centroids, mass–concentration relations, and sample variance.

Export citation and abstract BibTeX RIS

1. Introduction

Careful studies of galaxy clusters, the largest gravitationally bound structures, play a pivotal role in understanding the formation and evolution of large-scale structure in the universe. The cluster mass function, the number of galaxy clusters at a given mass interval, enables us to probe the growth rate of the largest halos and constrain cosmological parameters (e.g., Allen et al. 2011). Since the amplitude of the matter power spectrum probed by the cluster mass function at a fixed redshift is a degenerate function of the matter density Ωm and normalization σ8, it is necessary to combine the cluster mass functions over a wide range of redshift in order to break the degeneracy. Therefore, there have been constant efforts to enlarge the sample of galaxy clusters at high redshift.

The efforts to find new high-z clusters are happening at various wavelengths, which include the X-ray (e.g., Fassbender et al. 2011; Mehrtens et al. 2012), optical/IR (e.g., Muzzin et al. 2009; Adami et al. 2010), and millimeter (e.g., Marriage et al. 2011; Bleem et al. 2015) surveys. Among these, the millimeter surveys utilizing the Sunyaev–Zel'dovich (SZ) effect are particularly efficient in detecting massive high-z (e.g., z > 1) objects because the SZ signal strength is nearly independent of the cluster redshift without being plagued by surface brightness dimming that depends on redshift as (1 + z)−4.

The galaxy cluster SPT-CL J2106-5844 (hereafter SPT2106) was detected in the South Pole Telescope (SPT) SZ survey with a high signal-to-noise ratio (18.5σ at 150 GHz; Foley et al. 2011, hereafter F11). F11 performed follow-up observations including both photometry and spectroscopy at various wavelengths. Their infrared and X-ray observations, from the Spitzer and Chandra space telescopes, respectively, show that the cluster is also rich in galaxies and intracluster medium. Both its high X-ray temperature ${T}_{{\rm{X}}}={11.0}_{-1.9}^{+2.6}\,{\rm{keV}}$ and luminosity ${L}_{{\rm{X}}}(0.5-2\,{\rm{keV}})=(13.9\pm 1.0)\times {10}^{44}\,{\rm{erg}}\,{{\rm{s}}}^{-1}$ are strong indications that the system is very massive, although the morphology of the X-ray emission suggests that the system might be undergoing a merger and thus the temperature-based mass measurement might have been somewhat overestimated. F11 confirmed 18 spectroscopic member galaxies (mostly early-type) and measured the velocity dispersion to be ${1230}_{-180}^{+270}$ ${\rm{km}}\,{{\rm{s}}}^{-1}$. This high velocity dispersion is in support of the cluster's extreme mass.

F11 derived the virial mass of SPT2106 M200c = $(12.7\pm 2.1)\times {10}^{14}{h}_{70}^{-1}\,{M}_{\odot }$, combining their SZ, X-ray, and velocity dispersion measurements. This mass is the highest among the entire z > 1 sample of the South Pole Telescope Sunyaev–Zel'dovich (SPT-SZ) survey (Reichardt et al. 2013). Follow-up studies with SZ (e.g., Williamson et al. 2011; Reichardt et al. 2013; Bleem et al. 2015) and X-ray (e.g., Amodeo et al. 2016; Bartalucci et al. 2017) observations consistently provided high masses for the system. Schrabback et al. (2018, hereafter S18) presented the first weak-lensing (WL) measurement of SPT2106 using the Advanced Camera for Surveys (ACS) in the Hubble Space Telescope (HST) data and obtained ${M}_{200c}={8.8}_{-4.6}^{+5.0}\,\times $ ${10}^{14}\,{M}_{\odot }$. The central value of this measurement is somewhat lower than the previous non-WL estimates (e.g., ∼30% smaller than the F11 value). However, because of the large (≳50%) error bars on the S18 WL result, the statistical significance of the difference is not strong.

The abundance of massive clusters at high redshift such as SPT2106 is a sensitive function of cosmological parameters. Hence, the presence of any exceptional cluster can create a non-negligible tension with the current ΛCDM paradigm. According to F11, the expected number of SPT2106-like clusters is estimated to be ∼1 (∼0.07) in the entire sky (parent 2500 deg2 survey). Currently, the largest source of error in the abundance estimation is the uncertainty in the mass estimate.

In this paper, we present a detailed WL analysis of SPT2106 using ACS and Wide Field Camera 3 (WFC3) data. Our work is different from S18 as follows. First, we measure WL signals from both ACS and WFC3 images, while S18 obtained galaxy shapes only from ACS. As will be discussed in Section 3, high-redshift galaxies are brighter in IR, which enables us to obtain more precise shapes and a higher (a factor of ∼9 in the current case) number density of sources. Second, our WL pipeline uses the "SFIT" method, whose shear calibration accuracy has been publicly validated in the most recent public data challenge, called the third GRavitational lEnsing Accuracy Testing (GREAT3; Mandelbaum et al. 2015). SFIT won eight GREAT3 branches, one of which is the analysis of simulated future space-based WL images containing various effects such as complex variation on point-spread function (PSF), multi-epoch dithering, PSF undersampling, real galaxy morphologies, etc. The S18 analysis is based on the method of Kaiser et al. (1995), which has been improved by a number of authors (e.g., Hoekstra et al. 1998; Erben et al. 2001; Schrabback et al. 2010). Given the importance and rarity of the target, it is useful to study SPT2106 with an independent pipeline. Third, we investigate WL substructures of SPT2106. Our source density is more than a factor of 9 higher than the one in S18. This increase is due to the difference in the WL pipeline, the source selection method, and the availability of the deep WFC3 imaging data. Our higher source density enables us to probe the substructure in greater detail.

This paper is organized as follows. We describe our HST data and their reduction in Section 2. In Section 3, we present our WL analysis procedure. Our mass reconstruction and virial mass estimates are shown in Section 4. We compare our WL mass estimates with other mass proxies and previous WL measurement, discuss possible sources of systematic uncertainties, and estimate the rarity of such a massive cluster in Section 5. We summarize our work and present conclusions in Section 6.

Throughout the paper, M200c (or M500c) corresponds to the mass enclosed within a radius, inside which the mean density equals 200 (500) times the critical density of the universe at the cluster redshift. We adopt the cosmology published in Planck Collaboration et al. (2016). For this cosmology the angular diameter distance to the cluster is ∼1741 Mpc, and thus the plate scale is ∼506 kpc arcmin−1 at the cluster redshift. We use the AB magnitude system corrected for the Milky Way foreground extinction and express all uncertainties at the 1σ confidence (∼68.3%) level.

2. Observations

In this study we use two HST observation programs for the galaxy cluster SPT2106. In PROP 12477 (Cycle 19, PI: W. High), the cluster was observed in 2011 October and 2012 December with the Wide Field Channel (WFC) of the ACS using the F606W and F814W filters. The F606W observation used a 2 by 2 mosaic pattern whereas the F814W filter observed the central region in a single pointing. The total integration times per pointing are 1920 s and 1916 s for F606W and F814W, respectively. PROP 13677 (Cycle 22, PI: S. Perlmutter) observed the cluster in a single pointing with the WFC3/UVIS F814W, WFC3/IR F105W, and WFC3/IR F140W from 2015 March to 2015 September with total integration times of 3491 s, 7181 s, and 8131 s, respectively. Hereafter, we refer to the region covered by ACS/F814W as the "central" region and the area outside this region as the "peripheral" region as illustrated in Figure 1. Note that WFC3/IR covers a slightly smaller region than the central region. Because object colors are available only in the central region, we use different schemes for source selection and redshift estimation for the two regions (Section 3.4).

Figure 1.

Figure 1. HST observation footprint of SPT2106. In inverted grayscale, we show the intensities of ACS/F606W, which observed the cluster in the 2 × 2 mosaic pattern and covered the ∼6 × 6 arcmin2 region. The blue box depicts the central (∼3 × 3 arcmin2) F814W pointing. Throughout the paper, we refer to the areas inside and outside this pointing as central and peripheral regions, respectively. The WFC3/IR pointings (black) lie within the central region.

Standard image High-resolution image

We start our data reduction using the FLC and FLT images provided by the STScI CALACS pipeline (Hack et al. 2003) for the ACS and WFC3/IR data, respectively. The FLC images have been corrected for charge transfer efficiency (CTE) degradation based on the algorithm of Ubeda & Anderson (2012). The CTE degradation is mainly due to radiation damage, and prior to correction it manifests itself as a long trail along the CCD parallel readout direction. Because the success of WL analysis depends on our ability to control systematics that affect galaxy shapes, it is crucial to verify that the CTE correction of the STScI pipeline is sufficiently accurate. In Jee et al. (2014), we quantified the accuracy by utilizing the shapes of cosmic rays and demonstrated that the residual error in the STScI pixel-based correction is sufficiently small, compared to the statistical noise in cluster lensing analysis, although it is shown that the imperfection of the method slightly overcompensates for the CTE degradation for very faint objects. Contrary to the ACS/WFC and WFC3/UVIS detectors, the WFC3/IR detector does not suffer from CTE degradation.

To align individual exposures, we select one of the ACS/F814W exposures as a reference image and estimate shifts for the rest of the exposures with respect to this reference image based on common astronomical sources. This explicit shift estimation is mandatory because the typical WCS information in HST image headers is inaccurate. We use the MultiDrizzle (Koekemoer et al. 2003) software in order to conduct cosmic-ray removal, sky subtraction, and image combination. We adopt the Lanczos3 drizzle kernel for ACS/WFC and WFC3/UVIS with the output pixel scale of 0farcs05 per pixel. Our previous tests have shown that for ACS/WFC this combination of drizzle parameters produces the sharpest PSF (Jee et al. 2007). Because the WFC3/IR PSF is severely undersampled, we cannot use the same Lanczos3 drizzling kernel to process the image. Instead, we use the Gaussian kernel while maintaining the same 0farcs05 pixel scale.

We detect objects with SExtractor (Bertin & Arnouts 1996) in dual-image mode (using one image for detection and the other for measurement). The detection image is created by weight-averaging all available filter images. We find sources by looking for at least 10 connected pixels above 1.5 times the sky rms. The total number of sources obtained is ∼9800 within the ∼6' × 6' region. We exclude spurious objects (e.g., diffraction spikes of bright stars, cosmic rays, star formation regions in large foreground galaxies, etc.) with visual inspection.

3. Analysis

We measure galaxy shapes in ACS/F606W and WFC3/F140W. As mentioned in Section 2, the F606W filter covers the 6' × 6' cluster field whereas the WFC3/F140W filter is used only for the observation of the central 2' × 2' region. We combine the two shape catalogs where available. We must adopt slightly different schemes for source selection and redshift estimation between the central and peripheral regions due to the difference in color availability. In Sections 3.1 and 3.2, we discuss our PSF modeling and shape measurement, respectively. Schemes for source selection and redshift estimation are described in Sections 3.3 and 3.4, respectively.

3.1. PSF Modeling

Modeling accurate PSFs is one of our supreme interests because a PSF dilutes lensing signals and induces artificial ellipticity correlations. The effect is more destructive for small, faint galaxies, which provide more efficient lensing signals than large, bright galaxies because they are on average at higher redshift. Thus, accurate PSF modeling is paramount when one studies high-redshift clusters.

In HST, it is well known that the PSF changes in time because of the so-called "focus breathing" effect and in position across the CCD chip because of the position-dependent optical aberration. Since there are only several stars available with high signal-to-noise ratio (S/N) within each pointing, it is impossible to obtain a high-quality PSF model from the image itself that represents the complicated spatial variation. However, thanks to the repeatability of the pattern in the PSF variation (Jee et al. 2007), one can obtain a high-fidelity PSF model from dense stellar fields and apply it to a WL image. This PSF library approach has been successfully applied to a number of clusters in our and other previous studies (e.g., Jee et al. 2005; Schrabback et al. 2010). In the current study, we continue to use this library-based method to model PSFs and refer readers to our previous papers (e.g., Jee & Tyson 2009) for details.

3.2. Shape Measurement

The distortion of the shape of a background source due to a foreground lens is expressed by the matrix below:

Equation (1)

where the convergence κ is the projected mass density in units of the critical surface density Σc, and the reduced shear g1(2) determines the amount of stretch along the x-axis (along the 45° direction). The critical surface density Σc is defined by

Equation (2)

where c is the speed of light and G is the gravitational constant. Dl, Ds, and Dls are the angular diameter distances to the lens, to the sources, and between the lens and the sources, respectively. The ratio of the last two distances β = Dls/Ds is often referred to as "lensing efficiency" and is estimated from the effective redshift of the source plane (Section 3.4).

Observationally, the reduced shear g must be inferred from a population of source galaxies because the amount of distortion in each galaxy image is very small compared to its intrinsic shape dispersion. Assuming that the intrinsic galaxy shape is random, we can estimate the reduced shear g by averaging ellipticities $g=\langle e\rangle $. We define the ellipticity of a galaxy image to be e = (ab)/(a + b), where a and b are the semimajor and minor axes, respectively. In practice, the relation $g=\langle e\rangle $ does not hold exactly for a number of reasons. One obvious reason is that there is no unique way to define the semimajor and minor axes of real galaxy images. Most galaxy images possess radially varying isophotes, which are also asymmetric and irregular. Therefore, one's definition of ellipticity must be only operational and we need to derive a correction factor that reconciles the difference between g and $\langle e\rangle $.

In this paper, we measure ellipticity by fitting a two-dimensional (2D) elliptical Gaussian to a galaxy image using the MPFIT package (Markwardt 2009). The elliptical Gaussian model is convolved with a model PSF computed for each object prior to fitting. Although formally the total number of free parameters is seven—two for the object centroid, two for the normalization and background level,8 two for the semimajor and minor axes, and one for the orientation angle—we fix the background level and centroid using the SExtractor output. The semimajor axis a and semiminor axis b and the orientation ϕ are converted to the galaxy ellipticities e1 and e2 as follows:

Equation (3)

Equation (4)

As mentioned above, averaging the ellipticity obtained in this way causes bias in our estimation of reduced shear g. The bias due to the difference between elliptical Gaussian and real galaxy profiles is referred to as "model bias," and the bias caused by nonlinear translation of pixel noise to ellipticity is termed "noise bias" (Melchior & Viola 2012; Refregier et al. 2012). In addition to "model bias" and "noise bias," there are many other sources of bias such as selection effect, blending, etc.

Instead of addressing individual sources of bias separately, we ran WL image simulations to determine the combined effect. Our WL image simulations are described in Jee & Tyson (2011) and Jee et al. (2013). Jee & Tyson (2011) explain in detail how we sampled galaxy images from the Hubble Ultra Deep Field (HUDF; Beckwith et al. 2006) data. Jee et al. (2013) describe how shear calibrations are derived by comparing input and output (recovered) shears. The original simulation tool was developed mainly for ground-based imaging data. For HST WL analysis, an important change must be made because the PSF size is significantly different between ground and space. Previously, when shearing galaxy images, we ignored the fact that the images had been convolved by the HST PSF, which is much smaller than those for Large Synoptic Survey Telescope and the Deep Lens Survey (Jee et al. 2013). However, this approximation is not valid in HST WL studies. Therefore, we first deconvolved the HUDF galaxy images and then applied shears. Finally, we reconvolved the shear image with the ACS PSF, following the method described in the GalSim package (Rowe et al. 2015). The multiplicative factor obtained from this simulation is 1.15 for ACS/F606W.

For WFC3/F140W images, there are some additional sources of systematics due to the detector characteristics such as undersampling, nonlinearity, interpixel capacitance, etc. Instead of including these features in our image simulation, for WFC3/F140W we chose to derive shear calibration utilizing ∼2700 common astronomical objects between ACS/WFC and WFC3/F140W in the archival data. We refer readers to Jee et al. (2017) for details. The resulting multiplicative factor for WFC3/F140W is 1.25. We combine the ACS/F606W and WFC3/F140W ellipticities if available by weight-averaging.

3.3. Source Selection

We select the background sources that satisfy both shape and photometric requirements described in Sections 3.3.1 and 3.3.2, respectively. We impose different magnitude and color cuts on ACS and WFC3 instruments. Based on these requirements a total of 3129 objects are classified as sources. The resulting source density (∼79 arcmin−2) is higher than the one in S18 by nearly a factor of 9. We discuss our evaluation of the lensing efficiency β in Section 3.4.

3.3.1. Shape Requirement

Our WL sources are selected based on the following shape criteria. First of all, ellipticities of sources (after PSF deconvolution) should have converging values. We fulfill this condition by selecting sources whose MPFIT STATUS parameter (Markwardt 2009) is unity. Second, objects should have their semiminor axis from our elliptical Gaussian fitting (after PSF correction) greater than 0.4 pixels, which efficiently excludes point sources and small galaxies whose shapes are uncertain and subject to residual PSF systematics. Third, errors on ellipticity should be less than 0.25. The objects that have ellipticity error larger than this tend to have bright neighbors, very low surface brightness, and/or very small FWHM values. Moreover, their noise bias is large, which makes their shear calibration highly uncertain. Lastly, half-light radii (before PSF deconvolution) of the objects should be larger than those of stars (Figure 2). This requirement also prevents us from including spurious features such as cosmic rays, hot pixels, etc. In spite of the above efforts, some spurious objects (e.g., diffraction spikes around bright stars, cosmic rays, fragmented parts of foreground galaxies, clipped sources at the field boundaries, etc.) still remain. We manually identify and remove these objects by visual inspection.

Figure 2.

Figure 2. Relation between magnitude and half-light radius. We show the relations obtained from the F606W and F140W data in the top and bottom panels, respectively. We identify stars using their tight size–magnitude relation (rh ∼ 0farcs07 and rh ∼ 0farcs13 for F606W and F140W, respectively). Also displayed are our sources (blue), confirmed cluster members from optical spectroscopy (red), and member candidates (orange). Our sources are selected from both shape and photometric requirements (see the text). The member candidates are selected assuming the presence of a 4000 Å break feature and the effect of this on the magnitudes.

Standard image High-resolution image

3.3.2. Photometric Requirement

Source selection with ACS photometry. The redshifted rest-frame 4000 Å break feature (∼8500 Å at the cluster redshift) at z = 1.132 is bracketed by F606W and F814W. The early-type members occupy a narrow locus in the color–magnitude diagram shown in the top panel of Figure 3. We select objects bluer and fainter than the red sequence (F606W − F814W < 1.0 and F606W > 25) as our background sources (blue dots). This enables us to remove only red cluster galaxies and thus the resulting catalog may include blue cluster members. In the peripheral region where only F606W is available, we select sources purely based on their magnitudes 25.0 < F606W < 28.0. Therefore, we need to correct for the contamination for both red and blue members.

Figure 3.

Figure 3. Color–magnitude relation for the objects in the central SPT2106 field. The top (bottom) panel shows the result for the central ∼3 × 3 arcmin2 (∼2 × 2 arcmin2) region seen in F606W − F814W vs. F606W (F105W − F140W vs. F140W). Sources (blue circles) are selected based on these color–magnitude relations (top: F606W − F814W < 1.0, 25.0 < F606W < 28.0; bottom: F105W − F140W < 0.5, 23.5 < F140W < 28.0) and the shape selection criteria (see the text). The spectroscopically confirmed cluster galaxies from Foley et al. (2011) are depicted in red. The confirmed members are well isolated in F606W − F814W color, which brackets the rest-frame 4000 Å break feature. The object at F606W ∼ 24 with F606W − F814W ∼ 0.7 is the late-type member confirmed by its [O ii] line (F11). Interestingly, the red members occupy the narrow locus also in the F105W − F140W color, although the two filters are not optimal in capturing their rest-frame 4000 Å break feature.

Standard image High-resolution image

In order to assess the cluster member contamination, we utilize two control fields: the HUDF (Rafelski et al. 2015) and the Great Observatories Origins Deep Survey (GOODS; Giavalisco et al. 2004). After applying the same selection criteria to the control fields, we compare the resulting magnitude distributions with those from our sources as shown in the top panel of Figure 4. If the contamination is significant in our source selection, it should appear as a significant number density excess beyond the sample variance.

Figure 4.

Figure 4. Magnitude distribution of source galaxies. Top: F606W magnitude comparison between SPT2106 and the three control fields: HUDF, GOODS-S, and GOODS-N. The light gray histogram shows the distribution in the peripheral region where only F606W is available (i.e., no color information). The dark gray histogram is obtained for the central region where we use the F606W − F814W color to remove the cluster red sequence. Error bars are computed using Poissonian statistics. Solid lines represent the results for the control field when we apply only the magnitude selection criterion whereas dashed lines show the results when we use both object magnitudes and colors. We interpret the consistency between the dark gray histogram and the dashed lines as indicating that the blue member contamination is not significant. Note that the large discrepancies at F606W > 26.5 result from the difference in depth. See the text for the description of our correction for the red galaxy contamination in the peripheral region. Bottom: comparison of the F140W magnitude distribution between SPT2106 and the two control fields: HUDF and GOODS-S. The source selection based on F105W − F140W provides a magnitude distribution that is consistent with the HUDF one.

Standard image High-resolution image

In the central region (blue box in Figure 1) where the F606W − F814W color is available, the magnitude distribution (dark gray) is consistent with those from the control fields in the F606W ≲ 26 regime. The small difference between GOODS-N and GOODS-S suggests that the sample variance might be much smaller than the Poissonian scatter, shown with error bars. For fainter sources, the densities in the control field greatly outnumber those in our cluster field because the images of the control field are deeper. These tendencies are also observed in our previous high-z cluster WL studies (e.g., Jee et al. 2011). Therefore, in the central region where both F606W and F814W are available, we decide to proceed with our WL study without any correction for blue cluster contamination.

In the peripheral region, we only apply the magnitude cut (25.0 < F606W < 28.0), and the resulting distribution is shown in light gray. Obviously, this magnitude-only selection includes red galaxies that would have been discarded if F814W had been available. The excess due to the inclusion of these red sources is clearly seen (dark gray versus light gray in the top panel of Figure 4). However, one should not incorrectly attribute the difference exclusively to the red cluster members because many non-cluster members can also have their color satisfying the condition F606W − F814W > 1.0. In Section 3.4, we find that in fact the second issue dominates the excess and the red cluster member contamination is not significant. This is not surprising because the fraction of the early-type galaxies is expected to be low in the faint magnitude regime and also decreases rapidly with clustocentric radius.

Source selection with WFC3 NIR photometry. Since all objects observed with WFC3 are located inside the central region and have color, we select the sources using both color and magnitude criteria: F105W − F140W < 0.5 and F140W > 23.5. Similarly to the ACS-based selection, these thresholds in color and magnitude are chosen to select sources bluer and fainter than the locus of the red sequence (bottom panel of Figure 3). Again, we check for the presence of a possible contamination by blue cluster members by comparing the resulting magnitude distributions with control field statistics. The bottom panel of Figure 4 indicates that the source number density is consistent with that from HUDF at F140W ≲ 26. At F140W ≳ 26, the source density becomes gradually lower because of the shallower depth.

3.4. Source Redshift Estimation

We determine the effective redshift and the width of the redshift distribution of our source population using the HUDF photo-z catalog (Rafelski et al. 2015). Given that the field size of HUDF is small (∼12.8 arcmin2), one might be concerned about the sample variance. We will show in Section 5.4.1 that the mass uncertainty due to the sample variance (up to ∼7% in mass) is much less than the statistical error. As mentioned in Section 3.3.2, we apply the same selection criteria to the HUDF galaxies.

Because of the absence of ACS/F814W in the HUDF, we perform a photometric transformation to estimate the F814W magnitude using the HUDF F606W and F775W photometry. The transformation to F814W from F606W and F775W depends on the spectral energy distribution (SED) and redshift. The HUDF catalog includes photometry (AB magnitudes of ACS and WFC3 instruments), photometric redshift, and best-fit spectral type. Utilizing the information on redshift and best-fit spectral type (model SED), we perform synthetic photometry (multiplying the filter throughput to the redshifted SED) to obtain a mathematical relation among F814W, F606W, and F775W.

We compute the lensing efficiency β as follows:

Equation (5)

Note that for galaxies with redshift less than the cluster redshift, we assign zero because they are not lensed. The upper panel of Figure 5 shows β as a function of the F606W magnitude. As observed, β should increase with magnitude because sources at higher redshifts are fainter on average. The blue filled circles represent the cases where we have color measurements (thus the red sequence is removed). In the peripheral region, similar β values are observed when only magnitudes are used for selection (red circles). When computing the representative value of $\langle \beta \rangle $ for the entire source population, we apply proper weights to different magnitude bins because of the difference in depth between the HUDF and our cluster field using the following equation:

Equation (6)

where βi is the average lensing efficiency at the ith magnitude bin in Figure 5 and wi is the weight needed to take into account the difference in depth, which is the ratio of the number density of our sources to that of the HUDF galaxies selected with the same criteria.

Figure 5.

Figure 5. Lensing efficiency (β) estimated for source population as a function of magnitude using HUDF as a control field for ACS (top) and WFC3 (bottom). The lensing efficiency increases with magnitude, which is in accordance with our expectation that fainter galaxies are at higher redshifts. In the central region, we use both object colors and magnitudes to select sources (blue filled circles) whereas in the peripheral region (open red circles) we only apply magnitude cuts. We apply the same criteria to objects in HUDF. See the text for the description of how we use this color–magnitude–redshift or magnitude–redshift relation to derive the source redshift distribution after taking into account various effects such as depth difference, red galaxy contamination, etc.

Standard image High-resolution image

In the peripheral region where a color is not available, we apply an additional correction for inclusion of red galaxies as follows. This magnitude-based selection contains red background, red foreground, and red cluster galaxies as well as blue sources that would have been selected as the source population if the color had been available. These blue sources consist of blue foreground/background galaxies and blue cluster members. Since the red foreground and red cluster galaxies are not lensed, the lensing efficiency at each magnitude bin in the peripheral region is computed as below:

Equation (7)

where ${\beta }_{{bs},i}$ (${n}_{{bs},i}$) and ${\beta }_{{rb},i}$ (${n}_{{rb},i}$) are the average lensing efficiencies (number densities) at the ith magnitude bin for blue sources and red background galaxies, respectively. ${n}_{p,i}$ is the total number density within each magnitude bin in the peripheral region. ${\beta }_{{bs},i}$ is the same as the ${\beta }_{i}$ value measured from the central region. ${n}_{{bs},i}/{n}_{p,i}$ corresponds to the ratio of the number density in the central region (dark gray) to that in the peripheral region (light gray) of the cluster field in Figure 4. The number density of red background galaxies in the peripheral region (${n}_{{rb},i}$ in the second term of Equation (7)) is not directly observable. Nevertheless, we can approximate the ratio ${n}_{{rb},i}/{n}_{p,i}$ using the HUDF photo-z catalog. Since the catalog does not contain red cluster members, in principle this approximation leads to overestimation. However, the fraction of red cluster members is estimated to be small (<5%) in the peripheral region. Thus, the overestimation is negligibly small (∼4.7%).

Using the above methods, we obtain $\langle \beta \rangle =0.201$ for the central region, corresponding to an effective source-plane redshift zeff = 1.591, which is translated to the critical surface density ${{\rm{\Sigma }}}_{c}\simeq 4752\,{M}_{\odot }\,{{\rm{pc}}}^{-2}$. For the peripheral region (magnitude selection only), $\langle \beta \rangle $ is 0.139, corresponding to zeff = 1.413 (${{\rm{\Sigma }}}_{c}\simeq 6865\,{M}_{\odot }\,{{\rm{pc}}}^{-2}$). If we had ignored the correction due to red source inclusion, we would have obtained a ∼40% higher value of $\langle \beta \rangle =0.196$.

Although this assumption of a single source plane is convenient, ignoring the width of the distribution creates non-negligible bias in our cluster mass estimation because the lensing efficiency is a nonlinear function of source redshift. We obtain the width of the redshift distribution to be $\left\langle {\beta }^{2}\right\rangle =0.070$ and 0.048 for the central and peripheral regions, respectively. Seitz & Schneider (1997) derived the first-order correction to be

Equation (8)

where g' and g are the observed and true shears, respectively.

For the sources detected in F140W, we follow the same procedure using the HUDF photo-z catalog. Similarly to the case for F606W, the mean β value increases with magnitude as shown in the lower panel of Figure 5. We obtain $\langle \beta \rangle =0.161$, $\left\langle {\beta }^{2}\right\rangle =0.057$, and zeff = 1.470 for the F140W sources.

4. Results

4.1. Mass Reconstruction

We reconstruct the mass distribution of the cluster by averaging ellipticities of the background sources and then convolving the ellipticity map using the relation below:

Equation (9)

where ${D}^{* }({\boldsymbol{x}})$ is the complex conjugate of the convolution kernel defined as $D({\boldsymbol{x}})=-1/{\left({x}_{1}-{{ix}}_{2}\right)}^{2}$ and $\gamma ({\boldsymbol{x}})$ is the mean ellipticity (shear) distribution of the background sources. For our mass reconstruction, we use the FIATMAP code (Fischer & Tyson 1997), which implements the convolution in real space. We verify that this FIATMAP result is highly consistent with the one derived from our implementation of the algorithm of Kaiser & Squires (1993) in Fourier space. Since the reconstruction is a noisy process, it is important to assess the significance of the observed features. For this assessment, we generate an uncertainty (rms) map of the mass reconstruction by performing 1000 bootstrap resamples with randomly (while allowing redundancy) selected source galaxies.

Figure 6 shows our mass reconstruction. We create overlays with the color composite, X-ray emission, F814W luminosity, and number density distributions. The luminosity and number density maps are made by selecting cluster member candidates that meet the following criteria: 24 < F606W < 26 and 1.4 < F606W − F814W < 2.2 (see also Figure 3); the brightest cluster galaxy (BCG; Song et al. 2012) at (R.A., decl.) = (21:06:04.66, −58:44:28.32) is manually included (not initially selected by the criteria because of its extreme luminosity, F606W ∼ 22).

Figure 6.

Figure 6. Two-dimensional mass reconstruction of SPT2106 using the FIATMAP code (Fischer & Tyson 1997). All figures show the central 3' × 3' region. The symbols represent different centroids that we use to investigate the impact of this choice on mass measurement (see the text). (a) Mass contours overlaid on the color composite. This color composite is created with HST ACS/WFC F606W (blue), ACS/WFC F814W (green), and WFC3/IR F105W (red). The outermost contour corresponds to 2.5σ significance and the significance levels increase inward by 0.5σ. The contour label is convergence κ. Since we do not break the mass-sheet degeneracy, the scale is somewhat arbitrary. The mass distribution consists of two mass clumps: "main" and "western" extension. The main clump is also resolved into the northwestern and southeastern subclumps. The southeastern subclump has a slightly larger significance than the northwestern one. (b) Same as (a) except that the mass contours are overlaid on the Chandra X-ray image processed by the CIAO package with removal of point sources. The X-ray emission is highly elongated and the peak coincides with the northwestern subclump. (c) Same as (b) except that the mass contours are overlaid on the F814W luminosity density of the cluster galaxy candidate. The smoothing scale of the luminosity density map is FWHM ∼ 37''. (d) Same as (a) except that the mass contours are overlaid on the number density map (smoothed with the same kernel applied to the luminosity density map).

Standard image High-resolution image

The X-ray emission map is obtained from the Chandra archive.9 The observation was performed in 2010 and 2011 (ObsID of 12180 and 12189, respectively) with VFAINT mode using the ACIS-I instrument and a total exposure time of ∼73 ks. The Chandra X-ray data reduction is conducted using the Chandra Interactive Analysis of Observations (CIAO; Fruscione et al. 2006) ver. 4.11. After combining and reprojecting to the same tangent plane using the merge_obs tool, we create an X-ray emission map within the energy range [0.5–7.0] keV and obtain an exposure-corrected map. We use the wavdetect and aconvolve tools to remove point sources and to adaptively smooth the map, respectively. The morphological characteristics in the WL mass reconstruction, the cluster galaxies, and the X-ray emission are discussed below.

4.1.1. Comparison with Cluster Galaxies

Our high-resolution mass reconstruction enables a detailed comparison with the cluster galaxy distributions. Both luminosity and number density maps show that the galaxy distribution is not symmetric and can be described as consisting of the main clump and the western extension. Our WL mass map clearly detects these two structures. As observed in both luminosity and number density maps, the mass clump corresponding to the western galaxy concentration is also much weaker than the main mass clump. Our bootstrapping analysis shows that the significance of the western mass clump is $\sim 3.4\sigma $.

In Figure 6, the distribution of the western galaxies (both luminosity and number density) is offset from that of the mass contours by ∼30'' (distance between the two centroids). We investigate the centroid uncertainties of mass and galaxies in the western substructure by using bootstrapping analysis. The centroid is determined by 2D circular Gaussian fitting. Among the parameters that we described in Section 3.2, only centroid (x, y) is set free; we set zero for background, the highest pixel value for amplitude, and the effective smoothing scales for the semimajor and minor axes. The bottom panel of Figure 7 shows that the 1σ contours for mass and galaxy touch each other.

Figure 7.

Figure 7. WL mass centroid significance test for the main (top) and western (bottom) clumps. The blue contours represent the centroid distributions for the main (top) and western (bottom) mass clumps in our 1000 bootstrapping runs. The peaks in X-rays, SZ, galaxy luminosity, and galaxy number density are located near the 1σ centroid contour of the main clump. We also display the centroid distributions of the number density for both the main and western clumps with gray contours from our bootstrapping resamples. The number density centroids are statistically consistent with WL mass centroids.

Standard image High-resolution image

The mass map further resolves two substructures within the main mass clump. Our bootstrapping analysis shows that the northwestern (southeastern) peak is detected with a significance of ∼5.7σ (∼6σ). The orientation of the vector connecting the two substructures is similar to the direction (northwest to southeast) of the elongation seen in the galaxy distributions. The distance between the two substructure centroids is ∼150 kpc (∼20''). The effective smoothing scale of our mass reconstruction is FWHM ∼ 17''. If we increase the effective smoothing scale above ∼20'', the two peaks merge into a single peak elongated in the northwest–southeast direction.

Interestingly, the centroid of the northwestern substructure lies near the luminosity peak whereas that of the southeastern substructure is closer to the peak in number density. The difference between the luminosity and number density distributions arises from the BCG being much more luminous than the other member candidates by at least ∼2 mag, although more galaxies are concentrated near the southeastern WL peak. Figure 7 shows our significance test of the mass peak centroids based on bootstrapping. An important caveat to remember in comparison with cluster galaxies is that our galaxy map is based on photometric selection targeted for the red sequence and subject to non-member contamination and omission of blue members. Currently, the number of spectroscopically confirmed members is 18 (F11).

4.1.2. Comparison with X-Ray Emission

The asymmetric X-ray surface brightness distribution of SPT2106 was considered as an indication of merger activity in F11. Comparison of the X-ray image with our mass reconstruction and galaxy distributions also supports this merger hypothesis. As seen in Figure 6(b), the Chandra image shows elongated X-ray emission whose peak location is consistent with the centroid of the northwestern mass substructure. No strong X-ray emission is found in the other substructure. In some cluster mergers with small impact parameters, a bright cool core is often found near the less massive system while there is relatively very weak X-ray emission around the more massive (thus hot) component (e.g., Jee et al. 2014; Golovich et al. 2017; van Weeren et al. 2017). Also, the elongation of the X-ray emission is consistent with the orientation of the vector connecting the two substructures seen in both mass and galaxy distributions. Thus, the absence of any significant X-ray peak near the southeastern substructure may indicate that the two substructures might have passed through each other.

4.2. Mass Estimation

We derive the cluster mass by assuming that the radial shear profile is approximated by the Navarro–Frenk–White (NFW, Navarro et al. 1997) model. The NFW profile describes the average halo distribution in numerical simulations of dark matter and has been a popular choice in many WL mass studies. However, if our cluster deviates substantially from the profile, this will give rise to a model bias. We will discuss the impact of this model bias and other miscellaneous sources on our mass uncertainty in Section 5.4. Although an on-going merger activity is indicated from our WL mass map, cluster member candidates, and X-ray emission, we proceed with a single-halo approximation because the two substructures in the main clump are too close to be modeled as separate halos (∼150 kpc). We will also deal with its impact on the mass systematics in Section 5.4.

Since the two free parameters (i.e., concentration c and scale radius rs) of the NFW profile are highly correlated with each other, it is difficult to determine them simultaneously with noisy data. Therefore, many authors perform one-parameter fitting assuming that the two parameters follow a tight relation. Typically, the relation is expressed in terms of the evolution of the correlation between cluster mass and concentration with redshift. Since this so-called mass–concentration (Mc) relation is only the average behavior and individual clusters show large scatter around the mean relation, using the relation is an additional source of systematics in cluster mass estimation. In this paper, although we also present results from this traditional method to enable comparison with previous studies, our main result is obtained without assuming any particular Mc relation. This requires us to perform a Markov Chain Monte Carlo (MCMC) analysis with the two parameters—scale radius rs and concentration c—and compute a marginalized cluster mass (we refer to this two-parameter fitting result as 2PMD hereafter). The resulting statistical uncertainty is larger than when one uses the Mc relation (hereafter 1PMD).

Typical WL studies first construct a one-dimensional (1D) radial tangential shear profile and then fit a model to it. This makes the result sensitive to the choice of binning. To avoid this binning effect, we fit a 2D model to individual galaxy shapes without applying any binning scheme. Our log-likelihood function ${ \mathcal L }$ is given as (Kim et al. 2019)

Equation (10)

where gms (gos) is the sth component of the predicted (observed) reduced shear for the source redshift z at the ith galaxy position (xi, yi) as a function of the cluster mass MSPT2106. The ellipticity dispersion (shape noise) is σSN = 0.25 whereas σe is the ellipticity measurement noise of each object. In addition to removal of the binning effect, this method allows us to assign individual redshifts to source galaxies. This merit is important in the current study because we combine source catalogs from different selection schemes, which thus produce three different effective redshifts.

For each source, the tangential shear is obtained as below:

Equation (11)

where e1(2) is the ellipticity component and θ is the position angle of the source with respect to the reference cluster center. Obviously, the exact shape of the profile depends on the choice of the reference cluster center. We use the following four reference centers: the global peak location of our WL mass (the southeastern subclump), the number density center of the early-type member candidates, the X-ray emission peak (Bartalucci et al. 2017), and the centroid of the SZ decrement (Bleem et al. 2015). The coordinates of these positions are listed in Table 1 and indicated with various symbols in Figure 6.

Table 1.  Impacts of the Choice of Centroid on the Mass Estimation in Our MCMC Analysis

Centroids R.A. Decl. Concentration R200c M200c R500c M500c
  (J2000) (J2000)   (Mpc) (${10}^{14}\,{M}_{\odot }$) (Mpc) (${10}^{14}\,{M}_{\odot }$)
WL massa 21:06:06.44 −58:44:46.36 ${1.88}_{-0.27}^{+0.65}$ ${1.4}_{-0.1}^{+0.1}$ ${10.4}_{-3.0}^{+3.3}$ ${0.8}_{-0.1}^{+0.1}$ ${6.0}_{-1.7}^{+1.9}$
Galaxy Numberb 21:06:05.56 −58:44:53.16 ${2.02}_{-0.30}^{+0.86}$ ${1.5}_{-0.1}^{+0.1}$ ${12.1}_{-3.3}^{+3.5}$ ${0.9}_{-0.1}^{+0.1}$ ${7.1}_{-1.8}^{+1.9}$
X-rayc 21:06:05.28 −58:44:31.70 ${1.82}_{-0.25}^{+0.57}$ ${1.4}_{-0.2}^{+0.1}$ ${10.1}_{-3.0}^{+3.2}$ ${0.8}_{-0.1}^{+0.1}$ ${5.7}_{-1.7}^{+1.9}$
SZd 21:06:04.94 −58:44:42.36 ${1.95}_{-0.27}^{+0.67}$ ${1.4}_{-0.1}^{+0.1}$ ${11.7}_{-3.2}^{+3.3}$ ${0.9}_{-0.1}^{+0.1}$ ${6.8}_{-1.7}^{+1.9}$

Notes.

aWL mass center (the southeastern peak). bPeak in galaxy number density. cX-ray emission peak (Bartalucci et al. 2017). dSZ centroid (Bleem et al. 2015).

Download table as:  ASCIITypeset image

We limit the fitting range to 60'' < r < 240''. The lower limit is needed because of a number of issues (e.g., Jee et al. 2011). Two of the most important are (1) that the cluster member contamination may be most significant very near the cluster center and (2) that the location of the true cluster center is unknown. Outside the upper limit (r > 240''), the annulus cannot complete a circle due to the limits of our imaging coverage.

4.2.1. Two-parameter MCMC Analysis

We use the open-source Python package emcee (Foreman-Mackey et al. 2013) when performing our MCMC analysis. We use flat priors for both scale radius rs and concentration c. The prior interval of c is set to 0 < c < 10. We verify that the upper limit is sufficiently high and most of our MCMC samples stay below this upper limit. The lower limit is set to zero because the concentration in the NFW halo, by definition, should have a positive value. We limit the range of the scale radius to 10'' < rs < 110''. Together with the interval of the concentration parameter, this rs range results in cluster masses bound within ∼1011 M to ∼1017 M.

Figure 8 shows the posterior distributions from our 500,000 MCMC samples in M200c and c200c for the four choices of the cluster center. All four results are consistent and show that SPT2106 is indeed an extremely massive system. For example, when we select our WL mass peak as the cluster center, we obtain ${M}_{200c}={10.4}_{-3.0}^{+3.3}\,\times $ ${10}^{14}\,{M}_{\odot }$. The marginalized masses are summarized in Table 1.

Figure 8.

Figure 8. Posterior distribution from 500,000 MCMC samples of our two-parameter mass determination. We consider the four centroids: WL mass, number distribution of the cluster member candidates, X-ray, and SZ peak. The best-fit values and marginalized uncertainties of concentration c200c and mass M200c are indicated by black crosses. We also compare the results obtained from three mass–concentration relations using one-parameter fitting (1PMD, colored circles).

Standard image High-resolution image

4.2.2. One-parameter Tangential Shear Fitting

Although we claim that our two-parameter result presented in Section 4.2.1 is the least biased mass, here we present our mass estimates based on the one-parameter fitting method to enable fair comparisons with previous studies. We consider the three Mc relations from Duffy et al. (2008, hereafter D08), Dutton & Macciò (2014, hereafter DM14), and Diemer & Kravtsov (2015, hereafter DK15). In Figure 9 we display the radial tangential shear profile for the four different centers. The resulting masses are summarized in Table 2 for the three Mc relations and also for the singular isothermal sphere (SIS) model. Our results show that the cluster mass is insensitive to the choice among the four centers considered in this paper whereas the profile assumption is an important systematic factor. We obtain the highest masses for the D08 relation and the lowest for the DK15 relation, although their error bars marginally overlap.

Figure 9.

Figure 9. Reduced tangential shear profiles centered at four different centroids. Filled circles represent the tangential shear (gT) while open diamonds show the cross shear (gX, the 45° rotation of the source images). The vertical dotted lines are the cutoff radii. We show the best-fit models using the singular isothermal sphere (SIS, gray) and NFW halos with three different mass–concentration relations; we use blue, purple, and green for the D08, DM14, and DK15 relations, respectively.

Standard image High-resolution image

Table 2.  Mass Estimates Based on One-parameter 2D Fitting for Various Choices of Centroids and Mc Relations

  2D SIS 2D NFW
Centroids ${\sigma }_{v}$ ${M}_{200c,\mathrm{SIS}}$ ${M}_{200c,{\rm{D}}08}$ ${M}_{200c,\mathrm{DM}14}$ ${M}_{200c,\mathrm{DK}15}$
  (${\rm{km}}\,{{\rm{s}}}^{-1}$) (${10}^{14}\,{M}_{\odot }$) (${10}^{14}\,{M}_{\odot }$) (${10}^{14}\,{M}_{\odot }$) (${10}^{14}\,{M}_{\odot }$)
WL mass 1152 ± 104 ${7.7}_{-1.9}^{+2.3}$ ${10.6}_{-2.6}^{+3.5}$ ${8.8}_{-2.1}^{+2.8}$ ${7.3}_{-1.9}^{+1.9}$
Galaxy Number 1227 ± 93 ${9.3}_{-2.0}^{+2.3}$ ${12.5}_{-2.7}^{+3.6}$ ${10.5}_{-2.2}^{+2.9}$ ${8.7}_{-1.9}^{+1.9}$
X-ray 1134 ± 105 ${7.4}_{-1.9}^{+2.3}$ ${10.2}_{-2.5}^{+3.4}$ ${8.4}_{-2.0}^{+2.8}$ ${7.0}_{-1.9}^{+1.9}$
SZ 1202 ± 95 ${8.8}_{-1.9}^{+2.3}$ ${11.9}_{-2.7}^{+3.5}$ ${9.9}_{-2.2}^{+2.8}$ ${8.2}_{-1.9}^{+1.9}$

Download table as:  ASCIITypeset image

5. Discussion

5.1. Comparison with Other Non-WL Studies

Since its discovery from the initial SPT survey, there have been several efforts to improve the mass measurement of SPT2106. The mass constrained by SZ data (e.g., Williamson et al. 2011; Reichardt et al. 2013; Bleem et al. 2015) ranges from M200c = 9.0 to 10.6 × ${10}^{14}\,{M}_{\odot }$ (M500c = 5.4 to 8.4 × ${10}^{14}\,{M}_{\odot }$). The broad range of these mass estimates stems from different selections of SZ data (single band versus multiple bands) and/or different scaling relations. Studies with X-ray scaling relations (F11; Ruel et al. 2014; Bartalucci et al. 2017) suggest somewhat higher masses (M200c ∼ 18 × ${10}^{14}\,{M}_{\odot }$), although the estimate by Amodeo et al. (2016) based on the hot-gas temperature profile using the NFW model gives a result nearly a factor of two smaller. Based on the velocity dispersion, F11 measure the mass of the cluster to be ${M}_{200c}={1.4}_{-0.8}^{+1.7}\,\times \,{10}^{15}\,{M}_{\odot }$. Although this mass is consistent with our WL estimate, we believe that both the small number (18 members) and the possible merger activity described in Section 4.1 make it difficult to infer the cluster mass based on the velocity dispersion alone.

In Figure 10 we present a summary of these previous results (for both M200c and M500c) in chronological order. Also displayed is the previous WL measurement by S18, whose central value is somewhat lower than our value, although the statistical error bars overlap. Detailed discussions on the comparison of the WL results are presented in Section 5.2. We find that our WL mass is consistent with the previous results.

Figure 10.

Figure 10. Mass comparison (top: M200c, bottom: M500c) of SPT2106 measured with various proxies. We choose the two-parameter MCMC result measured at our WL mass peak to represent our work. Our WL mass estimate is consistent with previous measurements.

Standard image High-resolution image

5.2. Comparison with the S18 WL Result

SPT2106 was included in the sample of the 13 high-redshift clusters studied by S18 with WL. Their mass of SPT2106 is ${8.8}_{-4.6}^{+5.0}\,\times $ ${10}^{14}\,{M}_{\odot }$, whose central value is ∼85% of our result. Also, their mass uncertainty is ∼50% larger than ours. Although their error bars overlap with ours (our central value corresponds to the 1σ upper limit of the S18 result), the difference deserves scrutiny because S18 and our study share the same HST/ACS imaging data. Therefore, in this section we provide detailed comparisons and attempt to trace the origin of the discrepancies. We focus on the following four aspects that can sensitively affect the measurement of cluster mass and its uncertainty: (1) shear calibration, (2) source redshift estimation, (3) Mc relation, and (4) source selection.

5.2.1. Shear Calibration

S18 use a moment-based approach to shape measurement while our shape measurement is based on a model-fitting method. The moment-based approach was pioneered by Kaiser et al. (1995, hereafter KSB) and has been modified by a number of authors (e.g., Hoekstra et al. 1998; Erben et al. 2001; Schrabback et al. 2007). The specific KSB branch used by S18 is described in detail by Schrabback et al. (2010), where the authors apply the method to measure cosmic shears. One important difference made in S18 from the version of Schrabback et al. (2010) is the use of the pixel-based correction for CTE degradation of Massey et al. (2014). In Schrabback et al. (2010), the correction is applied at the catalog level. The pixel-based correction for CTE degradation is similar to our method, although our WL pipeline employs the STScI pipeline correction, which implements the algorithm of Ubeda & Anderson (2012). The fidelity test of this STScI correction in the context of WL is presented in Jee et al. (2014).

Since the galaxy shape catalog of S18 is not public, it is impossible to compare shape measurements for individual galaxies. Nevertheless, it is possible to compare aggregate statistics when we closely mimic the S18 procedure for tangential shear measurement. We match their source selection, source redshift estimation ($\langle \beta \rangle =0.282$), shape measurement filter (ACS/F606W), applied cosmology (ΩM = 0.3, ΩΛ = 0.7, and h = 0.7), reference point (X-ray peak from Chiu et al. 2016), and Mc relation (Diemer & Kravtsov 2015) between the two analyses. We read off the S18 tangential shears from Figure G8 of their paper. Because we compare the shears only in the region where both F606W and F814W colors are available, we choose the maximum distance to be 100''. Although S18 exclude the shear at θ < 60'' in their mass estimation, we use all shear profiles at θ < 100'' to increase the statistical significance. This is justified because we are only interested in comparing the amplitude of the shears (shear calibration) between the two studies. The resulting mass difference becomes ∼10% (our mass is lower). This discrepancy is ∼16% of the statistical error.

5.2.2. Source Redshift Estimation

S18 estimate the lensing efficiency $\langle \beta \rangle $ using the photometric redshift catalog from the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS, Skelton et al. 2014). They apply the same source selection criteria to the CANDELS photo-z catalog and obtain $\langle \beta \rangle =0.282$ for their source population. When we apply the S18 color and magnitude selection to the CANDELS photo-z catalog, we obtain $\langle \beta \rangle =0.260$, which would increase the cluster mass by ∼8%. We believe that the discrepancy of $\langle \beta \rangle \sim 0.02$ is due to the difference in the shape criteria. A larger difference ($\langle \beta \rangle =0.323$) is found when we replace the CANDELS catalog with the HUDF one. This higher $\langle \beta \rangle $ value would lower the cluster mass by ∼15%. The difference in $\langle \beta \rangle $ between CANDELS and HUDF may be attributed to the systematics in the photo-z catalogs or the sample variance. Note that since we use the HUDF catalog for our analysis and since our final mass (central value) is higher than the S18 one by ∼18%, the mass shift due to the increase in $\langle \beta \rangle $ is in the opposite direction.

5.2.3. Mc Relation

S18 perform their 1D tangential shear fitting utilizing only the DK15 Mc relation, whereas we present results using three Mc relations and also the one without the Mc relation. As shown in Section 4.2.2 and Table 2, our mass estimates become lowest when the DK15 relation is used. The D08 and DM14 relations give mass estimates larger than the DK15 relation by ∼21% and ∼10%, respectively. We confirm the same trend also with the S18 tangential shears (read off from Figure G8 of their paper). S18 reports ${8.8}_{-4.6}^{+5.0}\,\times $ ${10}^{14}\,{M}_{\odot }$ using the DK15 relation and X-ray center. For the same choices, our mass estimate is (7.0 ± 1.9) × ${10}^{14}\,{M}_{\odot }$. Our central value is lower than the S18 value by ∼20%. Since in Sections 5.2.1 and 5.2.2 we show that the differences in shear calibration and $\langle \beta \rangle $ estimation make our mass lower than the S18 one by ∼10% and ∼15%, respectively, it is plausible that this ∼20% difference in mass might be attributed to the combined effect of the two.

As mentioned in Section 4.2.2, we regard the result obtained without any Mc relation as the main mass estimate because we believe that it is the most unbiased value. When we use the X-ray center, the central value of this result (${10.1}_{-3.0}^{+3.2}\times {10}^{14}\,{M}_{\odot }$) is ∼45% higher than the one obtained from the DK15 relation ((7.0 ± 1.9) × ${10}^{14}\,{M}_{\odot }$). Consequently, the difference in the Mc relation is the leading cause of the discrepancy between our results and those of S18.

5.2.4. Source Selection and S/N

The two WL studies use different source selection (especially color cut) criteria; S18 select bluer (F606W − F814W < 0.2) and brighter (24.0 < F606W < 26.5) sources while we include redder (F606W − F814W < 1.0) and fainter (25.0 < F606W < 28.0) objects. S18 claim that their color cut maximizes the background galaxy fraction and that the expected (non-background) contamination rate is ∼2.4%.

When the color–redshift relation is tight and the photometric scatter is negligible, we agree with S18 that the S18 color cut is optimal for maximizing the background fraction. As shown in the top panel of Figure 11, these conditions are satisfied for the HUDF galaxies. Most sources bluer than F606W − F814W = 0.2 are above the cluster redshift z = 1.132 (dark gray). However, this color–redshift relation is substantially diluted if the field becomes shallower. Even for the GOODS-S field, which is deeper than our cluster field, the scatter is severe (light gray).

Figure 11.

Figure 11. Source selection color criteria vs. S/N in WL mass measurement. Top: color–redshift relation of the GOODS-S and HUDF galaxies within the magnitude range 25.0 < F606W < 28.0. We use the published photo-z catalogs. The red solid line represents our color threshold (F606W − F814W < 1.0) while the blue solid line shows the S18 threshold (F606W − F814W < 0.2). Three blue dashed lines indicate our experimental color thresholds, whose impacts are shown in the middle and bottom panels. The vertical red dashed line is the redshift of SPT2106 (z = 1.132). Note that the S18 threshold is motivated by the tight color–redshift relation in HUDF. Apparently, the relation in GOODS-S is significantly diluted. Middle: fraction of the "border" objects (see the text for the definition) as a function of color threshold. To avoid clutters, arbitrary offsets are added to the horizontal axis. The fraction of "border" sources increases as the F606W − F814W color threshold decreases. This effect is more significant if the field is shallower. We also display the results when we apply the magnitude cut (24.0 < F606W < 26.5) of S18 (open squares). Bottom: S/N in WL mass as a function of color threshold. The result shows the combined impact of purity, shot noise, and $\langle \beta \rangle $ uncertainty due to the sample variance (see the text). We estimate the errors in S/N computation assuming Poissonian statistics. Although lowering the F606W − F814W color threshold may increase the purity of background sources, the increase in shot noise and $\langle \beta \rangle $ uncertainty causes the total S/N to decrease.

Standard image High-resolution image

Given the large increase in scatter for the color–redshift relation, we do not think that the F606W − F814W < 0.2 cut is still optimal for the cluster field. To investigate the issue quantitatively, we define "border" objects as sources whose 1σ photometric error bars touch the color thresholds. A "border" source can be included or excluded depending on the direction of the scatter. The fraction of "border" objects increases as the image depth becomes shallower because of the increase in photometric error. The middle panel of Figure 11 shows this trend for several choices of the F606W − F814W color constraint. In the case of the bluest selection (F606W − F814W < 0.2), the border fraction is nearly ∼50% while the fraction is negligible for our case (F606W − F814W < 1). Consequently, we believe that the purity (background fraction) is nonnegligibly compromised by the increased "border" fraction when one uses a very tight color–redshift relation only observed in extremely deep fields, such as HUDF.

Apart from the decrease in the border fraction mentioned above, our selection has two additional advantages: a reduction in shot noise and improved accuracy in $\langle \beta \rangle $ estimation. Our source density is more than a factor of 9 higher than the S18 one, which results in a decrease in shot noise by more than a factor of 3. Also, the increase in source galaxy number leads to a significant reduction in the uncertainty of the $\langle \beta \rangle $ value because of the decrease in sample variance between the cluster and control fields.

Therefore, including red galaxies (F606W − F814W < 1) in our source selection enables us to increase the overall S/N in cluster mass estimation compared to the case where only blue sources are selected (F606W − F814W < 0.2). To illustrate this point, we display the S/N value of the mass estimation as a function of the color cut in the bottom panel of Figure 11. Although the purity of the background galaxies (i.e., numerator in S/N) decreases as the selection includes redder galaxies, the overall S/N value increases because of the reduction in the denominator, which comprises the shot noise and the $\langle \beta \rangle $ uncertainty.

5.3. Rarity

SPT2106 has been regarded as a rare system within the current ΛCDM hierarchical structure formation paradigm (e.g., F11; Holz & Perlmutter 2012). Here we revisit the issue using our WL mass. Previous rarity analyses of this system have been based on non-WL results.

The expected abundance of a cluster of mass M and redshift z over the sky coverage fraction fsky can be evaluated using the following integral (Hotchkiss 2011):

Equation (12)

where dV/dz is the comoving volume element and dn/dM is the halo mass function. In order to compute the mass function, we use the open-source Python package called HMFcalc (Murray et al. 2013). We adopt the fitting function of Tinker et al. (2008) and the cosmology of Planck Collaboration et al. (2016). The redshift of the cluster z = 1.132 is chosen to be the minimum redshift and M200c is used for the abundance estimation.

We evaluate the expected number of SPT2106-like clusters and their corresponding discovery probabilities assuming Poissonian statistics in the parent survey volume (SPT-SZ survey, 2500 deg2). For the threshold mass Mmin, we use the 1σ lower limit of our result ${M}_{200c}={10.4}_{-3.0}^{+3.3}\,\times $ ${10}^{14}\,{M}_{\odot }$. With these conditions, we estimate the expected abundance to be ∼1.3, which yields the discovery probability ∼72% for the parent survey volume.

Mortonson et al. (2011) argue that Eddington bias (Eddington 1913) is an important factor affecting a discovery probability when the mass function is steep, which makes up-scattering events more frequent than down-scattering events. Following the prescription of Mortonson et al. (2011), we obtain an Eddington bias-corrected abundance of ∼2.8 in the parent 2500 deg2 survey area.

Our rarity analysis shows that SPT2106 is certainly rare, but should not be regarded as an outlier within the current cosmology. In some previous studies (e.g., F11; Holz & Perlmutter 2012), the existence of SPT2106 was viewed as an extremely rare event (∼7% within the SPT survey volume). This is because the mass estimate of F11 is ∼24% larger than our WL result, which causes a non-negligible difference in abundance estimation when the mass function is steep.

5.4. Systematic Uncertainties

The errors quoted in Tables 1 and 2 include statistical errors only. Below, we discuss possible sources of systematics for our WL analysis and quantify their contributions to the total error budget. In summary, by adding in quadrature, we estimate that the total systematic uncertainty of our mass is ∼9% at r = 840 kpc.

5.4.1. Impacts of the Sample Variance on Source Redshift Estimation

If found to be large, the sample variance weakens the reliability of our determination of source redshift. In Section 3.4, we obtain the information on source redshift using the HUDF photo-z catalog by assuming that the color–magnitude–redshift relation is identical between the cluster field and HUDF. Because the two fields are of similar size, we are interested in quantifying how much the source redshift varies when a different sky patch is chosen. Our previous studies (e.g., Jee et al. 2014, 2017) have shown that in fact the sample variance is subdominant compared to the statistical errors. Here we repeat the same test to quantify the level of the sample variance on the SPT2106 field by comparing $\langle \beta \rangle $ measured from the HUDF, GOODS-S, and GOODS-N control fields.

We examine the sample variance on two angular scales. The first angular scale is the size of the GOODS fields; the second is the distance between GOODS-N and GOODS-S. For the sample variance estimation on the first scale, we divide each GOODS field into 9–11 subregions and measure the fluctuation of the $\langle \beta \rangle $ values. For ACS sources (F606W − F814W), the standard deviation of the distribution in $\langle \beta \rangle $ is ∼0.01 for each GOODS field. For WFC3 sources (F105W − F140W), the distribution width decreases by a factor of two (${\sigma }_{\langle \beta \rangle }\sim 0.005$). We attribute this reduction to the greater depth of the WFC3/IR data, which gives a higher total volume because of its long line-of-sight (LOS) baseline. These $\langle \beta \rangle $ fluctuations are translated to less than ∼7% in mass, which shows that the sample variance is negligible on the first angular scale (corresponding to the size of the GOODS field). For both ACS and WFC3 sources, the difference in the average $\langle \beta \rangle $ value between GOODS-N and GOODS-S is ∼0.01, similar to the level of the fluctuations observed within each GOODS field. Although future studies still need to investigate the sample variance on the above and other angular scales with more than just these two fields, we have not found any strong evidence that the sample variance is a significant source of systematics when WL sources are selected from deep imaging data (sufficient volume along the LOS direction) and their redshift estimation is based on deep control fields.

5.4.2. Systematics Due to Mc Relations

The use of Mc relations can be a source of systematic uncertainties. In Section 4.2 we measure the mass estimates of SPT2106 with and without the Mc relations. Figure 8 shows the distributions of posteriors on the Mc plane and their marginalized values. Also displayed are the mass estimates for three Mc relations. The masses obtained from the three Mc relations are consistent with the results derived from our two-parameter model. On the other hand, the two concentration values favored by the DM14 and DK15 Mc relations are outside the 1σ contours.

Although applying an Mc relation is an efficient and popular method to determine a cluster mass, we argue that the mass obtained without the assumption is least biased when the relation is highly uncertain as in the current case. Because massive high-z clusters are rare even in the current large state-of-the-art N-body simulations, statistical properties such as mass function, Mc relations, etc. are only loosely determined. For example, the Mc relations for massive, high-z clusters come from the power laws extrapolated from the results for less massive clusters; note that the shapes of the two power laws in DM14 and DK15 are different (e.g., flattening, upturn, etc.) in the high-redshift regime. In addition, the Mc relations are valid for the "mean" (or "median") properties of simulated clusters. Thus there is no guarantee that SPT2106 follows the mean Mc relation even if the extrapolated relation holds.

5.4.3. Cluster Model Bias

In Section 4.2, we assume a spherical NFW profile for measuring the mass of SPT2106. This assumption, however, is only applicable to a cluster with average properties. It is possible that SPT2106 might deviate greatly from this mean cluster profile, which if true is an additional source of uncertainties in our mass determination. According to Becker & Kravtsov (2011), this cluster model bias amounts to ∼20% for massive clusters. Some impacts of the assumed profile on our cluster mass estimation can be seen in Table 2, where we compare mass estimates using SIS and NFW profiles with various Mc relations. For example, the SIS mass estimate is ∼30% lower than our two-parameter NFW result; we already discussed the impacts of the various Mc relations in Section 5.4.2.

As an attempt to assess the amount of systematics due to the cluster model bias, we perform aperture mass densitometry (hereafter AMD), which minimizes the impact from the assumption of any particular halo model. AMD uses tangential shears of sources (Equation (11)) and computes a projected overdensity of the region inside a specific aperture radius with respect to the control annulus. For the estimation of the mean surface density in the control annulus, we inevitably have to assume a particular halo model. Nevertheless, the assumption plays a much less significant role than in the pure model-fitting method.

The projected overdensity within the r = r1 aperture is given by

Equation (13)

where $\left\langle {\gamma }_{T}\right\rangle $ is the azimuthal average of tangential shears, r1 is the aperture radius, and r2 and rmax are the inner and outer radii of the annulus, respectively. The aperture radius should be sufficiently separated from the inner radius of the annulus because the AMD uses tangential shears between the aperture radius r1 and the inner radius r2 for evaluation of the integral as expressed in Equation (13). The statistics ${\zeta }_{c}({r}_{1},{r}_{2},{r}_{\max })$ is then converted into the aperture mass within the radius r1 through the equation below:

Equation (14)

where Σcrit is the critical surface density obtained in Equation (2).

We choose the inner and outer radii of the control annulus to be 155'' (∼1.27 Mpc) and 175'' (∼1.44 Mpc), respectively. The average surface density of the annulus is estimated to be $\bar{\kappa }=0.009$ from our two-parameter NFW fitting result (Table 1). We iteratively update the tangential shear using γ = (1 − κ)g because what we measure is not a strict shear γ but a reduced shear g = γ/(1 − κ). The process stops when the density is converged. Readers are referred to Clowe et al. (2000) and Jee et al. (2005) for details of the AMD implementation.

Figure 12 shows the resulting cumulative projected mass profile. Also displayed are the profiles derived from the best-fit NFW models for comparison. At r = 840 kpc, the AMD mass agrees with our NFW result obtained with two-parameter fitting within ∼6%, which we adopt as the systematic error due to the model bias. The shaded region represents the 1σ uncertainties computed from bootstrapping runs. The statistical error at the radius is ∼22%. We find a ∼21% difference at the same radius when we compare the AMD result with the NFW result based on the DK15 Mc relation. These comparisons demonstrate that an NFW profile assumption might not be a significant source of systematics in our determination of the SPT2106 mass. And the excellent agreement with the AMD profile shows that the intrinsic mass profile of SPT2106 might be better described with the NFW parameters derived from our two-parameter fitting, although the result from the DK15 Mc relation is still marginally consistent with the AMD profile.

Figure 12.

Figure 12. Projected mass of SPT2106. We evaluate the projected mass as a function of radius, the distance from the WL mass peak. The gray solid line represents the result after we project the best-fit NFW model (Table 1). The blue solid line is obtained when we use the aperture mass densitometry (see the text). Shaded regions depict 1σ uncertainty intervals. The two results are highly consistent with each other. At R500c ∼ 840 kpc, the difference is only ∼6%, much less than the statistical uncertainties (∼22%). The agreement shows that the potential model bias due to our NFW assumption is not a major source of systematics.

Standard image High-resolution image

The good agreement of the AMD profile with our parametric (NFW) result also suggests that the mass profile of SPT2106 can be well approximated by a single halo. In general, treating well-separated merging clusters as a single halo is an important source of bias in estimating the total mass of the systems (e.g., Jee et al. 2014). However, since in SPT2106 the two substructures are very close (∼150 kpc), we believe that this single-halo approximation is not a significant source of systematic error.

6. Summary

We have presented a detailed WL study of the massive high-z cluster SPT2106 based on new HST data. From the deep HST imaging data, we achieve a source density of 169 arcmin−2 in the WFC3/IR field, which enables us to perform a high-resolution mass reconstruction and improve the precision in mass estimation.

The overall mass distribution of the cluster is characterized by the main mass clump and the extension ∼0.6 Mpc west of it. These two structures are spatially correlated with the cluster galaxy distributions. The main clump is further resolved into two substructures. The position of the northwestern mass peak agrees with that of the X-ray peak, which also coincides with the BCG, whereas the southeastern peak seems to be associated with the peak in number density. This WL substructure supports the merger hypothesis suggested in previous studies.

We estimate the cluster mass with various assumptions on its profile and argue that the result obtained with the two-parameter NFW profile fitting method (i.e., without any assumption on the Mc relation) is likely to be the least biased mass because the Mc relation is highly uncertain for extremely massive clusters at high redshift. We support this claim with our parameter-free result based on AMD. Our virial mass ${M}_{200c}=({10.4}_{-3.0}^{+3.3}\pm 1.0)\times $ ${10}^{14}\,{M}_{\odot }$ (the second error bar is the systematic uncertainty) agrees nicely with the previous measurements obtained from SZ and X-ray studies. Our statistical uncertainty is ∼40% smaller than the previous WL result. The difference is attributed to a significant increase in source density due to the availability of the deep WFC3 imaging data and improvement in shape measurement. We find that, given our new WL mass estimate, the cluster SPT2106 is certainly rare, but should not be regarded as an outlier within the current ΛCDM cosmology.

Support for the current HST program was provided by NASA through a grant from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555. We thank Kyle Finner and Cristiano Sabiu for useful discussions. M.J.J. acknowledges support for the current research from the National Research Foundation (NRF) of Korea under the programs 2017R1A2B2004644 and 2017R1A4A1015178. J.K. acknowledges support from Shin Donguk scholarship at Yonsei University and the NRF of Korea grant funded by the Ministry of Science and ICT, Korea under the program 2019R1C1C1010942.

Footnotes

Please wait… references are loading.
10.3847/1538-4357/ab521e