Articles

CLASH: THREE STRONGLY LENSED IMAGES OF A CANDIDATE z ≈ 11 GALAXY

, , , , , , , , , , , , , , , , , , , , , , and

Published 2012 December 13 © 2013. The American Astronomical Society. All rights reserved.
, , Citation Dan Coe et al 2013 ApJ 762 32 DOI 10.1088/0004-637X/762/1/32

0004-637X/762/1/32

ABSTRACT

We present a candidate for the most distant galaxy known to date with a photometric redshift of z = 10.7+0.6−0.4 (95% confidence limits; with z < 9.5 galaxies of known types ruled out at 7.2σ). This J-dropout Lyman break galaxy, named MACS0647-JD, was discovered as part of the Cluster Lensing and Supernova survey with Hubble (CLASH). We observe three magnified images of this galaxy due to strong gravitational lensing by the galaxy cluster MACSJ0647.7+7015 at z = 0.591. The images are magnified by factors of ∼80, 7, and 2, with the brighter two observed at ∼26th magnitude AB (∼0.15 μJy) in the WFC3/IR F160W filter (∼1.4–1.7 μm) where they are detected at ≳12σ. All three images are also confidently detected at ≳6σ in F140W (∼1.2–1.6 μm), dropping out of detection from 15 lower wavelength Hubble Space Telescope filters (∼0.2–1.4 μm), and lacking bright detections in Spitzer/IRAC 3.6 μm and 4.5 μm imaging (∼3.2–5.0 μm). We rule out a broad range of possible lower redshift interlopers, including some previously published as high-redshift candidates. Our high-redshift conclusion is more conservative than if we had neglected a Bayesian photometric redshift prior. Given CLASH observations of 17 high-mass clusters to date, our discoveries of MACS0647-JD at z ∼ 10.8 and MACS1149-JD at z ∼ 9.6 are consistent with a lensed luminosity function extrapolated from lower redshifts. This would suggest that low-luminosity galaxies could have reionized the universe. However, given the significant uncertainties based on only two galaxies, we cannot yet rule out the sharp drop-off in number counts at z ≳ 10 suggested by field searches.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

Current models of structure formation suggest that the first galaxies formed at z ≳ 10 when the universe was ≲ 470 Myr old (Wise & Abel 2007; Wise et al. 2008; Greif et al. 2008, 2010; and for recent reviews, see Bromm & Yoshida 2011 and Dunlop 2012). Observations may be closing in on these first galaxies with one z ∼ 10 candidate detected in the Ultra Deep Field (UDFj-39546284; Bouwens et al. 2011a) and another strongly lensed by a galaxy cluster (MACS1149-JD; Zheng et al. 2012).

Intriguingly, the number density of z ∼ 10 galaxies detected in unlensed fields is several times lower than predicted based on extrapolations from lower redshifts, assuming a luminosity function with one or more parameters evolving linearly with redshift (Bouwens et al. 2008, 2011a; Oesch et al. 2012a). This suggests that the star formation rate density (SFRD) built up more rapidly from z ∼ 10 to 8 than it did later between z ∼ 8 and 2. This is consistent with some theoretical predictions (Trenti et al. 2010; Lacey et al. 2011). However, Robertson & Ellis (2012) suggest such a sharp drop-off would be in tension with z < 4 gamma-ray burst rates as correlated with SFRD and extrapolated to higher redshifts. Direct detections and confirmations of z ≳ 10 galaxies are required to more precisely constrain the SFRD at that epoch.

The observed luminosity functions at z ∼ 7 and 8 feature steep faint end slopes of α ∼ −2 (Bouwens et al. 2011b; Bradley et al. 2012b), steeper than at lower redshifts, a trend consistent with model predictions (Trenti et al. 2010; Jaacks et al. 2012). If these luminosity functions can be extrapolated to z ≳ 10, then low-luminosity galaxies (MUV fainter than −16 AB) could have reionized the universe (Bouwens et al. 2012a; Kuhlen & Faucher-Giguère 2012), assuming a sufficient fraction of their UV photons could escape their host galaxies to the surrounding medium (see also Conroy & Kratter 2012). Otherwise, a more exotic source of reionizing energy may have been required, such as self-annihilating dark matter (Iocco 2010; Natarajan 2012).

Reionization was likely well underway by z ≳ 10 but with over half the universe still neutral (Robertson et al. 2010; Pandolfi et al. 2011; Mitra et al. 2012). Improving our understanding of the early universe and this phase change is one of the pressing goals of modern cosmology.

Observations with the Wide Field Camera 3 (WFC3; Kimble et al. 2008) installed on the Hubble Space Telescope (HST) have significantly advanced our understanding of the z ≳ 7 universe, over 13 billion years in the past. The UDF and surrounding deep fields have yielded over 100 robust z > 7 candidates as faint as 29th magnitude AB (Bunker et al. 2010; Labbé et al. 2010; Bouwens et al. 2011b; Oesch et al. 2012a). Analyses of wider space-based surveys such as CANDELS (Grogin et al. 2011; Koekemoer et al. 2011), BoRG (Trenti et al. 2012), and HIPPIES (Yan et al. 2011) have helped fill out the brighter end of the luminosity function (Oesch et al. 2012b; Bradley et al. 2012b; Yan et al. 2011).

Of the handful of z > 7 galaxies spectroscopically confirmed to date, most have been discovered in even wider near-infrared surveys carried out with the ground-based telescopes Subaru (Shibuya et al. 2012; Ono et al. 2012), Very Large Telescope (VLT; Vanzella et al. 2011), and UKIRT (Mortlock et al. 2011). Surveys with the VISTA telescope are also beginning to yield high-redshift candidates (Bowler et al. 2012).

Complementary to these searches of "blank" fields are searches behind strongly lensing galaxy clusters (Kneib et al. 2004; Bradley et al. 2008, 2012a; Bouwens et al. 2009; Bradač et al. 2009, 2012; Maizy et al. 2010; Richard et al. 2011; Hall et al. 2012; Zitrin et al. 2012a; Wong et al. 2012; Zackrisson et al. 2012; and for a recent review, see Kneib & Natarajan 2011). The drawbacks of lensed searches are reduced search area in the magnified source planes and some uncertainty in the estimate of that search area introduced by the lens modeling. But the rewards are galaxies that are strongly magnified, often by factors of 10 or more. Lensed searches are significantly more efficient in yielding high-redshift candidates bright enough for spectroscopic confirmation, including A1703-zD6 (Bradley et al. 2012a) at z = 7.045 (Schenker et al. 2012).

The Cluster Lensing and Supernova survey with Hubble (CLASH; Postman et al. 2012) is a large Hubble program imaging 25 galaxy clusters in 16 filters, including five in the near infrared (0.9–1.7 μm). Five of these, including MACSJ0647.7+7015 (z = 0.591; Ebeling et al. 2007), were selected on the basis of their especially strong gravitational lensing power as observed in previous imaging, with the primary goal of discovering highly magnified galaxies at high redshift. To date, some of the more notable strongly lensed galaxies found in CLASH include a doubly imaged galaxy with a spectroscopic redshift of z = 6.027 observed at ∼24.6 mag (Richard et al. 2011), which is possibly ∼800 Myr old (although see Pirzkal et al. 2012); a quadruply imaged z ∼ 6.2 galaxy observed at 24th magnitude (Zitrin et al. 2012a); and the z ∼ 9.6 candidate galaxy MACS1149-JD observed at ∼25.7 mag (Zheng et al. 2012). The z ∼ 9.6 candidate is strongly lensed by MACSJ1149.6+2223, another CLASH cluster selected for its high magnification strength.

Here we report the discovery of MACS0647-JD, a candidate for the earliest galaxy yet detected at a redshift of z = 10.7+0.6−0.4 (95% confidence), just 427−30+21 million years after the big bang. It is strongly lensed by MACSJ0647.7+7015, yielding three multiple images observed at F160W AB mag ∼25.9, 26.1, and 27.3, magnified by factors of ∼8, 7, and 2. The brightest image is similar in flux to MACS1149-JD (F160W mag ∼25.7) at z = 9.6 ± 0.2 (68% confidence) and roughly 15 times (3 mag) brighter than the z = 10.3 ± 0.8 (68% confidence) candidate in the UDF (Bouwens et al. 2011a).

MACS0647-JD is a J-dropout as all three lensed images are securely detected in F160W and F140W but drop out of detection in the J-band F125W and all 14 bluer HST filters. We show this photometry is most likely due to the Lyman-α break redshifted to ∼1.46 μm at z ∼ 11. This Lyman dropout technique (Meier 1976; Giavalisco 2002) pioneered by Steidel et al. (1996) at z ∼ 3 has been used with a high success rate to identify high-redshift candidates later spectroscopically confirmed out to z ∼ 7. However, care must be taken not to confuse dropouts with intrinsically red (evolved and/or dusty) galaxies at intermediate redshift (Schaerer et al. 2007; Dunlop et al. 2007; Chary et al. 2007; Capak et al. 2011; Boone et al. 2011; Hayes et al. 2012). In the case of MACS0647-JD, we show that it is extremely difficult, if not impossible, for low-redshift interlopers to reproduce the observed colors, especially the J125H160 ≳ 3 magnitude break of MACS0647-JD. We also test our analysis method by reanalyzing previously published J-dropouts that later proved to be at intermediate redshift. Our Bayesian photometric redshift (BPZ; Benítez 2000; Coe et al. 2006) analysis correctly shows that intermediate redshift solutions are preferred for those objects, while higher redshift solutions are preferred for MACS0647-JD.

We describe our HST and Spitzer Space Telescope (Spitzer) observations in Section 2 and present photometry in Section 3. We derive the photometric redshift in Section 4 and consider a wide range of possible interlopers in Section 5. We present our gravitational lensing analysis in Section 6. In Section 7, we derive physical properties of MACS0647-JD based on additional photometric analysis. In Section 8, we compare our observed number density of z ∼ 11 galaxies to that expected, and we constrain the z > 9 SFRD. Finally, we present conclusions in Section 9.

Where necessary, we assume a concordance ΛCDM cosmology with h = 0.7, Ωm = 0.3, ΩΛ = 0.7, where H0 = 100 h km s−1 Mpc−1. In this cosmology, 1'' = 3.93 kpc at z = 10.8 and 6.62 kpc at the cluster redshift z = 0.591.

2. OBSERVATIONS

As part of the CLASH program, HST observed the core of MACSJ0647.7+7015 (Figure 1) during 19 orbits spread among eight different visits between 2011 October 5 and November 29 (General Observer program 12101). Imaging was obtained with the Wide Field Camera 3 (WFC3; Kimble et al. 2008) and Advanced Camera for Surveys (ACS; Ford et al. 2003) in 15 filters spanning 0.2–1.7 μm, including five near-infrared WFC3/IR filters spanning 0.9–1.7 μm. These data sets were supplemented by prior ACS imaging obtained in the F555W (0.56 μm) and F814W (0.81 μm) filters to total depths of ∼3.5 and 5.5 orbits, respectively (GO 9722 P.I. Ebeling; GO 10493, 10793 P.I. Gal-Yam). These observations are detailed in Table 1.

Figure 1.

Figure 1. Lenstool strong lensing mass model of MACSJ0647.7+7015 and multiply imaged galaxies as identified in this work using the Zitrin et al. (2009) method, including two strong lensing systems identified in Zitrin et al. (2011a). Each strongly lensed galaxy is labeled with a number and color coded by redshift (scale at bottom right). Letters are assigned to the multiple images of each galaxy. Dashed circles indicate predicted locations of counterimages not unambiguously identified. Overplotted are critical curves from our Lenstool model indicating thin regions of formally infinite magnification for background galaxies at z = 2.0 (cyan), 3.5 (green), and 11.0 (red). Mirror images of galaxies straddle these critical curves. The Hubble color image was produced using Trilogy (Coe et al. 2012) and is composed of ACS and WFC3/IR filters as given at top right.

Standard image High-resolution image

Table 1. Observed Filters and Integration Times

Filter Wavelengtha Exposure
F225W 0.24 μm 3805 s
F275W 0.27 μm 3879 s
F336W 0.34 μm 2498 s
F390W 0.39 μm 2545 s
F435W 0.43 μm 2124 s
F475W 0.47 μm 2248 s
F555W 0.54 μm 7740 s
F606W 0.59 μm 2064 s
F625W 0.63 μm 2131 s
F775W 0.77 μm 2162 s
F814W 0.81 μm 12760 s
F850LP 0.90 μm 4325 s
F105W 1.06 μm 2914 s
F110Wb 1.15 μm 1606 s
F125W 1.25 μm 2614 s
F140W 1.39 μm 2411 s
F160W 1.54 μm 5229 s
IRAC ch1 3.55 μm 18000 s
IRAC ch2 4.50 μm 18000 s

Notes. aEffective "pivot" wavelength (Tokunaga & Vacca 2005). bVisit A2 only, excluding visit A9 (Section 3.1).

Download table as:  ASCIITypeset image

We processed the images for debias, (super)flats, and darks using standard techniques, then co-aligned and combined them to a scale of 0farcs065 pixel−1; see Koekemoer et al. (2002, 2011) for further information on the astrometric alignment and drizzle algorithms that were used and Postman et al. (2012) for specific details on their implementation in CLASH. We also produced inverse variance maps (IVMs) based on the observed sky level, identified cosmic rays, detector flat field, read noise, dark current, and bad pixels. These IVMs may be used to estimate the level of uncertainty in each pixel before accounting for correlated noise and any Poisson source noise.

Imaging at longer wavelengths was obtained by Spitzer with the InfraRed Array Camera (IRAC; Fazio et al. 2004) ch1 (3.6 μm) and ch2 (4.5 μm) with total exposure times of 5 hr at each wavelength (program 60034, P.I. Egami). These observations were divided into two epochs separated by ∼5.5 months (2009 November 10 and 2010 April 23). We combined the Basic Calibrated Data (BCD) using MOPEX (Makovoz & Khan 2005) to produce mosaicked images.

As of 2012 July, CLASH had obtained 16-band HST observations for 17 clusters, including MACSJ0647.7+7015 and three other "high-magnification" strong lensing clusters, as given in Table 2. We searched for high-redshift galaxies in the WFC3/IR fields of view (FOVs) of all 17 of these clusters (L. D. Bradley et al. 2013, in preparation).

Table 2. Seventeen Clusters Searched in This Work

High Magnification?a Clusterb Redshift
  Abell 383 (0248.1−0331) 0.187
  Abell 611 (0800.9+3603) 0.288
  Abell 2261 (1722.5+3207) 0.244
  MACSJ0329.7−0211 0.450
Y MACSJ0647.8+7015 0.591
Y MACSJ0717.5+3745 0.548
  MACSJ0744.9+3927 0.686
  MACSJ1115.9+0129 0.355
Y MACSJ1149.6+2223 0.544
  MACSJ1206.2−0847 0.439
  MACSJ1720.3+3536 0.387
  MACSJ1931.8−2635 0.352
Y MACSJ2129.4−0741 0.570
  MS2137.3−2353 0.313
  RXJ1347.5−1145 0.451
  RXJ1532.9+3021 0.363
  RXJ2129.7+0005 0.234

Notes. aCLASH clusters were selected based on either X-ray or strong lensing properties. The latter "high magnification" clusters are marked with Y's here. For details, see Postman et al. (2012). bRA and decl. (J2000) are given in parentheses for the Abell clusters, encoded as they are in the names of the other clusters.

Download table as:  ASCIITypeset image

Out of ∼20,000 detected sources, we identified MACS0647-JD (Figure 2) as having an exceptionally high photometric redshift (Section 4). Our selection was based on spectral energy distribution (SED) fitting as used in some previous high-redshift searches (e.g., McLure et al. 2006; Dunlop et al. 2007; Finkelstein et al. 2010). We did not impose specific magnitude limits, color cuts, or other detection thresholds on our selection as in other works (e.g., Bunker et al. 2010; Yan et al. 2011; Bouwens et al. 2012b; Oesch et al. 2012b).

Figure 2.

Figure 2. Three images of MACS0647-JD as observed in various filters with HST. The leftmost panels show the summed 11 hr (17-orbit) exposures obtained in eight filters spanning 0.4–0.9 μm with the Advanced Camera for Surveys. The five middle columns show observations with the Wide Field Camera 3 IR channel in F105W, F110W, F125W, F140W, and F160W, all shown with the same linear scale in electrons per second. The F125W images were obtained at a single roll angle, and a small region near JD2 was affected by persistence due to a moderately bright star in our parallel observations immediately prior (see also Figure 4). The right panels zoom in by a factor of 2 to show F110W+F140W+F160W color images scaled linearly between 0 and 0.1 μJy.

Standard image High-resolution image

3. PHOTOMETRY

3.1. HST Photometry

3.1.1. Photometric Analysis

We used SExtractor version 2.5.0 (Bertin & Arnouts 1996) to detect objects in a weighted sum of all five HST WFC3/IR images. Along the edge of each object, SExtractor defines an isophotal aperture consisting of pixels with values above a detection threshold. We set this threshold equal to the rms measured locally near each object. Isophotal fluxes (and magnitudes) are measured within these isophotal apertures. SExtractor derives flux uncertainties by adding in quadrature the background rms derived from our inverse variance maps and the Poisson uncertainty from the object flux.

Since our images are drizzled to a 0farcs065 pixel scale, which is 2–3 times smaller than the WFC3 point-spread function (PSF), the resulting images contain significant correlated noise. The weight maps produced by drizzle represent the expected variance in the absence of correlated noise. To account for the correlated noise, one may apply a correction factor as in Casertano et al. (2000).

Previous authors have also noted that SExtractor tends to underestimate flux uncertainties (Feldmeier et al. 2002; Labbé et al. 2003; Gawiser et al. 2006) by as much as a factor of 2–3 (Becker et al. 2007). In this work, we obtained empirical measurements of the flux uncertainties using the following method which also captures the effects of correlated noise.

SExtractor has the ability to measure the local background within a rectangular annulus (default width 24 pixels) around each object. We constructed a rectangle of the same size, but rather than calculate the rms of the individual pixels, we obtained samples of the background flux within this region using the isophotal aperture shifted to new positions. In other words, we moved the isophotal aperture to every position within this rectangle, sampling the flux at each position. We discarded measurements for which the aperture includes part of any object, as we are interested in measuring the background flux. Finally, we measured the rms of these measurements and added in quadrature the object's Poisson uncertainty to obtain the total flux uncertainty for that object. We found that this technique indeed yielded larger flux uncertainties than reported by SExtractor, typically by factors of 2–3 in the WFC3/IR filters and by lower factors in ACS and WFC3/UVIS.

We also used this method to determine object fluxes. The mean of the flux measurements in the nearby apertures was adopted as the local flux bias, which we subtracted from the flux measurement in the object itself. We found that this yielded photometry very similar to that obtained using SExtractor, agreeing well within the photometric uncertainties. While we used this photometry for all subsequent analyses, we also verified that our derived photometric redshifts did not vary significantly (after excluding the F110W second epoch exposures; Section 3.1.3) if we instead utilized photometry derived directly from SExtractor.

We corrected for Galactic extinction of E(BV) = 0.11 in the direction of MACSJ0647.7+7015 as derived using the Schlegel et al. (1998) IR dust emission maps. For each filter, the magnitudes of extinction per unit E(BV) are given in Postman et al. (2012, their Table 5). (These values should be ∼10% lower in the NUV and optical according to Schlafly & Finkbeiner 2011.) This extinction reddens the observed colors at the few percent level in the near-IR. Thus, the effects on the J-dropout are negligible. The extinctions range from 0.05 to 0.11 mag in the WFC3/IR images; 0.16 to 0.46 mag in the ACS images; and 0.50 to 0.83 mag in WFC3/UVIS. We note that the extinction may be somewhat uncertain due to patchy galactic cirrus in the direction of MACSJ0647.7+7015.

3.1.2. Photometric Results

Our resulting 17-band HST photometry is given in Table 3 and Figure 3. All three J-dropouts are detected at >10σ in F160W, >6σ in F140W, and <3σ in all other filters. JD is not detected above 1σ in any filter blueward of F140W.

Figure 3.

Figure 3. Observed HST photometry (filled circles and triangles) plotted against the expected fluxes (open blue squares) from a young starburst galaxy spectrum (gray line) redshifted to z ∼ 11. HST filter transmission curves are plotted in the upper panel, normalized to their maxima, and with black dots indicating the effective "pivot" wavelengths. Photometry of the J-dropouts observed through these filters (Table 3) is plotted as the larger circles and triangles for positive and negative observed fluxes, respectively, with 1σ error bars. For some points, horizontal "error bars" are plotted to reiterate the filter widths. The gray line is a model spectrum of a young starburst at z = 11.0, the best fit to the summed photometry. The integrals of this spectrum through our filters give the model predicted fluxes plotted as blue squares. Other galaxy types at z ∼ 11 yield similar predicted HST fluxes, as the shape of the spectrum cannot be constrained by the HST photometry alone. Redshifted Lyman-α at 0.1216 μm(1 + z) ∼ 1.46 μm is indicated by the vertical dashed line.

Standard image High-resolution image

Table 3. Coordinates, Observed Filters, and Photometry of the J-dropouts

  JD1 JD2 JD3 JD1 + JD2 + JD3a
RA (J2000) 06:47:55.731 06:47:53.112 06:47:55.452  
Decl. (J2000) +70:14:35.76 +70:14:22.94 +70:15:38.09  
F225W −129 ± 51 nJy (−2.5σ) −40 ± 50 nJy (−0.8σ) 12 ± 32 nJy (0.4σ) −157 ± 78 nJy (−2.0σ)
F275W −95 ± 51 nJy (−1.9σ) −31 ± 42 nJy (−0.8σ) 49 ± 24 nJy (2.0σ) −77 ± 70 nJy (−1.1σ)
F336W 2 ± 37 nJy (0.0σ) 49 ± 29 nJy (1.7σ) −25 ± 18 nJy (−1.4σ) 25 ± 50 nJy (0.5σ)
F390W −8 ± 20 nJy (−0.4σ) 1 ± 19 nJy (0.1σ) 1 ± 10 nJy (0.1σ) −6 ± 29 nJy (−0.2σ)
F435W 0 ± 26 nJy (0.0σ) 43 ± 24 nJy (1.8σ) 5 ± 14 nJy (0.4σ) 48 ± 38 nJy (1.3σ)
F475W −2 ± 14 nJy (−0.1σ) −27 ± 16 nJy (−1.7σ) 7 ± 8 nJy (0.9σ) −22 ± 23 nJy (−1.0σ)
F555W −3 ± 9 nJy (−0.3σ) 12 ± 7 nJy (1.7σ) 6 ± 4 nJy (1.4σ) 15 ± 12 nJy (1.3σ)
F606W 3 ± 16 nJy (0.2σ) 13 ± 20 nJy (0.6σ) −1 ± 6 nJy (−0.1σ) 15 ± 26 nJy (0.6σ)
F625W −35 ± 21 nJy (−1.7σ) −52 ± 24 nJy (−2.2σ) 23 ± 10 nJy (2.3σ) −64 ± 33 nJy (−1.9σ)
F775W 4 ± 30 nJy (0.2σ) −16 ± 52 nJy (−0.3σ) 4 ± 10 nJy (0.3σ) −8 ± 61 nJy (−0.1σ)
F814W 0 ± 8 nJy (0.1σ) −2 ± 5 nJy (−0.3σ) −2 ± 3 nJy (−0.8σ) −3 ± 10 nJy (−0.3σ)
F850LP −3 ± 30 nJy (−0.1σ) 1 ± 29 nJy (0.0σ) 6 ± 15 nJy (0.4σ) 4 ± 45 nJy (0.1σ)
F105W 11 ± 12 nJy (0.9σ) 14 ± 13 nJy (1.1σ) 3 ± 5 nJy (0.6σ) 28 ± 18 nJy (1.6σ)
F110Wb −8 ± 10 nJy (−0.8σ) 3 ± 9 nJy (0.3σ) 7 ± 4 nJy (1.9σ) 2 ± 14 nJy (0.1σ)
F125W −3 ± 10 nJy (−0.3σ) 7 ± 16 nJy (0.5σ) 2 ± 5 nJy (0.4σ) 6 ± 20 nJy (0.3σ)
F140W 63 ± 10 nJy (6.0σ) 50 ± 8 nJy (6.7σ) 26 ± 4 nJy (6.1σ) 139 ± 14 nJy (9.9σ)
  =26.90 ± 0.17 mag AB =27.15 ± 0.17 mag AB =27.86 ± 0.17 mag AB =26.04 ± 0.11 mag AB
F160W 162 ± 13 nJy (12.4σ) 136 ± 9 nJy (15.1σ) 42 ± 4 nJy (10.1σ) 341 ± 16 nJy (21.3σ)
  =25.88 ± 0.09 mag AB =26.07 ± 0.07 mag AB =27.34 ± 0.10 mag AB =25.07 ± 0.05 mag AB
IRAC ch1 <277 nJyc <166 nJy <166 nJy <363 nJy
IRAC ch2 <245 nJyc 436 ± 139 nJy(3.1σ) <138 nJy 436 ± 314 nJy (1.4σ)

Notes. Fluxes in nanoJanskys (nJy) may be converted to AB magnitudes via mAB ≈ 26–2.5log10(Fν/(145 nJy)). Magnitude uncertainties, where given, are non-Gaussian but are approximated as 2.5log10(e) times the fractional flux uncertainties. aSum of all three images with uncertainties added in quadrature. bVisit A2 only, excluding visit A9, which exhibits significantly elevated and non-Poissonian backgrounds due to earthshine (Section 3.1). cIncludes uncertainties from modeling and subtracting a nearby brighter galaxy. More conservative estimates of these uncertainties were also considered in the analysis (Section 3.2).

Download table as:  ASCIITypeset image

All three J-dropouts are confidently detected in two filters (F140W and F160W) observed at six different epochs over a period of 56 days (Figure 4). No significant temporal variations are observed in position or brightness, ruling out solar system objects and transient phenomena such as supernovae, respectively (see Figures 5 and 6 and Section 5).

Figure 4.

Figure 4. MACS0647-JD as observed in each of the individual epochs of F160W and F140W obtained over a 56 days period. These observations were obtained at two different telescope roll angles, which alternate between the stamps shown here. A small region of the WFC3/IR images in our first roll angle was affected by persistence due to a moderately bright star in our parallel observations immediately prior. These pixels happen to fall within 1'' of JD2 at that roll angle (marked in gray here and flagged as unreliable). Excluding this roll angle for JD2 does not significantly affect the derived photometry.

Standard image High-resolution image
Figure 5.

Figure 5. Flux measurements in the individual epochs observed over a period of 56 days. Filters are colored F160W (red), F140W (yellow), F125W (green), and F110W (blue) as both individual data points and solid bands, as determined for the summed observations. The F110W exposures obtained in the second epoch (visit A9) were found to have significantly elevated and non-Poissonian backgrounds due to earthshine (Section 3.1). These were excluded in our analysis; we adopted the F110W fluxes measured in the first epoch (visit A2).

Standard image High-resolution image
Figure 6.

Figure 6. Relative centroid measurements for the detections in F160W (red) and F140W (yellow) in individual epochs (circles) and summed observations (squares). Centroids measured in the summed NIR images are also plotted as gray diamonds. The offsets are generally less than one of our drizzled pixels (0farcs065), roughly half the native WFC3/IR pixel size (∼0farcs13).

Standard image High-resolution image

3.1.3. Exclusion of F110W Second Epoch

Based on an initial standard reduction of the HST images and standard SExtractor photometry, MACS0647-JD2 was detected in F110W at 5σ, while JD1 and JD3 were not significantly detected (0.9σ and 1.7σ, respectively). Our empirical rederivations of the photometric uncertainties, including proper accounting for correlated noise (Section 3.1.1), reduced the significance of this detection to 2.5σ. However, we ultimately we concluded that this marginal detection was completely spurious due to significantly elevated and non-Poissonian backgrounds due to earthshine in two out of five F110W exposures, both obtained during the second epoch (see below). After excluding these exposures, the detection significance drops to 0.3σ, consistent with background noise. For reference, see the WFC3/IR images in Figure 2.

Even based on the initial "standard" analysis described above, we determined that MACS0647-JD is at z < 9 with a likelihood of ∼10−9 based on a joint photometric redshift analysis of all three images (Section 4). This likelihood decreased further to 3 × 10−13 based on our improved analysis. These values are summarized in Table 4. The spuriously high flux measurements may be seen in Figure 5.

Table 4. Effects of F110W Aberrant Second Epoch

F110W detection σ JD1 JD2 JD3 P(z < 9)a
SExtractor photometry 0.9 5.0 1.7 1 × 10−9
Empirical uncertainties 0.5 2.5 1.0 4 × 10−8
Excluding second epoch −0.8 0.3 1.9 3 × 10−13

Notes. JD2 is spuriously detected in F110W images processed using standard techniques. This is due to significantly elevated non-Poissonian backgrounds in the second epoch of observations due to earthshine. We exclude this epoch in our analysis. See Section 3.1 aBased on the summed photometry of all three images, and assuming that MACS0647-JD is a galaxy well described by our templates. See Sections 4 and 5.

Download table as:  ASCIITypeset image

The final observations of MACSJ0647.7+7015 were two 502 s exposures in F110W obtained during visit A9, the second epoch for that filter. We found these to have significantly elevated backgrounds of 1.9 (σ-clipped mean) ±0.44 (rms) and 6.4 ± 0.27 electrons s−1, respectively, compared with the more typical values around 1.5 ± 0.08. These high backgrounds were due to earthshine, or sunlight reflected from the Earth. The first observation was obtained during twilight as the telescope pointed within 67°–59° of the bright limb of Earth. This Earth limb angle continued to steadily decrease from 47° to 24° during the second observation which was obtained during daylight. In the observation log, the diagnostic Earth bright limb flag was raised halfway through the second exposure. We also examined the 10 individual readouts of 100 s each obtained over the course of both exposures and found that the mean background increased steadily from 0.9 to 7.5 electrons s−1. The resulting elevated background rms values of 0.44 and 0.27 electrons s−1 in the two exposures are the highest and sixth highest relative to the median values for a given filter in 1582 CLASH observations to date of 17 clusters. None of the three F160W observations obtained at the beginning of visit A9 exhibit elevated backgrounds because they were obtained at night (twilight had yet to set in) and Earth is less bright in F160W.

Specifically, when we compared the measured rms values to what would be expected from scaling the background intensity levels, we found that these rms values are several times higher than would be expected in the case of Poissonian statistics. We attribute this to the fact that the sky background was increasing in a strongly nonlinear fashion during the exposure, whereas the up-the-ramp slope fitting algorithm implemented in "calwf3" implicitly assumes that the count rate is constant when converting measured counts into counts per second (see Dressel 2011). Since this assumption is violated, the pixel-to-pixel variations in the final count-rate image no longer scale as expected for Poissonian statistics, as demonstrated by the much higher rms values. Since these data no longer conform to Poissonian statistics, we were able to demonstrate that attempting to combine them with the other data did not yield an improvement in signal-to-noise ratio but instead produced combined data sets with non-Poissonian statistics, from which we were not able to obtain reliable photometry.

We therefore exclude the two F110W visit A9 exposures from our analysis and derive photometry instead from the weighted sum of the three visit A2 exposures.

3.2. Spitzer Photometry

To derive photometry in the longer wavelength Spitzer IRAC images (Figure 7), we performed both GALFIT PSF fitting and aperture photometry on JD2 and JD3. No significant flux is detected for either object in either channel except for a 3σ detection of JD2 in ch2: mag = 24.8 ± 0.3. Aperture photometry (2farcs4 diameter aperture) yields mag = 25.8 ± 0.3, subject to an approximate 0.7 mag correction, roughly consistent with the GALFIT-derived photometry.

Figure 7.

Figure 7. Spitzer IRAC ch1 (3.6 μm) and ch2 (4.5 μm) images of MACS0647-JD compared to the HST WFC3/IR F160W (1.6 μm) image. Two intensity scalings and zooms are shown. Left: both 26'' × 26'' F160W cutouts are scaled linearly in photon counts to the same range as in Figure 2. And for each Spitzer filter, the same count range is used in each row. The background photon counts are significantly higher near JD1 and JD2 (top row) due to intracluster light and scattered starlight. MACS0647-JD is not detected brightly in the Spitzer images, supporting the high-redshift solution. The only possible detection we report is for JD2 at 3.1σ in ch2 (Table 3). JD1 is contaminated by light from other nearby galaxies which we modeled and subtracted to estimate JD's photometry. Right: in each of these 5'' × 5'' closeups, the intensity is scaled independently to the observed range within the central 3'' × 3''.

Standard image High-resolution image

JD1 is significantly contaminated by light from a nearby cluster galaxy. We modeled this galaxy using GALFIT, subtracted it from the image, and measured photometry in 2farcs4 diameter apertures, yielding a null detection plus uncertainty. We also added a simulated 25th magnitude source and used GALFIT to derive its photometry. We conservatively combined the uncertainties from these two measurements in quadrature to yield total uncertainties (1σ upper limits) of 277 and 245 nJy in ch1 and ch2, respectively (3σ limits of mag 24.2 and 24.1). We also experimented with inflating these uncertainties further by 1 mag (3σ limits of 23.2 and 23.1 mag). This would increase the JD1 z < 9 likelihood (see Section 4) from ∼3 × 10−7 to 2 × 10−5, and the likelihood based on the integrated photometry of all three images from 3 × 10−13 to 2 × 10−9.

4. PHOTOMETRIC REDSHIFT

We perform two independent analyses of the HST+Spitzer photometry to estimate the photometric redshift of MACS0647-JD. These two methods, BPZ (Section 4.1) and LePHARE (Section 4.2), were the top 2 performers out of 17 methods tested in Hildebrandt et al. (2010). They yielded the most accurate redshifts with the fewest outliers given a photometric catalog for galaxies with known spectroscopic redshifts.

According to our gravitational lensing models (Section 6), MACS0647-JD1, 2, and 3 are likely three multiple images of the same strongly lensed background galaxy. Thus, in this section, we present photometric redshift likelihoods for each individual image, as well as jointly for the two brighter images and for all three images.

4.1. Bayesian Photometric Redshifts (BPZ)

We used BPZ (Benítez 2000; Coe et al. 2006) for our primary photometric redshift analyses. We modeled the observed HST+Spitzer photometry of MACS0647-JD using model SEDs from PEGASE (Fioc & Rocca-Volmerange 1997), which have been significantly adjusted and recalibrated to match the observed photometry of galaxies with known spectroscopic redshifts from FIREWORKS (Wuyts et al. 2008). The FIREWORKS data set includes 0.38–24 μm photometry of galaxies down to mag ∼24.3 (5σ K band) and spectroscopic redshifts out to z ∼ 3.7. In analyses of large data sets with high quality spectra, this template set yields ≲1% outliers, demonstrating that it encompasses the range of metallicities, extinctions, and star formation histories (SFHs) observed for the vast majority of real galaxies. (In Section 5.1, we explore a still broader range of galaxy properties using a synthetic template set that has not been recalibrated to match observed galaxy colors.) These templates include nebular emission lines as implemented by Fioc & Rocca-Volmerange (1997).

The Bayesian analysis tempers the SED model quality of fit with an empirically derived prior P(z, T|m) on the galaxy redshift and type given its (delensed) magnitude. Our prior was constructed as in Benítez (2000) and updated based on likelihoods P(z, T|m) observed in COSMOS (Ilbert et al. 2009), GOODS-MUSIC (Grazian et al. 2006; Santini et al. 2009), and the UDF (Coe et al. 2006). According to this prior (extrapolated to higher redshifts), all galaxy types of intrinsic (delensed) magnitude ∼28.2 are over 80 times less likely to be at z ∼ 11 than z ∼ 2. Thus, our analysis is more conservative regarding high-redshift candidates than an analysis which neglects to implement such a prior (implicitly assuming a flat prior in redshift). The prior likelihoods for MACS0647-JD are uncertain both due to the prior's extrapolation to z ∼ 11 and uncertainty in MACS0647-JD's intrinsic (delensed) magnitude. Yet, it serves as a useful approximation that is surely more accurate than a flat prior.

Based on this analysis, we derived photometric redshift likelihood distributions as plotted in Figure 8 and summarized in Table 5. The images JD1, JD2, and JD3 are best fit by a starburst SED at z ∼ 10.9, 11.0, and 10.1, respectively. After applying the Bayesian prior, we find that JD1 and JD2 are most likely starbursts at z ∼ 10.6 and 11.0, respectively. A z ∼ 2.5 elliptical template is slightly preferred for JD3; however, z = 11 is within the 99% confidence limits (CLs). Observed at ∼27.3 mag, we may not expect this fainter image to yield as reliable a photometric redshift.

Figure 8.

Figure 8. Top row: photometric redshift probability distributions based on our BPZ analysis (Section 4.1) of HST+Spitzer photometry for each image. Cumulative probabilities P(< z) are shaded gray; probabilities P(z) per unit 0.01 in redshift are drawn as black lines; and likelihoods for individual SED templates are drawn as colored lines. In this work, we use four elliptical templates (Ell), one E/S0, two spirals (Sbc and Scd), and four starbursts (SB). We then interpolate nine templates between each pair of adjacent templates. Bottom left: joint likelihoods for all three images. Bottom center: likelihoods based on the summed photometry of all three images. For all of these likelihoods, we assume the prior plotted at bottom right for galaxies of an intrinsic (delensed) magnitude of ∼28.2. This prior was empirically derived from large surveys with photometric and spectroscopic redshifts and extrapolated to higher redshifts.

Standard image High-resolution image

Table 5. Individual and Joint Redshift Likelihoods

Image 95% CL 99% CL P(z < 9)
JD1 (F160W ∼ 25.9)a 10.62+0.83−0.34 [10.11–11.67] 3 × 10−7
JD2 (F160W ∼ 26.1)a 10.99+0.50−0.77 [ 9.99–11.69] 3 × 10−4
JD3 (F160W ∼ 27.3)a  2.48+7.95−0.42 [ 1.81–11.07] 7 × 10−1
P(JD1)×P(JD2)b 10.66+0.68−0.31 [10.21–11.53] 4 × 10−10
P(JD1)×P(JD2)×P(JD3)b 10.42+0.66−0.19 [10.12–11.30] 2 × 10−8
P(JD1 + JD2)c 10.99+0.43−0.61 [10.22–11.59] 2 × 10−11
P(JD1 + JD2 + JD3)c 10.71+0.59−0.37 [10.20–11.47] 3 × 10−13

Notes. BPZ results assuming that MACS0647-JD is well modeled by our SED templates (Sections 4 and 5). All likelihoods include a Bayesian prior, which assumes that galaxies of this (unlensed) magnitude are over 80 times more likely to be at z ∼ 2 than at z ∼ 11. See also Figure 8. aApproximate AB magnitudes are given in parentheses. (See also Table 3.) Note JD3 is significantly fainter. bJoint likelihood of multiple images weighted equally. cLikelihood based on integrated photometry of multiple images.

Download table as:  ASCIITypeset image

In Table 4, we also provide joint likelihoods based on the brighter two images and all three images equally weighted. To properly downweight the fainter image, we also analyzed the integrated photometry of all three images (with uncertainties added in quadrature). Based on this analysis including our Bayesian prior, and assuming MACS0647-JD is a galaxy well described by our template set (see also Section 5.1), we found z = 10.7+0.6−0.4 (95% CL) with a ∼3 × 10−13 likelihood that MACS0647-JD is at z < 9. This likelihood corresponds to a 7.2σ confidence that MACS0647-JD is at z > 9. The joint likelihood analysis (weighting all images equally) yields a similar 95% CL [10.2–11.1] and a more conservative P(z < 9) ∼2 × 10−8 or z > 9 at 5.5σ.

The strong confidence in the high-redshift solution requires the combined HST and Spitzer photometry. Without the Spitzer photometry, the z > 9 likelihood would drop to 95% for the summed HST photometry. Similarly, we would find P(z > 9) ∼91% for JD1 individually. However, the most likely solutions for JD2 and JD3 would be early types at z ∼ 4. We would expect such galaxies to be ∼23 mag in the Spitzer observations, which is extremely unlikely (as quantified above) given the measured photometry (see also Section 3.2).

4.2. LePHARE

We also used LePHARE (Arnouts et al. 1999; Ilbert et al. 2006, 2009) to independently estimate the photometric redshifts. For this analysis, we used an SED template library primarily from Ilbert et al. (2009) as optimized for the COSMOS survey (Scoville et al. 2007a, 2007b; Koekemoer et al. 2007). This includes three ellipticals and seven spirals as generated by Polletta et al. (2007) using the GRASIL code (Silva et al. 1998), as well as 12 starburst galaxies with ages ranging from 30 Myr to 3 Gyr generated by GALAXEV based on Bruzual & Charlot (2003). We supplemented these with four additional elliptical templates for a total of seven ellipticals.

We added dust extinction in 10 steps up to E(BV) = 0.6. (Stronger degrees of extinction are explored in Section 5.1.1.) Four different dust laws were explored: Calzetti et al. (2000); Calzetti plus two variations on a 2170 Å bump; and Prevot et al. (1984) as observed for the SMC.

We adopted the Benítez (2000) prior as implemented in LePHARE. The results were consistent with those from BPZ: z = 10.6+0.6−0.2 (JD1), z = 10.6+0.5−0.3 (JD2), and z = 10.1+0.3−0.3 (JD3), each at 68% CL. A secondary solution of z ∼ 2.5 was reported for JD3 with a peak likelihood 10 times less than that of the best-fit high-redshift solution.

5. LOWER REDSHIFT INTERLOPERS RULED OUT

In this section, we consider a broad range of z < 11 possibilities. As found in Section 4, the z < 9 likelihood is formally ∼3 × 10−13 assuming MACS0647-JD is a galaxy well modeled by our SED templates. Though strongly disfavored, a z ∼ 2.5 early type and/or dusty galaxy is the most likely alternative, as we discuss further in Section 5.1. We reanalyzed previously published J-dropouts and found them most likely to be at intermediate redshift (Section 5.2). Objects within the Galaxy are less likely, as this would require three objects with extremely rare colors (Figure 13) at positions consistent with strongly lensed multiple images according to our lens models (Section 6). Nevertheless, we found that the only stars or brown dwarfs consistent with the observed colors are rare, transient post-asymptotic giant branch (AGB) flare-ups, though these would be far more luminous if observed within the Galaxy (Section 5.3). Solar system objects would have likely exhibited parallax motion and are inconsistent with the observed colors (Section 5.4). Intermediate-redshift long-duration multiply imaged supernovae (Section 5.5) and emission line galaxies (Section 5.6) are also extremely unlikely. We conclude that MACS0647-JD is most likely either at z ∼ 11 or exhibits unique photometry yet to be observed in any other known object.

5.1. Intermediate Redshift Galaxy?

5.1.1. SED Constraints

While we found P(z < 9) ∼ 3 × 10−13, the next best alternative to z ∼ 11 is an early-type galaxy (ETG) at z ∼ 2.5 (Figure 9). At z ∼ 2.65, the 4000 Å break is redshifted to 1.46 μm, coinciding with Lymanα (1216 Å) redshifted to z ∼ 11.0. However, 4000 Å breaks are not expected to be as strong as observed for MACS0647-JD (Figures 10 and 11). JD1 features a J125H160 ≳ 3 magnitude break between F125W and F160W as well as a ∼1 mag break between F140W and F160W. Thus, low-redshift ETGs yield a significantly worse SED fit than z ∼ 11 for all three images as quantified in Figure 8 and Table 5.

Figure 9.

Figure 9. Observed NIR photometry from HST WFC3/IR and Spitzer IRAC (filled circles and triangles) compared to the expected fluxes (open squares) from two SEDs: the z = 11.0 starburst from Figure 3 (blue) and a z = 2.5 early-type galaxy (red). Note that the JD3 plot is scaled differently along the y-axis.

Standard image High-resolution image
Figure 10.

Figure 10. Observed WFC3/IR colors (shaded 68% confidence regions) for JD1 (solid lines), JD2 (dashed lines), and JD3 (dotted lines) plotted against those predicted with the BPZ template set from young starburst (blue) to early type (yellow–orange–red) as a function of redshift. The three panels plot flux ratios in F125W/F160W (left), F125W/F140W (middle), and F140W/F160W (right). The corresponding colors in magnitudes are given along the right axes.

Standard image High-resolution image
Figure 11.

Figure 11. Observed colors in WFC3/IR F125W − F160W and F160W − IRAC ch1 plotted as black lines (95% confidence limits) versus those predicted from the current BPZ template library (lines colored as a function of redshift and made thicker for earlier galaxy types).

Standard image High-resolution image

To explore an even broader range of galaxy SED models than used in Section 4, we utilized the flexible stellar population synthesis (FSPS) models from Conroy et al. (2009) and Conroy & Gunn (2010). They provide simple stellar population (SSP) models that span ages of 5.5 ⩽ log(age/yr) ⩽10.175 and metallicities of 0.0002 ⩽ Z ⩽ 0.03 (where Z = 0.019). Nebular emission lines are not included. We convolved their SSP models with SFHs ranging from the single early burst (SSP) to exponentially declining ("τ models"), continuous (constant rate), and exponentially rising ("inverted τ models"). The latter rising SFH likely describes high-redshift galaxies best according to both observations (Maraston et al. 2010; Papovich et al. 2011; Reddy et al. 2012) and simulations (Finlator et al. 2011). Finally, we added a variable degree (up to AV = 30 mag) of Calzetti et al. (2000) dust extinction with RV = AV/EBV = 4.05.

To uncover the most likely solutions in different regions of this multidimensional parameter space, we began with relatively coarse grid searches with redshift intervals of 0.1 and ∼9 steps in each of the four other free parameters. We then zoomed in on the higher likelihood regions, found again to be roughly z ∼ 2.5 and 11. Finally, we ran Powell (1964) minimizations to find the best-fitting SEDs at each of these redshifts.

We supplemented these SEDs with a suite of smooth τ models with stochastic bursts superposed (e.g., Kauffmann et al. 2003; Salim et al. 2007), as well as truncated ("quenched") SFHs designed to reproduce the colors of post-starburst (K+A) galaxies.

Our results with this combined template set confirm that a z ∼ 11 model fits MACS0647-JD best, while evolved and/or dusty galaxies at z ∼ 2.5 provide the best alternatives but are still significantly worse statistically. The best-fitting intermediate-redshift template to the summed photometry (z ∼ 2.7; ∼400 Myr old; AV ∼ 0.8 mag) with χ2 = 57.6 is only ∼10−9 times as likely as the best-fitting z ∼ 11 template (z ∼ 10.9; ∼6 Myr old; AV = 0) yielding χ2 = 16.9 with ≳ 14 degrees of freedom given the 19 photometric measurements and ≲ 5 free parameters (see discussion in Andrae et al. 2010).

The uncertainties on the z ∼ 11 SED parameters are quantified in Section 7.3. A proper calculation of the redshift likelihoods based on these templates would require an estimate of the prior likelihoods in this multidimensional parameter space, which is beyond the scope of this work. And while these templates probe a broad parameter space, we derive our primary photometric redshift estimates in Section 4 from templates that have been well calibrated to match the observed photometry of galaxies with spectroscopic redshifts.

We note that there is no evidence that z > 2 ETGs have significantly different SEDs than our ETG models calibrated at lower redshifts. The highest redshift ETG observed to date is HUDF-1446 with a spectroscopic redshift of z = 2.67 (Damjanov et al. 2011). Coe et al. (2006) published a photometric redshift of z = 2.74 ± 0.44 for this object using BPZ, in good agreement with the true redshift. Their ETG templates yielded a good fit to the ACS (B435V606i'775z'850) and NICMOS (J110H160) photometry, including the J110H160 = 1.89 ± 0.13 break with H160 = 23.074 ± 0.098 and significant detections in all ACS filters.

5.1.2. Lower Stellar Mass than Observed z > 2 ETGs

If MACS0647-JD were at z ∼ 2.5 (despite the low likelihood of this from SED fitting) it would likely be the least massive early-type host galaxy observed to date at z > 1. Spectroscopically confirmed z > 1.4 ETGs to date have stellar masses >2 × 1010M (Damjanov et al. 2011). HUDF-1446 at z = 2.67, for example, is ∼8 × 1010M.

Our subset of lens models that allow for MACS0647-JD to be at z ∼ 3 (Section 6.3) suggest that the magnification of the brightest two images would be μ > 30. Thus, it would be intrinsically ∼300 times fainter than HUDF-1446, with a correspondingly lower stellar mass on the order of ∼2 × 108M (and still <109M if we assume a more conservative magnification factor of μ ∼ 10; see also z ∼ 11 mass estimates in Section 7.2).

Quiescent galaxies of such low masses at z > 2 would be a surprising discovery. Observations to date demonstrate (e.g., Peng et al. 2010) that star formation is only significantly quenched by feedback in more massive galaxies, or alternatively as a galaxy is harassed as a satellite of a larger halo. MACS0647-JD is not observed to be a satellite of a galaxy group.

5.2. Comparisons to Previously Published J-dropouts

The previous highest redshift candidate, UDFj-39546284 (Bouwens et al. 2011a), was detected at 5.8σ in a single HST band (WFC3/IR F160W) dropping out of F125W and bluer filters also with non-detections in Spitzer yielding a photometric redshift of z = 10.3 ± 0.8. The ultimate inclusion of the F140W filter on WFC3 (Brown & Baggett 2006) and in the CLASH observing program enables us to securely identify MACS0647-JD as the highest redshift galaxy candidate to date. At z ∼ 11.0, Lymanα is redshifted to ∼1.46 μm, causing the galaxy light to drop out of ∼2/3 of the F140W bandpass as well as ∼1/5 of F160W. The ratio between these two filling factors (0.8/0.33 ∼ 2.4, corresponding to ∼1.0 mag) places tight, model-independent constraints on the wavelength of the (redshifted) Lyman break and thus the redshift of MACS0647-JD (Figure 10). The five NIR HST filters used by CLASH also enabled Zheng et al. (2012) to discover a J-dropout lensed by MACSJ1149.6+2233 and robustly measure its photometric redshift to be z = 9.6 ± 0.2 (68% CL).

Laporte et al. (2011) identified a J-dropout lensed by Abell 2667 based on VLT (FORS2 and HAWK-I), ACS/F850LP, and Spitzer IRAC (ch1–ch4) photometry. Hayes et al. (2012) then measured a spectroscopic redshift of z = 2.082 for that galaxy, A2667-J1. Laporte et al. (2011) had already emphasized that z > 9 possibilities were excluded based on the significant (6.0σ) ACS detection. We concur with this conclusion after reanalyzing their photometry as provided in Hayes et al. (2012). Only by excluding the ACS data point and assuming no Bayesian redshift prior do z > 9 solutions have significant probability (Figure 12). If Spitzer IRAC ch3 and ch4 were not available (as is the case with MACS0647-JD) in addition to the ACS detection being unavailable, then the z > 9 likelihood would rise further, yet still be insignificant once the prior is included. The z > 9 likelihood is enhanced further, but only modestly, if the IRAC ch1 and ch2 uncertainties are inflated to yield only 3σ detections (as is the case for our JD2 IRAC ch2). In Figure 12, we compare the observed NIR photometry of A2667-J1 and MACS0647-JD1. Our multiband HST photometry of the latter yields significantly tighter upper limits on the non-detections and adds a key data point at 1.4 μm, resulting in a far greater z ∼ 11 likelihood even when accounting for the Bayesian prior which disfavors them (Figure 8).

Figure 12.

Figure 12. Left:comparison of near-infrared photometry of lensed J-dropouts MACS0647-JD1 (this work) and A2667-J1 (Laporte et al. 2011; Hayes et al. 2012). Also overplotted are three SED fits from Figure 2 of Hayes et al. (2012) to the photometry of A2667-J1 at its spectroscopic redshift z = 2.082. The Hayes et al. (2012) photometry plotted here is all from VLT (FORS2 and HAWK-I) except for the 6.0σ detection in HST/ACS F850LP, the upper diamond with a darker border at 0.91 μm. Right: photometric redshift probability distribution for A2667-J1 based on our reanalysis of the photometry provided in Hayes et al. (2012) with and without a Bayesian prior. The top panel uses all the available photometry. The middle panel omits the 6.0σ detection in ACS/F850LP. The bottom panel omits both ACS and Spitzer IRAC ch3 and ch4. The spectroscopic redshift z = 2.082 is indicated by the red vertical lines.

Standard image High-resolution image

We also applied our analysis methods to the photometry of other J-dropouts in the literature. Schaerer et al. (2007) showed A1835-#17 was fit well by a dusty (AV ∼ 3.6 mag) starburst at z ∼ 0.8. Dickinson et al. (2000) presented both z ≳ 2 and ≳ 10 solutions for HDF-N J123656.3+621322. And HUDF-JD2 (Mobasher et al. 2005) has since been shown likely to be a z ∼ 1.7 luminous infrared galaxy (Chary et al. 2007). For all three of these J-dropouts, our analysis yields low redshift (2 ≲ z ≲ 4 or very dusty z ≲ 1) solutions which are strongly preferred given our Bayesian prior.

5.3. Stars Or Brown Dwarfs?

MACS0647-JD1, JD2, and JD3 are most likely multiple images of a strongly lensed background galaxy, well behind the z = 0.591 cluster. Their observed colors are extremely rare in our multiband HST catalogs of 17 clusters observed to date (Figure 13). And they lie at or near the predicted positions of multiply lensed images (Section 6). It would be highly unlikely to find three foreground (unlensed) objects with such rare colors coincidentally at these positions. Still we consider here possible interlopers within the Galaxy, namely stars, brown dwarfs, and (in Section 5.4) solar system objects including Kuiper Belt objects and Oort cloud objects.

Figure 13.

Figure 13. Observed NIR colors of the J-dropouts (red diamonds with 1σ uncertainties) plotted against those observed for all other 20,746 CLASH sources brighter than 28th magnitude in both F160W and F140W and also observed in F125W (filled circles and density map). The horizontal axis gives the ratio of the F160W flux to the maximum flux in all bluer WFC3/IR filters. The vertical axis gives a similar flux ratio but for F140W. Three objects with colors similar to the J-dropouts appear to be spurious IR artifacts based on visual inspection, and we mark these with X's.

Standard image High-resolution image

JD1 and JD2 are perhaps marginally resolved with deconvolved FWHM ≲ 0farcs2 (0farcs3 observed with a 0farcs2 PSF). We performed two independent analyses attempting to determine whether the observed FWHM was large enough to definitively distinguish it from the stellar locus. These analyses reached different conclusions. Therefore, we turn to other lines of evidence to rule out stars and smaller objects.

Stars are relatively plentiful in this field as the Galactic latitude is relatively low (+25fdg1). We used the online tool TRILEGAL21 (Girardi et al. 2005) to calculate that we may expect ∼5 late-type M dwarfs of ∼26th magnitude or fainter within our FOV. However, the predicted colors are J125H160 ∼ 0.4, a break significantly weaker than that observed.

In Figure 14, observed and expected colors of stars and brown dwarfs (including types M, L, T, and Y) are plotted versus those observed for the J-dropouts. No dwarf color is able to reproduce the observed J-dropout colors. According to models, the colors of extremely cold (∼200 K) Y dwarfs come close to matching the red observed HST NIR colors, but these are expected to be significantly brighter in IRAC by up to 10 mag. The coldest dwarfs yet discovered are Y dwarfs including WISEP J1828+2650 at ∼300 K (Cushing et al. 2011; Kirkpatrick et al. 2012) with colors as plotted in Figure 14 from ground-based JH and WISE W2 4.6 μm observations.

Figure 14.

Figure 14. Observed colors in J125, H160, and at 4.5 μm plotted as black lines (95% confidence contours) versus those observed and predicted for stars and brown dwarfs. Colors derived from stellar spectra observed with IRTF (Cushing et al. 2005; Rayner et al. 2009) are plotted as open magenta circles for dwarfs and open black star symbols for giants and supergiants. Blue error bars are observed photometry (ground-based and WISE) for Y dwarfs (Kirkpatrick et al. 2012). Open black diamonds are post-AGB flare-ups with dust ejecta observed with 2MASS and WISE; the upper diamond is "Sakurai's object" (Duerbeck & Benetti 1996) and the lower diamond is WISE J1810−3305 (Gandhi et al. 2012). Simulated dwarf spectra from Hubeny & Burrows (2007) are plotted as filled circles colored as a function of temperature.

Standard image High-resolution image

Of the stellar spectra observed with the Infrared Telescope Facility (IRTF; Cushing et al. 2005; Rayner et al. 2009), the M8III red giant WX Piscium (IRAS 01037+1219; Ulrich et al. 1966; Decin et al. 2007) comes closest to matching the observed colors of MACS0647-JD. However, such a large, bright star (M ∼ −4) would need to be well outside the Galaxy (∼10 Mpc distant) to be observed at 26th magnitude in F160W (as argued in Dickinson et al. 2000 and Bouwens et al. 2011a for two previous z ∼ 10 candidates). If MACS0647-JD were within the Galaxy (out to ∼10 kpc), it would have an absolute magnitude of M ∼ +11 or fainter, consistent with a red dwarf in terms of magnitude but not color as shown above.

A few red giants in the post-AGB phase have been observed to flare up apparently as the result of a helium burning "thermal pulse," which triggers the ejection of a dust shell. "Sakurai's object" (Duerbeck & Benetti 1996) and WISE J1810−3305 (Gandhi et al. 2012) do have similar colors to our J-dropouts. But again these are very bright events, observed at 0.34 ± 0.01 and 2.74 ± 0.06 Jy, respectively, in the H band with the Two Micron All Sky Survey (2MASS). Believed to be a few kpc distant, they would need to be removed to several Mpc to be observed at ∼0.1 μJy as our J-dropouts. These are also rare events, lasting on the order of 100 years (but varying more rapidly), such that only these two have been reported to date. It would be highly unlikely to detect three such events occurring at the same time in the same HST field.

5.4. Solar System Objects?

If the J-dropouts were solar system objects, we would expect to have detected their proper motions in our six epochs of F160W/F140W imaging spanning 56 days (Figures 4 and 6). Only an Oort cloud object at ∼50,000 AU would be orbiting the Sun sufficiently slowly for us not to have detected its motion. But Oort cloud objects are expected to be significantly fainter (∼58th magnitude assuming a diameter of ∼20 km; Sheppard 2010) and have different colors than those observed here. Even a maximally large rocky planet with a diameter of ∼200,000 km and 100% albedo would only be ∼37th mag. Larger objects would be brown dwarfs, which we ruled out in Section 5.3 based on their colors. Oort cloud objects are also expected to have colors different than those observed for MACS0647-JD. Benecchi et al. (2011) measured HST/NICMOS F110W−F160W colors of 80 trans-Neptunian objects and found that they have HST F110W−F160W colors clustered around ∼0.6 with none redder than 0.8. To be observed at ∼26th mag, an Oort cloud object would have to be the size of a small moon. Even if such objects exist, they are almost certainly rare, or they would have been discovered by now. It would be highly improbable to discover the first three within a single HST FOV.

5.5. Lensed Supernova?

The J-dropouts do not exhibit any significant temporal variations in brightness either over our 56 days of observations (Figure 5), ruling out most transient phenomena such as supernovae. However, Type IIP supernovae can plateau to a roughly constant magnitude for ∼100 days (e.g., Arcavi et al. 2012) which we would observe to last ∼100(1 + z) days due to cosmic time dilation (Blondin et al. 2008). A Type Ia supernova at z ∼ 4 would be observed to have magnitudes and colors similar to those observed in HST for MACS0647-JD. A Type IIP plateau supernova would likely be bluer, but could perhaps match the observed HST colors. We would expect to detect it as a bright object at longer wavelengths, but the Spitzer images were obtained 1.5 years earlier, perhaps before the star went supernova.

This intriguing scenario is ruled out by the gravitational lens time delays due to MACSJ0647.7+7015, which we estimate to be on the order of 1–10 years between JD1 and JD2 and ∼50 years between these and JD3. Our subset of lens models (Section 6), which allow for z ∼ 4, also suggest that the intrinsic fluxes of all three images are roughly consistent with one another (at least to within a magnitude). Thus, the supernova plateau (several magnitudes brighter than the host galaxy) would have to have lasted 50/(1 + z) ∼ 10 years for us to observe it simultaneously in all three images. Even if this were somehow possible, we would then expect to have detected the earlier (least time-delayed) images with Spitzer.

5.6. Emission Line Galaxy?

In principle, an active galactic nucleus/starburst galaxy with an undetected continuum and two or more extremely strong nebular emission lines redshifted into F140W and F160W could reproduce the observed HST colors. The only plausible configuration is that Hβ (4861 Å) and [O iii] (4959 Å) are redshifted to within F140W and F160W, while [O iii] (5007 Å) is redshifted beyond F140W but within F160W. This is possible for the narrow redshift range 2.20 < z < 2.22. At this redshift, F140W and F160W have rest-frame widths of ∼1229 and ∼889 Å, respectively. In the case of JD, we measure the flux blueward of F140W to be −3.4 ± 3.7 nJy. We conservatively adopt <7.4 nJy as the 2σ upper limit on the continuum flux. Boosting the F140W flux to the observed ∼63 nJy would require emission lines with a combined equivalent width (EW) of >1229 × (63/7.4 − 1) ∼ 9525 Å. Similarly, increasing the F160W flux to the observed ∼162 nJy would require a combined EW >889 × (162/7.4 − 1) ∼ 19114 Å (∼10−15 erg s−1 cm−2). Thus, in our configuration assuming a continuum flux of 7.4 nJy:

Equation (1)

Equation (2)

Equation (3)

Equation (4)

Equation (5)

where the line ratio in Equation (4) is dictated by the relative transition probabilities.

An [O iii] (5007 Å) line with EW >9000 Å would be several times greater than the strongest emission lines observed to date, approaching EW ∼2000 Å, for [O iii] (5007 Å) and Hα (6563 Å) (Atek et al. 2011; van der Wel et al. 2011; Shim et al. 2011; Fumagalli et al. 2012). If we consider instead the 5σ continuum limit of 18.4 nJy, we would still require EW ∼3959 Å for [O iii] (5007 Å) and EW ∼1659 Å for Hβ. The strongest Hβ lines are robustly predicted to have EW ≲ 800 Å even for extremely young stellar populations, according to models like Starburst9922 (Leitherer et al. 1999, 2010). Finally, given such bright lines, one would also expect a significant contribution of [O ii] (3727 Å) in F110W and F125W, which is not observed.

6. GRAVITATIONAL LENS MODELING

We identified 24 strongly lensed images of nine background galaxies (Section 6.1), used them to derive lens models using three different methods (Section 6.2), and derived results including magnifications in (Section 6.3). Importantly, our lens models show that MACS0647-JD1, 2, and 3 are observed in relative positions as expected if they are strongly lensed multiple images of the same galaxy at z ∼ 11.

6.1. Strongly Lensed Multiple Images

Zitrin et al. (2011a) presented a preliminary gravitational lens mass model of the MACSJ0647.7+7015 cluster core based on pre-CLASH HST/ACS F555W+F814W imaging and their identifications of two background galaxies strongly lensed to produce multiple images. Based on CLASH imaging in 15 additional HST filters and additional lens modeling using the Zitrin et al. (2009) method, we have now identified seven more galaxies that have been multiply imaged, and we have measured robust photometric redshifts for all nine galaxies. This enables us to model the mass distribution (primarily dark matter) and thus lensing properties in greater detail.

In addition to the three images of MACS0647-JD at z ∼ 11, we observe 21 multiple images of eight background galaxies with photometric redshifts ranging from 2 ≲ z ≲ 6.5 (Table 6 and Figure 1). The candidate z ∼ 6.5 system is notable in its own right, consisting of two images observed at magnitudes of ∼26.3 and 27.3 in the NIR.

Table 6. Twenty-four Multiple Images of Nine Strongly Lensed Galaxies

    RA Decl. Photometric
  ID (J2000.0) (J2000.0) Redshifta
  1a 06 47 51.87 +70 15 20.9 2.2 ± 0.1
  b 06 47 48.54 +70 14 23.9 2.2 ± 0.1
  c 06 47 52.01 +70 14 53.8 2.2 ± 0.1
  2a 06 48 00.33 +70 15 00.7 4.7 ± 0.1
  b 06 48 00.33 +70 14 55.4 4.7 ± 0.1
  c 06 47 58.62 +70 14 21.8 4.7 ± 0.1
  3a 06 47 53.85 +70 14 36.2 3.1 ± 0.1
  b 06 47 53.41 +70 14 33.5 3.1 ± 0.1
  4a 06 47 42.75 +70 14 57.7 1.9 ± 0.1
  b 06 47 42.93 +70 14 44.5 1.9 ± 0.1
  c 06 47 45.37 +70 15 25.8 1.9 ± 0.1
  5a 06 47 41.04 +70 15 05.5 6.5 ± 0.15
  b 06 47 41.16 +70 14 34.4 6.5 ± 0.15
JD1 6a 06 47 55.74 +70 14 35.7 11.0 ± 0.3
JD2 b 06 47 53.11 +70 14 22.8 11.0 ± 0.3
JD3 c 06 47 55.45 +70 15 38.0 11.0 ± 0.3
  7a 06 47 50.91 +70 15 19.9 2.2 ± 0.15
  b 06 47 47.73 +70 14 23.2 2.2 ± 0.15
  c 06 47 48.69 +70 14 59.8 2.2 ± 0.15
  8a 06 47 48.61 +70 15 15.8 2.3 ± 0.1
  b 06 47 47.34 +70 15 12.5 2.3 ± 0.1
  9a 06 47 43.79 +70 15 00.4 5.9 ± 0.15
  b 06 47 44.98 +70 14 23.2 5.9 ± 0.15
  c 06 47 49.06 +70 15 37.7 5.9 ± 0.15

Note. aJoint likelihoods for each system with approximate 68% uncertainties.

Download table as:  ASCIITypeset image

For each of systems 3, 5, and 8, our lens models predict a third faint counterimage, but we cannot unambiguously identify it among several possible candidates. To be conservative, we do not include these uncertain identifications as constraints on our lens models. Inclusion of these candidates does not significantly affect the lens models. In Figure 1, we indicate the predicted locations of these counterimages.

6.2. Strong Lens Modeling Methods

Based on the observed positions of all 24 strongly lensed images, we model the mass of MACSJ0647.7+7015 using three different methods: the Zitrin et al. (2009) method, Lenstool (Kneib et al. 1993; Jullo et al. 2007), and LensPerfect (Coe et al. 2008, 2010). The first two methods are "parametric" in that they assume light traces mass, which has proved to be a very good prior. For example, some of the earliest efforts to model cluster lenses found that assigning masses to individual luminous cluster galaxies significantly improved the reproduction of strongly lensed images (Kassiola et al. 1992; Kneib et al. 1996). LensPerfect makes no assumptions about light tracing mass, exploring a broader range of mass models and perfectly reproducing the observed positions of all multiple images positions as input.

The Zitrin et al. (2009) mass model parameterization consists of three components: the cluster galaxies, a smooth cluster halo, and an external shear. Cluster galaxies were identified according to the "red sequence" in F814W–F555W color–magnitude space, then verified with photometric redshifts. Each cluster galaxy was modeled as a power-law density profile, its mass scaling with flux observed in F814W. In this work, we also allowed the masses of the two brightest central galaxies to vary independently. The cluster halo component was derived from this galaxy component by smoothing the latter with either a polynomial spline or a Gaussian. The two components were allowed to scale independently before being added. In all, there were eight free parameters: the mass scalings of the galaxy and halo components, the masses of the two brightest central galaxies, the power law of the galaxy density profiles, the degree of the polynomial spline or Gaussian smoothing width, and the amplitude and direction of the external shear.

The Lenstool model consisted of an ellipsoidal Navarro–Frenk–White halo (Navarro et al. 1996) plus cluster galaxies modeled as truncated pseudo-isothermal elliptical mass distributions (PIEMDs; Kassiola & Kovner 1993). We assumed core radii rcore = 300 pc and luminosity scaling relations as in Jullo et al. (2007): velocity dispersion σ0L1/4 and cutoff radius rcutL1/2, resulting in all galaxies having equal mass-to-light ratios. The normalizations of these two scaling relations were free parameters along with the cluster halo position (x, y), ellipticity (e, θ), scale radius, and concentration. There were eight free parameters in all.

6.3. Lens Model Results

Given the observed position of any one of the MACS0647-JD images, the Lenstool model accurately predicts and reproduces the positions of the other two images to an rms of 1farcs3, as minimized for z = 11.59+0.12−1.53. This scatter is consistent with ∼1farcs4 expected due to lensing by line-of-sight structures and variation in the mass-to-light ratio of cluster galaxies (Jullo et al. 2010; Host 2012).

The lens model and inferred redshift for MACS0647-JD do not change significantly if the MACS0647-JD images are excluded as constraints. In this case, the eight-parameter lens model remains well constrained by the 21 other multiple images which provide 26 constraints (see discussion below).

Using the Zitrin et al. (2009) method, two sets of acceptable models are found in different regions of the model parameter space. One set prefers z ∼ 11 for MACS0647-JD, while the other prefers z ∼ 3.5. The latter mass models have flatter profiles.

Our LensPerfect analysis confirms that this is a degeneracy between the MACS0647-JD redshift and the cluster mass distribution. A wide range of redshifts including z = 3.5, 11.0, and 11.6 is permitted given the strong lensing data. When fixing the redshift to any of these values, LensPerfect produces reasonable lens models (physical and with light approximately tracing mass) that perfectly reproduce all 24 observed positions of the nine strongly lensed galaxies. When the MACS0647-JD redshift is set lower, the cluster mass distribution is more spread out yielding a flatter profile. We confirm that the parametric models have similar differences, in part due to their parameterizations of the cluster mass distribution.

Including the redshift of MACS0647-JD, both parametric models have ≲ 9 free parameters. (This number should be considered a maximum given covariances among the parameters. See discussion in Andrae et al. 2010.) There are 30 constraints =2 × (24 − 9), where the constraints are the two coordinates (x,y) from each of the 24 multiple images minus the nine unknown source positions. Thus, each model has ≳ 21 degrees of freedom (30 constraints − 9 parameters). The Lenstool model reproduces all lensed image positions to an rms of 1farcs17. Assuming a scatter of 1farcs4 as explained above, this yields χ2 ≈ 24 × (1farcs17/1farcs4)2 = 16.8 with 21 degrees of freedom, for a reduced χ2ν ≲ 16.8/21 ≈ 0.8.

The Zitrin spline model with the flat mass profile preferring z ∼ 3.5 for MACS0647-JD obtains an rms of 1farcs1 for χ2 ≈ 15, also with 21 degrees of freedom, yielding χ2ν ≲ 0.7. When the models are forced to adopt z ∼ 11, the best fit is found with a Gaussian-smoothed model, yielding an rms of 2farcs9, for χ2 ≈ 103 and a reduced χ2ν ≲ 4.9.

Assuming MACS0647-JD is at z = 11.0, the Lenstool model (Figure 1) estimates magnifications of ∼8.4, 6.6, and 2.8 for JD1, JD2, and JD3, respectively, with uncertainties of ∼20%. These magnifications are consistent with an F160W =20 ± 4 nJy (28.2 ± 0.2 mag) source magnified by factors of ∼ 8.1, 6.8, and 2.1 to match the observed F160W fluxes within their ∼10% uncertainties (Table 3).

LensPerfect models perfectly reproduce all 24 observed image positions as input. These data constrain the mass distribution and profile well globally but only to a resolution of ∼20'', or ∼130 kpc, roughly the average separation between the strongly lensed images. This resolution is insufficient to obtain robust estimates of the image magnifications, which are strong functions of local mass gradients. Here these magnifications are better estimated by adopting priors of light tracing mass as in the parametric methods.

7. PHYSICAL PROPERTIES OF MACS0647-JD

We estimate the rest-frame UV luminosity and SFR (Section 7.1), infer a rough stellar mass (Section 7.2), and place upper limits on the physical size (Section 7.4). In Section 7.3, we explore SED parameter degeneracies and place modest constraints on other properties of MACS0647-JD.

7.1. Rest-frame UV Luminosity; Star Formation Rate

As described above, we estimate the intrinsic (unlensed) magnitude of MACS0647-JD to be 28.2 ± 0.2 in F160W. At z ∼ 11, the Lymanα break falls within the F160W bandpass, attenuating the observed flux; the rest-frame UV (0.16 μm) continuum flux is ∼0.25 mag brighter. To convert this flux to rest-frame UV absolute magnitude MUV, AB, we add three terms. Most significantly, the magnitude is brighter by the distance modulus ∼50.3. The flux per unit frequency is also dimmer by a factor of 1 + z (∼2.7 mag) simply because the rest frame samples a higher frequency. We also derive a small color term of ∼0.1 mag as we switch from the blueshifted F160W filter (∼0.13 μm) to a tophat filter centered on 0.16 μm for comparison with previous measurements. Combining these terms, we find MUV ∼ −19.5. Converting this to UV luminosity at a distance of 10 pc, we find LUV ∼ 2.8 × 1028 erg s−1 Hz−1.

This rest-frame UV luminosity can be generated by an SFR of ∼4 M yr−1 assuming a Salpeter (1955) initial mass function (IMF) with mass limits 0.1–100 M (Kennicutt 1998). The ionizing efficiency could be increased by a factor of ∼1.8 for a Chabrier (2003) IMF or ∼3 for a top-heavy IMF (Bruzual & Charlot 2003; Schaerer 2003; Stiavelli et al. 2004), with the latter generally realized in simulations of high-redshift galaxies (e.g., Abel et al. 2002; Bromm et al. 2002). Stellar rotations may also increase this efficiency by a factor of ∼2–5 (Levesque et al. 2012). Given these and other uncertainties, an SFR of ∼1 M yr−1 or lower could generate LUV derived for MACS0647-JD.

This luminosity LUV is ∼L* or perhaps a few times brighter than L*, depending on which extrapolation we assume to estimate this characteristic luminosity at z ∼ 11 (Bouwens et al. 2008, 2011a; Robertson et al. 2010; Bradley et al. 2012b). Based on the estimated luminosity function (Section 8) and our lens magnification model, we find that a z ∼ 11 galaxy lensed to 26th magnitude does in fact have an ∼80% likelihood of being intrinsically brighter than L*.

7.2. Stellar Mass

Meaningful observational constraints on the stellar mass of MACS0647-JD would require rest-frame optical photometry redward of 0.4 μm (beyond the Balmer and 4000 Å breaks), or 4.8 μm observed. However, we may infer a stellar mass estimate as follows.

Specific star formation rates (sSFRs) of 2–3 Gyr−1 (that is, 2–3 M formed per year per 109M total stellar mass) are observed on average for galaxies over a remarkably broad range of redshifts (2 ≲ z ≲ 7; see, e.g., Stark et al. 2009; González et al. 2010; McLure et al. 2011; Bouwens et al. 2012b). If this "plateau" continues out to z ∼ 11 and MACS0647-JD has a typical sSFR of ∼2 or 3 Gyr−1, then this combined with our derived SFR would imply a stellar mass on the order of ∼109M. The average stellar mass of L* galaxies was ∼109M at z ∼ 7–8 and rose to a few times 1010M by z ∼ 2 (González et al. 2010; Labbé et al. 2010; Finkelstein et al. 2010). Based on this trend, we may expect the average stellar mass of L* galaxies at z ∼ 11 to be less than 109M. If this holds true for MACS0647-JD, it would suggest a higher sSFR, more in line with expectations from simulations that are in some tension (but perhaps only mild tension) with the observed sSFR plateau (e.g., Khochfar & Silk 2011; Davé et al. 2011; Weinmann et al. 2011; Behroozi et al. 2012).

We conclude that the stellar mass of MACS0647-JD is most likely on the order of 108–109M. The lower end of this mass range is more compatible with expectations from cosmological simulations and galaxy formation models. Based on simulations (e.g., Klypin et al. 2011), we may expect to find a dark matter halo of virial mass ∼1010M or so within our search volume of a few times 1000 Mpc3. This would comfortably host a galaxy of stellar mass 108M or so. A stellar mass of 109M would be larger than expected.

7.3. Other SED-based Constraints

In Section 5.1, we explored a broad range of galaxy properties to rule out lower redshift interlopers with a high degree of confidence. In this section, we quantify the degeneracies in those parameters (see also Pirzkal et al. 2012; Pforr et al. 2012) along with the modest constraints we obtain on them, assuming that MACS0647-JD is indeed at high redshift.

We modeled the observed photometry using the FSPS models from Conroy et al. (2009) and Conroy & Gunn (2010). As described in Section 5.1, we convolved their SSP models with a range of SFHs, including a single early burst, exponentially declining, constant, and exponentially rising. Though little dust is expected at these redshifts (e.g., Bouwens et al. 2012b), we tested this assumption explicitly by adding a variable degree of Calzetti (2001) dust extinction. We did not add nebular emission lines. Only [O ii] (3727 Å) redshifted to ∼4.5 μm might be a significant concern, though we only modestly detect a 4.5 μm flux in JD2. We assume a flat linear prior in age up to the age of the universe at each redshift.

Our constraints on redshift, age, SFH (τ), metallicity, and extinction are shown in Figure 15 based on the integrated photometry of all three images. We confirm that significant dust extinction is unlikely, with a rest-frame V-band extinction of AV < 0.25 mag (95% confidence). Constraints on the other parameters are modest, with slight preferences for low metallicity and rising or continuous SFH in a maximally old galaxy (limited to ≲400 Myr by the age of the universe).

Figure 15.

Figure 15. Constraints on redshift, age, star formation history (exponential scale factor τ), metallicity (where Z ≈ 0.019), and dust extinction (in V-band magnitudes). This is based on fitting the HST+Spitzer photometry integrated over the three images to the flexible stellar population synthesis models of Conroy et al. (2009) and Conroy & Gunn (2010) convolved with an exponential star formation history ∝exp (− t/τ) and with Calzetti (2001) dust extinction added (RV = 4.05). The star formation history was either decaying (τ/age = 0.3, 1, 3), rising (τ/age = −0.3, −1, −3), constant star formation rate (CSF; τ = ), or occurring in a single burst at "birth" (simple stellar population, SSP; τ = 0), with equal likelihoods for all eight possibilities. We assumed a flat linear prior for age (though it is plotted as log here) up to the age of the universe (as a function of redshift). Within each panel, the marginalized likelihood is plotted as a color map (scaled linearly) and confidence contours of 68% and 95% are overplotted as blue and white lines, respectively.

Standard image High-resolution image

7.4. Physical Size

After correcting for the observed PSF, JD1 and JD2 have observed half-light radii ≲ 0farcs1, or delensed ≲ 0farcs03 (≲ 0.1 kpc). Based on extrapolations from lower redshifts (Oesch et al. 2010a; Mosleh et al. 2012), we expect an average half-light radius of roughly 〈r1/2〉 ∼ 0.26 kpc for a galaxy with a stellar mass of ∼109M. MACS0647-JD is likely somewhat less massive (Section 7.2). Scatter in galaxy sizes is large: ∼0.3 dex, or a factor of ∼2, as found for well-studied samples at 3 ≲ z ≲ 5 (Ferguson et al. 2004). So our derived r1/2 ≲ 0.1 kpc is on the small side, though not beyond expectations (see Figure 16). Furthermore, we may only be detecting a bright star-forming knot in a larger galaxy. These knots typically have sizes of ∼0.1 kpc as observed in high-redshift (5 < z < 8) lensed galaxies (Franx et al. 1997; Bradley et al. 2008, 2012a).

Figure 16.

Figure 16. Upper limit on MACS0647-JD half-light radius (deconvolved and delensed) compared to observed mean galaxy sizes from Oesch et al. (2010a) and Mosleh et al. (2012) extrapolated to z ∼ 11. The intrinsic scatter in galaxy sizes is a factor of ∼2 as measured for well-studied samples at z ∼ 4 (Ferguson et al. 2004).

Standard image High-resolution image

8. NUMBER COUNT AND STAR FORMATION RATE DENSITIES AT z ≳ 10

Our discovery of a 26th magnitude z ≈ 11 candidate in 17 cluster lensing fields (∼78 arcmin2) agrees with rough expectations given observed luminosity functions (LFs) at lower redshifts extrapolated to higher redshift and propagated through our lens models.

The LF at z ∼ 8 has recently been robustly constrained at both faint (Bouwens et al. 2011b) and bright (Bradley et al. 2012b) magnitudes (see also Oesch et al. 2012b). Based on the combined HST data from the HUDF09, WFC3 Early Release Science Program (ERS), CANDELS, and BoRG pure parallel fields, Bradley et al. (2012b) find that the z ∼ 8 LF follows a Schechter (1976) function with a normalization ϕ* = 4.3+3.5−2.1 × 10−4 Mpc−3, characteristic rest-frame UV absolute magnitude M*UV = −20.26+0.29−0.34, and faint end slope α = −1.98+0.23−0.22. These data and LF with uncertainties are plotted in blue in Figure 17.

Figure 17.

Figure 17. Observed number counts for both lensed (squares) and field (circles/arrows) galaxies at z ≈ 8, 9.6, and 10.8 as a function of observed F160W AB magnitude. CLASH discoveries of candidates at z ≈ 10.8 (MACS0647-JD; this work) and z ≈ 9.6 (MACS1149-JD; Zheng et al. 2012) in ∼78 arcmin2 (17 cluster lensing fields) are plotted as the red and yellow squares, respectively, in units of counts per unit magnitude, redshift, and square arcminute. The blue square indicates the single robust z ≈ 7.5 candidate (A1689-zD1; Bradley et al. 2008) identified in 11 non-CLASH cluster fields covering 21 arcmin2 (Bouwens et al. 2009). Cyan and orange circles represent counts in unlensed fields at z ∼ 8 and 10.3, respectively, from the BoRG survey (Bradley et al. 2012b) as well as CANDELS, ERS, and HUDF09 (Oesch et al. 2012a) including UDFj-39546284 (Bouwens et al. 2011a). The arrows give both 1σ and 90% confidence upper limits from Oesch et al. (2012a). The bottom dashed curves are expected counts with uncertainties from the Bradley et al. (2012b) z ∼ 8 luminosity function and extrapolated to z ∼ 9.6 and 10.8 assuming an evolution of dM*/dz = 0.30, similar to that found in, e.g., Bouwens et al. (2008, 2012a; also see Figure 18). This yields F160W m* ∼ 27.0, 27.7, and 28.3 at these redshifts, plotted as a small star along each curve. We then simulate the lensing of these expected counts in our CLASH WFC3/IR survey area using mass models of the 17 clusters and excluding the area (∼17%) covered by foreground objects. These are given as the upper solid curves, which are consistent with the CLASH discoveries (red and yellow squares).

Standard image High-resolution image

We then assumed an evolving LF in which M* varies linearly with redshift while ϕ* and α are fixed. Previous work (e.g., Bouwens et al. 2008, 2012b) has shown that dM*/dz ∼ 0.3 yields good agreement to data at 4 ≲ z ≲ 8 (see also Figure 18). So we first assumed that this holds out to z ∼ 11.

Figure 18.

Figure 18. CLASH constraints on the cosmic star formation rate density at z ∼ 9.6 and 10.8 compared to previous estimates from 0 ≲ z ≲ 10 as compiled and corrected for dust extinction by Bouwens et al. (2012b). We adopt the z ∼ 8 LF from Bradley et al. (2012b) and assume M* evolves with linearly redshift. MACS0647-JD constrains dM*/dz = 0.33+0.35−0.15, while MACS1149-JD (Zheng et al. 2012) constrains dM*/dz = 0.62+0.62−0.24, yielding a joint constraint of dM*/dz = 0.44+0.29−0.15. We integrate these LFs down to 0.05 L*(z = 3), or MUV, 1400 ≈ −17.7 AB mag, to obtain rest-frame UV luminosity densities (magenta and green points with the gray shaded region giving the joint constraint). The yellow point is an alternative estimate of the z ∼ 9.6 SFRD from Zheng et al. (2012) rescaled by 12/17 to account for the larger search volume behind 17 clusters. An extrapolation from z ∼ 8 assuming dM*/dz = 0.30 is shown as the dashed blue line. Points along the bottom blue curve are observed while the upper orange curve is corrected for extinction by Bouwens et al. (2012b). We assume that a star formation rate of one solar mass per year (left axis) produces a UV (0.14 μm) luminosity of 8 × 1027 erg s−1 Hz−1 (right axis), as from a Salpeter (1955) IMF truncated between 0.1 and 125 M. A Chabrier (2003) IMF would yield lower SFRD by a factor of ∼1.8. Previous data are from Schiminovich et al. (2005), Oesch et al. (2010b), Reddy & Steidel (2009), Bouwens et al. (2007, 2011a), and Oesch et al. (2012a) as described in Bouwens et al. (2012b, see their Figure 19).

Standard image High-resolution image

We convolved this evolving LF through our lens models for the 17 CLASH clusters studied in this work, accounting for both the brightening of sources and the reduction in search area due to the magnifications. Some of these models have been published (Zitrin et al. 2011a, 2012b; Coe et al. 2012; Zheng et al. 2012; and this work) and the rest will be detailed in upcoming work. We applied masks to the lensed regions, restricting our search area to the WFC3/IR observations and excluding regions covered by foreground objects (∼17% of the total area) according to our SExtractor segmentation maps. Our total search area for 17 clusters is ∼78 arcmin2 (as observed and lensed).

The resulting expected lensed number counts for z ∼ 8, 9.6, and 10.8 are plotted in Figure 17. These are consistent with CLASH observed ∼26th magnitude number counts at z ∼ 10.8 (this work) as well as z ∼ 9.6 (MACS1149-JD; Zheng et al. 2012).

However, the observed z ∼ 10 number counts in the field are a factor of ∼4 lower than expected based on an evolving LF such as this, suggesting a sharp drop-off in star formation density at these redshifts (Bouwens et al. 2011a; Oesch et al. 2012a). To test for such a drop-off, we allowed for more (or less) rapid evolution in M* at z > 8, still as extrapolated from the Bradley et al. (2012b) LF at z ∼ 8. We found that MACS0647-JD constrains dM*/dz = 0.33+0.35−0.15, and MACS1149-JD constrains dM*/dz = 0.62+0.62−0.24, yielding a joint constraint of dM*/dz = 0.44+0.29−0.15.

We then integrated these LFs (with uncertainties) down to 0.05 L*z = 3 (MUV = −17.7) to obtain SFRDs that can be compared directly to previously published estimates (Figure 18). We found SFRD =(1.4+1.3−1.1) × 10−3M yr−1 Mpc−3 at z ∼ 11 and (1.1+1.0−0.9) × 10−3M yr−1 Mpc−3 at z ∼ 9.6. For consistency with the other measurements derived and compiled by Bouwens et al. (2012b), we assumed that an SFR of one solar mass per year produces a UV (0.14 μm) luminosity of 8 × 1027 erg s−1 Hz−1, as from a Salpeter (1955) IMF truncated between 0.1 and 125 M.

Our SFRD estimate at z ∼ 9.6 is consistent with an independent estimate of (1.8+4.3−1.1) × 10−3M yr−1 Mpc−3 based on MACS1149-JD presented in Zheng et al. (2012). Only 12 clusters were searched in that work. To account for the larger volume now searched without additional ∼26th magnitude z ∼ 10 candidates, we rescaled this estimate by a factor of 12/17, yielding (1.3+3.0−0.8) × 10−3M yr−1 Mpc−3. This value is plotted in Figure 18.

Our joint constraint from both galaxies on SFRD at z > 8 is also plotted as the gray shaded region in Figure 18. Given the large uncertainties, we cannot confidently discriminate between the trend observed at lower redshifts (dM*/dz ∼ 0.30) and the sharp drop-off suggested by the paucity of z ∼ 10 galaxies detected in the field.

The dominant uncertainty in our SFRD measurement is the Poisson uncertainty of ∼0.7 dex (a factor of ∼5) given our single detection. Subdominant uncertainties include cosmic variance in 17 independent fields (e.g., Trenti & Stiavelli 2008), uncertainties in the lens models (e.g., Bradač et al. 2009), and incompleteness in our ability to detect ∼26th magnitude galaxies (25.4 < F160W < 26.4) after masking out areas corresponding to foreground galaxies. Accounting for overlapping regions in the source plane (multiple images) would slightly increase our derived SFRD.

9. CONCLUSIONS

We have discovered a candidate for the earliest galaxy yet known at z = 10.7+0.6−0.4 (95% CLs) when the universe was 427−30+21 Myr old. The galaxy is strongly lensed by MACSJ0647.7+7015 producing three magnified images, including two observed at ∼26th magnitude AB (∼162 and 136 nJy) in HST WFC3/IR F160W imaging (∼1.4–1.7 μm). The intrinsic (delensed) magnitude is ∼20 nJy (∼28.2 mag) based on our lens models for a galaxy at z ∼ 11. The unattenuated continuum is ∼0.25 mag brighter (lensed ∼0.2 μJy in the brightest image).

Spitzer/IRAC upper limits further support the high-redshift solution. We tested a broad range of lower redshift interlopers, including some previously published as high-redshift candidates, and showed that none is able to reproduce the observed HST+Spitzer photometry. Galaxies of known types at z < 9.5 are formally ruled out at 7.2σ, with the next most likely alternative being an early type and/or dusty galaxy at z ∼ 2.5. Our Bayesian priors assume that a z ∼ 2 galaxy is over 80 times more likely than a z ∼ 11 galaxy, making our z ∼ 11 claim more conservative than if such a prior were neglected. For MACS0647-JD to be at z < 9.5, it appears that it would have to belong to a new class of objects not yet observed.

The discoveries of both MACS0647-JD at z ∼ 10.7 (this work) and MACS1149-JD at z ∼ 9.6 (Zheng et al. 2012) in CLASH observations of 17 clusters to date are consistent with extrapolations of luminosity functions observed at lower redshifts (Bouwens et al. 2012a; Bradley et al. 2012b), assuming a linear evolution of M* with redshift, and as convolved through our lens models. If these extrapolations are valid to z ≳ 10, then low-luminosity galaxies could have reionized the universe (Bouwens et al. 2012a; Kuhlen & Faucher-Giguère 2012). However, these extrapolations are in conflict with the paucity of z ∼ 10 galaxies discovered in unlensed fields, suggesting a rapid buildup in star formation density between z ∼ 10 and 8 (Bouwens et al. 2011a; Oesch et al. 2012a). Our data do not allow us to discriminate between these two scenarios, given the large uncertainties dominated by the Poisson statistics of these two detections.

MACS0647-JD is likely close to the characteristic luminosity for a z ∼ 11 galaxy (∼1–3 L*), producing a few M yr−1, with an inferred stellar mass of roughly ∼108–109M, and a half-light radius of ≲ 100 pc (deconvolved and delensed). This is smaller by a factor of a few than the average size expected as extrapolated from lower redshifts (Oesch et al. 2010a; Mosleh et al. 2012) with an intrinsic scatter in sizes of perhaps a factor of ∼2 (Ferguson et al. 2004). The size of ≲ 100 pc is similar to the sizes of bright knots observed in lensed galaxies at 5 < z < 8 (Franx et al. 1997; Bradley et al. 2008, 2012a).

Thanks to the magnified views afforded us by gravitational lensing, this galaxy may be studied further with existing and future large telescopes, including the James Webb Space Telescope (JWST; Gardner et al. 2006; Stiavelli 2009) and extremely large ground-based telescopes constructed in the northern hemisphere. Unfortunately due to its high declination of +70, it is not accessible to southern telescopes such as the Atacama Large Millimeter Array (ALMA).

This z ∼ 11 candidate approaches the redshift limit of galaxies detectable by Hubble's WFC3/IR camera. Galaxies at z > 12 would drop out completely of the F140W filter and to an increasing degree in F160W until z ∼ 13, when all the light redward of Lyman-α would be redshifted beyond the observable wavelength range.

We thank Nor Pirzkal for useful discussions regarding the WFC3/IR backgrounds and contributing the single emission line galaxy scenario. We thank the referee for useful comments which helped us improve the manuscript.

The CLASH Multi-Cycle Treasury Program is based on observations made with the NASA/ESA Hubble Space Telescope. The Space Telescope Science Institute is operated by the Association of Universities for Research in Astronomy, Inc. under NASA contract NAS 5-26555. ACS was developed under NASA contract NAS 5-32864.

This work is also based in part on observations made with the Spitzer Space Telescope, which is operated by the Jet Propulsion Laboratory, California Institute of Technology under a contract with NASA.

A.Z. is supported by contract research Internationale Spitzenforschung II-1 of the Baden-Württemberg Stiftung. We wish to acknowledge the support of the Michigan State University High Performance Computing Center and the Institute for Cyber-Enabled Research. S.S. was supported by the DFG cluster of excellence Origin and Structure of the Universe. The work of L.A.M. was carried out at Jet Propulsion Laboratory, California Institute of Technology, under a contract with NASA. The Dark Cosmology Centre is funded by the DNRF.

Facilities: HST (WFC3, ACS) - Hubble Space Telescope satellite, Spitzer (IRAC) - Spitzer Space Telescope satellite

Footnotes

Please wait… references are loading.
10.1088/0004-637X/762/1/32