This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Brought to you by:

The Near-infrared Transmission Spectra of TRAPPIST-1 Planets b, c, d, e, f, and g and Stellar Contamination in Multi-epoch Transit Spectra

, , , and

Published 2018 October 4 © 2018. The American Astronomical Society. All rights reserved.
, , Citation Zhanbo Zhang et al 2018 AJ 156 178 DOI 10.3847/1538-3881/aade4f

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1538-3881/156/4/178

Abstract

The seven approximately Earth-sized transiting planets in the TRAPPIST-1 system provide a unique opportunity to explore habitable- and nonhabitable-zone small planets within the same system. Its habitable-zone exoplanets—due to their favorable transit depths—are also worlds for which atmospheric transmission spectroscopy is within reach with the Hubble Space Telescope (HST) and James Webb Space Telescope (JWST). We present here an independent reduction and analysis of two HST Wide Field Camera 3 (WFC3) near-infrared transit spectroscopy data sets for six planets (b through g). Utilizing our physically motivated detector charge-trap correction and a custom cosmic-ray correction routine, we confirm the general shape of the transmission spectra presented by de Wit et al. Our data reduction approach leads to a 25% increase in the usable data and reduces the risk of confusing astrophysical brightness variations (e.g., flares) with instrumental systematics. No prominent absorption features are detected in any individual planet's transmission spectra; by contrast, the combined spectrum of the planets shows a suggestive decrease around 1.4 μm similar to an inverted water absorption feature. Including transit depths from K2, the SPECULOOS-South Observatory, and Spitzer, we find that the complete transmission spectrum is fully consistent with stellar contamination owing to the transit light source effect. These spectra demonstrate how stellar contamination can overwhelm planetary absorption features in low-resolution exoplanet transit spectra obtained by HST and JWST and also highlight the challenges in combining multi-epoch observations for planets around rapidly rotating spotted stars.

Export citation and abstract BibTeX RIS

1. Introduction

The TRAPPIST-1 system (2MASSI J23062928–0502285, 2MUCD 12171) hosts seven known nearly Earth-sized transiting exoplanets (Gillon et al. 2016, 2017). Four of these planets (b, c, d, and e) are in or near the liquid water habitable zone (e.g., Alberti et al. 2017; Wolf 2017), although the stellar activity and ultraviolet radiation of the star (e.g., Bourrier et al. 2017b; O'Malley-James & Kaltenegger 2017), as well as the formation and initial volatile budget (e.g., Ciesla et al. 2015; Ormel et al. 2017) and subsequent volatile loss of the planets (e.g., Bourrier et al. 2017a), remain concerns for their habitability. The TRAPPIST-1 host star—an M8-type ultracool dwarf at the stellar/substellar boundary—has a very small radius (R* ∼ 1.14 ± 0.04 RJup = 0.117 ± 0.004 R; Filippazzo et al. 2015), leading to exceptionally deep transit depths (0.3%–0.8%) for its small planets. These favorable transit depths, in combination with the relatively bright host star (V = 18.8, but J = 11.35) and frequent transits (planet orbital periods between 1.6 and 15 days; Gillon et al. 2017), make the TRAPPIST-1 planetary system exceptionally well suited for follow-up infrared transit spectroscopy. Of particular importance for such observations are photometrically very stable and sensitive infrared space telescopes: the Hubble Space Telescope (HST) and the James Webb Space Telescope (JWST). High-precision spectroscopy with these facilities may be able to probe atmospheric composition (gas-phase absorbers: O3, scattering, and particulates) in the inner TRAPPIST-1 planets, including those in the habitable zone (Barstow & Irwin 2016; Morley et al. 2017).

These studies find that the most prominent absorption features that may be present and detectable in these atmospheres are water, ozone, and carbon dioxide absorption bands. While the detection of one or more of the features could distinguish between Earth-, Venus-, or Titan-like atmospheres (Morley et al. 2017), even the lack of features may be interesting: stringent nondetections of absorption features could be interpreted as a lack of stratospheric water (e.g., Madhusudhan et al. 2014), the veiling effect of high-altitude hazes (Kreidberg et al. 2014; Yang et al. 2015), or the lack of a significant atmosphere. Recent ambitious HST transmission spectroscopy programs showed encouraging results demonstrating that instrumental systematics can be successfully corrected even for very long combined integrations (i.e., very low photon noise; e.g., Kreidberg et al. 2014; Stevenson et al. 2014; Morley et al. 2017).

With the JWST guaranteed time observations and early-release science observations determined and the community working on the JWST Cycle-1 open time proposals, the assessment of the feasibility of the photon noise–limited transit spectroscopy with HST and JWST is of paramount importance for the field.

Over the past months, two new results have impacted extrapolations from past HST programs toward future, even more ambitious HST and JWST programs. Zhou et al. (2017) demonstrated a solid-state physics–based correction algorithm for the HST charge-trapping processes that introduce the so-called "ramp effect," the dominant HST systematics in time-resolved observations. This model offers a more efficient use of the telescope and enables observers to correct for different systematics occurring in different orbits, in contrast to the previously widely utilized empirical correction that assumed identical systematics in all orbits beyond the first. This model has recently begun to be applied in HST/WFC3 transmission spectroscopic studies (e.g., Spake et al. 2018).

However, a new study by Rackham et al. (2018) highlighted a major astrophysical noise source. These authors provided a comprehensive exploration of the impact of stellar heterogeneity on high-precision near-infrared spectroscopy of M-dwarf transiting planets and showed that this method may ultimately be limited by the fact that heterogeneous stellar photospheres introduce a spectral contamination into the transmission spectra (i.e., the "transit light source effect"). In fact, the study by Rackham et al. (2017) showed that repeatable, high-quality visual spectra of the sub-Neptune GJ 1214b (also orbiting an M-dwarf host star) are only consistent with stellar contamination, not with planetary features, providing the first clear example of the effect that may also impact other high-quality exoplanet transmission spectra (Apai et al. 2017, Pinhas et al. 2018, in press).

Therefore, the central questions that emerge on the atmospheric characterization of TRAPPIST-1 (and similar, to-be-discovered M-dwarf habitable planet systems) and could be addressed, at least partly, before JWST are as follows. What are the compositions of the individual atmospheres, and is there evidence for differences in the seven atmospheres? What effects will limit the precision with which HST and JWST will be able to probe these atmospheres?

In this study, we present an independent reduction and analysis of two recently obtained HST infrared spectroscopic data sets published in de Wit et al. (2016, 2018). Our reduction builds on the new and physically motivated detector charge-trap correction (Zhou et al. 2017), which provides an improved correction for the primary systematics affecting HST high-precision spectroscopy. In addition, we provide a comprehensive assessment of the potential impact of stellar activity on observations and stellar spectral contamination of the transmission spectra due to the heterogeneous photosphere of TRAPPIST-1.

2. Observations

The data presented in this study were obtained in two HST Wide Field Camera 3 (WFC3) programs (GO-14500 and GO-14873, PI: de Wit) targeting the TRAPPIST-1 system. In the following, we refer to the two programs as Program 1 and Program 2, respectively. Program 1 consists of one visit, executed on 2016 May 4, and covers the overlapping transits of planets TRAPPIST-1 b and c. The results were initially published in de Wit et al. (2016). Program 2 consists of four visits, executed between 2016 December and 2017 January, and covers the transits of planets TRAPPIST-1 d, e, f, and g (de Wit et al. 2018). In the two programs, the six inner planets (TRAPPIST-1 b–g) have been observed at least once during transit. In addition, the observations include two overlapping transits of the planet pairs b and c and e and g. For convenience, we label the seven transits in chronological order as Transit 1, 2, 3, 4, 5, 6, and 7, as listed in Table 1. As normal for HST observations, the phase coverages of the transit light curves were limited by Earth occultations.

Table 1.  Observation Log

P ID Visit No. Obs. Date Planet Transit No. of Orbits No. of Exposuresa Note
14500 0 2016 May 04 b, c 1, 2 4 74  
14873 1 2016 Dec 04 d 3 7 114 Only last three orbits not affected by CRs
  2 2016 Dec 29 g, e 4, 5 5 84 No apparent SAA influence
  3 2017 Jan 09 f 6 6 93 First two orbits discarded with a possible transit
  4 2017 Jan 10 e 7 5 69 Last orbits discarded including a possible transit

Note.

aThe number of exposures excludes those that were discarded due to guiding failure or compromised data quality.

Download table as:  ASCIITypeset image

All transmission spectra were obtained using the WFC3 infrared G141 grism, which covers wavelengths from 1.1 to 1.7 μm. The observations utilized state-of-the-art strategies for WFC3 IR transit spectroscopy, including spatial scanning (to avoid saturation and increasing observing efficiency), detector subarraying (to avoid memory saturation), and the acquisition of a direct image at the beginning of each orbit to provide an accurate wavelength calibration for the slitless spectra. For each spectroscopic image, the exposure time was 112 s, and the scanning rate was 0farcs027 s−1, yielding a scanning length on the detector of 3farcs02, or ∼25 pixels. Spatial scans were conducted in the bidirectional scanning mode in Program 1, while Program 2 adopted the single-directional scanning mode, resulting in slightly different cadences (151 s for Program 1, 176 s for Program 2) for observations in the two programs. Table 1 lists the key details of the observations.

The first, third, and fourth visits of Program 2 were severely affected by cosmic rays (CRs) due to HST's passage through the South Atlantic Anomaly (SAA7 ), which in several of these orbits also negatively affected the HST guiding performance, resulting in unrecoverable data. Due to particularly severe CR damage, we had to discard the following data subsets: Orbits 1–4 in Visit 1, Orbits 1 and 2 in Visit 2, and Orbit 5 in Visit 4.

3. Data Reduction

We downloaded the data from both programs from the Mikulski Archive for Space Telescopes. Our data reduction procedure started from the ima frames produced by the CalWFC3 pipeline version 3.4. The ima frames are bias, dark, and nonlinearity corrected and include all nondestructive reads. Each spectroscopic ima file contains seven readouts. We discarded the "zeroth" read because the detector was reset during this read (Deustua et al. 2016). Following Deming et al. (2013), we formed subexposures by differencing adjacent reads. There were seven major steps in our data reduction procedure: (i) wavelength calibration, (ii) flat-field correction, (iii) CR removal, (iv) image registration, (v) light-curve extraction, (vi) ramp-effect correction, and (vii) light-curve fitting and transmission spectra extraction. Steps (i)–(iv) were applied to individual subexposures, while the subsequent steps were applied to the combined data. In the following, we review these key steps.

3.1. Wavelength Calibration and Flat-field Correction

We derived wavelength solutions based on the position of the target in the direct images. We adopted up-to-date wavelength calibration coefficients from Wilkins et al. (2014). The centroids of the target point source in the direct images were determined by fitting two-dimensional Gaussian profiles. In Program 1, the direct-image frame had a different aperture from the spectroscopic frame. For those observations, we adjusted the direct-image coordinates accordingly.

We adopted a third-order polynomial function in wavelength for the flat-field correction. For each visit, we separately calculated a wavelength calibration–dependent flat-field correction, i.e., differences introduced primarily by the variations in the position of the target on the direct images. We next applied the correction to individual nondestructive reads. While applying the flat-field correction, we also identified and corrected for low data quality pixels: pixels in the flat-field frame that deviated more than 20% from unity were flagged (most of these also had nonzero data quality flags, i.e., were also flagged by the CalWFC3 pipeline). We also flagged any additional pixels identified by the CalWFC3 pipeline as bad pixels, hot pixels, pixels with unstable response, or bad or uncertain flat values.8 Finally, we replaced the flagged pixels by interpolating over the neighboring unflagged pixels.

3.2. CR Correction

Correcting for CRs was a crucial step in our reduction, particularly for the orbits heavily affected by SAA passage. We identified seven orbits that suffered from SAA passage using the fits header flag and the number of CR hits. On average, there were at least 20 visually apparent CR hits per frame in these orbits. We developed a suite of custom algorithms to identify and remove the CRs and evaluate the efficiency of the CR corrections.

First, we applied iterative bidirectional median filtering to each nondestructive read and identified pixels as CR-affected if they exceeded the median-filtered image level by a threshold of 11σ (determined through the CR removal assessment described below). For each iteration, pixels that were previously marked as CRs were excluded from median filter calculations. We typically repeated this iterative filtering procedure at least three times (see discussion below for the connection between the algorithm's performance and the number of iterations used).

For each identified CR, we replaced the CR-affected pixel value with the weighted average (freplace) of the same pixel in the exposures preceding (f−1) and following (f1) the image, as described by the relation

Equation (1)

in which t0 refers to the time of the CR-affected exposure and the subscripts ±1 denote the two adjacent (in time) exposures. The weights for the averaging are effectively the inverse of the time difference between the exposures. Program 1 adopted a bidirectional scanning mode; i.e., forward and reverse scanning directions were applied alternately. In this case, the preceding and following exposures had slightly different scanned image regions due to the upstream/downstream effect (McCullough & Mackenty 2012). To account for this effect, we corrected the CR hits separately for images taken with different scanning directions. Figure 1 shows example images before and after CR removal.

Figure 1.

Figure 1. Comparison of image subsets before (left) and after (right) CR corrections. All image subsets are from frame idde01koq.

Standard image High-resolution image

We assessed our CR correction algorithm quantitatively by injecting and removing CR hits with a CR template. We constructed the CR template using one uncorrected frame (observation identifier: iddea1meq) that was taken during an SAA crossing and contaminated by over 1000 CR hits of different sizes and amplitudes. We removed the portion of the image with the stellar spectrum (pixel coordinates [135:170, 50:200]) and replaced it with randomly selected copies of regions out of the spectrum. We set all pixels with a flux below 2000 e (∼3× sky background) as zero and used the resulting image as the CR template. The CR template had 1150 CR hits, representing the most severe CR-affected case. We added the template to 20 cleaned frames and applied our CR removal algorithm. We evaluated the performance of our CR identification and correction algorithm based on three criteria: CR identification rate, false-positive rate, and correction efficiency (ηc). The latter we defined on a pixel-to-pixel basis as

Equation (2)

in which fclean, fdirty, and fcorrected refer to the pixel fluxes in the input, CR-injected, and output images, respectively.

Three parameters influence the effectiveness of the algorithm: the size of the median filter, the CR identification threshold, and the number of iterations. We optimized the algorithm through a three-dimensional grid search, with the size of the median filter ranging from 3 to 25 pixels, the thresholds ranging from 3σ to 20σ, and the number of iterations ranging from 1 to 10. The most effective combination included an 11 pixel median filter, an 11σ threshold, and a minimum of three iterations. Our algorithm yields identification rates of 98.5% on average, false-positive rates below 1.5%, and correction efficiencies above 90% for over 99.9% of the pixels across the entire image for non-SAA exposures. Within the image region containing the stellar spectrum, the identification rate dropped slightly to approximately 90%. Typically, there were less than 40 pixels identified as CR hits in the spectrum region in an exposure obtained out of SAA passage, indicating that the total numbers of missed CR pixels and false-positive pixels are 5 and 1 pixels, respectively. Since the region we used for the spectral extraction has a total size of 60 × 140 = 8400 pixels, we found that unidentified and false-positive CR hits had a negligible influence on our results.

In addition, SAA passage severely affected the pointing accuracy of HST for a few orbits in Program 2. For example, during Orbit 5 of Visit 4, the pointing shifted by ∼14 pixels (2''). Shifts with similar amplitudes were observed in the first four orbits of Visit 1 and the first two orbits of Visit 3 in Program 2, which all indicate failures in the HST fine guidance. In these orbits, the light curve also drops by ∼1%. We identify two potential causes for these systematics. First, after a pointing shift, the spectrum moved to the part of the detector that was not previously illuminated. This increased the ramp effect induced by charge trapping, because charge traps in these newly illuminated pixels have not yet been filled (Zhou et al. 2017). Second, WFC3's IR flat field has intrinsic uncertainties of ∼1.0% (Deustua et al. 2016), which introduce light-curve systematics for such large pointing shifts. The ramp-effect correction would, in principle, be correctable using the RECTE model (Zhou et al. 2017) with additional free parameters describing the image drifts. However, the existing science data and calibrations did not provide a viable option for alleviating the increased flat-field uncertainty. Therefore, we excluded the first four orbits in Visit 1, the first two orbits in Visit 3, and the last orbit in Visit 5 from the remainder of our analysis.

We consulted transit times listed in the NASA Exoplanet Archive (Akeson et al. 2013) and identified three transits that may lie within the discarded orbits. For completeness, we list these transits in Table 2.

Table 2.  Possible Transits within Discarded Data Sets

Planet Mid-time Mid-transit Time Uncertainty Mid-transit Timea T a
  (UT) (days) (JD) (days) (R*) Possible Visit and Orbit
f 2017 Dec 04 03:18 0.00032 2457726.64 9.20669 68.4 Orbit 3, Visit 1
d 2017 Jan 09 18:47 0.001799 2457763.28 4.04961 39.55 Orbit 2, Visit 3
e 2017 Jan 10 13:43 0.000567 2457764.07 6.099615 51.97 Orbit 5, Visit 4

Note.

aFrom Wang et al. (2017), the three planets display large transit timing variations (up to half an hour). The mid-transit times do not take TTVs into account, and actual timings could be different by up to 40 minutes from the times given here.

Download table as:  ASCIITypeset image

3.3. Sky Background Removal

We identified and removed the sky background using a σ-clipping algorithm. Pixels within 5σ of the median image level after 10σ clip iterations were considered as background. The median value of the background pixels was then subtracted from the image.

3.4. Image Drift Calibration

The remaining observations suffered from HST pointing drifts in both the x and y directions at levels of 0.05–0.1 pixels per orbit. Such drifts, especially in the wavelength dispersion direction, introduced systematic slopes in the spectrally binned light curves when left uncorrected. To correct for the drifts, we measured the shifts between each image and the reference image (first image in each data set) by cross correlation. We then used bicubic interpolation to shift and align the images to the reference image.

3.5. Light-curve Extraction

We generated white-light and spectrally binned light curves. We summed the CR-cleaned, background-subtracted, and aligned subexposures back to a total exposure image. We created 12 10 pixel wide bands from the scanned area ranging from 1.1 to 1.70 μm. We then obtained the light curves by summing pixels inside a 60 pixel wide window for every band. The uncertainty of each point on the light curve includes photon noise, dark current, and readout noise. As wavelength solutions differ slightly for each visit, the central wavelengths of the 10 pixel wide bins vary at levels of (∼0.01 μm) for the different visits. We list the central wavelengths of the bands in Table 3. In this way, five raw light curves were derived, each with 12 bins.

Table 3.  Center Wavelengths of Different Bands

Band     Wavelength (Å)    
Program GO-14500   GO-14873    
Band/Visit 0 1 2 3 4
1 11505 11465 11377 11408 11442
2 11970 11930 11841 11873 11907
3 12434 12394 12305 12337 12371
4 12898 12858 12770 12801 12835
5 13363 13322 13234 13265 13300
6 13827 13787 13698 13730 13764
7 14292 14251 14163 14194 14228
8 14756 14715 14627 14658 14692
9 15220 15179 15092 15123 15157
10 15685 15644 15556 15587 15621
11 16149 16108 16020 16051 16085
12 16613 16572 16485 16516 16550

Download table as:  ASCIITypeset image

We note that the bins applied here are slightly different from those used by de Wit et al. (2016, 2018), who used 11 (0.05 μm wide) bins in the 1.15–1.7 μm range for Transits 1 and 2 and 10 (also 0.05 μm wide) bins in the 1.15–1.65 μm range for the subsequent transits. In general, determining bin sizes with an integer number of pixels is more widely adopted in HST/WFC3 transmission spectroscopic studies (Deming et al. 2013; Mandell et al. 2013; Kreidberg et al. 2014). Considering that the 0.05 μm bin size is not an integer multiple of the spectral resolution unit of the G141 grism, without knowing the exact binning and interpolations applied there, we could not use identical bins in the light-curve extraction steps. Therefore, to compare our results with those of de Wit et al. (2016, 2018), we instead interpolated our pixel-binned transmission spectra. The interpolation had a negligible effect due to the coarse wavelength resolution of the spectra. We discuss this point further in Section 4.3.

3.6. Ramp-effect Correction

The raw light curves show prominent ramp-effect systematics (Figure 2), typical of HST/WFC3/IR time-resolved observations (e.g., Berta et al. 2012; Apai et al. 2013). This systematic is caused by two populations of charge carriers that are trapped and then, with some delay, released by impurities in the HgCdTe detectors (Zhou et al. 2017). We corrected these systematics using the RECTE model (Zhou et al. 2017), which models the history of illumination, trapping, and release for each pixel. This model offers a consistent solution to correct the ramp-effect systematics from the perspective of the physical cause, instead of fitting empirically determined exponential/polynomial functions, which are used in most HST/WFC3 transiting exoplanet studies to date. The use of this correction is also a major difference between our data reduction and that of de Wit et al. (2016), who used exponential functions to model and correct for the ramp effect. De Wit et al. (2018), which was published after the submission of this paper, used another model bearing more resemblance to the RECTE model to remove the ramp effects, and we compare them in Section 7.3.

Figure 2.

Figure 2. Light curves (green dots) and best-fitting RECTE ramp-effect correction. The predicted charge-trap effects for the two scanning directions are plotted separately (red and blue curves). This figure includes Transits 1 and 2 from planets c and b, respectively.

Standard image High-resolution image

Zhou et al. (2017) described the charge-trapping processes with six parameters (Es,tot, Ef,tot, ηs, ηf, τs, and τf9 ) representing the trap numbers, trapping efficiency, and charge release time for slow and fast charge-trap populations. Zhou et al. (2017) found these parameters to vary little in different observations and considered them to be intrinsic to the WFC3 detector. Therefore, we fixed these six parameters and provided the adopted values in Table 4. The free parameters that determine the systematic profiles are as follows.

  • 1.  
    f: the incoming flux on each pixel as a function of time. We consider it to be a constant here.
  • 2.  
    Es,0 and Ef,0: the initial numbers of trapped charges.
  • 3.  
    ΔEs and ΔEf: the number of additional charges trapped during interorbit gaps due to unintended detector illumination.
  • 4.  
    v: the slope of the visit-long trend.

Table 4.  RECTE Model Parameters

Parameter Value Parameter Value
Es,tot 1525.38 Ef,tot 162.38
ηs 0.013318 ηf 0.008407
τs 16300 τf 281.463

Download table as:  ASCIITypeset image

Table 5.  Parameters Used as Input in Transit Profile Fitting from Gillon et al. (2016, 2017)

Planet Inclinationa Tb a
  (deg) (days) (R*)
b 89.65+0.22−0.27 1.5109 20.50
c 89.67 ± 0.17 2.4218 28
d 89.75 ± 0.16 4.04961 39.55
e ${89.86}_{-0.12}^{+0.10}$ 6.099615 51.97
f 89.680 ± 0.034 9.20669 68.4
g 89.710 ± 0.025 12.35294 83.2

Notes.

aGaussian-distributed priors. bFixed parameters.

Download table as:  ASCIITypeset image

We found the best-fit RECTE profiles for the light curves of each band. While the RECTE algorithm essentially models charge-trapping processes in individual pixels, it was not feasible (or necessary) to fit the light curves at the single-pixel level. First, the ramp effect at the pixel level is overwhelmed by other systematics, particularly telescope jitter. Second, the accuracy of the single-pixel level ramp-effect correction is negatively influenced by photon noise in these data. As Zhou et al. (2017) found no evidence for the charge-trap parameters varying between pixels, we therefore adopted an average band-level charge-trapping correction instead of a single-pixel level correction.

For observations using bidirectional scanning, exposures conducted in the opposite scanning directions bear an intrinsic flux level difference of ∼0.5%. For these cases, we assumed different f and v values for the light curves observed in different scanning directions but calculated the charge-trapping/release processes for the two scanning directions together.

We found the best-fit parameters using a Markov chain Monte Carlo (MCMC) with 500 walkers for 600 steps, with the first 300 steps discarded as burn-in. The MCMC runs were performed using the emcee package (Foreman-Mackey et al. 2013). Examples of the best-fit RECTE profiles are shown in Figure 2. In most bands, the ratio of the average value of the standard deviation to the average of the photon noise is within the range of 0.8–1.2, i.e., our complete procedure (including CR and charge-trapping corrections) robustly reaches the photon noise level or very near to it.

For each transit, we also derived a broadband light curve by computing the weighted average of all bands, adopting the inverse variance as the weight.

3.7. Transit Profile Fitting and Spectrum Extraction

Our final reduction steps were fitting the transit profiles and extracting the transmission spectra. We first fitted the broadband light curves by generating model transit light curves using the Python package batman (Kreidberg 2015), in which the light-curve shape model is based on Mandel & Agol (2002). The fitting procedure was performed with an MCMC algorithm using the emcee package (Foreman-Mackey et al. 2013). The transit profile model contained nine parameters, namely transit mid-time t0, orbital period T, relative planet size rp/r*, semimajor axis a, eccentricity e, inclination i, argument of periapsis ω, and quadratic limb-darkening coefficients (LDCs) u1, u2.

We found that due to the lack of ingress or egress data in some transits, the LDCs could not be constrained well by the light curves. Consequently, the LDCs obtained in different transits are not always consistent. It is important to note that models predict that LDCs of late M stars will vary significantly with wavelength in the 1.1–1.7 μm range and that the transit depths derived are anticorrelated with LDCs (Figure 3). Therefore, errors in LDCs may introduce apparent spectral features in the transmission spectra. To carefully examine the effect of LDCs, we experimented with three different LDC treatments: (i) interpolating LDC values and uncertainties provided in de Wit et al. (2016)—derived using PHOENIX stellar models (Husser et al. 2013)—to our bandpasses and using these as Gaussian-distributed priors; (ii) fixing the LDC to the best-fit values reported in de Wit et al. (2016); and (iii) independently deriving the LDCs by fitting PHOENIX specific intensity model stellar spectra10 (disk-integrated, multiplied by the HST/G141 bandpass, and normalized) to the HST/G141 out-of-transit spectrum and fitting a quadratic limb-darkening law to the limb-darkening profile of the best-fit model. We then fixed the LDCs in the transit fit using the derived values.

Figure 3.

Figure 3. Posterior distributions of the transit profile of TRAPPIST-1 e in Transit 7.

Standard image High-resolution image

The comparison of the results based on the different LDCs showed that the final spectra are only weakly affected by the adopted LDCs: in every band, the transit depths derived from the different methods agreed with each other to levels better than 1σ. We adopt the first limb-darkening treatment described above as our nominal procedure and present the results from this approach here.

The limited phase coverage of ingress and egress in some visits that complicated the LDC studies also precluded precise measurements of the transit durations. This uncertainty likewise hampered the precise determination of the orbital inclinations. Therefore, we adopted the distributions obtained by Gillon et al. (2017) as priors in our fitting procedure.

We fixed all eccentricities to be zero, as justified by the small values found by Gillon et al. (2017), which rendered ω irrelevant. We also fixed the orbital periods and semimajor axes to the values given in Gillon et al. (2017). We did not directly adopt external constraints on the mid-transit times, as transit timing variations (TTVs) are large in the system and not yet well understood. The fit parameters are summarized in Table 5. With rp/r*, t0, i, and the LDCs as the only free parameters, we performed an MCMC search. We adopted 500 walkers and ran them for 2000 steps, the first 1000 of which were treated as burn-in. In the evaluation of the fit quality, we did not use the uncertainty estimates directly derived from the pipeline (which are dominated by photon noise), as these did not include the assessment of the residual systematic noise (even though these are found to be very small). Instead, for each light curve in each band of each visit, we opted to calculate the standard deviation of the data in the baseline (pre- and post-transit) and adopted this value as a uniform relative uncertainty applicable to all data points in the light curves.

We present a corner plot of the MCMC posterior distributions in Figure 3 and an example of the transit profile fit in Figure 4.

Figure 4.

Figure 4. Broadband (upper) and spectral band (lower) light-curve profile fits for planets TRAPPIST-1 b and c. The observations and best-fit profiles are shown in the left panels, and the fitting residuals are shown in the right panels. Similar figures for the other three visits containing the other 6 transits are available in Appendix B (Figures 1417).

Standard image High-resolution image

After fitting the broadband transit light curves, we fitted the transits in the individual spectral bins in each transit, keeping the mid-transit times fixed to the values found in the broadband transit fits. To test the reliability of these fits, we carried out a Shapiro–Wilk test on the residuals of the fittings (Shapiro & Wilk 1965). Most of our fits passed the test with p-values exceeding 0.1. We inspected each of the few exceptions (p-values less than 0.1) visually and found that a few outlying data points, probably caused by stellar flares or other activity, were the reason that the transit models did not provide complete fits. We further investigate possible stellar activity in the light curves in Section 5.4.

Finally, we subtracted the best-fit broadband transit depth value (rp/r*) from each spectral bin's transit fit (to determine relative, spectrally dependent differences in the transit depths) to derive the transmission spectra of the seven transits.

4. Results

We measured the transit depths and mid-transit times of all seven transit events in both the broadband and individual spectral bins. In the transit profile fits, we reached an average reduced χ2 of 0.99, and our residuals were typically 1.05 times the photon noise level. In total, we derived seven transmission spectra of six planets, including two of TRAPPIST-1 e. In addition to the individual planets' spectra, in the following, we also present their combined spectrum. We list the results of our broadband model fits in Table 6 and compare them to the results from the literature (Gillon et al. 2016, 2017) and mid-transit times in the Online Exoplanet Archive (Akeson et al. 2013).

Table 6.  Comparison of Broadband Model Fit Results to Literature Values

Transit Planet Mid-time Mid-time Uncertainty Mid-time Literature rp/r* Best-fit rp/r*
    UT days BJDTDB Gillon et al. (2016, 2017) de Wit et al. (2016, 2018) This Study
1 c 2457512.88051 0.000352 2457512.8807 ${0.0828}_{-0.0006}^{+0.0006}$ ${0.0854}_{-0.0014}^{+0.0014}$ ${0.0849}_{-0.0012}^{+0.0012}$
2 b 2457512.88712 0.000176 2457512.8876 ${0.0852}_{-0.0005}^{+0.0005}$ ${0.0895}_{-0.0012}^{+0.0012}$ ${0.0879}_{-0.0011}^{+0.0012}$
3 d 2457726.83624 0.001232 2457726.8400 ${0.0605}_{-0.0015}^{+0.0015}$ ${0.0631}_{-0.0006}^{+0.0007}$ ${0.0622}_{-0.0005}^{+0.0006}$
4 g 2457751.81998 0.00105 2457751.8397 ${0.0884}_{-0.0016}^{+0.0015}$ ${0.0885}_{-0.0007}^{+0.0008}$ ${0.0888}_{-0.0007}^{+0.0007}$
5 e 2457751.87282 0.000545 2457751.8701 ${0.0720}_{-0.002}^{+0.002}$ ${0.0689}_{-0.0006}^{+0.0007}$ ${0.0694}_{-0.0005}^{+0.0005}$
6 f 2457763.46460 0.00038 2457763.4462 ${0.0820}_{-0.0014}^{+0.0014}$ ${0.0803}_{-0.0011}^{+0.0011}$ ${0.0802}_{-0.0004}^{+0.0004}$
7 e 2457764.07205 0.00117 2457764.0671 ${0.0720}_{-0.002}^{+0.002}$ ${0.0707}_{-0.0007}^{+0.0008}$ ${0.0715}_{-0.0006}^{+0.0006}$

Note. Literature rp/r* values are from Gillon et al. (2016, 2017) and de Wit et al. (2016, 2018), while mid-transit times are from the online exoplanet archive (Akeson et al. 2013).

Download table as:  ASCIITypeset image

4.1. Broadband Transit Depths

We measured the WFC3 broadband transit depths of TRAPPIST-1 b, c, d, e, f, and g with an average precision of 123 ppm. We note that overlapping Transits 1 and 2 have larger uncertainties in transit depths than the other transits, which are all due to single planets. Most TRAPPIST-1 planets' transit depths we measured in the WFC3 G141 broadband are consistent with those measured in Spitzer Channel 2 (central wavelength 4.5 $\mu {\rm{m}}$) light curves (Gillon et al. 2017). However, our transit model's rp/r* for Transits 1, 2, and 3 (0.0849 ± 0.0012, 0.0879 ± 0.0012, and 0.0622 ± 0.0005) are deeper than the corresponding Spitzer transit depth measurement for the same planets (0.0828 ± 0.0006, 0.0852 ± 0.0005, and 0.0605 ± 0.0015) by over 1σ. Differences between the HST and Spitzer bands should come as no surprise, as these may be introduced either by planetary absorption features or—more likely—by stellar activity and heterogeneity, which we will explore in greater detail in Section 5.4.

4.2. Mid-transit Times

Through our transit light-curve modeling, we obtained high-precision mid-transit time measurements, which were converted from Modified Julian Dates (MJD) to Barycentric Julian Dates (BJDTDB) using the algorithms described in Eastman et al. (2010). The results are summarized in Table 6.

The mid-transit times have a typical uncertainty of 0.0002 days (or 17 s). The uncertainties are typically dominated by the limited phase coverage (due to HST's visibility windows), i.e., light curves missing either the ingress or egress. Different phase coverage of the individual transits causes the quality of the constraints on mid-transit times to vary. Better constraints were achieved for the transits of planets TRAPPIST-1 b and c because they have better-than-typical transit phase coverages, owing to their shorter transit durations; for these planets, the transit mid-times are constrained with uncertainties of only 0.0001 days (or 8 s).

In contrast, Transit 7—corresponding to a transit of TRAPPIST-1 e—lacks both the ingress and ingress, resulting in a greater uncertainty (0.002 day) on the mid-transit time.

We found substantial TTV signals (observed–predicted time differences) in the observed transits when comparing our results with those from the Online Exoplanet Archive, which are calculated assuming strict periodicity (Akeson et al. 2013). In particular, the best-fit mid-transit time for Transit 4 (TRAPPIST-1 g), occurring on 2016 December 9, deviates from that predicted from the best-fit Spitzer transit mid-time (Gillon et al. 2017) and the planet's orbital period by ∼30 minutes, as was noticed and discussed in Wang et al. (2017).

4.3. Transmission Spectra

We obtained WFC3/IR G141 transmission spectra for TRAPPIST-1 b, c, d, e, f, and g. The spectra are shown in Figure 5, with the individual spectra offset by arbitrary levels for clarity. Each spectra has 12 bins covering wavelengths from 1.1 to 1.17 μm, corresponding to a spectral resolution of Δλ = 50 Å (or spectral resolving power of R = Δλ/λ = 22–34; Figure 4). Table 13 lists original spectra with no interpolation, measured by transit depths of different bands, and Table 14 lists the spectra with wavelength interpolation.

Slight differences in the wavelength calibrations of the individual visits resulted in small differences of wavelength solutions among them (Table 3). To directly compare our results with those by de Wit et al. (2016) and to combine spectra from multiple visits, we interpolated our single-transit transmission spectra to align our wavelength bins with those of de Wit et al. (2016). We adopted the third-order spline interpolation for both transit depths and uncertainties. We justify the interpolation of transit depths in two respects. First, the adopted bin size is coarse and significantly larger than the average difference of the interpolated wavelengths and original wavelengths. Second, the transmission spectra and their uncertainties have small spectral variations. The interpolation introduced negligible modifications to the transit depths and uncertainties. In order to allow direct comparison with the spectra published in de Wit et al. (2016), we interpolated our transmission spectra to match the wavelength bins used in that study (Figures 57).

Figure 5.

Figure 5. The HST/WFC3 G141 transmission spectra for TRAPPIST-1 b to g (top to bottom). Wavelength bands are aligned with those used in de Wit et al. (2016). The observed transmission spectra are plotted in circles. The gray curves are water transmission models (Kempton et al. 2017), scaled to provide the best fit to the data. All spectra have been mean subtracted, and vertical offsets have been applied for clarity.

Standard image High-resolution image
Figure 6.

Figure 6. Combined HST/WFC3 G141 transmission spectra for TRAPPIST-1 planets. Wavelength bands are aligned with those used in de Wit et al. (2016). For the upper panel, all planets' spectra are included. For the middle panel, Transits 3, 4, 5, and 6 from planets d, g, f, and e are included. For the lower panel, spectra of planets b and c, whose transits overlapped during observations, are excluded. The gray curves are water transmission models (Kempton et al. 2017), scaled to provide the best fit to the data. All spectra have been mean subtracted.

Standard image High-resolution image
Figure 7.

Figure 7. Comparison of the spectra of TRAPPIST-1 b and c in our work and de Wit et al. (2016). The two reductions result in spectra that are statistically consistent with each other. All spectra have been mean subtracted.

Standard image High-resolution image
Figure 8.

Figure 8. Transmission spectra of planet TRAPPIST-1 e from two visits. Green and cyan points are the spectra from Transits 5 and 7, respectively. The spectra are offset in wavelength for clarity.

Standard image High-resolution image

In the following, we also explore the combined spectra of the TRAPPIST-1 planets, as well as the combined spectra of different subsets. We combined the spectra by summing the transit depths from the six planets in each of the bands (already aligned to those in de Wit et al. 2016), giving the combined transit depth

Equation (3)

and uncertainty

Equation (4)

For planet e, for which we have two transits, we used the inverse variance–weighted average of the two as the transmission spectrum.

Combined spectra may provide information about shared spectral features: combining seven spectra could increase the signal-to-noise of the spectra by up to a factor of 2.6.

The combined spectra of all seven transits are shown in the top panel of Figure 6.

Given the differing quality of the spectra, we also explored combinations of different subsets of the transits. As noted above, the transits of planets b and c overlapped in time, and their spectra show an apparent anticorrelation. Additionally, the parameters of Transit 7 (planet e) are determined less precisely than those of the other transits due to its lack of ingress or egress coverage. To allow the assessment of the impacts of these data quality differences and possible systematics on the combined spectra, we show two additional spectral combinations in Figure 6. The middle panel shows the combined spectra of planets d, e, f, and g (with spectra for planets b, c, and e from Transits 1, 2, and 7 excluded). The lower panel in Figure 6 shows the combined spectrum of Transits 1 and 2, i.e., planets c and b, respectively.

Water absorption is the most prominent expected spectral feature in the planetary atmospheres in this wavelength range. As such, we show in Figure 6 a model of water absorption for comparison (gray lines). For this comparison, we adopted a water transmission model calculated using ExoTransmit (Kempton et al. 2017), with an amplitude scaled to provide the best fit to the observed spectra. None of the three combined spectra in Figure 6 resemble the water absorption spectrum: while the water absorption would result in larger rp/r* values around 1.4 μm, all three spectra suggests a decrease in rp/r* values. We conclude that no water absorption or other planetary molecular absorption features are visible in any of the individual or combined spectra. In Sections 5.3 and 6, we explore the upper limits we can place on planetary water absorption, as well as the impact of stellar contamination on these spectra.

5. Discussion

In this and the following sections, we discuss the key points of our study: comparison of our data reduction to that of de Wit et al. (2016) and the resulting data quality, comparisons of multiple transits of the same planet, placing upper limits on water absorption bands, and discussing in detail stellar activity and contamination in the HST/WFC3 IR spectra of the TRAPPIST-1 planets. Finally, we place these results in the context of future HST and JWST transit spectroscopy of the TRAPPIST-1 and similar planetary systems.

5.1. HST/WFC3 IR Transiting Exoplanet Data Reduction Comparison

One of the key differences between our study and that of de Wit et al. (2016) is that we used the RECTE model to correct for the HST/WFC3 IR ramp effect, while the de Wit et al. (2016) study discarded the first orbits of each visit and relied on an empirical fit to correct the systematics in the subsequent orbits, implicitly assuming those to be identical to the ramp seen in Orbit 2. Here we compare the results of the two data reduction approaches.

First, by using RECTE, we successfully corrected the ramp effect in each visit's first orbit, which was discarded in de Wit et al. (2016), as well as in almost all published HST/WFC3 transit spectroscopic observations. As a result, we increased the available useful data and effectively improved the efficiency of the HST observations by about 25%. This increase translated to a better orbital phase coverage and an improved accuracy of the transit baseline levels. Furthermore, the additional baseline observations also enabled a more thorough exploration of the stellar activity and spectral changes (see Section 5.4) than would have been possible had those orbits been discarded.

Second, in addition to the increase in the data quantity and efficiency, we seek to compare the resulting data quality. At this point, any such comparison must be limited to Visit 1 (from Program 1), the only visit for which reduced data had been published at the time of submission for the current work.11 With an uncertain astrophysical signal underlying possible residual systematics, such comparisons are not trivial. We proceed here by assuming that, to first order, the astrophysical signal is well understood and purely consists of the planetary transit that follows the analytic models of Mandel & Agol (2002); we will then discuss the limitations of this approach.

Under the assumptions laid out, the residuals of the observed and modeled light curves contain no systematics and should be photon noise–limited. Therefore, we use the standard deviation of the light-curve fit residuals as a metric to compare the data quality between our reduction and that of de Wit et al. (2016). De Wit et al. kindly provided their detailed results, enabling accurate comparison studies. We note that de Wit et al. (2016) reported a larger broadband residual standard deviation (240 ppm) than that calculated from data in Figure 1 of de Wit et al. (2016; 215 ppm). We conservatively adopt the latter for the comparison. The first three rows of Table 7 compare the standard deviations of residuals between the two studies for Orbits 1, 2, and 4, as well as several orbit combinations.

Table 7.  The Comparison of Standard Deviations of Residuals between Our Study and That of de Wit et al. (2016), Utilizing Different Ramp-effect Corrections

Orbit This Study de Wit et al. (2016)
  (ppm) (ppm)
1 225 /
2 342 257
4 136 159
1, 2, & 4 254 /
2 & 4 266 ± 25a 215  
2 & 4, flare excluded 195 ± 16a 180  

Notes. Note that this comparison is only easily interpretable if no astrophysical systematics (e.g., flares, spots) are present. The standard deviations from de Wit et al. (2016) have been provided by the authors (private communication).

aThe uncertainties are derived by calculating the standard deviations of the standard deviations of randomly selected 80% subsets of the data.

Download table as:  ASCIITypeset image

Table 8.  Planetary Parameters for Water Absorption Calculations

Planet Mass Teff Δd Radius AH ${ \mathcal U }({A}_{H})$
  M K ppm R    
b 0.79 ± 0.27 400.1 ± 7.7 −413 ± 266 1.086 ± 0.035 −9.2 ± 6.8 11.2
c 1.63 ± 0.63 341.9 ± 6.6 −212 ± 245 1.056 ± 0.035 −11.1 ± 13.6 29.7
d 0.33 ± 0.15 288.0 ± 5.6 17 ± 113 0.772 ± 0.030 0.2 ± 1.0 3.2
e 0.24 + 0.56−0.24 251.3 ± 4.9 −209 ± 129 0.918 ± 0.039 −1.9 ± 4.6 11.9
e ${0.24}_{-0.24}^{+0.56}$ 251.3 ± 4.9 −303 ± 126 0.918 ± 0.039 −2.8 ± 6.6 17
f 0.36 ± 0.12 219.0 ± 4.2 −130 ± 190 1.045 ± 0.038 −2.3 ± 3.5 8.2
g 0.566 ± 0.0038 198.6 ± 3.8 77 ± 197 1.127 ± 0.041 2.5 ± 6.6 22.3

Download table as:  ASCIITypeset image

Several points are notable about this comparison. If our underlying assumptions were correct, the residuals in all orbits should be the same. However, differences in Orbits 1, 2, and 4 are visible even within the same reductions. We attribute these differences in part to the scatter of the standard deviations themselves and in part to the fact that physical processes other than the transit are present in the data, as discussed below.

The immediate comparison of the standard deviations of the residuals between the two studies (rows 1–3 of Table 7) have overall very similar levels, considering the limited number of data points from which the standard deviations are calculated. Data from our reduction have lower standard deviations in Orbit 4 than those from de Wit et al. (2016; 136 ppm versus 159 ppm) but larger residuals in Orbit 2 (342 ppm versus 257 ppm). For the combined residuals of Orbits 2 and 4 (row 5), our results still have a slightly larger standard deviation than that of de Wit et al. (2016; 266 ± 25 ppm versus 215 ppm), corresponding to about a 1σ level difference (when assuming similar uncertainties for the standard deviations themselves). We note that Orbit 2's correction quality is worse in both reductions than that of Orbit 4, suggesting that the inherent data quality or other processes also play an important role.

Thus, based on a superficial comparison, one may conclude that (1) both reductions reach similar precision, (2) underlying differences in the data quality are more important than the type of ramp-effect correction applied, and (3) the empirical ramp-effect correction performs sometimes slightly better than the physical model for the charge trapping (not considering the benefits of the nondiscarded first orbit).

Closer inspection of the residuals in Figure 9 reveals isolated groups of outlying data points. These may mark potential small stellar flare events (discussed further in Section 5.4), such as an event with 4σ outliers above the baseline in Orbit 2 of Visit 0. It is instructional to inspect this event: in fact, the outliers contribute the most to the increased standard deviation in the residuals in this orbit. By excluding the four data points around suspected flare events, the standard deviations of the combined Orbit 2 and 4 data decreased by 40% to 195 ppm (last row of Table 7). If the same data points are excluded from the de Wit et al. (2016) data, the resulting standard deviation is, within the uncertainties, the same as that resulting from the RECTE fit. We conclude that the empirical ramp correction may occasionally provide a lower standard deviation because it can potentially fit the ramp-effect correction and stellar flares together.

Figure 9.

Figure 9. Identified stellar activity events. Left panels: two 3σ flare events were found in the light curves, one in Visit 0 of Program 1 and another in Visit 3 of Program 2. The purple lines show the smoothed baseline constructed using third-order spline smoothing. Red circles mark the identified events. Right panel: spectral information of the identified event.

Standard image High-resolution image

Thus, in our RECTE-based reduction, the higher standard deviations are an indication that the model applied (charge-trapping model + transit) is incomplete; in contrast, the empirical systematics fit and transit have the capability to absorb different sources of astrophysical signal and instrument systematics without distinguishing these, resulting in a slightly lower standard deviation of the residuals. Compared to empirical corrections, the physically motivated RECTE-based correction is less likely to be skewed by astrophysical events. This comparison highlights another advantage of the RECTE model over traditional empirical fits: given the well-determined detector response, it will be less likely to overcorrect and remove astrophysical processes (or other types of instrumental systematics). In fact, in Orbit 4, which has no signs of stellar flares, our RECTE model indeed resulted in a residual standard deviation that is the lowest of all measured here (136 ppm).

The transmission spectra from our reduction and that of de Wit et al. (2016) agree within the uncertainties (Figure 7). For planets b and c, we performed Kolmogorov–Smirnov (K-S) tests on the spectra from this study and de Wit et al. (2016). The tests resulted in p-values of 0.9094 and 0.8286; i.e., they did not show any evidence for the data points being drawn from statistically different parent samples.

5.2. Comparison of Spectra from Two Transits of Planet e

The fact that our data sets contain two transit events (Transits 5 and 7) for planet TRAPPIST-1 e offers an opportunity to examine the similarities between the transmission spectra from the two epochs. Figure 8 shows the two spectra of TRAPPIST-1 e interpolated to the same wavelength bins. While the two spectra share some features, such as average transit depth and nondetection of planetary water absorption, they differ in the blue parts (<1.35 μm) of the spectra. There appears to be a difference in the overall slope between the two transit spectra.

To quantify the agreement between the two spectra, we calculated the bin-by-bin difference between them and report an average difference of 2.4σ. A K-S test yields a p-value of 0.01, i.e., supporting the conclusion that the two data sets are drawn from different parent populations. Manual inspection confirms that the blue parts of the spectra contribute most of the difference (Figure 8). This difference may be an indication of a time-evolving stellar contamination signal, which we discuss further in Sections 6.3 and 7.2.

5.3. No Evidence for Water Absorption Features in Individual or Combined Spectra

In our data, all six planets (TRAPPIST-1 b–g) have spectra that show no obvious absorption features in the 1.1–1.65 μm wavelength range. We examine the possibility of planets sharing similar spectral features by combining the spectra from the six planets.

We also compared our observed transmission combined spectra with a scaled water transmission model (Kempton et al. 2017), as shown in Figure 6. The deviations, as expressed in χ2, are 2.2, 1.3, and 1.0, which are all worse than those of the flat model (0.85, 0.56, and 0.45, respectively). We attribute this to the contamination from stellar heterogeneity, which we will discuss in more detail in Section 6.

Water is of the most interest among molecules with absorption features in the WFC3 G141 bandpass. Since no spectral features were observed, we estimate the upper limits of water absorption for each of the planets instead. Following Fu et al. (2017), we quantified the water absorption amplitude AH, i.e., the transit depth difference in and out of the water absorption band in terms of scale heights H, as

Equation (5)

We estimated the scale height of each planet based on TTV masses from Wang et al. (2017). We set the host star radius to R* = 0.114 ± 0.006 R (Gillon et al. 2016) to calculate the absolute radii of the planets and the host star effective temperature to 2559 K (Gillon et al. 2017). For the mean molecular weight, we adopted 18 amu, corresponding to that of water. We calculated ${ \mathcal U }({A}_{H})$, the 3σ upper bounds of AH, for planets TRAPPIST-1 b–g.

The results, along with essential information, are presented in Table 8. We do not find evidence for a planetary water absorption feature in any of the transmission spectra; i.e., AH is consistent with or less than 0 at the 1σ level in each case. However, the wide uncertainties on AH allow for the possibility of significant water absorption, as illustrated by the upper limits. Improved precisions on the transmission spectra may place tighter constraints on planetary absorption features, though additional observations will have to contend with the time-resolved activity and photospheric heterogeneity of the host star.

5.4. TRAPPIST-1 Stellar Activity

We examined TRAPPIST-1's out-of-transit broadband light curves and searched for possible flares and dimming events. We identified the baseline using the scipy UnivariateSpline routine, which smooths the light curves with third-order splines. For the spline fit, the numbers of knots were optimized with inverse standard deviation weighting. We then computed the weighted average of deviations from unity within each window. Windows that had deviations above 3σ were selected as candidates for flare or dimming events. In total, two potential flares were found. We summarize the key properties of these potential events in Table 9 and show their broadband light curves in Figure 9. This figure also shows the spectrum of each event.

Table 9.  Identified Stellar Activity Information

Type Center Time Deviation Visit and Orbit
  MJD σ
Flare 57512.4 4.06 Visit 0, Orbit 2
Flare 57763.0 3.02 Visit 3, Orbit 3

Download table as:  ASCIITypeset image

The possible events were sampled in only a few readouts, and their signal-to-noise ratio remains low, inhibiting their robust classification. We only note here a naïve expectation for these spectra, which is that (micro-)flares, representing hotter-than-photospheric plasma, would show an overall "bluer" continuum. This expectation may be met by these events, but the generally low quality of the spectra does not allow for meaningful characterizations.

In total, 20 orbits are used in our analysis, amounting to approximately 850 minutes of observations. We estimate the stellar occurrence rate of marginally detectable (micro)flares to be on the order of 1/425 minutes, i.e., an event every seven hours, on average. We note here that the identification of the events leading to this statistic benefited from the use of the RECTE correction, which allowed us to include data from four additional orbits in our analysis and provided a ramp-effect correction that was not affected by the flare events.

6. Stellar Contamination of the Transmission Spectra

TRAPPIST-1 demonstrates a 1.40 day periodic photometric variability in the I+z bandpass with a full amplitude of roughly 1% (Gillon et al. 2016, Extended Data Figure 5). Rackham et al. (2018) found that this observed variability is consistent with rotational modulations due to a heterogeneous stellar photosphere with whole-disk spot and faculae covering fractions of ${F}_{\mathrm{spot}}={8}_{-7 \% }^{+18 \% }$ and ${F}_{\mathrm{fac}}={54}_{-46 \% }^{+16 \% }$, respectively. These authors also found that spots and faculae, if present in regions of the stellar disk that are not occulted by the transiting planets, can alter transit depths by roughly 1–15× the strength of planetary atmospheric features, thus dominating the observed wavelength-dependent variations in transit depth (i.e., the "transit light source effect").

The observed transmission spectrum Dλ,obs is the multiplicative combination of the nominal transit depth Dλ (i.e., the square of the true wavelength-dependent planet-to-star radius ratio) and the stellar contamination spectrum epsilonλ (Rackham et al. 2018). As the primary purpose of this exercise was to investigate the possible stellar contribution to the transmission spectrum, we assumed an achromatic transit depth D for each planetary spectrum. Thus, we modeled the observed transmission spectra as

Equation (6)

and assumed a stellar origin for all variations from a flat transmission spectrum.

Rackham et al. (2018) presented a formalism for calculating the stellar contamination spectrum in the specific case that no heterogeneities—spots or faculae—are present within the transit chord, or, if they are, they can be identified in the light curve and properly taken into account (their Equation (3)). Of course, the precision of observations may not allow stellar surface heterogeneities within the transit chord to be reliably detected. In general, the stellar contamination spectrum epsilonλ is given by the spectral ratio of the region occulted by the exoplanet relative to the integrated stellar disk. If we assume the transit chord is composed of the same spectral components as the integrated disk, namely spots, faculae, and immaculate photosphere, but we allow their covering fractions to differ from the whole-disk values, then the generalized stellar contamination spectrum is given by

Equation (7)

in which Sλ,phot, Sλ,spot, and Sλ,fac refer to the spectra of the photosphere, spots, and faculae, respectively; Fspot and Ffac refer to the whole-disk covering fractions of spots and faculae, respectively; and fspot and ffac refer to the spot and faculae covering fractions within the transit chord. We adopt this generalized stellar contamination framework in this analysis.

6.1. Composite Photosphere and Atmospheric Transmission Model

With this framework, we investigated the possible contribution of photospheric heterogeneities to the observed transmission spectra of the TRAPPIST-1 planets using the composite photosphere and atmospheric transmission (CPAT) model (Rackham et al. 2017). We used an MCMC approach developed with the PyMC (Patil et al. 2010) Python package to fit the CPAT model to our observations. The free parameters of the model and their priors are given in Table 10. For the prior on the photosphere temperature Tphot, we adopted the stellar effective temperature Teff from Gillon et al. (2017), with a width of 5× of the reported uncertainty to allow the algorithm to thoroughly explore the parameter space. We adopted the same uncertainty for priors on the spot and facula temperatures, Tspot and Tfac, with means given by Tspot = 0.86 × Tphot and Tfac = Tphot + 100 K, following Rackham et al. (2018). For priors on the spot and faculae covering fractions, both within the transit chord and for the whole disk, we adopted normalized estimates of the covering fractions found for TRAPPIST-1 by Rackham et al. (2018).

Table 10.  Priors for Stellar Contamination Model Fits

Parameter Description Prior Unit
D Nominal transit depth Uniform (0, 100) %
Tphot Photosphere temperature TruncNorm (2559, 250) K
Tspot Spot temperature TruncNorm (2201, 250) K
Tfac Facula temperature TruncNorm (2659, 250) K
Fspot Whole-disk spot covering fraction TruncNorm (8, 13) %
Ffac Whole-disk faculae covering fraction TruncNorm (54, 31) %
fspot Transit-chord spot covering fraction TruncNorm (8, 13) %
ffac Transit-chord faculae covering fraction TruncNorm (54, 31) %
η Spectra error inflation factor Uniform (1, 100)

Note. Uniform (a, b) distributions are uniform between a and b. TruncNorm (μ, σ) distributions are normal distributions with means μ and standard deviations σ. The priors for Tphot, Tspot, and Tfac were truncated in the range [1000, 3000 K], given by the temperature limits of the DRIFT-PHOENIX model grid. We enforced Tfac > Tphot and Tphot > Tspot with likelihood penalties. Similarly, the covering fractions were allowed to vary over the range [0%, 100%], and we enforced Fspot + Ffac < = 100% and fspot + ffac < = 100% with likelihood penalties.

Download table as:  ASCIITypeset image

The CPAT model describes the emergent disk-integrated spectrum of the photosphere as the sum of three distinct components—the immaculate photosphere, spots, and faculae—each covering some fraction of the projected stellar disk. We utilized the grid of DRIFT-PHOENIX model stellar spectra (Hauschildt & Baron 1999; Woitke & Helling 2003, 2004; Helling & Woitke 2006; Helling et al. 2008a, 2008b; Witte et al. 2009, 2011) with solar metallicity ([Fe/H] = 0.0) to generate spectra for each component. We parameterized the three components by their temperatures (Tphot, Tspot, and Tfac for the photosphere, spots, and faculae, respectively), which we allowed to vary, and linearly interpolated between models with different temperatures to produce the component spectra. For all components, we linearly interpolated between models with log g = 5.0 and $\mathrm{log}g=5.5$ to produce spectra matching the surface gravity of TRAPPIST-1 ($\mathrm{log}g=5.21$), which we calculated from the star's mass and radius (Gillon et al. 2017).

While fitting the transmission spectra, we required that the stellar parameters also produced a disk-integrated stellar spectrum matching the median observed out-of-transit stellar spectrum (λ = 1.15–1.70 μm) of TRAPPIST-1. We computed the disk-integrated spectrum as

Equation (8)

ignoring projection effects owing to the positions of photospheric heterogeneities, which are not constrained by our model. Both the observed and model stellar spectra were normalized to the median flux between 1.27 and 1.31 μm for comparison. To account for discrepancies between the high-precision HST observations and stellar models, we multiplied the observational uncertainties by an error inflation factor η, which was allowed to vary between 1 and 100 (see Table 10).

We utilized two variations of this modeling framework. In the first, which we call the flat model here, we set the active-region covering fractions within the transit chord to the whole-disk values (i.e., fspot = Fspot and ffac = Ffac). Thus, epsilonλ = 1 (see Equation (7)), and there was no stellar contribution to the observed transmission spectra. The achromatic transit depth D solely determined the transmission spectra model, while the stellar parameters in the model only affected the fit to the out-of-transit stellar spectrum. In the second framework, which we call the contamination model, we allowed the active-region covering fractions within the transit chord to differ from the whole-disk values. In this case, both planetary and stellar parameters affected the model fit to the observed transmission spectra (Equation (6)), and the stellar parameters (save the transit-chord covering fractions) determined the fit to the stellar spectrum. Flat models have seven free parameters, contamination models have nine, and each fit includes 144 data points—14 for the transmission spectrum and 130 for the stellar spectrum (see Section 6.2).

For each transmission spectrum that we considered, we performed an MCMC optimization procedure for both of these modeling frameworks. In each procedure, we marginalized over the log likelihood of a multivariate Gaussian. We ran three chains of 5 × 105 steps with an additional 5 × 104 steps discarded as the burn-in. We checked for convergence using the Gelman–Rubin statistic $\hat{R}$ (Gelman & Rubin 1992) and considered chains to be well mixed if $\hat{R}\lt 1.03$.

6.2. Multi-instrument Transit Measurements

We initially performed this analysis using only the HST transit depths presented here and the Spitzer 4.5 μm transit depths provided by Delrez et al. (2018b). After the submission of this manuscript, Ducrot et al. (2018) presented K2 (0.42–0.9 μm) transit depths and I + z (0.8–1.1 μm) transit depths from the SPECULOOS-South Observatory (SSO; Delrez et al. 2018a) for each of the planets discussed here. In the following analysis, we consider the full K2+SSO+HST +Spitzer transmission spectra. Thus, each transmission spectrum includes 14 transit depths: 11 HST/WFC3 depths from this analysis, along with a K2, SSO, and Spitzer 4.5 μm depth.

We note that Ducrot et al. (2018) examined the impact of stellar contamination from TRAPPIST-1 by comparing the K2+SOO+HST+Spitzer spectra of TRAPPIST-1 b and c to a model from a pre-peer-reviewed version of this work (see Figure 4 in Ducrot et al. 2018). They found a 20σ discrepancy between the observed K2 transit depth and the model prediction and claimed that the stellar contamination model can be "firmly discarded." For the model that was used in the study of Ducrot et al., we assumed that spots and faculae were present only in the nonocculted stellar disk and not in the transit chords; i.e., the spectra of the transit chords were the same as the immaculate photosphere spectrum. During the reviewing process and prior to the publication of Ducrot et al. (2018), we updated the model to a more general form in which spots and faculae affect both the nonocculted disk and the transit chords (see Equation (7)). The analysis of the K2+SOO+HST+Spitzer data set using the more general stellar contamination model is presented here. Appendix C provides the results of this same analysis considering only the HST+Spitzer transmission spectra, along with the accompanying predictions for the K2 and I+z transit depths. The results of the analyses for the two data sets are fully consistent. Most notably, the best-fit contamination models from the HST+Spitzer analysis offer accurate predictions of the K2 and I+z transit depths for the combined transmission spectra.

We also note that transit depth determinations depend on i and a/Rs, which may vary between analyses of transit data. In our analysis, we fixed these values, which are hard to constrain with the HST observations, to those reported by Gillon et al. (2017). As the K2, SSO, and Spitzer observations all covered a longer baseline, however, Delrez et al. (2018b) and Ducrot et al. (2018) both benefited from larger data sets including many repeated transits, which allowed them to fit for more parameters. To be more specific, both these works adopted impact parameter as a free parameter in MCMC analyses. They also included the stellar mass and radius as parameters, either fixed or with a prior distribution, which combine with the orbital periods reported by Gillon et al. (2017) to yield the system scale a/Rs for each planet. For a comparison of the orbit parameters, we refer the reader to Table 5, which contains the parameters that we adopted from Gillon et al. (2017), and Table 1 of Delrez et al. (2018b), which details the fitted parameter values from that analysis.12 In short, we find that the inclinations and system scales that we adopt differ, on average, from those of Delrez et al. (2018b) by 0.9σ and 0.2σ, respectively. We note that these subtle differences might lead to minor differences in the following analysis.

6.3. Fits to Single-planet Transmission Spectra

We first fit CPAT models to each of the seven single-transit transmission spectra presented in this work, as well as the weighted mean spectrum of TRAPPIST-1 e from Transits 5 and 7. We included the corresponding K2, SSO (Ducrot et al. 2018), and Spitzer 4.5 μm transit depths (Delrez et al. 2018b) in the fitting procedure. Table 11 summarizes the posterior distributions of the fitted parameters and provides the Akaike information criterion (Akaike 1974) corrected for small sample sizes (AICc; Sugiura 1978), Bayesian information criterion (BIC; Schwarz 1978), χ2, and its corresponding p-values for each fit. We use the information criteria (ICs) to evaluate the efficacy of the increased model complexity of the contamination model compared to the flat model. Following convention (e.g., Liddle 2007), for both ICs—i.e., AICc and BIC—we interpret ΔIC > + 5 and ΔIC > + 10 relative to the best model as "strong" and "decisive" evidence against the current model, respectively.

Table 11.  Results of Stellar Contamination Model Fits to Single-planet Spectra

Data Set Model Fitted Parameter AICc ΔAICc BIC ΔBIC χ2 p
    D (%) Tphot (K) Tspot (K) Tfac (K) Fspot Ffac fspot ffac η            
b flat ${0.744}_{-0.006}^{+0.005}$ ${2118}_{-127}^{+87}$ ${1962}_{-131}^{+111}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −736.0 6.2 −716.0 0.9 150.0 0.21
b cont. ${0.677}_{-0.023}^{+0.022}$ ${2433}_{-245}^{+240}$ ${2009}_{-99}^{+150}$ ${2950}_{-25}^{+50}$ ${39}_{-10}^{+10}$ ${49}_{-10}^{+7}$ ${9}_{-9}^{+5}$ ${48}_{-7}^{+8}$ ${23}_{-2}^{+1}$ −742.2 −716.9 136.4 0.45
c flat ${0.705}_{-0.005}^{+0.005}$ ${2117}_{-125}^{+87}$ ${1961}_{-136}^{+106}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-1}^{+1}$ −746.0 1.9 −726.1 −3.5 141.8 0.37
c cont. ${0.663}_{-0.020}^{+0.021}$ ${2271}_{-205}^{+147}$ ${1960}_{-165}^{+91}$ ${2964}_{-20}^{+36}$ ${32}_{-10}^{+10}$ ${47}_{-7}^{+6}$ ${8}_{-8}^{+4}$ ${47}_{-7}^{+6}$ ${23}_{-2}^{+1}$ −747.9 −722.6 133.4 0.52
d flat ${0.388}_{-0.003}^{+0.003}$ ${2118}_{-127}^{+86}$ ${1962}_{-133}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −723.2 24.6 −703.2 19.3 180.9 0.01
d cont. ${0.309}_{-0.016}^{+0.015}$ ${2551}_{-157}^{+253}$ ${2000}_{-78}^{+107}$ ${2937}_{-27}^{+63}$ ${48}_{-9}^{+11}$ ${49}_{-12}^{+9}$ ${8}_{-8}^{+4}$ ${52}_{-10}^{+12}$ ${23}_{-2}^{+1}$ −747.8 −722.5 151.4 0.16
e (T5)a flat ${0.477}_{-0.004}^{+0.004}$ ${2117}_{-126}^{+86}$ ${1961}_{-131}^{+110}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −759.8 −3.2 −739.9 −8.7 141.4 0.38
e (T5)a cont. ${0.498}_{-0.020}^{+0.021}$ ${2125}_{-120}^{+84}$ ${1982}_{-124}^{+118}$ ${2972}_{-15}^{+28}$ ${16}_{-12}^{+8}$ ${47}_{-6}^{+5}$ ${12}_{-12}^{+5}$ ${43}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −756.6 −731.2 139.5 0.38
e (T7)b flat ${0.504}_{-0.004}^{+0.004}$ ${2116}_{-125}^{+87}$ ${1960}_{-135}^{+107}$ ${2975}_{-14}^{+25}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −743.3 12.6 −723.4 7.1 158.9 0.10
e (T7)b cont. ${0.449}_{-0.021}^{+0.021}$ ${2559}_{-118}^{+163}$ ${2032}_{-75}^{+100}$ ${2937}_{-27}^{+63}$ ${46}_{-9}^{+9}$ ${51}_{-12}^{+8}$ ${10}_{-10}^{+4}$ ${38}_{-8}^{+11}$ ${23}_{-2}^{+1}$ −755.9 −730.5 139.7 0.37
ec flat ${0.493}_{-0.003}^{+0.003}$ ${2116}_{-127}^{+85}$ ${1960}_{-133}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −767.5 −5.6 −747.5 −10.9 142.0 0.37
ec cont. ${0.480}_{-0.023}^{+0.022}$ ${2267}_{-209}^{+149}$ ${1981}_{-172}^{+102}$ ${2965}_{-20}^{+35}$ ${30}_{-11}^{+10}$ ${48}_{-8}^{+6}$ ${9}_{-9}^{+4}$ ${43}_{-6}^{+6}$ ${23}_{-2}^{+1}$ −761.9 −736.6 140.6 0.35
f flat ${0.641}_{-0.005}^{+0.005}$ ${2117}_{-125}^{+87}$ ${1961}_{-130}^{+111}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −746.7 0 −726.7 −5.3 144.0 0.33
f cont. ${0.627}_{-0.027}^{+0.026}$ ${2395}_{-254}^{+199}$ ${2022}_{-106}^{+159}$ ${2959}_{-23}^{+41}$ ${35}_{-11}^{+11}$ ${50}_{-9}^{+6}$ ${10}_{-10}^{+5}$ ${40}_{-7}^{+8}$ ${23}_{-2}^{+1}$ −746.7 −721.4 135.8 0.46
g flat ${0.774}_{-0.006}^{+0.006}$ ${2116}_{-126}^{+87}$ ${1960}_{-131}^{+109}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −750.4 -7.8 −730.4 −13.1 137.3 0.48
g cont. ${0.755}_{-0.027}^{+0.027}$ ${2176}_{-173}^{+109}$ ${1987}_{-133}^{+122}$ ${2969}_{-17}^{+31}$ ${23}_{-13}^{+12}$ ${48}_{-6}^{+5}$ ${10}_{-10}^{+4}$ ${47}_{-7}^{+6}$ ${23}_{-2}^{+1}$ −742.6 −717.3 139.1 0.39

Notes. Posterior distributions are provided for 16 MCMC optimization procedures resulting from two model frameworks and eight data sets. The model frameworks are the flat model, in which the transmission spectrum is unaffected by photospheric features, and the contamination model (identified as cont. here), in which the covering fractions of spots and faculae are allowed to differ from the whole-disk covering fractions. The eight data sets are the seven individual transits and the combined TRAPPIST-1 e data set. Medians and 68% confidence intervals of the fitted parameters are quoted. The AICc corrected for small sample sizes, BIC, χ2, and corresponding p-value for each model are provided. Each model has 144 data points (14 for the transmission spectrum and 130 for the stellar spectrum). Flat models have 137 degrees of freedom, and contamination models have 135.

aTransit 5. bTransit 7. cWeighted mean of Transits 5 and 7.

Download table as:  ASCIITypeset image

Of the single-planet spectra, both ICs only indicate decisive evidence against the flat model for one case: the TRAPPIST-1 d data set. Considering its χ2 value, the flat model for this data set is ruled out at 99% confidence (p = 0.01). For TRAPPIST-1 e, of the two transits that were observed in two visits separated by 12 days, one shows evidence to support the contamination model (Transit 7), but the other (Transit 5) does not. This difference result for two transits of the same planet could suggest temporal variability of stellar contamination. For the remaining data sets, both ICs generally support the same model—the exception being the TRAPPIST-1 c data set—though they do not both rise to the level of decisive evidence. In general, the additional complexity of the contamination models results in lower χ2 values, as expected, though the ICs show that the additional complexity is not decisively warranted by the data for any of the data sets besides that of TRAPPIST-1 d.

6.4. Fits to Combined Transmission Spectra

The observed effect of a stellar contamination signal scales with the transit depth (Equation (6)). The impact of stellar contamination is therefore more readily observable in the spectra of exoplanets with deeper transit depths. In the case of TRAPPIST-1, assuming that a steady-state stellar contamination signal similarly affects all the individual transmission spectra, we can coadd the individual transmission spectra to increase the signal-to-noise ratio. An examination of the combined transit spectrum can reveal whether the regions probed by the transit chords have different spectra from the average spectrum of the stellar disk.

Thus, in addition to the single-planet spectra, we fit both model frameworks to seven combinations of TRAPPIST-1 transmission spectra. The first combination is the sum of transit depths for all TRAPPIST-1 planets observed with HST, b–g, using the weighted mean spectrum of TRAPPIST-1 e from Transits 5 and 7. The resulting spectrum utilizes all of the available data and is, in effect, what one would observe if TRAPPIST-1 b–g transited simultaneously. This approach probes for shared spectral features, similar to the analysis of the double transit of TRAPPIST-1 b and c by de Wit et al. (2016). In this case, we are primarily interested in a stellar contribution that affects all transmission spectra similarly, such as surface active regions that are outside of all the planetary transit chords. This can be more easily studied in the combined spectra because the stellar contamination signal combines multiplicatively with any planetary transmission spectrum. The remaining combinations exclude the contribution from one of the six planets in turn, allowing us to examine the effect of the individual planets on the combined result. For each combination, we also included the sum of the corresponding K2 and SSO (Ducrot et al. 2018) and Spitzer 4.5 μm transit depths (Delrez et al. 2018b) in the fitting procedure and added the uncertainties of the individual transit depths in quadrature.

Figure 10 shows the TRAPPIST-1 b–g combined transmission spectrum, the out-of-transit stellar spectrum, and the best-fit flat and contamination models. The K2, SSO, and Spitzer transit depths are 3.6σ, 1.0σ, and 7.1σ below the mean of the combined HST transit depth, respectively. The combined HST transmission spectrum displays a notable decrease in transit depth around 1.4 μm, which coincides with a strong water absorption band. This "inverted" water feature is the opposite of the water absorption signature commonly observed in transiting exoplanet atmospheres (e.g., Sing et al. 2016). The offsets between instruments and the apparent 1.4 μm decrease are also evident in all five-planet combined transmission spectra (Figure 11), which illustrates that it is not due solely to the spectrum of an individual planet. We find that the offsets between instruments and the apparent 1.4 μm decrease are both reproduced well by the contamination model for each combined spectrum.

Figure 10.

Figure 10. Stellar contamination model jointly fit to K2+SSO+HST+Spitzer TRAPPIST-1 combined transmission spectra and the observed HST stellar spectrum. The left panel shows the combined transmission spectrum (blue points) and best-fitting contamination and flat models (black solid and gray dashed lines, respectively). The inset panel highlights the HST/WFC3 G141 data. The right panel shows the observed HST/WFC3 G141 out-of-transit stellar spectrum of TRAPPIST-1 (blue line) with a scaled uncertainty determined by the MCMC optimization procedure (shaded region). The best-fit disk-integrated model stellar spectra for the contamination (black lines) and flat models (gray lines) are indistinguishable.

Standard image High-resolution image
Figure 11.

Figure 11. Stellar contamination models jointly fit to K2+SSO+HST+Spitzer TRAPPIST-1 five-planet combined transmission spectra and the observed HST stellar spectrum. Data and models are offset for clarity. The left panel shows the combined transmission spectra and models, the middle panel highlights the HST/WFC3 G141 transmission spectrum, and the right panel shows the HST/WFC3 G141 out-of-transit stellar spectrum. The best-fit disk-integrated model stellar spectra for the contamination (black lines) and flat models (gray lines) are indistinguishable. The figure elements are the same as those in Figure 10.

Standard image High-resolution image

Table 12 provides the complete results of the model fits to the combined spectra. For each combination, the AICc and BIC both prefer the contamination model. According to the ICs, each data set provides decisive evidence against the flat model (ΔAICc > + 10, ΔBIC > + 10). In each case, the ICs show that the data warrant the inclusion of the additional parameters in the contamination models.

Table 12.  Results of Stellar Contamination Model Fits to Combined Spectra

Combination Model Fitted Parameter AICc ΔAICc BIC ΔBIC χ2 p
    D (%) Tphot (K) Tspot (K) Tfac (K) Fspot Ffac fspot ffac η            
b–g flat ${3.748}_{-0.013}^{+0.013}$ ${2117}_{-128}^{+84}$ ${1961}_{-132}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −687.7 39.1 −667.8 23.6 177.8 0.01
b–g cont. ${3.467}_{-0.059}^{+0.058}$ ${2425}_{-178}^{+168}$ ${2006}_{-93}^{+127}$ ${2957}_{-25}^{+43}$ ${38}_{-8}^{+8}$ ${48}_{-8}^{+6}$ ${10}_{-10}^{+4}$ ${45}_{-6}^{+6}$ ${23}_{-2}^{+1}$ −726.8 −701.4 130.7 0.59
c–g flat ${3.000}_{-0.011}^{+0.012}$ ${2116}_{-123}^{+89}$ ${1960}_{-128}^{+113}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −705.0 26.2 −685.0 20.8 164.8 0.05
c–g cont. ${2.794}_{-0.051}^{+0.054}$ ${2419}_{-192}^{+179}$ ${2013}_{-98}^{+133}$ ${2954}_{-25}^{+46}$ ${38}_{-9}^{+8}$ ${49}_{-9}^{+6}$ ${10}_{-10}^{+5}$ ${45}_{-6}^{+6}$ ${23}_{-2}^{+1}$ −731.2 −705.9 130.4 0.59
b, d–g flat ${3.043}_{-0.012}^{+0.012}$ ${2117}_{-125}^{+86}$ ${1961}_{-134}^{+107}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-1}^{+1}$ −697.0 29.1 −677.1 23.6 172.0 0.02
b, d–g cont. ${2.808}_{-0.054}^{+0.053}$ ${2480}_{-162}^{+206}$ ${2033}_{-85}^{+119}$ ${2949}_{-27}^{+51}$ ${39}_{-8}^{+8}$ ${50}_{-10}^{+7}$ ${11}_{-11}^{+5}$ ${45}_{-6}^{+7}$ ${23}_{-2}^{+1}$ −726.1 −700.7 134.5 0.50
b–c, e–g flat ${3.366}_{-0.013}^{+0.012}$ ${2118}_{-129}^{+84}$ ${1961}_{-136}^{+106}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −700.0 26.8 −680.1 21.3 166.4 0.04
b–c, e–g cont. ${3.165}_{-0.055}^{+0.055}$ ${2425}_{-211}^{+173}$ ${2014}_{-101}^{+146}$ ${2954}_{-25}^{+46}$ ${37}_{-9}^{+8}$ ${49}_{-9}^{+7}$ ${10}_{-10}^{+5}$ ${44}_{-6}^{+6}$ ${23}_{-2}^{+1}$ −726.8 −701.4 131.2 0.58
b–d, f–g flat ${3.254}_{-0.012}^{+0.012}$ ${2117}_{-124}^{+87}$ ${1961}_{-133}^{+107}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −693.6 34.5 −673.6 29.1 172.7 0.02
b–d, f–g cont. ${2.999}_{-0.052}^{+0.052}$ ${2444}_{-183}^{+181}$ ${2015}_{-92}^{+125}$ ${2952}_{-26}^{+48}$ ${39}_{-9}^{+8}$ ${49}_{-9}^{+7}$ ${10}_{-10}^{+5}$ ${46}_{-6}^{+6}$ ${23}_{-2}^{+1}$ −728.1 −702.7 130.1 0.60
b–e, g flat ${3.105}_{-0.012}^{+0.012}$ ${2117}_{-125}^{+87}$ ${1960}_{-133}^{+106}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −693.6 36.8 −673.6 31.4 174.6 0.02
b–e, g cont. ${2.849}_{-0.051}^{+0.051}$ ${2407}_{-191}^{+147}$ ${2001}_{-95}^{+125}$ ${2956}_{-24}^{+44}$ ${38}_{-9}^{+8}$ ${48}_{-8}^{+6}$ ${9}_{-9}^{+4}$ ${46}_{-6}^{+6}$ ${23}_{-2}^{+1}$ −730.4 −705.0 130.3 0.60
b–f flat ${2.970}_{-0.012}^{+0.011}$ ${2117}_{-124}^{+90}$ ${1960}_{-133}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −690.6 40.3 −670.6 35.9 178.5 0.01
b–f cont. ${2.721}_{-0.049}^{+0.051}$ ${2390}_{-164}^{+141}$ ${1987}_{-96}^{+119}$ ${2959}_{-23}^{+41}$ ${38}_{-8}^{+8}$ ${47}_{-7}^{+6}$ ${9}_{-9}^{+4}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −731.9 −706.5 129.8 0.61

Note. The combined spectra are listed in the first column. The combination b–g utilizes all of the current data, including the weighted mean of the two transmission spectra of TRAPPIST-1 e. The others are five-planet combinations, which in turn exclude the spectrum of a single planet. The remaining table elements are the same as those in Table 11.

Download table as:  ASCIITypeset image

Table 15.  Results of Stellar Contamination Model Fits to Single-Planet Spectra Using Only HST and Spitzer Data

Data Set Model Fitted Parameter AICc ΔAICc BIC ΔBIC χ2 p
    D (%) Tphot (K) Tspot (K) Tfac (K) Fspot Ffac fspot ffac η            
b flat ${0.745}_{-0.006}^{+0.006}$ ${2117}_{-128}^{+85}$ ${1961}_{-134}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −707.8 −8.6 −687.9 −14.0 148.3 0.21
b cont. ${0.675}_{-0.023}^{+0.021}$ ${2210}_{-241}^{+190}$ ${1980}_{-136}^{+119}$ ${2964}_{-20}^{+36}$ ${29}_{-16}^{+16}$ ${47}_{-8}^{+5}$ ${10}_{-10}^{+4}$ ${52}_{-9}^{+11}$ ${23}_{-2}^{+1}$ −699.2 −673.9 151.5 0.13
c flat ${0.705}_{-0.005}^{+0.005}$ ${2118}_{-126}^{+87}$ ${1961}_{-134}^{+107}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −718.6 −6.8 −698.8 −12.2 139.5 0.38
c cont. ${0.663}_{-0.020}^{+0.020}$ ${2227}_{-236}^{+154}$ ${1973}_{-160}^{+98}$ ${2965}_{-19}^{+35}$ ${29}_{-13}^{+15}$ ${47}_{-7}^{+5}$ ${9}_{-9}^{+4}$ ${48}_{-9}^{+9}$ ${23}_{-2}^{+1}$ −711.8 −686.6 140.4 0.31
d flat ${0.388}_{-0.003}^{+0.003}$ ${2117}_{-127}^{+86}$ ${1961}_{-132}^{+109}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −696.4 25.7 −676.5 20.4 178.5 0.01
d cont. ${0.309}_{-0.016}^{+0.015}$ ${2653}_{-98}^{+151}$ ${2030}_{-62}^{+91}$ ${2931}_{-31}^{+62}$ ${53}_{-8}^{+11}$ ${54}_{-16}^{+12}$ ${9}_{-9}^{+4}$ ${42}_{-18}^{+19}$ ${23}_{-2}^{+1}$ −722.1 −696.9 149.5 0.16
e (T5)a flat ${0.478}_{-0.004}^{+0.004}$ ${2117}_{-125}^{+87}$ ${1961}_{-133}^{+109}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −732.6 −4.2 −712.8 −9.6 139.6 0.38
e (T5)a cont. ${0.493}_{-0.021}^{+0.021}$ ${2123}_{-120}^{+82}$ ${1982}_{-120}^{+119}$ ${2973}_{-15}^{+27}$ ${15}_{-12}^{+8}$ ${47}_{-6}^{+5}$ ${12}_{-12}^{+5}$ ${44}_{-7}^{+6}$ ${23}_{-2}^{+1}$ −728.4 −703.2 139.0 0.34
e (T7)b flat ${0.506}_{-0.004}^{+0.004}$ ${2117}_{-125}^{+86}$ ${1960}_{-136}^{+105}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-5}^{+5}$ ${23}_{-2}^{+1}$ −721.4 6.6 −701.5 1.3 152.0 0.15
e (T7)b cont. ${0.447}_{-0.022}^{+0.021}$ ${2559}_{-135}^{+187}$ ${2030}_{-81}^{+113}$ ${2941}_{-27}^{+59}$ ${46}_{-10}^{+11}$ ${51}_{-12}^{+9}$ ${10}_{-10}^{+4}$ ${39}_{-12}^{+14}$ ${23}_{-2}^{+1}$ −728.0 −702.8 139.2 0.34
ec flat ${0.494}_{-0.003}^{+0.003}$ ${2116}_{-124}^{+88}$ ${1960}_{-133}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −742.8 −7.5 −723.0 13.0 137.7 0.42
ec cont. ${0.479}_{-0.023}^{+0.021}$ ${2214}_{-178}^{+127}$ ${1970}_{-161}^{+98}$ ${2968}_{-18}^{+32}$ ${27}_{-11}^{+11}$ ${47}_{-7}^{+5}$ ${9}_{-9}^{+4}$ ${45}_{-7}^{+6}$ ${23}_{-2}^{+1}$ −735.3 −710.0 139.1 0.34
f flat ${0.643}_{-0.005}^{+0.006}$ ${2116}_{-126}^{+85}$ ${1961}_{-132}^{+109}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −728.5 −8.3 −708.6 −13.7 134.8 0.49
f cont. ${0.625}_{-0.025}^{+0.025}$ ${2196}_{-193}^{+116}$ ${1984}_{-141}^{+112}$ ${2969}_{-17}^{+31}$ ${24}_{-13}^{+12}$ ${48}_{-6}^{+5}$ ${10}_{-10}^{+4}$ ${46}_{-7}^{+7}$ ${23}_{-2}^{+1}$ −720.2 −694.9 137.1 0.39
g flat ${0.776}_{-0.006}^{+0.006}$ ${2117}_{-128}^{+86}$ ${1961}_{-131}^{+111}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −726.3 -4.7 −706.4 -10.3 134.7 0.49
g cont. ${0.747}_{-0.028}^{+0.027}$ ${2138}_{-138}^{+90}$ ${1981}_{-124}^{+118}$ ${2971}_{-15}^{+29}$ ${18}_{-13}^{+9}$ ${47}_{-6}^{+5}$ ${11}_{-11}^{+5}$ ${49}_{-7}^{+7}$ ${23}_{-2}^{+1}$ −721.6 −696.3 134.2 0.45

Notes. The table elements are the same as those in Table 11. Each model has 142 data points (12 for the transmission spectrum and 130 for the stellar spectrum). Flat models have 135 degrees of freedom, and contamination models have 133.

aTransit 5. bTransit 7. cWeighted mean of Transits 5 and 7.

Download table as:  ASCIITypeset image

Table 16.  Results of Stellar Contamination Model Fits to Combined Spectra Using HST+Spitzer Data Only

Combination Model Fitted Parameter AICc ΔAICc BIC ΔBIC χ2 p
    D (%) Tphot (K) Tspot (K) Tfac (K) Fspot Ffac fspot ffac η            
b–g flat ${3.754}_{-0.013}^{+0.013}$ ${2119}_{-125}^{+88}$ ${1961}_{-133}^{+110}$ ${2974}_{-14}^{+26}$ ${16}_{-14}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −671.9 30.5 −652.1 25.0 169.0 0.03
b–g cont. ${3.474}_{-0.059}^{+0.059}$ ${2408}_{-252}^{+235}$ ${2002}_{-134}^{+142}$ ${2955}_{-24}^{+45}$ ${38}_{-10}^{+9}$ ${48}_{-10}^{+7}$ ${10}_{-10}^{+5}$ ${44}_{-7}^{+9}$ ${23}_{-2}^{+1}$ −702.4 −677.1 131.3 0.52
c–g flat ${3.005}_{-0.011}^{+0.012}$ ${2116}_{-126}^{+86}$ ${1960}_{-135}^{+106}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −687.9 17.2 −668.0 11.9 157.2 0.09
c–g cont. ${2.796}_{-0.054}^{+0.054}$ ${2351}_{-250}^{+175}$ ${1987}_{-157}^{+120}$ ${2958}_{-22}^{+42}$ ${36}_{-10}^{+9}$ ${48}_{-9}^{+6}$ ${9}_{-9}^{+4}$ ${45}_{-6}^{+8}$ ${23}_{-2}^{+1}$ −705.1 −679.9 132.8 0.49
b, d–g flat ${3.048}_{-0.012}^{+0.012}$ ${2118}_{-126}^{+88}$ ${1962}_{-130}^{+112}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −680.4 20.2 −660.6 14.7 163.9 0.05
b, d–g cont. ${2.810}_{-0.055}^{+0.054}$ ${2377}_{-275}^{+198}$ ${1997}_{-163}^{+117}$ ${2957}_{-23}^{+43}$ ${37}_{-10}^{+10}$ ${48}_{-9}^{+7}$ ${9}_{-9}^{+4}$ ${45}_{-7}^{+9}$ ${23}_{-2}^{+1}$ −700.6 −675.3 136.7 0.40
b–c, e–g flat ${3.372}_{-0.013}^{+0.013}$ ${2116}_{-126}^{+88}$ ${1959}_{-132}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −687.1 14.4 −667.2 9.0 154.7 0.12
b–c, e–g cont. ${3.168}_{-0.054}^{+0.057}$ ${2294}_{-223}^{+152}$ ${1956}_{-165}^{+97}$ ${2962}_{-21}^{+38}$ ${33}_{-10}^{+8}$ ${47}_{-8}^{+5}$ ${8}_{-8}^{+4}$ ${45}_{-6}^{+7}$ ${23}_{-2}^{+1}$ −701.5 −676.2 133.5 0.47
b–d, f–g flat ${3.259}_{-0.013}^{+0.013}$ ${2116}_{-126}^{+85}$ ${1961}_{-131}^{+109}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −675.9 25.8 −656.0 20.4 165.6 0.04
b–d, f–g cont. ${3.007}_{-0.054}^{+0.054}$ ${2392}_{-269}^{+223}$ ${1997}_{-151}^{+128}$ ${2956}_{-24}^{+44}$ ${37}_{-10}^{+10}$ ${48}_{-10}^{+7}$ ${9}_{-9}^{+5}$ ${45}_{-7}^{+9}$ ${23}_{-2}^{+1}$ −701.7 −676.4 133.0 0.48
b–e, g flat ${3.110}_{-0.012}^{+0.012}$ ${2115}_{-122}^{+88}$ ${1959}_{-132}^{+108}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −673.1 30.9 −653.2 25.5 170.1 0.02
b–e, g cont. ${2.850}_{-0.053}^{+0.051}$ ${2406}_{-238}^{+230}$ ${2001}_{-127}^{+140}$ ${2954}_{-25}^{+46}$ ${38}_{-10}^{+9}$ ${48}_{-9}^{+7}$ ${9}_{-9}^{+4}$ ${45}_{-7}^{+9}$ ${23}_{-2}^{+1}$ −704.0 −678.7 131.9 0.51
b–f flat ${2.976}_{-0.012}^{+0.012}$ ${2118}_{-126}^{+87}$ ${1961}_{-128}^{+112}$ ${2974}_{-14}^{+26}$ ${16}_{-15}^{+7}$ ${46}_{-6}^{+5}$ ${23}_{-2}^{+1}$ −672.2 31.3 −652.3 26.0 171.4 0.02
b–f cont. ${2.733}_{-0.052}^{+0.050}$ ${2502}_{-153}^{+259}$ ${2028}_{-91}^{+151}$ ${2950}_{-25}^{+50}$ ${41}_{-10}^{+10}$ ${50}_{-10}^{+8}$ ${10}_{-10}^{+5}$ ${43}_{-8}^{+11}$ ${23}_{-2}^{+1}$ −703.5 −678.3 132.7 0.49

Note. The table elements are the same as those in Table 12. Each model has 142 data points (12 for the transmission spectrum and 130 for the stellar spectrum). Flat models have 135 degrees of freedom, and contamination models have 133.

Download table as:  ASCIITypeset image

Table 17.  Predictions for K2 (0.42–0.9 μm) and I+z (0.8–1.1 μm) Transit Depths from Fits to Combined HST+Spitzer Transmission Spectra

Combination Model K2 Transit Depth (%) I+z Transit Depth (%)
    (0.42–0.9 μm) (0.8–1.1 μm)
b–g flat 3.75 3.75
b–g cont. 3.43 3.63
c–g flat 3.00 3.00
c–g cont. 2.79 2.91
b, d–g flat 3.05 3.05
b, d–g cont. 2.81 2.95
b–c, e–g flat 3.37 3.37
b–c, e–g cont. 3.15 3.27
b–d, f–g flat 3.26 3.26
b–d, f–g cont. 3.00 3.16
b–e, g flat 3.11 3.11
b–e, g cont. 2.87 3.03
b–f flat 2.98 2.98
b–f cont. 2.69 2.91

Download table as:  ASCIITypeset image

6.4.1. Impact of the Stellar Spectrum

For all model fits, both to the individual and combined transmission spectra, the fitted value of η is ${23}_{-2}^{+1}$ (Tables 11 and 12). Notably, this is the case for all flat models, in which the stellar parameters are determined by the out-of-transit stellar spectrum alone and have no effect on the fit to the transmission spectra. This implies that the observational errors of the HST/WFC3 G141 spectrum of TRAPPIST-1 must be largely inflated to match the DRIFT-PHOENIX model spectra, even when using one composed of multiple component spectra, which should provide more flexibility to the model fits. Inspection of the right panel of Figure 10 shows that the data and model agree on the general shape of the spectrum but disagree on many details and the continuum level for the bluest and reddest wavelengths. For further clarity, Figure 12 illustrates these same spectra but with the true uncertainty on the observations shown. Also shown is a DRIFT-PHOENIX model spectrum for a star with TRAPPIST-1's effective temperature and surface gravity (T = 2559 K, $\mathrm{log}g=5.21;$ Gillon et al. 2017). This single-component model disagrees with the data significantly between roughly 1.34 and 1.6 μm, which demonstrates why even the flat models prefer multicomponent stellar spectra, as shown by the fitted values of the Fspot and Ffac parameters in Tables 11 and 12.

Figure 12.

Figure 12. Comparison of the observed TRAPPIST-1 HST/WFC3 G141 spectrum to the models. The observed out-of-transit stellar spectrum is shown in blue; the uncertainties on the observed spectrum are smaller that the line width. The black line shows the best-fit disk-integrated model spectrum from the contamination model fit to the combined TRAPPIST-1 b–g data set (Table 12). All other three-component fitted models from this analysis are indistinguishable. By contrast, the DRIFT-PHOENIX model stellar spectrum for a star with T = 2559 K and log g = 5.21 (salmon line) provides a relatively poor fit to the data.

Standard image High-resolution image

The stellar spectrum contains many more data points than the transmission spectrum. Therefore, disagreement between the observed and model stellar spectra could strongly influence our results. In principle, the η parameter guards against this by effectively deweighting the stellar spectrum in the MCMC optimization procedure. Nonetheless, we investigated how the inclusion of the stellar spectrum in the model framework affects the results using the TRAPPIST-1 b–g data set. Considering only the transmission spectrum, the flat model has 14 data points and one fitted parameter, giving 13 degrees of freedom. The ${\chi }^{2}$ value for this model is 51.8, indicating that it is conclusively ruled out (p < 10−5). The AICc and BIC for this model are −129.3 and −129.0, respectively. By contrast, the contamination model has eight fitted parameters and thus gives 6 degrees of freedom when considering only the 14 data points of the transmission spectrum. The χ2 value for the contamination model is 4.2, indicating that it is not ruled out by the data (p = 0.64). Additionally, the AICc and BIC for this model are −135.2 and −158.1, respectively. A comparison of the ICs for the two models yields ΔAICc = +5.9 and ΔBIC = +29.1 in favor of the contamination model. Thus, relative to the contamination model, the flat model is ruled out decisively by the BIC and strongly by the AICc, which more stringently penalizes model complexity. This exercise shows that the additional complexity of the contamination model is warranted by the transmission spectrum alone. For the remainder of this analysis, however, we opt to utilize the additional information in the stellar spectrum and discuss the results taking into account both the transmission and stellar spectra.

6.4.2. Impact of Instrumental Offsets

Given the significant offsets between the HST transit depths and those from other instruments, one might reasonably wonder how strongly the model results rely on the offsets between the instruments. To investigate this, we conducted this same analysis on the combined TRAPPIST-1 b–g data set using only the HST data for the transmission spectrum. In this case, the ICs prefer the flat model (ΔAICc > + 2, ΔBIC > + 8). In other words, the additional complexity of the contamination model is not warranted by the HST spectra alone, even though it results in smaller χ2 values; the offsets between the HST and Spitzer measurements are integral to the interpretation of stellar contamination impacting the transmission spectra. However, we also note that the contamination models fit to the HST+Spitzer data set accurately predicted the K2 and SSO transit depths (see Appendix C). The best-fit parameters do not differ significantly between the analyses of the HST+Spitzer data sets and the full K2+SSO+HST+Spitzer data sets. It is unlikely that systematics among four instruments mimic an astrophysical signal. Therefore, we argue for an astrophysical origin and against instrumental systematics as the source of the overall shape of the TRAPPIST-1 combined transmission spectrum.

Additionally, one might also wonder if any offset induced by instrumental systematics could be mistakenly interpreted as evidence for stellar contamination. In other words, can any combination of offsets be well fit by a stellar contamination model? To examine this point, we conducted the same analysis on two hypothetical variants of the TRAPPIST-1 b–g data set. In the first, we held the HST and Spitzer transit depths to their measured values and perturbed the K2 and SSO depths to 50% of their measured values while keeping the original measurement uncertainties for all data points. The χ2 values for the flat and contamination models are 1115 and 392 for 137 and 135 degrees of freedom, respectively, indicating that both models are conclusively ruled out (p ≪ 10−10). In the second variant, we perturbed the K2 and SSO depths to 200% of their measured values while keeping the original HST and Spitzer points. The MCMC chains do not converge for the contamination model in this case ($\hat{R}=1.05$ for D and $\hat{R}=1.07$ for Fspot, ffac, and fspot), illustrating the difficulty of fitting such a spectrum with a stellar contamination model. Nonetheless, taking the fits at face value, the χ2 values (3219 and 1436 for the flat and contamination models, respectively) indicate that both models are again conclusively ruled out (p ≪ 10−10). This exercise shows that arbitrary offsets cannot be well fit by the additional parameters afforded by the stellar contamination model. Instead, the observed offset between HST and Spitzer transit depths provides a specific prediction for the K2 and SSO depths (see Appendix C for details), which is borne out by the observations.

6.4.3. Physical Interpretation

Returning to the interpretation of the fits to the full K2+SSO+HST+Spitzer transmission spectra, we note that the posterior distributions for the stellar parameters (Tphot, Tspot, Tfac, Fspot, Ffac, fspot, and ffac) in the contamination models listed in Table 12 are consistent for all combined transmission spectra. They broadly agree on the same general picture for TRAPPIST-1, namely a heterogeneous photosphere comprised of three components: a hot component or faculae (T ∼ 3000 K) covering ∼50% of the projected stellar disk, a cool component or spots (T ∼ 2000 K) covering ∼40%, and an intermediate component or immaculate photosphere (T ∼ 2400 K) covering the remaining ∼10% of the disk. Within the region transited by the TRAPPIST-1 planets, spots are less prevalent, covering only ∼10% of the transit chord, which results in a spectral mismatch between the disk-integrated stellar spectrum and the light source for the transit spectroscopy observations.

Figure 13 illustrates examples of the joint posterior distributions for both modeling frameworks. For the contamination model, the distributions of two parameters, fspot and Tfac, deserve special notice. Here fspot piles on the lower bound of the allowed parameter space in the contamination models and is consistent at 1σ confidence with no spots being present within the transit chord in each case. The facula temperature is notable because it piles on the upper bound of the allowed parameter space, even in the flat models. This suggests that the HST spectrum of TRAPPIST-1 shows evidence for a high-temperature component beyond the limit of the DRIFT-PHOENIX model grid, a result in broad agreement with the bright spots proposed by Morris et al. (2018a) from an analysis of K2 and Spitzer photometry. Given the limitations of the model grid to sample high-enough values for Tfac, the fitted values of Ffac and ffac are likely overestimated.

Figure 13.

Figure 13. Posterior distributions of free parameters in CPAT model fits to the observed transmission spectrum of TRAPPIST-1 b–g. Results are shown for the flat and contamination models in the left and right panels, respectively.

Standard image High-resolution image

The discrepancy between fspot and Fspot suggests that the 28% of the projected stellar disk (or 56% of a half hemisphere) covered by the transit chords (Delrez et al. 2018b) is less spotted than the whole disk and seems to point to the presence of an active region not represented within the transit chord. As we discuss in Section 6.5, this arrangement bears some resemblance to the active high latitudes and circumpolar spot structures observed on fully convective M dwarfs (Phan-Bao et al. 2009; Barnes et al. 2015, 2017) and polar spots observed on active earlier-type stars (see Strassmeier 2009 and references therein). Still, while the ICs show that models that allow for stellar contamination in the transmission spectra are strongly preferred over those that assume no stellar contamination, the tendency of Tfac and fspot to pile on the boundaries of their allowed ranges highlights the limitations of this analysis (discussed in more detail in Section 6.7) and cautions against overinterpreting the specific component temperatures or covering fractions determined by this analysis.

Nonetheless, these results demonstrate that, in principle, the features in the TRAPPIST-1 transmission spectra can result from a heterogeneous stellar photosphere and not from transmission through the planetary atmospheres. Of course, both stellar and planetary signals can contribute to the observed spectra, and future efforts to jointly constrain the contributions of each source may bear fruit. However, these results urge caution for planetary interpretations of observed features in near-infrared transmission spectra from low-mass host stars. We consider this point more broadly in Section 7.

6.5. TRAPPIST-1 Results in Context

We present here a relatively simple model to explain a complicated physical phenomenon, namely the heterogeneity of the surface of TRAPPIST-1 and the its effect on the transmission spectra of the TRAPPIST-1 planets. Despite the limitations of the approach, we show that models that allow for imprints of stellar features in the combined TRAPPIST-1 transmission spectra are strongly preferred to those that do not. The model fits point to large spot and faculae covering fractions, suggesting a highly heterogeneous photosphere for TRAPPIST-1. In the following paragraphs, we provide further context for interpreting this result.

Photometric variability shows that M dwarfs have highly heterogeneous surfaces. McQuillan et al. (2013) measured rotation periods and variability amplitudes for 1570 M dwarfs in the Kepler sample with masses between 0.3 and 0.5 M. They found amplitudes ranging from 1.0 to 140.8 mmag in the Kepler bandpass, with a median amplitude of 7.6 mmag or 0.70% (McQuillan et al. 2013, Table 2). Newton et al. (2016) measured rotation periods for 387 field mid-to-late M dwarfs (M < 0.35 M) in the MEarth bandpass (roughly i+z) and found that they typically vary in brightness by 1%–2% as they rotate. Using an ensemble of model M-dwarf photospheres with randomly distributed active regions, Rackham et al. (2018) found that a 1% I-band variability amplitude corresponds to spot covering fractions ${F}_{\mathrm{spot}}={14}_{-7 \% }^{+16 \% }$ and faculae covering fractions ${F}_{\mathrm{fac}}={63}_{-25 \% }^{+5 \% }$ across all M-dwarf spectral types, assuming a typical spot radius of 2°. Importantly, they found that the relation between active-region coverages and variability amplitudes is not linear, as variability monitoring only traces the nonaxisymmetric component of stellar surface features (see also Jackson & Jeffries 2012). Therefore, the typical amplitudes of M dwarfs in the Kepler and MEarth samples indicate that variable M dwarfs can be covered in large part by active regions.

As noted above, TRAPPIST-1 demonstrates a photometric variability of roughly 1% in the I+z bandpass. This has generally been interpreted as active regions rotating into and out of view (Gillon et al. 2016; Vida et al. 2017), though Morris et al. (2018a) suggested that the variability of TRAPPIST-1 may be driven by a magnetic activity timescale rather than a rotation period. Assuming the variability owes to rotation, Rackham et al. (2018) found that it could be caused by spot and faculae with covering fractions of ${F}_{\mathrm{spot}}={8}_{-7 \% }^{+18 \% }$ and ${F}_{\mathrm{fac}}={54}_{-46 \% }^{+16 \% }$, respectively. We used this information as priors in the current analysis, so the results presented here are not completely independent. However, we point out these previous results to show that the active-region coverages found in this analysis are neither unexpected for TRAPPIST-1 given its variability level nor unique among the population of M dwarfs.

The difference between the best-fit values that we find for Fspot and fspot for the combined transmission spectra argues for variation in the latitudinal distribution of spots for TRAPPIST-1. This should not be surprising: Doppler imaging (Vogt & Penrod 1983) of early M dwarfs provides examples of both stars with spots emerging at preferential latitudes and those with spots at all latitudes (Barnes & Collier Cameron 2001; Barnes et al. 2004). However, the few Doppler images that exist for fully convective M dwarfs suggest that spots may emerge preferentially at high latitudes. To date, six mid-to-late M dwarfs have been studied with this technique: V374 Peg (M4V; Morin et al. 2008), G 164-31 (M4V; Phan-Bao et al. 2009), GJ 791.2A (M4.5V; Barnes et al. 2015, 2017), the binary GJ 65A and GJ 65B (M5.5V and M6V; Barnes et al. 2017), and LP 944-20 (M9V; Barnes et al. 2015). Three of these six targets (G 164-31, GJ 65A, and LP 944-20) only display spots at high-latitude or circumpolar regions. Additionally, GJ 791.2A shows a high-latitude circumpolar spot structure, as well as low-latitude spots. For earlier spectral types, which can be studied more readily with Doppler imaging techniques, polar spots are commonly observed on active stars (see Strassmeier 2009 and references therein) and may be stable and long-lived (Jeffers et al. 2007).

In the case of TRAPPIST-1, the combined transit chords of the seven TRAPPIST-1 planets probe a substantial portion (28%) of the projected stellar disk (Delrez et al. 2018b). However, given the frequent and prominent latitudinal variations in active-region occurrence observed in M dwarfs, it is unsurprising that our analysis suggests significant differences between the 28% equatorial zone and the polar regions of TRAPPIST-1. Therefore, while we cannot precisely constrain the latitudes of active regions in the current analysis, we suggest that features like the active high latitudes observed on fully convective M dwarfs or polar spots observed on active earlier-type stars may account for the different spot covering fractions that we fit to the transit chord and whole disk of TRAPPIST-1.

Spectral template fitting provides another approach for studying the heterogeneity of stellar surfaces. Observations of TiO molecular bands in dwarf stars with photospheres warmer than T ∼ 3500 K have been used to constrain spot sizes and covering fractions (Vogt 1979; Ramsey & Nations 1980; Vogt 1981). Using spectra of inactive G, K, and M dwarfs as templates, Neff et al. (1995) and later O'Neal et al. (1996, 1998) showed that TiO bands in medium-resolution (R ∼ 10,000) spectra of active G and K stars point to covering fractions of cool spots as large as 64%. Gully-Santiago et al. (2017) studied the T Tauri star LkCa 4 and found that spectra features in high-resolution near-infrared spectra were produced by hot (∼4100 K) and cool (∼2700–3000 K) photospheric components, with the cool component covering ∼80% of the stellar surface. The TiO bands in R ∼ 1000 spectra of a large sample of stars in the Pleiades show that spot covering fractions of ∼50% are common among K and M stars at ∼125 Myr (Fang et al. 2016).

In this work, we find that the low-resolution HST spectrum of TRAPPIST-1 is best fit by a template combination of three DRIFT-PHOENIX model stellar spectra. This is the case even for our flat models, in which the spot and faculae parameters are determined by fitting the stellar spectrum alone. While TRAPPIST-1 is both later-type and older than the stars in the examples referenced above, these previous efforts show that the inferred heterogeneity for TRAPPIST-1 is not extreme but rather typical of active stars.

6.6. Comparison with Morris et al. (2018c)

Recently, Morris et al. (2018c) investigated the problem of stellar contamination from TRAPPIST-1 using a novel transit light-curve "self-contamination" technique (Morris et al. 2018b). They determined Rp/Rs for TRAPPIST-1 b–h (individually) using Spitzer transit light curves based on the durations of the ingress/egress and the durations from the mid-ingress to mid-egress, for which stellar heterogeneity has a negligible effect, and compared it to the Rp/Rs values determined by the transit depths. They found consistent Rp/Rs measurements from the two methods and claimed a nondetection of stellar contamination. However, the Rp/Rs values calculated by the light-curve self-contamination method have large uncertainties, e.g., ∼10% versus 1.4% in this work for TRAPPIST-1 b, and therefore do not have the sensitivity to probe the level of stellar contamination in transmission spectra that we study here. In this light, the nondetection of Morris et al. (2018c) should be more properly viewed as a weak upper limit on stellar contamination, rather than evidence for the lack of it. Put simply, none of the individual transit depths (HST, Spitzer, K2, and SOO) that we used in this work would satisfy the criterion used by Morris et al. (2018c) for rejecting the null hypothesis (nondetection of stellar contamination), although we show that the combined transmission spectra strongly favor a stellar contamination model.

6.7. Limitations of the Model

While we build upon previous attempts to characterize the contribution of stellar heterogeneity to transmission spectra by imposing the constraint provided by the out-of-transit stellar spectrum, this model has several notable limitations. We utilize disk-integrated stellar spectra for the spectral components, which may neglect unique spectral features that can emerge from magnetically active regions (e.g., Norris et al. 2017). Similarly, we do not consider any chromospheric contribution to the stellar contamination spectrum. We neglect the effect of limb darkening and consider the photosphere to be static during the transit events, though the observed activity and relatively short rotation period of TRAPPIST-1 may allow for photospheric evolution on a timescale important for transit observations. We note that future efforts could improve on this initial analysis in these respects and more.

Nonetheless, despite the model limitations, we find that models including stellar heterogeneity effects provide significantly better fits to the combined TRAPPIST-1 transmission spectra than flat planetary transmission models. Interestingly, the spot and faculae covering fractions that we infer from the transmission spectra have considerably lower uncertainties than those inferred through modeling the star's rotational variability (Rackham et al. 2018). While the rotational variability of TRAPPIST-1 demonstrates the presence of photospheric heterogeneities, its magnitude poorly constrains their covering fractions. We find that the observed transmission spectra place tighter constraints on the stellar heterogeneity.

This interpretation of the spectra also offers testable predictions. We find the best-fit models to display notable decreases in the visual transit depths of the TRAPPIST-1 planets, which represent an observational challenge given the faintness of this ultracool dwarf in the visual, but may be observable with large ground-based telescopes. Broadband photometry over repeated transits can also overcome this challenge, and in Appendix C, we show that stellar contamination models fit to the HST+Spitzer data sets accurately predict the recently measured K2 (0.42–0.9 μm) and I+z (0.8–1.1 μm) combined transit depths (Ducrot et al. 2018). Looking forward, we find "inverse water features" akin to the 1.4 μm feature in the unexplored spectral region between 1.7 and 4 μm, which can be studied with JWST.

7. Outlook: HST and JWST Transit Spectroscopy

7.1. Stellar Contamination as Dominant Astrophysical Noise

The contamination seen in the combined TRAPPIST-1 data emerges due to the transit light source effect: the difference between the baseline stellar spectrum (disk-integrated) and the spectrum of the transit chord (actual light source for the transmission spectroscopy), as described by Rackham et al. (2018) in detail. The presence of ∼200 ppm-level (inverted) water features as stellar contamination in the HST/WFC3 spectra should serve as a red flag for investigators planning major future HST/WFC3 or JWST transit spectroscopy campaigns on TRAPPIST-1 or similar host stars. The possible range of stellar spectral contaminations has been discussed by Rackham et al. (2018), and here we will briefly discuss two aspects not addressed there: connection with atmospheric retrievals and emission spectroscopy.

Atmospheric retrievals use a Bayesian exploration of possible atmospheric models to identify best-fitting models and derive confidence intervals (and posterior probability distributions) for the atmospheric pressure–temperature structures, molecular abundances, and cloud properties. These models have been shown to be powerful in characterizing atmospheres (e.g., Madhusudhan & Seager 2009; Benneke & Seager 2012; Waldmann et al. 2015) and are widely anticipated to be the primary tools for interpreting high-quality transmission spectra from HST and JWST (e.g., Greene et al. 2016; Morley et al. 2017). The stellar contamination seen in the TRAPPIST-1 planets and sub-Neptune GJ 1214b (Rackham et al. 2017), as well as those predicted by Rackham et al. (2018) for most M-dwarf stars, highlight the importance of including this effect in retrievals. Indeed, without modeling and correcting for the relatively strong (∼200 ppm-level) inverted water absorption feature introduced by the stellar contamination, one cannot hope to accurately measure the water abundance (probably <80 ppm levels; Morley et al. 2017) in the transmission spectra of small planets. Recent efforts to include the effects of stellar contamination in retrievals, however, show promise in separating stellar and planetary contributions to transmission spectra (Pinhas et al. 2018, in press; Bixel et al. 2018, submitted; Espinoza et al. 2018, submitted).

We point out, furthermore, that stellar emission (eclipse) spectroscopy will be affected much less by stellar contamination, as the planet is its own light source for emission or eclipse spectroscopy. In fact, for the limiting case of infinitely slowly rotating stars with a constant (nonevolving) starspot/facular pattern, there will be no stellar contamination. Therefore, JWST emission spectroscopy of the TRAPPIST-1 planets is likely to be much easier to interpret than HST and JWST transmission spectroscopy. We caution, however, that for rapidly rotating stars—such as TRAPPIST-1—the evolution of the stellar spectrum due to its heterogeneity (e.g., Apai et al. 2013; Yang et al. 2015) may also pose a nonnegligible astrophysical systematic.

7.2. Combining Multi-epoch Transit Data from Planets Orbiting Heterogeneous Host Stars

A central question in HST and JWST transiting exoplanet observations is whether data from multiple transits can be combined. If the instrumental systematics can be robustly corrected and the astrophysical systematics are negligible, data can be coadded. This approach has been used in the HST/WFC3 IR transmission spectroscopy study of the sub-Neptune GJ 1214b by Kreidberg et al. (2014), where observations from 12 transits of the planet were coadded. In fact, the overall success of the empirical ramp-effect correction and the physically motivated RECTE model are very encouraging and suggest that photon noise–limited performance could be achievable for large, multi-epoch campaigns.

However, the astrophysical noise—stellar contamination through the transit light source effect (Rackham et al. 2018)—emerges as a major concern. Not only will the stellar contamination features often dominate the planetary absorption features in amplitude (as explored in detail in Rackham et al. 2018), but the contamination itself may change as the star rotates and its starspot/faculae coverage evolves over the course of a transit observation.

With a rotational period of 1.4 ± 0.05 days (Gillon et al. 2016), TRAPPIST-1 is a relatively rapidly rotating host star at the stellar/substellar boundary. It is worthwhile to briefly outline the implications of this rapid rotation by comparing four different timescales: starspot evolution, planetary orbits, stellar rotation, and the lengths of planetary transit observations.

Stellar contamination in transit spectra will depend on the heterogeneity of the projected stellar disk during the observations. A straightforward mechanism that alters the brightness and spectral distribution on the visible hemisphere of a star is the appearance, evolution, and disappearance of starspots and surrounding faculae. Although we have a limited understanding of starspot lifetimes, it seems reasonable to assume that their evolution may occur over timescales of 10–20 days, based on the similar processes observed in the Sun (Bradshaw & Hartigan 2014). This timescale is generally longer than the orbital periods of the TRAPPIST-1 planets. Therefore, we tentatively conclude that starspot evolution may only have a limited impact on stellar contamination variations in the case that consecutive planetary transits can be observed; nonetheless, it will have a major impact in coadding data from transits separated by months or years.

The stellar rotation, however, will introduce a more problematic source of time-varying stellar contamination. The rotational period of TRAPPIST-1 is shorter than the orbital periods of all but the closest planet (b), which means that even during subsequent transits of the same planet, a different (essentially random) stellar longitude will face the observer, i.e., the stellar contamination will be different between even the closest transits of the same planet.

The rapid rotation of TRAPPIST-1 poses another interesting problem: the rotational period (1.4 ± 0.05 days; Gillon et al. 2016) is comparable to the duration of a typical transit observation (five HST orbits, or about 1/3 of a day). This means that even during a single transit, the star will rotate by about 60°. Thus, about 1/3 of the visible stellar hemisphere will be different between the beginning and the end of the transit. This rapid change means that stellar contamination will not only be different between different transits but will change even during a single transit, making potential starspot modeling and correction even more difficult. We note here that the same effect may also complicate planet eclipse spectroscopy, albeit to a lesser extent.

With its rapid rotation, it may be tempting see TRAPPIST-1 as a worst-case scenario in terms of stellar contamination; however, it should be considered as a typical example of very late M dwarf or brown dwarf planet hosts. Photometric monitoring programs have found that many brown dwarfs are much faster rotators, with rotational periods of a few hours being not uncommon (Metchev et al. 2015). Spectroscopic monitoring of brown dwarfs (with the same HST/WFC3 IR grism as used for this study) showed that most, if not all, brown dwarfs have heterogeneous atmospheres (Buenzli et al. 2014), a result that was reinforced by a larger but less sensitive ground-based broadband photometry survey (Radigan et al. 2014) and a large and unbiased Spitzer photometric survey (Metchev et al. 2015). At least some, and perhaps most, of these brightness and spectral modulations are attributable to the rearrangement of condensate clouds driven by global circulation patterns (Apai et al. 2017).

Therefore, rotational modulations and stellar heterogeneity evolution should be expected to be very common for ultracool planet hosts—the coolest host stars and brown dwarfs. The rapidly evolving stellar contamination in the transit spectra of exoplanets orbiting such ultracool hosts will be a major challenge to obtaining the multi-epoch data necessary to build up the data quality required for atmospheric abundance analysis (Morley et al. 2017). In fact, this goal may only be achievable if we can develop a robust understanding of ultracool dwarf heterogeneities and derive a reliable contamination correction method for this astrophysical noise source.

7.3. Comparison with de Wit et al. (2018)

De Wit et al. (2018) was published after the first submission this work. They independently reduced and analyzed data from program GO-14873. They adopted a ramp removal method that is similar to ours and different from that in de Wit et al. (2016). We identify a few minor differences between the reduction processes, none of which led to significant differences in the reduced data, but the differences in the analysis led to different interpretations.

  • 1.  
    The most crucial difference between our study and that of de Wit et al. (2018) is the interpretations of the transmission spectra. De Wit et al. (2018) compared observed spectra with planetary models and flat lines, while we compared the observations with flat spectra, planetary atmosphere model spectra with water absorption features, and stellar contamination model spectra. We agree with de Wit et al. (2018) that flat spectra are preferred over spectra with planetary water absorption features, but we also show that stellar contamination models are favored over flat spectra for the combined TRAPPIST-1 transmission spectra.
  • 2.  
    Each visit in de Wit et al. (2018) was designed to include two transit events, and the three events identified in Table 2 are genuine transits. While we discarded some data sets heavily affected by severe CR events and the resulting guiding failure due to SAA passages, de Wit et al. (2018) attempted to recover all the data. Although the Visit 1 data—which covered the transit by TRAPPIST-1 f—was successfully recovered, it was not possible to recover the other two events.
  • 3.  
    De Wit et al. (2018) aligned every single readout, while we applied the alignment on each ima file frame. As for the CR correction, in de Wit et al. (2018), both spatial and temporal interpolation were used to replace the pixels affected by CRs.
  • 4.  
    De Wit et al. (2018) adopted a ramp correction formula that resembles the correction predicted by the RECTE model (Zhou et al. 2017). They reported an average standard deviation of residuals in the broadband light curve of 220 ppm, while we reached 198 ppm—very similar values given the uncertainties. For the individual (narrowband) bins, they reported average uncertainties of 545, 526, 493, and 494 ppm for Visits 1 to 4, respectively; by comparison, we find average uncertainties of 500, 550, 487, and 454 ppm for the same data sets. Thus, the residuals from the two reductions are very similar.

8. Summary

The TRAPPIST-1 system offers exceptionally deep transits and a relatively bright host star for multiple approximately Earth-sized planets. We present here HST/WFC3 IR near-infrared grism spectroscopy of the TRAPPIST-1 transiting planets based on rereduction of archival data (de Wit et al. 2016, 2018) covering a total of seven transits.

The key findings of our study are as follows.

  • 1.  
    We detected transits for the six inner planets: TRAPPIST-1 b, c, d, e, f, and g. Our data include two transits for planet e.
  • 2.  
    We provide improved broadband transit depths and mid-transit times for each of the six planets. We find evidence for transit timing variations for planets f and g.
  • 3.  
    Comparisons of the transit light curves of planets b and c between those published by de Wit et al. (2016) and our study show the benefits of the RECTE charge-trap correction: increased orbital phase coverage, higher observing efficiency, and similar or better systematics correction. Compared to the empirical correction, the RECTE model is less affected by astrophysical signals (e.g., flares). In addition, RECTE can be combined with systematics marginalization methods (e.g., Gibson 2014; Wakeford et al. 2016; Sheppard et al. 2017) or methods based on Gaussian processes (e.g., Beatty et al. 2017; Evans et al. 2017; Nikolov et al. 2018) to provide a physically based model for the ramp-effect component.
  • 4.  
    Our data reduction reaches a typical precision of about 230–340 ppm for individual planets and, after excluding possible flare events, a precision of 180–210 ppm.
  • 5.  
    We note two short-duration brightening events in the broadband light curves, which we identify as candidate flare events. Flare events can complicate the transit light-curve fits and can be confused with higher instrumental systematics levels by traditional empirical fits.
  • 6.  
    No significant planetary absorption features are present in the individual transit spectra of the six planets.
  • 7.  
    In the combined spectra of the six planets, there is no obvious evidence for water or other absorption features.
  • 8.  
    In the combined transmission spectra, we identify a suggestive decrease in transit depth at 1.4 μm relative to the adjacent continuum. We find that this feature in itself does not provide decisive evidence for a heterogeneous stellar photosphere impacting the transmission spectra, though it is consistent with stellar contamination models fit to the K2+SSO+HST+Spitzer data set covering a wider wavelength range.
  • 9.  
    We present spectral fits to the combined K2, SSO, HST/WFC3 IR grism, and Spitzer 4.5 μm transit depths assuming an intrinsically flat planetary spectrum but allowing for the presence of starspots and faculae in the star. We find evidence for stellar contamination in the TRAPPIST-1 d transmission spectrum and, more notably, in the combined transmission spectrum of TRAPPIST-1 b–g. The model interpretation for the combined spectrum is robust for all five-planet combinations excluding the impact of a single planet in turn. The composite photosphere produced by the stellar contamination models also matches the out-of-transit spectrum of TRAPPIST-1, as well as starspot and faculae populations inferred from its observed photometric variability level.
  • 10.  
    The facular and starspot covering fractions required by the stellar contamination model are consistent with those expected for late M-type stars and their optical/near-infrared photometric variability, demonstrating that stellar contamination from the transit light source effect (Rackham et al. 2018) poses a major challenge to HST and JWST high-precision exoplanet transit spectroscopy.
  • 11.  
    We also point out that coadding transit spectra from multiple epochs for planets orbiting rapidly rotating late-type host stars will be complicated by rapidly changing stellar contamination: for TRAPPIST-1, the contamination will even significantly change during the length of a single-transit observation.

In summary, our study provides an independent reduction (based on a physically motivated charge-trap model) of the first transmission spectra of Earth-sized habitable-zone transiting exoplanets, providing a nearly complete spectral library of the known planets in the TRAPPIST-1 system. Our findings, however, highlight stellar contamination as a dominant astrophysical noise source and illustrate the challenge in combining multi-epoch spectroscopy for planets orbiting rapidly rotating host stars.

This research has made use of the NASA Exoplanet Archive, which is operated by the California Institute of Technology under contract with the National Aeronautics and Space Administration under the Exoplanet Exploration Program. BR acknowledges support from the National Science Foundation Graduate Research Fellowship Program under grant No. DGE-1143953. DA acknowledges support from the Max Planck Institute for Astronomy, Heidelberg, for a sabbatical visit. The results reported herein benefited from collaborations and/or information exchange within NASA's Nexus for Exoplanet System Science (NExSS) research coordination network, sponsored by NASA's Science Mission Directorate. Support for Programs 14241 and 15060 was provided by NASA through a grant from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS5-26555. Based on observations made with the NASA/ESA Hubble Space Telescope, obtained in GO programs 14500 and 14873 at the Space Telescope Science Institute.

Facilities: HST/WFC3 - , Spitzer - , Kepler - .

Software: Numpy, Scipy, Matplotlib, Astropy, emcee, batman, pymc.

Appendix A: Transmission Spectra

Transmission spectra from all seven transits are tabulated here in Table 13 (original) and Table 14 (interpolated).

Table 13.  Transmission Spectra

Wavelength Transit Upper Lower
Å Depth Error Error
Transit 1, TRAPPIST-1 c
11505 0.00709 0.00031 0.00032
11970 0.0074 0.00028 0.00029
12434 0.00719 0.00024 0.00024
12898 0.00762 0.00021 0.00021
13363 0.00728 0.00033 0.00032
13827 0.00707 0.00031 0.00032
14292 0.00707 0.00034 0.00033
14756 0.00686 0.00032 0.00029
15220 0.00741 0.00031 0.00032
15685 0.00704 0.00029 0.00028
16149 0.00702 0.0003 0.00028
16613 0.00774 0.00034 0.00033
Transit 2, TRAPPIST-1 b
11505 0.00783 0.00032 0.00033
11970 0.00771 0.00032 0.00031
12434 0.00803 0.00025 0.00025
12898 0.00745 0.00023 0.00022
13363 0.00731 0.00036 0.00035
13827 0.00728 0.00035 0.00034
14292 0.00745 0.00037 0.00036
14756 0.00781 0.00032 0.00033
15220 0.00776 0.00036 0.00035
15685 0.00806 0.00031 0.00032
16149 0.0081 0.00031 0.00034
16613 0.00769 0.00035 0.00036
Transit 3, TRAPPIST-1 d
11465 0.00378 0.00012 0.00012
11930 0.00444 0.00013 0.00015
12394 0.00419 0.00009 0.00009
12858 0.00401 0.00011 0.00013
13322 0.00392 0.00014 0.00014
13787 0.00424 0.00013 0.00013
14251 0.00356 0.00018 0.00018
14715 0.00391 0.00014 0.00014
15179 0.0036 0.00015 0.00014
15644 0.00401 0.00011 0.00011
16108 0.00393 0.00014 0.00014
16572 0.00361 0.00023 0.00024
Transit 4, TRAPPIST-1 g
11377 0.00824 0.00035 0.00038
11841 0.0076 0.00021 0.00023
12305 0.00769 0.00021 0.00023
12770 0.00803 0.00022 0.00021
13234 0.00773 0.00021 0.00021
13698 0.00776 0.00025 0.00024
14163 0.00803 0.00027 0.00027
14627 0.00781 0.00025 0.00025
15092 0.00753 0.00026 0.00026
15556 0.00771 0.00028 0.0003
16020 0.00821 0.00024 0.00023
16485 0.00769 0.00025 0.00026
Transit 5, TRAPPIST-1 e
11377 0.00476 0.0002 0.0002
11841 0.00466 0.00012 0.00012
12305 0.00458 0.00012 0.00012
12770 0.00483 0.00013 0.00012
13234 0.00469 0.00012 0.00014
13698 0.00448 0.00015 0.00016
14163 0.00479 0.0002 0.00019
14627 0.00489 0.00015 0.00015
15092 0.00508 0.00016 0.00016
15556 0.00484 0.00018 0.00017
16020 0.00496 0.00014 0.00015
16485 0.00497 0.00016 0.00014
Transit 6, TRAPPIST-1 f
11408 0.00637 0.00023 0.00022
11873 0.00664 0.00021 0.00019
12337 0.00642 0.00024 0.00024
12801 0.0065 0.00023 0.00022
13265 0.00676 0.00022 0.00021
13730 0.00634 0.00024 0.00024
14194 0.00638 0.00026 0.00027
14658 0.00634 0.00026 0.00024
15123 0.00605 0.0002 0.0002
15587 0.00667 0.0002 0.00019
16051 0.00635 0.00021 0.00021
16516 0.00681 0.00025 0.00025
Transit 7, TRAPPIST-1 e
11442 0.00546 0.00016 0.00016
11907 0.00517 0.00014 0.00016
12371 0.00552 0.00013 0.00013
12835 0.00514 0.00013 0.00013
13300 0.00526 0.00013 0.00013
13764 0.00487 0.00017 0.00017
14228 0.00483 0.00017 0.00017
14692 0.00494 0.00013 0.00013
15157 0.00494 0.00013 0.00013
15621 0.00506 0.00013 0.00014
16085 0.00511 0.00012 0.00011
16550 0.00503 0.00021 0.00021

Download table as:  ASCIITypeset images: 1 2

Table 14.  Transmission Spectra in the Interpolated Bins

Transit Depth Upper Error Lower Error
Transit 1, TRAPPIST-c
0.00708 0.00031 0.00032
0.00738 0.00028 0.00029
0.00723 0.00023 0.00023
0.00761 0.00023 0.00023
0.00717 0.00033 0.00033
0.0071 0.00031 0.00032
0.00692 0.00034 0.00031
0.00716 0.00031 0.00031
0.00725 0.0003 0.0003
0.00694 0.00029 0.00027
0.00751 0.00033 0.00032
Transit 2, TRAPPIST-1 b
0.00784 0.00032 0.00033
0.00774 0.00032 0.00031
0.00799 0.00024 0.00024
0.00737 0.00025 0.00025
0.0073 0.00037 0.00036
0.0073 0.00035 0.00034
0.00764 0.00035 0.00035
0.00779 0.00034 0.00034
0.00793 0.00033 0.00033
0.00814 0.0003 0.00033
0.00783 0.00034 0.00036
Transit 3, TRAPPIST-1 d
0.00389 0.00013 0.00013
0.00442 0.00013 0.00014
0.00413 0.00009 0.00009
0.00394 0.00013 0.00014
0.00409 0.00013 0.00013
0.00393 0.00016 0.00015
0.00372 0.00016 0.00016
0.00371 0.00014 0.00014
0.00386 0.00012 0.00012
0.004 0.00013 0.00012
0.00364 0.00021 0.00022
Transit 4, TRAPPIST-1 g
0.00799 0.00029 0.00031
0.00756 0.0002 0.00022
0.00787 0.00022 0.00022
0.00791 0.00021 0.00021
0.00768 0.00023 0.00023
0.00797 0.00027 0.00026
0.00791 0.00025 0.00025
0.00756 0.00026 0.00025
0.00766 0.00028 0.0003
0.0082 0.00024 0.00024
0.00764 0.00025 0.00027
Transit 5, TRAPPIST-1 e
0.00477 0.00016 0.00017
0.0046 0.00012 0.00012
0.00468 0.00013 0.00012
0.00482 0.00012 0.00013
0.00451 0.00013 0.00015
0.00467 0.00019 0.00019
0.00486 0.00017 0.00016
0.00507 0.00015 0.00015
0.00487 0.00018 0.00017
0.00495 0.00014 0.00015
0.00496 0.00016 0.00014
Transit 6, TRAPPIST-1 f
0.00651 0.00021 0.0002
0.00659 0.00022 0.00021
0.00639 0.00024 0.00024
0.00666 0.00022 0.00021
0.00656 0.00023 0.00022
0.00631 0.00025 0.00026
0.00641 0.00026 0.00025
0.00604 0.00022 0.00021
0.00659 0.0002 0.00019
0.0064 0.00021 0.0002
0.00675 0.00025 0.00024
Transit 7, TRAPPIST-1 d
0.00531 0.00016 0.00016
0.00526 0.00014 0.00015
0.00545 0.00013 0.00013
0.00516 0.00013 0.00013
0.00513 0.00015 0.00014
0.00479 0.00018 0.00017
0.00491 0.00014 0.00014
0.00494 0.00012 0.00012
0.00502 0.00013 0.00014
0.00511 0.00011 0.00012
0.00504 0.00019 0.00019

Note. The central wavelengths of the 11 bins are an arithmetic series from 11500 to 16500 Å.

Download table as:  ASCIITypeset images: 1 2

Appendix B: Supplementary Figures

Here we illustrate the broadband and spectral band light-curve profiles for transits observed in Visits 1, 2, 3, and 4. (Figures 1417).

Figure 14.

Figure 14. Same as Figure 4 for Visit 1, broadband and spectral band fit for planet TRAPPIST-1 d.

Standard image High-resolution image
Figure 15.

Figure 15. Same as Figure 4 for Visit 2, broadband and spectral band fit for planets TRAPPIST-1 g and e.

Standard image High-resolution image
Figure 16.

Figure 16. Same as Figure 4 for Visit 3, broadband and spectral band fit for planet TRAPPIST-1 f.

Standard image High-resolution image
Figure 17.

Figure 17. Same as Figure 4 for Visit 4, broadband and spectral band fit for planet TRAPPIST-1 e.

Standard image High-resolution image

Appendix C: Analysis of HST+Spitzer Transmission Spectra

Here we summarize the results of the same stellar contamination analysis described in Section 6 as originally performed using transmission spectra comprised of only the HST transit depths from this work and the Spitzer 4.5 μm transit depths from Delrez et al. (2018b). Tables 15 and 16 summarize the results for the single-planet and combined transmission spectra, respectively. As with the full analysis in Section 6, the contamination models are preferred for the TRAPPIST-1 d single-planet spectrum and all variations of the combined spectra. The predictions for K2 and I+z transit depths from the best-fit flat and contamination to the combined transmission spectra are provided in Table 17. These are shown in Figures 18 and 19, along with the HST+Spitzer data, best-fit models, and K2 and I+z depths from Ducrot et al. (2018). Examples of the posterior distributions for the HST+Spitzer fits are provided in Figure 20.

Figure 18.

Figure 18. Stellar contamination model jointly fit to HST+Spitzer TRAPPIST-1 combined transmission spectra and the observed HST stellar spectrum. The unfitted K2 (0.42–0.9 μm) and I+z (0.8–1.1 μm) combined transit depths (Ducrot et al. 2018) are overplotted as blue crosses, along with the predictions for the flat and contamination models (gray and black squares, respectively) for comparison. The remaining figure elements are the same as in Figure 10.

Standard image High-resolution image
Figure 19.

Figure 19. Stellar contamination models jointly fit to HST+Spitzer TRAPPIST-1 five-planet combined transmission spectra and the observed HST stellar spectrum. The unfitted K2 (0.42–0.9 μm) and I+z (0.8–1.1 μm) combined transit depths (Ducrot et al. 2018) are overplotted as colored crosses, along with the predictions for the flat and contamination models (gray and black squares, respectively) for comparison. The remaining figure elements are the same as in Figure 11.

Standard image High-resolution image
Figure 20.

Figure 20. Posterior distributions of free parameters in CPAT model fits to HST+Spitzer combined transmission spectra of TRAPPIST-1 b–g and the observed HST stellar spectrum. Results are shown for the flat and contamination models in the left and right panels, respectively. None of the parameter distributions differ significantly from those in Figure 13, which were fitted using the complete K2+SSO+HST+Spitzer data set.

Standard image High-resolution image

Footnotes

  • The SAA is the lowest region to which Earth's inner Van Allen Belt extends. The SAA passages by HST result in enhanced CR hits on the detector (Deustua et al. 2016).

  • These correspond to data quality flags 4, 16, 32, and 512, respectively (Deustua et al. 2016).

  • The subscripts "s" and "f" denote two charge-trap types, slow and fast, which describe the release speed. Detailed descriptions of the two trap types are provided in Zhou et al. (2017).

  • 10 
  • 11 

    A comparison with the work of de Wit et al. (2018) is provided in Section 7.3.

  • 12 

    The fitted parameters of i and a/Rs are not provided by Ducrot et al. (2018).

Please wait… references are loading.
10.3847/1538-3881/aade4f