This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy. Close this notification

ACCESS I. AN OPTICAL TRANSMISSION SPECTRUM OF GJ 1214b REVEALS A HETEROGENEOUS STELLAR PHOTOSPHERE

, , , , , , , , , , and

Published 2017 January 11 © 2017. The American Astronomical Society. All rights reserved.
, , Citation Benjamin Rackham et al 2017 ApJ 834 151 DOI 10.3847/1538-4357/aa4f6c

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/834/2/151

ABSTRACT

GJ 1214b is the most studied sub-Neptune exoplanet to date. Recent measurements have shown its near-infrared transmission spectrum to be flat, pointing to a high-altitude opacity source in the exoplanet's atmosphere, either equilibrium condensate clouds or photochemical hazes. Many photometric observations have been reported in the optical by different groups, though simultaneous measurements spanning the entire optical regime are lacking. We present an optical transmission spectrum (4500–9260 Å) of GJ 1214b in 14 bins, measured with Magellan/IMACS repeatedly over three transits. We measure a mean planet-to-star radius ratio of ${R}_{p}/{R}_{s}=0.1146\pm 2\times {10}^{-4}$ and mean uncertainty of $\sigma ({R}_{p}/{R}_{s})=8.7\times {10}^{-4}$ in the spectral bins. The optical transit depths are shallower on average than observed in the near-infrared. We present a model for jointly incorporating the effects of a composite photosphere and atmospheric transmission through the exoplanet's limb (the CPAT model), and use it to examine the cases of absorber and temperature heterogeneities in the stellar photosphere. We find the optical and near-infrared measurements are best explained by the combination of (1) photochemical haze in the exoplanetary atmosphere with a mode particle size r = 0.1 μm and haze-forming efficiency ${f}_{\mathrm{haze}}=10 \% $ and (2) faculae in the unocculted stellar disk with a temperature contrast ${\rm{\Delta }}T={354}_{-46}^{+46}$ K, assuming 3.2% surface coverage. The CPAT model can be used to assess potential contributions of heterogeneous stellar photospheres to observations of exoplanet transmission spectra, which will be important for searches for spectral features in the optical.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

Transmission spectroscopy, in which we study transiting planets at multiple wavelengths, provides a powerful tool for placing constraints on the nature of close-in exoplanets. The apparent radius of a transiting exoplanet at a given wavelength ${R}_{p}(\lambda )$ is a function of its atmospheric mean molecular cross section $\sigma (\lambda )$ and scale height $H=\tfrac{{k}_{{\rm{B}}}T}{\mu g}$, where kB is Boltzmann's constant, T is the temperature, μ is the atmospheric mean molecular mass, and g is the local gravitational acceleration. Therefore, by examining how an exoplanet blocks the light from its host star at multiple wavelengths, we directly probe both the chemical composition and physical structure of its atmosphere.

In the optical wavelength regime (∼0.3–1.0 μm), this technique provides access to strong atomic lines and molecular bands, as well as cloud and haze processes, revealing a diversity of exoplanet atmospheres (Sing et al. 2016). Detections have been reported of Na i (Charbonneau et al. 2002; Redfield et al. 2008; Sing et al. 2008, 2012, 2016; Jensen et al. 2011; Huitson et al. 2012; Zhou & Bayliss 2012; Nikolov et al. 2014), K i (Sing et al. 2011, 2015), and H2O (Stevenson et al. 2016) in the atmospheres of hot Jupiters. Evans et al. (2016) have presented evidence for TiO/VO in WASP-121b, while non-detections of TiO/VO in other hot, giant exoplanets (Huitson et al. 2013; Sing et al. 2013) could point to breakdown by stellar activity (Knutson et al. 2010) or the presence of a high-altitude opacity source, either due to lofted cloud decks or photochemical hazes (Seager & Sasselov 2000; Fortney 2005; Howe & Burrows 2012; Morley et al. 2013, 2015). In clear atmospheres, measurements at shorter optical wavelengths directly probe the physics of scattering processes (Seager & Sasselov 2000; Hubbard et al. 2001; Lecavelier Des Etangs et al. 2008), thereby allowing measurements of the atmospheric mean molecular mass (Benneke & Seager 2012). For low-mass transiting exoplanets, this information can provide the key for distinguishing between rocky and gaseous bulk compositions (Benneke & Seager 2013).

GJ 1214b (Charbonneau et al. 2009) provides an excellent opportunity to study a low-mass planet with transmission spectroscopy. Orbiting an M4.5 dwarf only 14.55 parsecs away (Anglada-Escudé et al. 2013), the large transit depth of the planet and apparent brightness of its host star make it very suitable for in-depth study through transmission spectroscopy. Given its relatively small mass (6.3 M) and large radius (2.8 R; Anglada-Escudé et al. 2013), the low bulk density of GJ 1214b requires that it contain a substantial gas component, though different admixtures of rock, ice, and volatiles—owing to different formation histories—can explain its bulk density equally well (Rogers & Seager 2010). If GJ 1214b possessed a clear, hydrogen-dominated atmosphere with a ∼150–200 km scale height, whether obtained through direct accretion from the protoplanetary nebula or secondary outgassing, absorption features in its transmission spectrum could vary as a function of wavelength by as much as 0.3% of the host star's flux (Miller-Ricci & Fortney 2010). Assuming a hydrogen-dominated and clear atmosphere, such a planet should in principle produce absorption features detectable by current ground-based and space-based instrumentation.

Attempts to constrain the transmission spectrum of GJ 1214b, however, have revealed a remarkably flat spectrum. Starting with the first results provided by Bean et al. (2010), observations of GJ 1214b in the optical and near-infrared have found a featureless spectrum (Bean et al. 2011; Crossfield et al. 2011; Désert et al. 2011; Berta et al. 2012; Murgas et al. 2012; Colón & Gaidos 2013; de Mooij et al. 2013; Fraine et al. 2013; Teske et al. 2013; Cáceres et al. 2014; Gillon et al. 2014; Wilson et al. 2014; Nascimbeni et al. 2015). Reported spectral features in the near-infrared (Croll et al. 2011; de Mooij et al. 2012) have not been reproduced by follow-up measurements in the same bandpasses (Narita et al. 2013). Recently, Kreidberg et al. (2014) reported the most precise measurements to date obtained during 12 transits with HST/WFC3, which demonstrate a lack of observable features from 1.1 to 1.7 μm that rules out cloud-free scenarios for both hydrogen-dominated and high mean molecular mass atmospheres. While exquisite precisions exist for GJ 1214b's transmission spectrum in the near-infrared where its red host star is very bright (H = 9.1), its transmission spectrum remains poorly constrained in the blue optical, where the host star is exceedingly faint (B = 16.4). The existing optical measurements rely on wide-band photometry and have been compiled from a variety of sources, complicating the detection of spectral features and making the measurements more prone to systematics in the measurement of the transit depth.

Modeling efforts have found that a high-altitude, optically thick layer, whether composed of photochemically produced hydrocarbon hazes (Howe & Burrows 2012; Miller-Ricci Kempton et al. 2012; Morley et al. 2013, 2015) or equilibrium condensate clouds (Morley et al. 2013, 2015), can account for the flat near-infrared transmission spectrum by obscuring spectral features that originate lower in the atmosphere. Such a layer could exist for both hydrogen-dominated and high mean molecular mass atmospheres (Morley et al. 2013), so the presence of a high-altitude opacity source does not by itself tell us about atmospheric composition. Interestingly, however, the only V-band (4730–6860 Å; centered at 5500 Å) measurement to date (Teske et al. 2013) points to a relatively shallow transit depth, which should be precluded by an high-altitude, optically thick layer.

Here we present an optical transmission spectrum (4500–9260 Å) of GJ 1214b measured with Magellan/IMACS over three transits, which represents the first transmission spectrum of this sub-Neptune measured simultaneously across the optical wavelength range. We find the transit depths across this range are generally shallower than comparable values in the near-infrared. As no physical model of the exoplanet's atmosphere can reproduce these measurements and the flat near-infrared spectrum simultaneously, we investigate the contribution of a heterogeneous stellar photosphere, including faculae and starspots, to the observed transmission spectrum. We find unocculted faculae in the stellar photosphere to be most consistent with the shallower transit depths we observe in the optical. In Section 2, we describe the data collection. We detail the data reduction and our detrending procedure in Sections 3 and 4, respectively, and present our results in Section 5. We discuss the physical interpretation of the spectrum in Section 6, presenting a model for incorporating the effects of a composite photosphere and atmospheric transmission through the exoplanet's limb and applying it to the cases of absorber and temperature heterogeneities in the stellar photosphere. We summarize our findings and their implications in Section 7.

2. DATA COLLECTION

The observations of GJ 1214b were collected as part of ACCESS, the Arizona-CfA-Cátolica Exoplanet Spectroscopy Survey. In this section we first give a brief summary of ACCESS before describing the specific observations of GJ 1214b in detail.

2.1. ACCESS

ACCESS is a collaborative project between the University of Arizona, the Harvard–Smithsonian Center for Astrophysics, the Pontificia Universidad Católica de Chile, and the Carnegie Institution for Science, with the aim of measuring optical transmission spectra from a representative sample of transiting exoplanets. Our targets include 30 planets with masses and radii between 6 and 450 M and 2.5 and 23 R, and effective temperatures (Teff) between 600 and 2800 K. ACCESS utilizes ground-based, multi-object spectrographs (MOS) to simultaneously collect spectra from the exoplanet host star and many comparison stars in the same field of view, enabling corrections for systematic noise sources arising from the instrument or variable weather conditions. Our survey design emphasizes repeated observations to ensure the reliability of our findings. We demonstrated the feasibility of this technique in a pilot study on WASP-6b (Jordán et al. 2013), which precisely measured the optical transmission spectrum of that transiting hot Jupiter in a single transit with Magellan/IMACS, despite strongly variable transparency during part of the transit. The resulting spectrum for WASP-6b was most consistent with scattering, a result confirmed later by HST (Nikolov et al. 2015; Sing et al. 2016).

2.2. Observational Design

We observed three transits of GJ 1214b with the Inamori-Magellan Areal Camera and Spectrograph (IMACS), a versatile wide-field imager and spectrograph permanently mounted on the Magellan Baade Telescope. It includes two cameras, one at each of the f/2 and f/4 foci, which can each be used in MOS mode with custom-designed slit masks. The f/2 camera covers a $27^{\prime} $ diameter circular field and provides spectra with resolving powers up to $R\sim 1200$. The f/4 provides higher resolution spectra, up to $R\sim 5000$, over a smaller, $15^{\prime} $ square field. The detector of each camera is composed of eight 2 K × 4 K CCDs, forming an 8 K × 8 K mosaic (Dressler et al. 2006).

For these observations we used in turn each of the IMACS cameras in MOS mode. We observed two transits with the f/4 camera and one with the f/2 camera; therefore we designed two masks, one for each of the cameras. In the mask design we considered three criteria: (1) to include the widest spectral range for the target and comparison stars, (2) to include as many comparison stars as possible, and (3) to eliminate slit losses using extra-wide slits ($5^{\prime \prime} $ for the f/4 mask and $10^{\prime \prime} $ for the f/2 mask). We also used larger lengths for the slits ($12^{\prime \prime} $ for the f/4 mask and $22^{\prime \prime} $ for the f/2 mask) in order to adequately sample the sky background.

We selected comparison stars using multiple criteria. Starting from an initial list of all stars in the UCAC4 catalog (Zacharias et al. 2013) in the same field of view as the target, we eliminated all known binary stars. We then made magnitude cuts, including only stars less than 0.5 mag brighter and 1 magnitude fainter than the target in the V-band. Finally, we prioritized the remaining candidates by their distance (D) from the target star in $B-V/J-K$-color space,

Equation (1)

in which the subscripts c and t refer to the comparison and target stars, respectively, with the closest stars in this parameter space receiving the highest priorities. This ranking is especially important in the case of GJ 1214b because the host star is much redder than other stars in its field. Using these rankings, we adjusted the pointing offset and rotation of the IMACS mask to maximize the wavelength coverage for the target and a maximum number of comparison stars. Our final configurations allowed us to cut slits for the target and 14 comparison stars on the f/4 mask and 24 comparison stars on the wider-field f/2 mask. Finally, we included 25 $5\times 5^{\prime \prime} $ square slits in each mask, matching the position of relatively faint stars in the field. Those boxes are used to align the masks.

2.3. Observations

Details for the three transit observations are provided in Table 1. We carried out all observations in spectroscopic mode with no filter. We utilized 2 × 2 binning to reduce read-out times, and chose integration times to provide a maximum of roughly 25,000–30,000 counts in analog-to-digital units (ADU; gain $=\,0.56\,{e}^{-1}/$ADU for f/4 setup, $1.0\,{e}^{-1}/$ADU for f/2 setup) per resolution element on the target spectrum, which was the brightest of the spectra. A sample spectrum for the target is shown in Figure 1. For Transits 1 and 2, we used the 150 line/mm grating, which provided usable coverage of 4500–9260 Å on the target spectrum with a chip gap from 7054 to 7254 Å (Figure 2). Given the wide slits used, the spectral resolution for each exposure was set by the seeing (see Table 1), with the average being $R\sim 240$. For Transit 3, we utilized the f/2 camera for two reasons: (1) combining results from two different cameras provides an additional level of cross-validation for the resulting transmission spectrum, and (2) the larger field of view for the f/2 camera allowed us to obtain more comparison spectra from potentially better comparison stars. We used the 300 line/mm grism (blazed at 17fdg5), which gave spectral coverage of 4500–9260 Å on the target spectrum with a chip gap from 6506 to 6596 Å (Figure 2). With 2 × 2 binning and variable seeing ($0\buildrel{\prime\prime}\over{.} 5\mbox{--}1\buildrel{\prime\prime}\over{.} 0$), the spectral resolution for each exposure varied between $R\sim 300$–700, with an average of $R\sim 480$.

Figure 1.

Figure 1. Example of data showing a portion of the slit containing the spectrum of GJ 1214. The spectral trace and the slit borders are shown as dashed lines. A typical region used to measure the sky background is shown as vertical bars. This sub-image is taken from the Transit 1 data set.

Standard image High-resolution image
Figure 2.

Figure 2. Median spectra of GJ 1214 from Transits 1 (red), 2 (green), and 3 (blue). We observed Transits 1 and 2 using the IMACS f/4 camera, and Transit 3 with the f/2 camera. The exposure times we utilized account for the difference in maximum counts between Transits 1 and 2. The overall shape of the spectrum from Transit 3 differs from that of Transits 1 and 2 due to the transmission profile of the IMACS f/2 camera. Vertical lines indicate chip gaps.

Standard image High-resolution image

Table 1.  Observing Log for GJ 1214b Data Sets from Magellan/IMACS

Transit Date Obs. Camera Disperser Airmass Exposure Readout + Frames Seeing
    Start/Enda       Times (s) Overhead (s)    
1 2013 Apr 25 06:18–10:26 f/4 150 line/mm grat. 1.21–1.62 30–40 34 235b ∼0farcs6–0farcs8
2 2013 May 22 03:14–08:01 f/4 150 line/mm grat. 1.21–1.62 35–40 31 243c ∼0farcs7–0farcs9
3 2014 Apr 03 05:57–09:10 f/2 300 line/mm 1.21–1.88 63 29 126 ∼0farcs5–1farcs0
        grism + 17.5          

Notes.

aDates and observation start/end times are given in UTC. bThe last 14 frames, which were taken during twilight, displayed a systematic trend due to imperfect sky subtraction and were not included in the light curve analysis. cThe last 40 frames displayed a systematic trend in the target's light curve and were not included in the light curve analysis.

Download table as:  ASCIITypeset image

Each night we collected bias frames, dark frames, and quartz lamp flat frames with the science mask and the same setup used for the science observations. The bias frames demonstrated that the bias levels are essentially constant across the detector, so we adopted constant bias levels using the median of the overscan region on each science frame. The dark frames showed the dark count to be negligible for the exposure times used, so no dark subtraction was applied. We reduced the data with and without flat field corrections, and found the detrended light curves of GJ 1214 with the f/4 setup (Transits 1 and 2) displayed more correlated noise when flat-fielded. Flat-fielding the f/2 data (Transit 3), however, reduced the correlated noise contribution to the final light curve. More precisely, applying a flat field correction increased the variance of the flicker noise models (see Section 4) for Transits 1 and 2 by 4% and 22%, respectively, and decreased it for Transit 3 by 21%. For the rest of this analysis, we used the non-flat-fielded spectra for Transits 1 and 2 (f/4 camera) and the flat-fielded spectra for Transit 3 (f/2 camera). To obtain a wavelength solution, we took exposures of He, Ne, and Ar lamps before and after the science observations, using calibration masks identical to the science masks but with $0\buildrel{\prime\prime}\over{.} 5$ slits.

3. DATA REDUCTION

We reduced each data set with a custom, Python-based pipeline following the procedure employed in an earlier analysis of WASP-6b (Jordán et al. 2013). Here we summarize the key steps in the reduction procedure.

3.1. Spectrum Tracing

Bias levels were estimated and removed for each integration using the median of the overscan region of each chip. For each frame, the position of each spectrum was then traced by identifying the centroid of the spectrum's spatial profile for each resolution element (2 × 2 binned pixel) in the dispersion or spectral direction, and robustly fitting a second order polynomial to the identified centroids (Figure 1), taking into account chip gaps when necessary. The left and right slit borders were identified using the SciPy15 implementation of a Prewitt filter, which approximates the spatial gradient of images and produces maximal values at edges. In this case, the Prewitt-filtered images display strong positive values at slit borders. Traces of the slit borders were obtained by robustly fitting polynomials to the positions of the maxima flanking the spectral trace in the Prewitt-filtered images.

3.2. Sky Subtraction

The sky spectrum was identified for each resolution element in the spectral direction using the median of all resolution elements within the slit borders, excluding an aperture centered on the spectral trace containing the stellar spectrum. The central aperture sizes were selected to minimize the correlated noise contribution to the final light curve. Central apertures of 20 ($4\buildrel{\prime\prime}\over{.} 4$), 24 ($5\buildrel{\prime\prime}\over{.} 3$), and 24 ($9\buildrel{\prime\prime}\over{.} 6$) resolution elements were used for the Transit 1, 2, and 3 data sets, respectively. The sky spectrum was subtracted from the signal within the central aperture, leaving only the profile of the stellar spectrum. The final extracted spectrum was then obtained by summing the spectrum profile within the central aperture in the spatial direction. Optimal extraction (Marsh 1989) did not give noticeable gains over the simple extraction for our high signal-to-noise spectra.

3.3. Wavelength Calibration

Given the wide slits used, skylines recorded during the observations had too low of a spectral resolution to be useful for wavelength calibration. Therefore, the arc lamps taken before and after the science observations with the narrow-slit ($0\buildrel{\prime\prime}\over{.} 5$) calibration mask were used to calibrate the extracted spectra. Lorentzian profiles were fitted to each spectral line to determine their centroids in the dispersion direction. Using these pixel positions and the known vacuum wavelengths, the wavelength solution for each spectrum was found by an iterative process in which a sixth order polynomial was fitted to the wavelengths as a function of pixel position, the data point with the greatest deviation from the fit was removed, and the process was repeated until the root mean square error value of the fit was less than 2000 m s−1 ($\sim 0.05\,$Å). Depending on the wavelength coverage of the particular spectrum, between 50 and 60 spectral lines were assigned a pixel position, and roughly 35–45 were utilized in the final fit.

The wavelength solution found with the arc lamps was used for the first science spectrum of the night, and the remaining science spectra were cross-correlated with the first to determine their respective wavelength shifts. As a function of time, the positions of the spectra drifted slowly in the dispersion direction, so a third order polynomial was fitted to the shifts identified via the cross-correlations, and the "smoothed" wavelength shifts provided by this fit were used to interpolate all spectra into a common wavelength grid using b-splines. This step removed wavelength shifts of roughly 10 Å between spectra over the course of the night. To identify any residual wavelength shifts on the order of 1 Å, the Fourier transform of each spectrum was multiplied by the Fourier transform of the median spectrum, and the peak of the convolution function was used to identify the remaining wavelength shift required.

These steps ensured that all spectra from a single star were calibrated to the same reference frame. As a final step, the spectra from all stars were calibrated to the same physical reference frame by identifying shifts between the H-alpha absorption line minimum of the median spectra and the vacuum wavelength of H-alpha, and then interpolating the spectra onto a common wavelength grid using b-splines. For GJ 1214, in which the H-alpha line was not evident, the Na 8200 Å doublet minimum was used for this process instead.

4. DATA DETRENDING

After extracting and wavelength-calibrating the spectra, we generated sets of light curves for GJ 1214 and the comparison stars to identify each transit and related systematics. We first generated white light curves, integrating all the light between 4500 and 9260 Å in each spectrum. We also generated spectroscopically resolved light curves using 14 bins with varying widths that allowed for similar precisions on the light curve parameters between bins (see Section 4.2.3). All these light curves were systematics-dominated. We developed the formalisms described in the following sections to model out these systematics.

4.1. Signal Modeling Frameworks

We employed two modeling frameworks to understand and remove systematic trends from the light curve of GJ 1214, which we label the "PCA-based" and the "polynomial-based" frameworks.

4.1.1. PCA-based Modeling Framework

Following the framework detailed by Jordán et al. (2013), we modeled the observed target light curve $l(t)$ as

Equation (2)

in which t is time, F is the underlying flux from the target star, $T(t,\theta )$ is the transit signal defined by the vector of transit parameters θ, $S(t)$ is the perturbation signal due to the combination of all systematic variations, and $\epsilon (t)$ is the stochastic noise component. We account for both uncorrelated variations ("white noise") and correlated variations ("red noise") in the light curve with the $\epsilon (t)$ term and modeled them following the wavelet-based method of Carter & Winn (2009)16 (see Section 4.2.1). We assumed the underlying flux from the target star to be constant, so any actual variations in the star's flux were therefore encapsulated by the noise component. We assumed the perturbation signal to be a linear combination of signals owing to different instrumental and atmospheric effects, which can be represented mathematically as

Equation (3)

in which ${s}_{i}(t)$ represents the different signals and ${\alpha }_{i}$ their respective scaling coefficients. Expressed in logarithmic space, each of these multiplicative signals become additive, so defining $L\equiv \mathrm{log}l(t)$, the base-10 logarithm of the observed target light curve can be written as

Equation (4)

The systematic variations represented by the perturbation signal include common variations experienced by the target and comparison spectra due to instrumental and atmospheric effects. Therefore, within this framework, the observed signal from each k comparison star can be modeled as

Equation (5)

in which Fk is the baseline flux from the comparison star, ${\alpha }_{k}$ is the set of unique scaling coefficients for the systematic variations as they apply to this comparison star, and ${\epsilon }_{k}(t)$ is the unique noise signal associated with this light curve. Thus the mean-subtracted light curve from each comparison star (${L}_{k}-\mathrm{log}{F}_{k}$) can be used to estimate the perturbation signal $S(t)$, which can then be subtracted from L in Equation (4).

We performed a principal component analysis (PCA) of the mean-subtracted comparison light curves to estimate the independent signals ${s}_{i}(t)$ comprising the perturbation signal $S(t)$. With N comparison stars, we could estimate at most N principal components via PCA. As detailed in Section 4.2.1, we let the scaling coefficients ${\alpha }_{i}$ and the baseline flux from the host star F float as free parameters in the Markov chain Monte Carlo (MCMC) procedure, so we enforced the maximum number of principal components for use in the model to be $M\leqslant N-1$. However, we determined the optimal number of components to be the minimum number that could achieve the same predictive power as the best set of M available components.

We estimated the predictive power of each set of components using a k-fold cross-validation procedure. First, we split the out-of-transit target light curve into 20 segments (or "folds"). For each number of available principal components (from 1 to M), we fit the α scaling coefficients for the available components to the target light curve using 19 "training" folds, and recorded the error (in units of normalized flux) between the target light curve and fit for the single "validation" fold. Repeating this process using each of the 20 folds in turn as a validation fold gave us an estimate and confidence interval of the prediction error for each number of principal components. We identified the number of principal components that produced the smallest median prediction error ("the best set"), and then used the lowest number of principal components with a median prediction error indistinguishable at the $1\sigma $ level from that of the best set.

The final consideration in the PCA-based procedure was to select the comparison stars to ultimately use in generating the principal components. Each comparison light curve provides a noisy estimate of the perturbation signal, so it follows that the combination of comparison stars can be optimized to provide the best estimate with the least noise. Through successive iterations, we found that for each night the brightest comparison stars produced principal components that were most capable of accurately predicting perturbations in the target light curve. Ultimately, we used the brightest 4, 5, and 4 comparison stars to estimate the perturbation signals for Transits 1, 2, and 3, respectively. We found these comparison stars to also provide the best results when utilizing the polynomial-based modeling framework we explored.

4.1.2. Polynomial-based Modeling Framework

We also modeled the observed target light curve following the procedure employed by Bean et al. (2010) in their successful VLT/FORS observations of GJ 1214b. This framework is empirically motivated, as the data show that simply dividing the target light curve by the sum of the comparison light curves removes most of the variations in the out-of-transit flux from the target, leaving only a smoothly varying long-term trend. Like Bean et al. (2010), we find that this trend can be modeled well as a second order polynomial function of time, and attribute it to the color difference between this very red target star and the available comparison stars in the field. In this framework, the target light curve can be expressed as

Equation (6)

and the comparison light curves as

Equation (7)

in which ${\alpha }_{i}$ now describes the polynomial coefficients and the other terms have the same meaning as indicated previously. Dividing by the sum of comparison light curves, the detrended target light curve can be expressed as

Equation (8)

in which the noise term ${\epsilon }_{c}(t)$ now represents the combined noise from the target and comparison light curves, and the constant F and Fk terms have been subsumed into the ${\alpha }_{0}$ coefficient. We note that this formalism is an approximation, since $S(t)$ will be different for the target and reference stars in real data (as expressed by the ${\alpha }_{i,k}$ coefficients in Equation (5)) and will not divide out exactly.

Both modeling frameworks are tested in the following. We compare their effectiveness in Section 5.1 and report the results of the polynominal-based detrending procedure in this paper.

4.2. Markov Chain Monte Carlo Procedure

4.2.1. General Procedure

For both modeling frameworks, the transit parameters were estimated for each night using an MCMC optimization procedure. The transit signal was modeled following the formalism of Mandel & Agol (2002), which accounts for the effect of limb darkening via a quadratic law of the form

Equation (9)

in which μ is the cosine of the angle between the stellar surface normal and the line of sight to the observer, and u1 and u2 are the limb darkening coefficients. Fixing the limb darkening coefficients has been shown to bias measurements of the transit depth (Espinoza & Jordán 2015). Yet, when left as free parameters, these coefficients are strongly correlated in MCMC retrievals (Pál 2008; Kipping 2013). However, following a rotation onto new principal axes

Equation (10a)

Equation (10b)

with $\phi =35\buildrel{\circ}\over{.} 8$ (Kipping 2013), ${\omega }_{1}$ and ${\omega }_{2}$ are essentially uncorrelated, and the first parameter can account for variations induced by the transit geometry, while the second remains constant (Howarth 2011). Therefore we used these rotated coefficients to describe the effect of limb darkening, leaving ${\omega }_{1}$ as a free parameter and fixing ${\omega }_{2}$ to values obtained from a PHOENIX atmospheric model (Husser et al. 2013) for $\mu \geqslant 0.1$, with stellar parameters closest to those identified by Rojas-Ayala et al. (2012): ${T}_{\mathrm{eff}}=3300\,$ K, $\mathrm{log}g=5.0$, and $[M/H]=0.0$. We used a uniform prior on ${\omega }_{1}$, with boundary values set by the triangular sampling method of Kipping (2013).

In each MCMC procedure, we adopted the fixed parameters on the transit model, which included system scale ($a/{R}_{s}$), inclination (i), orbital period (P), eccentricity (e), and argument of periastron (ω), from Kreidberg et al. (2014) to allow for direct comparisons of results. The planet-to-star radius ratio (${R}_{p}/{R}_{s}$), rotated limb darkening coefficient (${\omega }_{1}$), scaling coefficients for principal components or polynomial terms (α), and photometric uncertainty ($\epsilon $) were left as free parameters. The baseline flux (F) was also left as a free parameter in the PCA-based procedure. We placed a Gaussian prior on ${R}_{p}/{R}_{s}$, centered on the median transit depth reported by Kreidberg et al. (2014), with an uncertainty of ${\sigma }_{{R}_{p}/{R}_{s}}=0.01$ to allow the algorithm to thoroughly explore the parameter space, and truncated by the range [0, 1]. We also placed $5\sigma $ Gaussian priors on each α coefficient. In the PCA-based procedure, we determined the mean and standard deviation for the priors using the distributions of the α coefficients obtained for the comparison stars. In the polynomial-based procedure, we performed a bootstrap analysis on the out-of-transit data to determine the mean and standard deviation for the priors. We resampled the out-of-transit data with replacement 1000 times and fit a second order polynomial function of time to each sample via least-squares, recording the fit coefficients. We utilized the mean and standard deviation of the coefficient distributions to determine the $5\sigma $ Gaussian priors for the MCMC. Finally, we placed a $5\sigma $ Gaussian prior on F in the PCA-based procedure using the median and standard error of the out-of-transit flux.

We ran five chains of 130,000 steps and discarded the first 30,000 steps as the burn-in. For each step in the chain, the likelihood of the residuals given the model was calculated via the same likelihood function given by Equation (41) of Carter & Winn (2009), which parameterizes the contributions of uncorrelated and time-correlated noise sources to the observed light curve via the parameters ${\sigma }_{w}$ and ${\sigma }_{r}$, respectively. We placed uniform priors bounded by the interval [0, 1] on both of these parameters.

We combined the results from all chains to determine the posterior values. We evaluated convergence between chains using the Gelman–Rubin statistic $\hat{R}$ (Gelman & Rubin 1992), and considered the chains to be well-mixed if $\hat{R}\leqslant 1.03$ for all parameters. We constructed the posterior distribution by sampling each chain at intervals spaced by $10\times $ the half-life of the autocorrelation in the chain. This wide spacing ensured independent sampling. We adopted the median and 68.27% confidence interval defined by the 15.87th and 84.14th percentiles of the posterior distribution as the final posterior value and uncertainty for each parameter.

4.2.2. White-light Light Curve Analysis

We first analyzed the white light curve, using flux from the complete spectra (4500–9260 Å) of GJ 1214 and the comparison stars. We performed the procedure as outlined previously, additionally including the time of mid-transit (t0) as a free parameter. We placed a uniform prior on t0 spanning the complete observation.

4.2.3. Spectroscopic Light Curve Analysis

Following the white light analysis, we repeated the MCMC procedure for each wavelength bin. We allowed the same parameters to float, except t0, which we fixed to the value obtained from the white light analysis.

In determining the width of wavelength bins for the spectroscopic analysis, we aimed to (1) maximize the number of bins, while (2) collecting enough signal in the bins to discriminate between model transmission spectra with signals on the order of ${\rm{\Delta }}({R}_{p}/{R}_{s})\sim 0.005$. Taking into account our usable wavelength coverage of 4500–9260 Å and the chip gaps for the f/4 and f/2 setups, we determined that dividing the spectra into 14 bins with roughly even signal to be optimal: we use 10 bins with a mean width of $\,200$ Å (174–243 Å) at wavelengths greater than $7250\,\mathring{\rm A} $, where the spectra are brightest, and four bins with a mean width of 616 Å (310–1157 Å) at shorter wavelengths, where the spectra are considerably fainter (Table 2).

Table 2.  Planet-to-Star Radius Ratios

Bin ${R}_{p}/{R}_{s}$
N ${\lambda }_{\min }$ ${\lambda }_{\max }$ Transit 1 Transit 2 Transit 3 Final
1 4500.0 5657.0 ${0.1121}_{-0.0020}^{+0.0020}$ ${0.1129}_{-0.0021}^{+0.0021}$ ${0.1172}_{-0.0027}^{+0.0026}$ ${0.1136}_{-0.0013}^{+0.0013}$
2 5657.0 6196.0 ${0.1119}_{-0.0018}^{+0.0017}$ ${0.1139}_{-0.0018}^{+0.0017}$ ${0.1140}_{-0.0021}^{+0.0021}$ ${0.1132}_{-0.0011}^{+0.0011}$
3 6196.0 6506.0 ${0.1153}_{-0.0015}^{+0.0015}$ ${0.1125}_{-0.0025}^{+0.0024}$ ${0.1149}_{-0.0013}^{+0.0013}$ ${0.1147}_{-0.0009}^{+0.0009}$
4 6596.0 7054.0 ${0.1112}_{-0.0015}^{+0.0015}$ ${0.1153}_{-0.0025}^{+0.0024}$ ${0.1157}_{-0.0019}^{+0.0019}$ ${0.1133}_{-0.0010}^{+0.0011}$
5 7254.0 7485.0 ${0.1147}_{-0.0012}^{+0.0012}$ ${0.1151}_{-0.0016}^{+0.0017}$ ${0.1138}_{-0.0008}^{+0.0008}$ ${0.1142}_{-0.0006}^{+0.0006}$
6 7485.0 7728.0 ${0.1130}_{-0.0013}^{+0.0013}$ ${0.1161}_{-0.0013}^{+0.0014}$ ${0.1149}_{-0.0012}^{+0.0011}$ ${0.1146}_{-0.0007}^{+0.0007}$
7 7728.0 7967.0 ${0.1155}_{-0.0013}^{+0.0013}$ ${0.1154}_{-0.0011}^{+0.0011}$ ${0.1138}_{-0.0011}^{+0.0012}$ ${0.1148}_{-0.0007}^{+0.0007}$
8 7967.0 8142.0 ${0.1157}_{-0.0016}^{+0.0015}$ ${0.1174}_{-0.0012}^{+0.0013}$ ${0.1138}_{-0.0010}^{+0.0011}$ ${0.1153}_{-0.0007}^{+0.0007}$
9 8142.0 8316.0 ${0.1150}_{-0.0019}^{+0.0018}$ ${0.1172}_{-0.0019}^{+0.0019}$ ${0.1146}_{-0.0010}^{+0.0010}$ ${0.1152}_{-0.0008}^{+0.0008}$
10 8316.0 8502.0 ${0.1133}_{-0.0013}^{+0.0013}$ ${0.1162}_{-0.0016}^{+0.0016}$ ${0.1153}_{-0.0011}^{+0.0011}$ ${0.1148}_{-0.0008}^{+0.0007}$
11 8502.0 8692.0 ${0.1150}_{-0.0014}^{+0.0014}$ ${0.1159}_{-0.0016}^{+0.0015}$ ${0.1168}_{-0.0014}^{+0.0013}$ ${0.1159}_{-0.0008}^{+0.0008}$
12 8692.0 8868.0 ${0.1136}_{-0.0014}^{+0.0013}$ ${0.1161}_{-0.0018}^{+0.0018}$ ${0.1128}_{-0.0013}^{+0.0014}$ ${0.1138}_{-0.0008}^{+0.0008}$
13 8868.0 9065.0 ${0.1164}_{-0.0017}^{+0.0017}$ ${0.1161}_{-0.0017}^{+0.0017}$ ${0.1141}_{-0.0017}^{+0.0016}$ ${0.1155}_{-0.0010}^{+0.0010}$
14 9065.0 9260.0 ${0.1165}_{-0.0023}^{+0.0022}$ ${0.1153}_{-0.0016}^{+0.0016}$ ${0.1147}_{-0.0014}^{+0.0014}$ ${0.1153}_{-0.0010}^{+0.0009}$

Note. For individual transits, we report the medians and 68% confidence intervals on the posterior distributions from the MCMC optimization procedure. The final radius ratios are the weighted means of the three transit measurements (see Section 5.4 for details).

Download table as:  ASCIITypeset image

4.2.4. Other Limb Darkening Prescriptions

Along with the limb darkening prescription described previously, we investigated two additional methods for handling the effect of stellar limb darkening on the transit light curve. In the first approach we fixed the limb darkening coefficients ${\omega }_{1}$ and ${\omega }_{2}$ during the MCMC analysis to the values from the PHOENIX atmospheric model (Husser et al. 2013). In the second we utilized the triangular sampling method proposed by Kipping (2013), which involves fitting for new limb darkening coefficients ${q}_{1}={({u}_{1}+{u}_{2})}^{2}$ and ${q}_{2}=0.5{u}_{1}{({u}_{1}+{u}_{2})}^{-1}$ with uniform priors on the interval [0,1] in order to efficiently sample the physically bounded parameter space. The ${R}_{p}/{R}_{s}$ values for the final transmission spectrum did not differ significantly among the three limb darkening prescriptions. With respect to the nominal results, adopting fixed limb darkening coefficients caused the ${R}_{p}/{R}_{s}$ values on average to differ by $5\times {10}^{-4}$ ($0.6\sigma $). Similarly, the triangular sampling method led to ${R}_{p}/{R}_{s}$ values that differed by $3\times {10}^{-4}$ ($0.3\sigma $). Both methods enlarged the $1\sigma $ uncertainties on the transmission spectrum by $0.2 \% $. In this paper we report the ${R}_{p}/{R}_{s}$ values determined via our nominal limb darkening prescription described in Section 4.2.1.

5. RESULTS

5.1. Comparison of the Detrending Methods

We find the polynomial-based detrending procedure to be the most effective at removing long-term trends in the light curves. By contrast, strong long-term trends in the light curves of GJ 1214 remain after applying the PCA-based procedure. We attribute these to effects introduced by the color difference between the target and comparison stars, which are not encapsulated in the principal components of the comparison light curves. As the wavelength-based likelihood function we employ (Carter & Winn 2009) is particularly sensitive to time-correlated systematics, this results in inflated parameter uncertainties. Accordingly, the uncertainties on ${R}_{p}/{R}_{s}$ we find with the PCA-based procedure are on average $1.4 \% $ larger than those from the polynomial-based procedure. The final ${R}_{p}/{R}_{s}$ values from the PCA-based procedure differ on average from those of the polynomial-based procedure by $1.6\times {10}^{-3}$ ($1.3\sigma $). We conducted the analysis described in Section 6 on the white-light and spectroscopic light curves extracted from both detrending procedures. The choice of detrending procedure does not affect the interpretation of the spectrum or the conclusions of this paper, though the inflated uncertainties from the PCA-based procedure lead to looser constraints. Given (1) the consistency between detrending methods, (2) the potential for time-correlated systematics to bias the measured transit depths, and (3) the tighter constraints provided by the polynomial-based detrending procedure, we report the results from the polynomial-based procedure and adopt them for the white light and spectroscopic analyses.

5.2. White-light Light Curve Fitting

The t0 values determined via the polynomial-based and PCA-based detrending procedures agree within $0.5\sigma $. We report the mid-transit times from the polynomial-based method in Table 3 and use them for the remainder of this work. Figure 3 displays the white light data for each night and their best-fitting transit models.

Figure 3.

Figure 3. White light curves (top) and residuals (bottom) from Transits 1 (left), 2 (center), and 3 (right). The top panels show the normalized flux measurements of GJ 1214 after subtracting the correlated noise component (colored points) and the best-fitting transit model for each night (black lines). The bottom panels show the residuals between the normalized flux measurements and best-fitting transit models (points), along with the correlated noise components identified by the wavelet analysis (black lines).

Standard image High-resolution image

Table 3.  Mid-transit Times from White Light Analyses

Transit Date (UTC) t0 (BJDUTC)
1 2013 Apr 25 ${2456407.85423}_{-0.00008}^{+0.00008}$
2 2013 May 22 ${2456434.72098}_{-0.00012}^{+0.00013}$
3 2014 Apr 03 ${2456750.80196}_{-0.00008}^{+0.00008}$

Note. For the spectroscopic light curve analyses, the mid-transit times were fixed to these values.

Download table as:  ASCIITypeset image

5.3. Spectroscopic Light Curve Fitting

We show the spectroscopic light curves obtained via the polynomial-based procedure for Transits 1, 2, and 3, respectively, with their best-fitting transit models and residuals in Figures 46. The median residual rms values for the spectroscopic light curves are $1.5\times $, $2.2\times $, and $3.3\times $ the photon noise limit for Transits 1, 2, and 3, respectively. The spectroscopic ${R}_{p}/{R}_{s}$ values for each transit are listed in Table 2 and shown in Figure 7.

Figure 4.

Figure 4. Detrended light curves and residuals from Transit 1 using the polynomial-based detrending procedure. The best-fitting transit models are plotted as black lines in the left panel. Black lines in the right panel show the correlated noise components identified by the wavelet analysis. Data from different wavelength bins are offset for clarity. Correlated noise levels are smaller than the Poisson noise for all wavelength bins.

Standard image High-resolution image
Figure 5.

Figure 5. Detrended light curves and residuals from Transit 2 using the polynomial-based detrending procedure. The figure components are the same as those for Figure 4.

Standard image High-resolution image
Figure 6.

Figure 6. Detrended light curves and residuals from Transit 3 using the polynomial-based detrending procedure. The figure components are the same as those for Figure 4. With longer exposures and shorter duty cycles, photon noise levels are lower for the Transit 3 data set, which utilized the IMACS f/2 camera. Time-correlated systematics are strongest for the bluest wavelength bins.

Standard image High-resolution image
Figure 7.

Figure 7. Transmission spectrum of GJ 1214b from Magellan/IMACS. Transits 1, 2, and 3 are shown in red, green, and blue, respectively, with horizontal offsets for clarity. The final, combined spectrum (see Section 5.4) is shown in black. The horizontal error bars on the final spectrum indicate the wavelength bins utilized for all transits.

Standard image High-resolution image

5.4. Combining Results

We take the weighted mean of the transmission spectra from the three transits to generate a final transmission spectrum (Table 2). For each wavelength bin, we average the ${R}_{p}/{R}_{s}$ measurements from each transit weighted by their inverse variances and calculate the uncertainty as the square root of the weighted sample variance. The final combined spectrum is shown in black circles in Figure 7.

6. DISCUSSION

6.1. Previous Measurements

Since its discovery, many groups have published measurements of GJ 1214b's transmission spectrum in the optical and near-infrared (Bean et al. 2010, 2011; Croll et al. 2011; Crossfield et al. 2011; Désert et al. 2011; Berta et al. 2012; de Mooij et al. 2012, 2013; Murgas et al. 2012; Colón & Gaidos 2013; Fraine et al. 2013; Narita et al. 2013; Teske et al. 2013; Cáceres et al. 2014; Gillon et al. 2014; Kreidberg et al. 2014; Wilson et al. 2014; Nascimbeni et al. 2015). A total of 58 measurements have been published for bandpasses with central wavelengths shorter than 9260 Å, the longest wavelength in our Magellan/IMACS spectrum. These measurements tend to lie above our data and have a mean of ${R}_{p}/{R}_{s}=0.1170\pm 2\times {10}^{-4}$, in which the quoted uncertainty is the standard error of the mean. However, they span a notable range, with a minimum of ${R}_{p}/{R}_{s}=0.1104\pm 0.0014$ at 8550 Å (Wilson et al. 2014), a maximum of ${R}_{p}/{R}_{s}=0.1217\pm 0.0025$ at 6560 Å (Murgas et al. 2012), and a standard deviation of $1.8\times {10}^{-3}$. The large body of work on GJ 1214b reflects a diverse set of approaches, which can complicate the combined analysis of data from different sources. A uniform reanalysis of all existing GJ 1214b observations would be fruitful but is outside the scope of this work.

Of the existing measurements, the 1.1–1.7 μm spectrum from Kreidberg et al. (2014), obtained during 12 transits with HST/WFC3, provides the most precise measurements and places the tightest constraints on the nature of GJ 1214b's atmosphere. We adopt the system parameters ($a/{R}_{s}=15.23$, $i=89\buildrel{\circ}\over{.} 1$, $P=1.58040464894$ days, e = 0) from Kreidberg et al. (2014) in this study to facilitate direct comparisons with those results.

While fitting model transmission spectra, we perform a simultaneous fit of the new 0.45–0.93 μm Magellan/IMACS spectrum and two highly constraining published data sets: the 1.1–1.7 μm spectrum from Kreidberg et al. (2014) and the Spitzer 3.6 and 4.5 μm bands from Fraine et al. (2013). We adopt the Fraine et al. (2013) results found using the Berta et al. (2012) system parameters, which are the most similar to the Kreidberg et al. (2014) parameters.

6.2. Comparison with Model Transmission Spectra

We compare the joint data set with model transmission spectra from Morley et al. (2015) that include either KCl and ZnS equilibrium condensate clouds or photochemically produced hydrocarbon hazes. In terms of equilibrium clouds, we consider a grid of 24 models with a range of metallicities (100–1000× solar) and cloud thicknesses, parameterized by the sedimentation efficiency fsed, which is the ratio of the sedimentation velocity to the convective velocity. We consider cloud-free models (no fsed value) and a range of models with thinner (${f}_{\mathrm{sed}}=1$) to thicker (${f}_{\mathrm{sed}}=0.01$) cloud layers. With respect to photochemical hazes, we consider a grid of 20 models with vertical eddy diffusion coefficients of ${K}_{{zz}}={10}^{10}\,{{\rm{cm}}}^{2}\,{{\rm{s}}}^{-1}$, and a range of mode particle radii r (0.01–1 μm) and haze-forming efficiencies fhaze (1%–30%), which represent the mass fraction of precursors that form soots. All models were calculated at 1 × GJ 1214b's incident stellar flux.

We investigate the goodness-of-fit of the data to the models using the ${\chi }^{2}$ statistic and represent the results as in Morley et al. (2015). In fitting the models to the data, we allow for a uniform offset in ${R}_{p}/{R}_{s}$ to minimize the ${\chi }^{2}$ value. We assume 37 degrees of freedom (dof; 38 data points − 1 fitted parameter) when calculating the reduced ${\chi }^{2}$ statistic (${\chi }_{\mathrm{red}}^{2}$). Tables 4 and 5 give the results for the cloud and haze models, respectively. For cloudy atmospheres, models with higher metallicities and thicker clouds (${f}_{\mathrm{sed}}=0.01\mbox{--}0.1$) tend to provide better fits (Figure 8). The best-fitting cloud model has the highest metallicity and relatively thick clouds ($1000\times $ solar, ${f}_{\mathrm{sed}}=0.1$, ${\chi }^{2}=81.3$, ${\chi }_{\mathrm{red}}^{2}=2.20$). For hazy atmospheres, the models with higher haze-forming efficiencies (${f}_{\mathrm{haze}}=10 \% \mbox{--}30 \% $) and larger mode particle sizes (r = 0.3–1 μm) provide the best fits (Figure 8). The best-fitting haze model includes a relatively large mode particle size and haze-forming efficiency (r = 0.3 μm, ${f}_{\mathrm{haze}}=10 \% $, ${\chi }^{2}=74.1$, ${\chi }_{\mathrm{red}}^{2}=2.00$). A flat line model (1 fitted parameter; $\mathrm{dof}=37$) provides a fit to the data comparable to that of the best-fitting cloud and haze models (${R}_{p}/{R}_{s}=0.11613$, ${\chi }^{2}=75.2$, ${\chi }_{\mathrm{red}}^{2}=2.03$).

Figure 8.

Figure 8. Reduced chi-squared maps for model fits considering only transmission through the exoplanet's limb. We find the best fits to the joint data set for cloudy models with high metallicity ($1000\times $ solar) and thick clouds (${f}_{\mathrm{sed}}\sim 0.01\mbox{--}0.1$) and hazy models with relatively large particles (0.1–1 μm) and high haze-forming efficiencies (10%–30%). The full results for the cloudy and hazy model grids (Morley et al. 2015) are provided in Tables 4 and 5, respectively.

Standard image High-resolution image

Table 4.  Model-Fitting Results for Cloudy Atmospheres Considering Only Transmission through the Exoplanet's Limb

Metallicity fseda ${\chi }^{2}$ dof ${\chi }_{\mathrm{red}}^{2}$
(× Solar)        
100 0.01 215.3 37 5.82
100 0.1 411.9 37 11.1
100 1 3963 37 107
100 cloud-free 6919 37 187
150 0.01 168.0 37 4.54
150 0.1 278.3 37 7.52
150 1 2627 37 71.0
150 cloud-free 4651 37 126
200 0.01 142.7 37 3.86
200 0.1 208.9 37 5.65
200 1 1881 37 50.9
200 cloud-free 3359 37 90.8
250 0.01 127.1 37 3.44
250 0.1 169.3 37 4.58
250 1 1422 37 38.4
250 cloud-free 2560 37 69.2
300 0.01 116.7 37 3.16
300 0.1 144.5 37 3.91
300 1 1120 37 30.3
300 cloud-free 2029 37 54.8
1000 0.01 87.77 37 2.37
1000 0.1 81.26 37 2.20
1000 1 246.8 37 6.67
1000 cloud-free 440.6 37 11.9

Note.

aCloud-free models do not have an fsed value.

Download table as:  ASCIITypeset image

Table 5.  Model-Fitting Results for Hazy Atmospheres Considering Only Transmission through the Exoplanet's Limb

r fhaze ${\chi }^{2}$ dof ${\chi }_{\mathrm{red}}^{2}$
(μm) (%)      
0.01 1 916.1 37 24.8
0.01 3 394.4 37 10.7
0.01 10 115.1 37 3.11
0.01 30 82.89 37 2.24
0.03 1 753.6 37 20.4
0.03 3 376.6 37 10.2
0.03 10 106.7 37 2.88
0.03 30 81.84 37 2.21
0.1 1 338.9 37 9.16
0.1 3 175.0 37 4.73
0.1 10 82.49 37 2.23
0.1 30 77.37 37 2.09
0.3 1 1119 37 30.2
0.3 3 90.66 37 2.45
0.3 10 74.13 37 2.00
0.3 30 75.58 37 2.04
1 1 5578 37 151
1 3 1734 37 46.9
1 10 117.0 37 3.16
1 30 75.77 37 2.05

Download table as:  ASCIITypeset image

Given the high precisions of the near-infrared measurements, the model fits are driven by these data. For both the equilibrium cloud and photochemical haze grids, the best-fitting models are those that could most effectively flatten the planet's near-infrared transmission spectrum in agreement with the observations. However, these models are inconsistent with the optical data (Figure 9). In effect, the large opacity source required to obscure spectral features in the near-infrared predicts an optical spectrum that is either flat and in line with the near-infrared measurements or slightly increasing with shorter wavelengths. Yet the optical data are in fact offset below the near-infrared data, with a mean value of ${R}_{p}/{R}_{s}=0.1146\pm 2\times {10}^{-4}$, compared with ${R}_{p}/{R}_{s}=0.11615\pm 3\times {10}^{-5}$ for the Kreidberg et al. (2014) data set, in which the quoted uncertainty is the standard error of the mean. Thus the mean of the optical data points is $8\sigma $ lower than that of the near-infrared data.

Figure 9.

Figure 9. Best-fitting model spectra only considering atmospheric transmission through the exoplanet's limb. Models assuming homogeneous stellar photospheres cannot reproduce both the near-infrared and optical measurements. From top to bottom, the panels show the full wavelength range of the best-fit cloudy (1000 × solar, ${f}_{\mathrm{sed}}=0.1$) and hazy (${K}_{{zz}}=10$, r = 0.3 μm, ${f}_{\mathrm{haze}}=10 \% $) models, and close-up views of the regions around the Magellan (current work), HST (Kreidberg et al. 2014), and Spitzer (Fraine et al. 2013) data sets.

Standard image High-resolution image

To evaluate the significance of the offset between our Magellan/IMACS data and the HST/WFC3 data, we calculate the ${\chi }^{2}$ fit of our optical data to a flat transmission spectrum given by the mean of the HST/WFC3 data. With 14 data points and 1 fitted parameter, we assume 13 dof when calculating the ${\chi }_{\mathrm{red}}^{2}$ statistic. We find the optical data are inconsistent with the mean ${R}_{p}/{R}_{s}$ of the HST/WFC3 data at high significance (${\chi }^{2}=54.2$, ${\chi }_{\mathrm{red}}^{2}=4.17$, $p\lt 1\times {10}^{-5}$). Since atmospheric models that are consistent with the flat near-infrared spectrum predict an optical spectrum that is in line with or slightly elevated with respect to the near-infrared, we conclude that by taking into account transmission through the exoplanet's atmosphere alone, none of the physically plausible models we considered can reproduce both the optical and near-infrared measurements. With that in mind, we consider in the following sections possible contributions from the star's photosphere to the observed transmission spectrum.

6.3. Effects of a Heterogenous Stellar Photosphere

6.3.1. Composite Photosphere and Atmospheric Transmission (CPAT) Model

We investigate how heterogeneities across the star's photosphere could affect the observed transmission spectrum of the exoplanet using a basic model to incorporate the effects of a composite photosphere and atmospheric transmission along the planet's limb (hereafter the "CPAT model"). We consider the simplest case in which the emergent spectrum of the star is composed of two distinct components: the spectrum typical of the occulted transit chord So (the "occulted" spectrum) and the unocculted spectrum Su, which is fully outside of the transit chord (Figure 10). It is important to note that the occulted spectrum includes the transit chord but is not limited to it. For example, the planet could transit a region with a spectrum that is typical of $80 \% $ of the photosphere, while another $20 \% $ of the photosphere produces a distinct spectrum that is not probed by the transit chord. Additionally, the unocculted region does not need to be continuous in this model. We note that the planet could transit multiple regions with distinct spectral characteristics (as is the case while crossing starspots; e.g., Sanchis-Ojeda & Winn 2011), or multiple unocculted regions with distinct spectra could exist, though we show here that this simple model is sufficient to reproduce the observations.

Figure 10.

Figure 10. Schematic of the composite photosphere and atmospheric transmission (CPAT) model. In this model, the exoplanet blocks a wavelength-dependent fraction of the stellar disk ${D}_{\lambda }$. The transit chord probes a region with a characteristic stellar spectrum (the "occulted" spectrum), while a fraction of the stellar disk F, either continuous or not, is described by another (the "unocculted" spectrum). During the transit, the difference between the stellar spectra is imprinted in the observed transmission spectrum.

Standard image High-resolution image

In cases where the stellar photosphere is composed of two distinct components, the observed transmission spectrum given by the CPAT model is

Equation (11)

in which ${({R}_{p}/{R}_{s})}_{\lambda ,\mathrm{obs}}$ is the observed, wavelength-dependent planet-to-star radius ratio; F is the fraction of the stellar disk covered by the unocculted spectrum Su; and ${D}_{\lambda }=({R}_{p}/{R}_{s}{)}_{\lambda }^{2}$ is the transit depth expected from the true, wavelength-dependent planet-to-star radius ratio. The numerator within the square root gives the in-transit flux, and the denominator the out-of-transit flux. In the following sections, we apply the CPAT model to three cases: (1) heterogeneous absorbers, (2) generalized temperature heterogeneities, and (3) cool starspots with parameters fixed to values inferred from long-term photometric monitoring of GJ 1214b.

6.3.2. CPAT Model for Absorber Heterogeneities

Significant chemical heterogeneities are known to exist for magnetically active hot (B and A-type) main sequence stars that show peculiar chemical abundances (e.g., Pyper 1969; Khokhlova 1985). While no strong correlation was observed between the line profile variations and magnetic field strengths, it has been proposed that the chemical abundance patterns emerge due to anisotropic diffusion of the elements in a strong magnetic field (e.g., Michaud 1970; Urpin 2016). Simultaneous Doppler imaging mapping of chemical heterogeneities and magnetic field geometry argue for highly complex configurations across the stellar disk (e.g., Piskunov & Kochukhov 2002). Similarly, both partially and fully convective mid-M dwarfs have been found to store the bulk of their magnetic flux in small scale components that are non-axisymmetric (Reiners & Basri 2009). Field strengths up to $4\times {10}^{3}$ G have been detected for very active M4.5 dwarfs (Johns-Krull & Valenti 1996) as well as rapidly rotating mid- to late-M dwarfs (Reiners & Basri 2010; Reiners 2012). If indeed anisotropic diffusion is the process that leads to chemically heterogeneous stellar photospheres, then it is expected that this effect will only be important for stars with strong magnetic fields ($\sim {10}^{3}$ G) and without fully convective atmospheres; however, weaker magnetic fields—not able to produce strong enough chemical heterogeneities to lead to varying absorption line profiles—may lead to low-level heterogeneities that could still potentially influence the optical transmission spectrum of transiting exoplanets. The detailed analysis of this effect is beyond the scope of this paper but may be important for future transit spectroscopy of planets orbiting stars with strong magnetic fields, including rapidly rotating mid- to late-M dwarfs.

We use Equation (11) to model the effects of a heterogeneous distribution of absorbers in the stellar photosphere on the observed transmission spectrum (the "CPAT-absorber model"). We utilize PHOENIX model stellar spectra (Husser et al. 2013) with ${T}_{\mathrm{eff}}=3300$ K and $\mathrm{log}g=5.0$ to generate the So and Su spectra for our model. As a proxy for the strength of absorbers, we employ models with a range of metallicities as defined by [Fe/H]. Higher metallicity models demonstrate deeper spectral features due to the larger absorber abundances, while lower metallicity models possess relatively muted absorption features. We do not consider alpha-enhanced or depleted models, so [Fe/H] is synonymous with the overall metallicity Z. The PHOENIX model grid includes spectra for metallicities of $-4.0\leqslant [\mathrm{Fe}/{\rm{H}}]\leqslant -2.0$ in steps of 1.0 and $-2.0\leqslant [\mathrm{Fe}/{\rm{H}}]\leqslant +1.0$ in steps of 0.5. We include each of these models in our analysis.

We employ an MCMC approach to find the best-fitting CPAT-absorber model. We conduct an MCMC optimization for each of the exoplanet atmosphere models in the grids of cloudy and hazy models described in Section 6.2. At each step in the Markov chain, the measured ${R}_{p}/{R}_{s}$ values from the joint data set are compared with the CPAT-absorber model produced by the combination of the heterogeneous stellar photosphere and the input exoplanet atmosphere. The spectra So and Su are generated using the values for ${[\mathrm{Fe}/{\rm{H}}]}_{o}$ and ${[\mathrm{Fe}/{\rm{H}}]}_{u}$ (the metallicities of the occulted and unocculted spectra, respectively) and linearly interpolating between the closest spectra in the PHOENIX model grid. Following the results of Doppler imaging studies of M dwarfs, we adopt $F=0.032$, which is the mean spot filling factor Barnes et al. (2015) found for the M4.5 dwarf GJ 791.2A over its rotation period. We fix the metallicity of the occulted spectrum to the best-fit value for the metallicity of GJ 1214, ${[\mathrm{Fe}/{\rm{H}}]}_{o}=0.20$ (Rojas-Ayala et al. 2012). The free parameters in the model are the metallicity contrast ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$, which determines the metallicity of the unocculted spectrum relative to that of the occulted spectrum, and the uniform offset applied to the exoplanet's model transmission spectrum, ${({R}_{p}/{R}_{s})}_{o}$. We place a uniform prior on ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ to allow the algorithm to fully explore the parameter space, with interval boundaries $[-4.2,+0.8]$ to keep ${[\mathrm{Fe}/{\rm{H}}]}_{u}$ within the PHOENIX model range $[-4.0,+1.0]$. While allowing the metallicity of 3.2% of the stellar disk to vary will change the mean metallicity of the photosphere, we note that the mean will never vary more than $1\sigma $ from the measured metallicity, $[\mathrm{Fe}/{\rm{H}}]=0.20\pm 0.17$ (Rojas-Ayala et al. 2012). We place a Gaussian prior on ${({R}_{p}/{R}_{s})}_{o}$ using the mean and standard deviation of the residuals found by fitting the joint data set to the exoplanet's model transmission spectrum (Section 6.2) and bounded on the interval $[-1,+1]$. We run five chains of 105 steps with an additional 104 steps discarded as the burn-in. We consider the chains to be well-mixed if the Gelman–Rubin statistic $\hat{R}\leqslant 1.03$ for all parameters.

The results of the CPAT-absorber model fitting for the full grids of cloudy and hazy transmission spectra are illustrated in Figure 11. As can be seen by comparing Figures 8 and 11, the best-fitting parameters for the exoplanet's atmosphere are not substantially changed by incorporating the effect of a heterogeneous stellar photosphere, though the CPAT-absorber models can provide better fits to the data. The complete results from the fitting procedure are provided in Tables 6 and 7, for the equilibrium cloud and photochemical haze models, respectively. The values for the free parameters, ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ and ${({R}_{p}/{R}_{s})}_{o}$, that we report there and quote as follows are the median and 68% confidence intervals from the MCMC optimization procedure. In the case of equilibrium clouds, we find the best-fitting model to have a very high metallicity and thick clouds in the exoplanet's atmosphere and a relatively low metallicity in the unocculted region of the star's photosphere ($1000\times $ solar, ${f}_{\mathrm{sed}}=0.1$, ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]=-{1.58}_{-0.37}^{+0.28}$, ${({R}_{p}/{R}_{s})}_{o}={8594}_{-26}^{+28}$ ppm, ${\chi }^{2}=54.7$, $\mathrm{dof}=36$, ${\chi }_{\mathrm{red}}^{2}=1.52$). Considering photochemical haze models, we find the best-fitting model to include the same mode particle size and haze-forming efficiency as when considering only the exoplanetary contribution to the transmission spectrum (Section 6.2) and a low metallicity for the unocculted region similar to that found for the CPAT-absorber cloud model (r = 0.3 μm, ${f}_{\mathrm{haze}}=10 \% $, ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]=-{1.31}_{-0.39}^{+0.35}$, ${({R}_{p}/{R}_{s})}_{o}=-{5758}_{-26}^{+27}$ ppm, ${\chi }^{2}=53.3$, $\mathrm{dof}=36$, ${\chi }_{\mathrm{red}}^{2}\ =1.48$). Figure 12 shows the posterior distributions for the free parameters in each of the best-fitting cloud and haze models. A model using a constant, achromatic value of ${R}_{p}/{R}_{s}$ for the exoplanet's transmission spectrum together with a CPAT-absorber model for the star provides a fit to the data comparable to that of the best-fitting CPAT-absorber cloud and haze models (${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]=-{1.02}_{-0.24}^{+0.33}$, ${R}_{p}/{R}_{s}=0.11604\pm 3\times {10}^{-5}$, χ2 = 55.8, dof = 36, ${\chi }_{\mathrm{red}}^{2}=1.55$).

Figure 11.

Figure 11. Reduced chi-squared maps for CPAT-absorber model fits. We find the best fits to the joint data set for cloudy models with high metallicity ($1000\times $ solar) and thick clouds (${f}_{\mathrm{sed}}\sim 0.01\mbox{--}0.1$), and hazy models with relatively large particles (0.1–1 μm) and high haze-forming efficiencies (10%–30%). The free parameters in the fitting procedure are described in Section 6.3.2, and their optimized values are provided in Tables 6 and 7 for the cloudy and hazy model grids (Morley et al. 2015), respectively.

Standard image High-resolution image
Figure 12.

Figure 12. Posterior distributions for free parameters in CPAT-absorber models. The left panel illustrates the posterior distributions for the free parameters in the best-fitting cloud model ($1000\times $ solar, ${f}_{\mathrm{sed}}=0.1$), and the right those for the best-fitting haze model (r = 0.3 μm, ${f}_{\mathrm{haze}}=10 \% $). Vertical dashed lines indicate the medians and 68% confidence intervals.

Standard image High-resolution image

Table 6.  Results from MCMC Fits to CPAT-absorber Models for Cloudy Atmospheres

Metallicity fseda ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ ${({R}_{p}/{R}_{s})}_{o}$ ${\chi }^{2}$ dof ${\chi }_{\mathrm{red}}^{2}$
(× Solar)     (ppm)      
100 0.01 $-{3.38}_{-0.29}^{+0.55}$ $-{1364}_{-96}^{+155}$ 119.7 36 3.32
100 0.1 $-{2.23}_{-0.15}^{+0.14}$ $-{32}_{-35}^{+42}$ 354.6 36 9.85
100 1 $-{1.72}_{-0.14}^{+0.12}$ ${1956}_{-25}^{+27}$ 3914 36 109
100 cloud-free $-{1.53}_{-0.18}^{+0.11}$ ${2392}_{-25}^{+26}$ 6863 36 191
150 0.01 $-{2.80}_{-0.15}^{+0.18}$ ${601}_{-55}^{+64}$ 87.68 36 2.44
150 0.1 $-{2.19}_{-0.17}^{+0.16}$ ${1638}_{-33}^{+42}$ 227.9 36 6.33
150 1 $-{1.68}_{-0.14}^{+0.13}$ ${3288}_{-26}^{+25}$ 2580 36 71.7
150 cloud-free $-{1.46}_{-0.23}^{+0.13}$ ${3646}_{-25}^{+25}$ 4590 36 127
200 0.01 $-{2.71}_{-0.14}^{+0.15}$ ${1990}_{-54}^{+51}$ 73.65 36 2.05
200 0.1 $-{2.14}_{-0.19}^{+0.18}$ ${2900}_{-31}^{+40}$ 163.5 36 4.54
200 1 $-{1.65}_{-0.16}^{+0.14}$ ${4340}_{-25}^{+26}$ 1834 36 50.9
200 cloud-free $-{1.42}_{-0.16}^{+0.21}$ ${4645}_{-25}^{+25}$ 3306 36 91.8
250 0.01 $-{2.65}_{-0.15}^{+0.15}$ ${3075}_{-52}^{+52}$ 66.64 36 1.85
250 0.1 $-{2.09}_{-0.21}^{+0.19}$ ${3895}_{-31}^{+37}$ 127.1 36 3.53
250 1 $-{1.63}_{-0.16}^{+0.15}$ ${5180}_{-24}^{+27}$ 1378 36 38.3
250 cloud-free $-{1.41}_{-0.16}^{+0.21}$ ${5450}_{-25}^{+26}$ 2508 36 69.7
300 0.01 $-{2.60}_{-0.15}^{+0.15}$ ${3940}_{-52}^{+51}$ 62.5 36 1.74
300 0.1 $-{2.03}_{-0.23}^{+0.20}$ ${4688}_{-30}^{+36}$ 105.6 36 2.93
300 1 $-{1.62}_{-0.18}^{+0.15}$ ${5854}_{-26}^{+25}$ 1074 36 29.8
300 cloud-free $-{1.41}_{-0.17}^{+0.21}$ ${6098}_{-24}^{+26}$ 1981 36 55
1000 0.01 $-{1.98}_{-0.54}^{+0.23}$ ${8231}_{-37}^{+54}$ 61.43 36 1.71
1000 0.1 $-{1.58}_{-0.37}^{+0.28}$ ${8594}_{-26}^{+28}$ 54.67 36 1.52
1000 1 $-{1.54}_{-0.24}^{+0.20}$ ${9205}_{-24}^{+26}$ 212.9 36 5.91
1000 cloud-free $-{1.39}_{-0.22}^{+0.21}$ ${9339}_{-26}^{+25}$ 404.9 36 11.2

Note. The free parameters in the fitting procedure, ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ and ${({R}_{p}/{R}_{s})}_{o}$, are described in Section 6.3.2. We report the median and 68% confidence intervals from the MCMC optimization procedure for each free parameter. With 38 data points and 2 fitted parameters, we assume 36 DOF when calculating the ${\chi }_{\mathrm{red}}^{2}$ statistic.

aCloud-free models do not have an fsed value.

Download table as:  ASCIITypeset image

Table 7.  Results from MCMC Fits to CPAT-absorber Models for Hazy Atmospheres

r fhaze ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ ${({R}_{p}/{R}_{s})}_{o}$ ${\chi }^{2}$ dof ${\chi }_{\mathrm{red}}^{2}$
(μm) (%)   (ppm)      
0.01 1 $-{4.16}_{-0.04}^{+0.02}$ $-{4007}_{-28}^{+28}$ 512.4 36 14.2
0.01 3 $-{3.93}_{-0.17}^{+0.15}$ $-{4626}_{-54}^{+49}$ 175.6 36 4.88
0.01 10 $-{2.57}_{-0.15}^{+0.15}$ $-{4064}_{-54}^{+52}$ 76.34 36 2.12
0.01 30 $-{1.65}_{-0.43}^{+0.41}$ $-{3201}_{-28}^{+31}$ 61.98 36 1.72
0.03 1 $-{4.16}_{-0.04}^{+0.03}$ $-{4777}_{-31}^{+27}$ 333.8 36 9.27
0.03 3 $-{3.67}_{-0.17}^{+0.18}$ $-{4743}_{-56}^{+53}$ 215 36 5.97
0.03 10 $-{2.47}_{-0.18}^{+0.15}$ $-{3879}_{-60}^{+54}$ 74.38 36 2.07
0.03 30 $-{1.61}_{-0.44}^{+0.40}$ $-{2928}_{-27}^{+31}$ 61.51 36 1.71
0.1 1 $-{3.55}_{-0.18}^{+0.18}$ $-{5320}_{-57}^{+55}$ 184.2 36 5.12
0.1 3 $-{2.77}_{-0.15}^{+0.15}$ $-{5538}_{-52}^{+53}$ 115.4 36 3.21
0.1 10 $-{1.77}_{-0.35}^{+0.45}$ $-{3871}_{-28}^{+35}$ 61.18 36 1.7
0.1 30 $-{1.35}_{-0.44}^{+0.35}$ $-{2965}_{-26}^{+28}$ 56.68 36 1.57
0.3 1 $-{1.69}_{-0.17}^{+0.15}$ $-{3184}_{-26}^{+26}$ 1079 36 30
0.3 3 $-{1.76}_{-0.29}^{+0.26}$ $-{6049}_{-27}^{+29}$ 65.6 36 1.82
0.3 10 $-{1.31}_{-0.39}^{+0.35}$ $-{5758}_{-26}^{+27}$ 53.34 36 1.48
0.3 30 $-{1.24}_{-0.33}^{+0.36}$ $-{4403}_{-26}^{+26}$ 54.21 36 1.51
1 1 $-{1.61}_{-0.12}^{+0.10}$ $-{635}_{-25}^{+25}$ 5518 36 153
1 3 $-{1.54}_{-0.19}^{+0.15}$ $-{2976}_{-25}^{+26}$ 1691 36 47
1 10 $-{1.29}_{-0.28}^{+0.26}$ $-{6518}_{-25}^{+26}$ 90.08 36 2.5
1 30 $-{1.21}_{-0.31}^{+0.32}$ $-{7258}_{-26}^{+26}$ 53.73 36 1.49

Note. The free parameters in the fitting procedure, ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ and ${({R}_{p}/{R}_{s})}_{o}$, are described in Section 6.3.2. We report the median and 68% confidence intervals from the MCMC optimization procedure for each free parameter. With 38 data points and 2 fitted parameters, we assume 36 DOF when calculating the ${\chi }_{\mathrm{red}}^{2}$ statistic.

Download table as:  ASCIITypeset image

Figure 13 shows the best-fitting equilibrium cloud and photochemical haze CPAT-absorber models. Additionally shown is the constant-${R}_{p}/{R}_{s}$ CPAT-absorber model, which illustrates the effect of a heterogenous distribution of absorbers in the stellar photosphere on an otherwise flat transmission spectrum. It shows that variations in the strength of absorbers across the stellar disk can produce large deviations from a flat spectrum in the optical, while simultaneously preserving a flat transmission spectrum in the near-infrared.

Figure 13.

Figure 13. Best-fitting transmission spectra using the CPAT-absorber model. The CPAT-absorber models reproduce both the optical and near-infrared measurements better than models for the exoplanetary atmosphere alone. The composite photospheres for the best-fitting models all include 3.2% of the unocculted stellar disk that is described by a PHOENIX model with a lower metallicity, which imprints stellar absorption features on the observed transmission spectrum. The figure layout is the same as that of Figure 9.

Standard image High-resolution image

For both the cloud and haze cases, the best fits to the data using the CPAT-absorber model require 3.2% of the unocculted stellar disk to possess weaker absorption features than the transit chord. In effect, the unocculted stellar disk is brighter in optical absorption bands than the region of the photosphere typified by the transit chord, which decreases the observed ${R}_{p}/{R}_{s}$ within those absorption bands. This allows the CPAT-absorber model to provide better fits to the data than the flat line model or models for atmospheric transmission alone considered in Section 6.2, but still does not fully explain the observed spectrum.

6.3.3. CPAT Model for Temperature Heterogeneities

We also investigate the contribution of stellar temperature heterogeneities to the observed transmission spectrum using the CPAT model (the "CPAT-temperature model"). We employ PHOENIX models with a surface gravity ($\mathrm{log}g=5.0$) and a metallicity ($[\mathrm{Fe}/{\rm{H}}]=0.0$), the closest PHOENIX grid values to those identified for GJ 1214 by Rojas-Ayala et al. (2012), and temperatures in the range $2700\,{\rm{K}}\leqslant {T}_{\mathrm{eff}}\leqslant 4700\,{\rm{K}}$. We conduct the MCMC optimization procedure as described in Section 6.3.2, linearly interpolating in temperature space within the PHOENIX model grid to generate the So and Su spectra based on the effective temperatures of the occulted and unocculted regions (To and Tu, respectively) at each step. We fix ${T}_{o}=3252$ K using the effective temperature of GJ 1214 (Anglada-Escudé et al. 2013) and $F=0.032$ following the results of Barnes et al. (2015). The free parameters in the model are the temperature contrast ${\rm{\Delta }}T$, which determines Tu relative to To, and ${({R}_{p}/{R}_{s})}_{o}$. We place a Gaussian prior on ${\rm{\Delta }}T$ centered on 0 K, with an uncertainty equal to 10% of the Teff of GJ 1214 to thoroughly explore the parameter space and truncate it on the range $[-552\,{\rm{K}},+1448\,{\rm{K}}]$, to keep Tu within the PHOENIX model range we consider $[2700\,{\rm{K}},4700\,{\rm{K}}]$. As with the CPAT-absorber model, we conduct an MCMC optimization for each of the exoplanet atmosphere models in the grids of cloudy and hazy models described in Section 6.2.

Figure 14 illustrates the goodness-of-fit for the full grids of cloudy and hazy transmission spectra modulated by the CPAT-temperature model. As with the CPAT-absorber model, the best-fitting parameters for the exoplanet's atmosphere are similar to those found when considering the contribution of the exoplanet atmosphere alone to the transmission spectrum (Figure 8). However, allowing for temperature heterogeneities provides better fits to the data than either the exoplanet-alone or CPAT-absorber models. The complete results for the CPAT-temperature model fitting for the equilibrium cloud and photochemical haze model grids are provided in Tables 8 and 9, respectively. The best-fitting cloud model includes an exoplanet atmosphere with the highest metallicity and thickest clouds of the models we considered and a temperature contrast of ${\rm{\Delta }}T\simeq +0.1{T}_{\mathrm{eff}}$ for the unocculted region of the star's photosphere ($1000\times $ solar, ${f}_{\mathrm{sed}}=0.01$, ${\rm{\Delta }}T\ ={354}_{-47}^{+46}$ K, ${({R}_{p}/{R}_{s})}_{o}={9054}_{-94}^{+110}$ ppm, ${\chi }^{2}=45.2$, $\mathrm{dof}=36$, ${\chi }_{\mathrm{red}}^{2}=1.25$). The best-fitting haze model includes a smaller mode particle size and the same haze-forming efficiency as the best-fitting exoplanet-only model and ${\rm{\Delta }}T\simeq +0.1{T}_{\mathrm{eff}}$ as well (r = 0.1 μm, ${f}_{\mathrm{haze}}=10 \% $, ${\rm{\Delta }}T={354}_{-46}^{+46}$ K, ${({R}_{p}/{R}_{s})}_{o}\ =-{3064}_{-102}^{+101}$ ppm, ${\chi }^{2}=40.5$, $\mathrm{dof}=36$, ${\chi }_{\mathrm{red}}^{2}=1.13$). This model is the best-fitting of all those we considered in this analysis. The posterior distributions for the free parameters in each of the best-fitting cloud and haze CPAT-temperature models are shown in Figure 15. The ${\rm{\Delta }}T$ and ${({R}_{p}/{R}_{s})}_{o}$ parameters are positively correlated due to the fact that larger temperature contrasts depress model ${R}_{p}/{R}_{s}$ values at all wavelengths and thus require larger offsets to bring models in line with observations. A model using a constant ${R}_{p}/{R}_{s}$ value for the exoplanet's transmission spectrum together with a CPAT-temperature model for the star provides a fit to the data comparable to that of the best-fitting CPAT-temperature cloud model but not as good as that of the best-fitting CPAT-temperature haze model (${\rm{\Delta }}T={336}_{-45}^{+54}$ K, ${R}_{p}/{R}_{s}=0.11680\pm 1\times {10}^{-4}$, χ2 = 45.6, dof = 36, ${\chi }_{\mathrm{red}}^{2}\ =1.27$).

Figure 14.

Figure 14. Reduced chi-squared maps for CPAT-temperature model fits. As with the CPAT-absorber models, we find the best fits for cloudy models with high metallicity ($1000\times $ solar) and thick clouds (${f}_{\mathrm{sed}}\sim 0.01\mbox{--}0.1$) and hazy models with relatively large particles (0.1–1 μm) and high haze-forming efficiencies (10%–30%). The free parameters in the fitting procedure are described in Section 6.3.3, and their optimized values are provided in Tables 8 and 9 for the cloudy and hazy grids (Morley et al. 2015), respectively.

Standard image High-resolution image
Figure 15.

Figure 15. Posterior distributions for free parameters in CPAT-temperature models. The left panel illustrates the posterior distributions for the free parameters in the best-fitting cloud model ($1000\times $ solar, ${f}_{\mathrm{sed}}=0.01$), and the right illustrates those for the best-fitting haze model (r = 0.1 μm, ${f}_{\mathrm{haze}}=10 \% $). Vertical dashed lines indicate the medians and 68% confidence intervals.

Standard image High-resolution image

Table 8.  Results from MCMC Fits to CPAT-temperature Models for Cloudy Atmospheres

Metallicity fseda ${\rm{\Delta }}T$ ${({R}_{p}/{R}_{s})}_{o}$ ${\chi }^{2}$ dof ${\chi }_{\mathrm{red}}^{2}$
(×Solar)   (K) (ppm)      
100 0.01 ${417}_{-41}^{+39}$ $-{45}_{-89}^{+91}$ 142.9 36 3.97
100 0.1 ${328}_{-43}^{+43}$ ${759}_{-90}^{+99}$ 368.9 36 10.2
100 1 ${344}_{-32}^{+30}$ ${2735}_{-68}^{+73}$ 3887 36 108
100 cloud-free ${389}_{-40}^{+29}$ ${3266}_{-83}^{+75}$ 6785 36 188
150 0.01 ${408}_{-41}^{+43}$ ${1734}_{-90}^{+98}$ 102 36 2.83
150 0.1 ${329}_{-45}^{+42}$ ${2424}_{-96}^{+95}$ 235.7 36 6.55
150 1 ${331}_{-34}^{+34}$ ${4039}_{-71}^{+82}$ 2563 36 71.2
150 cloud-free ${359}_{-28}^{+30}$ ${4457}_{-66}^{+69}$ 4555 36 127
200 0.01 ${399}_{-41}^{+43}$ ${3073}_{-92}^{+97}$ 80.93 36 2.25
200 0.1 ${333}_{-45}^{+43}$ ${3689}_{-97}^{+95}$ 166.3 36 4.62
200 1 ${324}_{-35}^{+39}$ ${5076}_{-81}^{+84}$ 1827 36 50.8
200 cloud-free ${347}_{-30}^{+29}$ ${5426}_{-66}^{+69}$ 3274 36 90.9
250 0.01 ${392}_{-42}^{+44}$ ${4126}_{-98}^{+95}$ 69.26 36 1.92
250 0.1 ${332}_{-44}^{+44}$ ${4676}_{-98}^{+95}$ 128.5 36 3.57
250 1 ${317}_{-38}^{+40}$ ${5903}_{-84}^{+87}$ 1368 36 38
250 cloud-free ${338}_{-32}^{+32}$ ${6213}_{-72}^{+72}$ 2489 36 69.1
300 0.01 ${384}_{-42}^{+45}$ ${4957}_{-93}^{+102}$ 61.85 36 1.72
300 0.1 ${332}_{-42}^{+46}$ ${5465}_{-99}^{+94}$ 104.1 36 2.89
300 1 ${315}_{-39}^{+41}$ ${6572}_{-87}^{+91}$ 1070 36 29.7
300 cloud-free ${329}_{-33}^{+35}$ ${6842}_{-76}^{+78}$ 1967 36 54.6
1000 0.01 ${354}_{-47}^{+46}$ ${9054}_{-94}^{+110}$ 45.15 36 1.25
1000 0.1 ${324}_{-47}^{+47}$ ${9331}_{-103}^{+103}$ 46.13 36 1.28
1000 1 ${309}_{-42}^{+47}$ ${9910}_{-96}^{+99}$ 210.9 36 5.86
1000 cloud-free ${298}_{-42}^{+47}$ ${10019}_{-92}^{+100}$ 404.8 36 11.2

Note. The free parameters in the fitting procedure, ${\rm{\Delta }}T$ and ${({R}_{p}/{R}_{s})}_{o}$, are described in Section 6.3.3. We report the median and 68% confidence intervals from the MCMC optimization procedure for each free parameter. With 38 data points and 2 fitted parameters, we assume 36 DOF when calculating the ${\chi }_{\mathrm{red}}^{2}$ statistic.

aCloud-free models do not have an fsed value.

Download table as:  ASCIITypeset image

Table 9.  Results from MCMC Fits to CPAT-temperature Models for Hazy Atmospheres

r fhaze ${\rm{\Delta }}T$ ${({R}_{p}/{R}_{s})}_{o}$ ${\chi }^{2}$ dof ${\chi }_{\mathrm{red}}^{2}$
(μm) (%) (K) (ppm)      
0.01 1 ${630}_{-29}^{+31}$ $-{1974}_{-73}^{+82}$ 673 36 18.7
0.01 3 ${546}_{-36}^{+36}$ $-{2860}_{-89}^{+88}$ 255 36 7.08
0.01 10 ${404}_{-38}^{+46}$ $-{3014}_{-91}^{+98}$ 55.96 36 1.55
0.01 30 ${345}_{-45}^{+48}$ $-{2416}_{-98}^{+106}$ 42.93 36 1.19
0.03 1 ${616}_{-29}^{+34}$ $-{2780}_{-79}^{+83}$ 542.8 36 15.1
0.03 3 ${527}_{-37}^{+39}$ $-{3089}_{-93}^{+87}$ 252.2 36 7.01
0.03 10 ${392}_{-41}^{+46}$ $-{2883}_{-89}^{+108}$ 51.3 36 1.43
0.03 30 ${344}_{-47}^{+46}$ $-{2145}_{-103}^{+103}$ 42.46 36 1.18
0.1 1 ${487}_{-38}^{+38}$ $-{3791}_{-87}^{+91}$ 227.4 36 6.32
0.1 3 ${445}_{-39}^{+40}$ $-{4334}_{-91}^{+90}$ 96.09 36 2.67
0.1 10 ${354}_{-46}^{+46}$ $-{3064}_{-102}^{+101}$ 40.52 36 1.13
0.1 30 ${329}_{-45}^{+51}$ $-{2222}_{-101}^{+107}$ 42.54 36 1.18
0.3 1 ${318}_{-37}^{+41}$ $-{2459}_{-82}^{+89}$ 1071 36 29.7
0.3 3 ${325}_{-44}^{+47}$ $-{5304}_{-102}^{+97}$ 53.34 36 1.48
0.3 10 ${322}_{-48}^{+48}$ $-{5029}_{-102}^{+107}$ 41.24 36 1.15
0.3 30 ${314}_{-45}^{+52}$ $-{3691}_{-104}^{+105}$ 43.98 36 1.22
1 1 ${358}_{-28}^{+27}$ ${174}_{-68}^{+62}$ 5470 36 152
1 3 ${297}_{-39}^{+44}$ $-{2298}_{-87}^{+93}$ 1689 36 46.9
1 10 ${272}_{-51}^{+51}$ $-{5893}_{-109}^{+107}$ 91.88 36 2.55
1 30 ${302}_{-47}^{+51}$ $-{6572}_{-102}^{+110}$ 47.19 36 1.31

Note. The free parameters in the fitting procedure, ${\rm{\Delta }}T$ and ${({R}_{p}/{R}_{s})}_{o}$, are described in Section 6.3.3. We report the median and 68% confidence intervals from the MCMC optimization procedure for each free parameter. With 38 data points and 2 fitted parameters, we assume 36 DOF when calculating the ${\chi }_{\mathrm{red}}^{2}$ statistic.

Download table as:  ASCIITypeset image

The transmission spectra for the best-fitting equilibrium cloud and photochemical haze CPAT-temperature models are shown in Figure 16. The constant ${R}_{p}/{R}_{s}$ model, shown as a dashed black line, illustrates the effect of a temperature heterogeneity in the stellar photosphere on an otherwise flat transmission spectrum. Its similarity in the optical to the cloud and haze models owes to the fact that the best-fitting exoplanet transmission spectra are essentially flat in the optical, and the observed variations are due to features imprinted by the heterogeneous stellar photosphere. In each of these cases, a region of the unocculted stellar disk is brighter than that occulted by the transiting planet, which effectively decreases the observed ${R}_{p}/{R}_{s}$ during the transit. The effect is chromatic, producing the largest change in ${R}_{p}/{R}_{s}$ in the optical, where the difference in emergent flux between the stellar spectral models is most pronounced.

Figure 16.

Figure 16. Best-fitting transmission spectra using the CPAT-temperature model. Models that include a temperature heterogeneity in the stellar photosphere provide the best fits to the near-infrared and optical measurements. Bright regions covering $3.2 \% $ of the unocculted stellar disk with a temperature contrast of ∼350 K can effectively decrease the flat ${R}_{p}/{R}_{s}$ (black), cloud (blue), and haze (orange) models to the observed optical values, while only minimally altering the near-infrared spectrum. The effect of cool, unocculted starspots (gray dashed line; see Section 6.3.3), however, does not match the optical data. The figure layout is the same as that of Figure 9.

Standard image High-resolution image

Cool, unocculted spots, on the other hand, would increase the observed ${R}_{p}/{R}_{s}$ values. We use the CPAT-temperature model to investigate how cool, unocculted starspots reported in the literature could affect the transmission spectrum. Long-term monitoring shows that GJ 1214 demonstrates a $1 \% $ peak-to-peak variability in the MEarth (Nutzman & Charbonneau 2008) bandpass ($715\,{\rm{nm}}\lt \lambda \lt 1000\,{\rm{nm}}$) on a timescale that is an integer multiple of 53 days (Berta et al. 2011), which has been attributed to rotational modulation of cool starspots (Charbonneau et al. 2009; Berta et al. 2011; Fraine et al. 2013). We calculate the spot-covering fraction implied by the variability using PHOENIX model spectra with $\mathrm{log}g=5.0$ and $[\mathrm{Fe}/{\rm{H}}]=0.0$. Assuming the spots are 10% cooler than the unspotted surface, following results from Doppler imaging of M dwarfs (Barnes et al. 2015), we utilize models with $T=3252$ K and $T=2927$ K for the unspotted and spotted surfaces, respectively, by linearly interpolating in temperature space between models in the PHOENIX grid. Integrating over the MEarth bandpass,17 we find that a spot-covering fraction $F=0.03$ of the stellar disk can reproduce the reported variability in the MEarth bandpass. The effect of such a spot configuration on the observed transmission spectrum is shown in gray on Figure 16. In the near-infrared, the scale of the effect is smaller than the reported measurement uncertainties. The most pronounced change is present in the optical, where the observed transmission spectrum is increased as much as ${\rm{\Delta }}({R}_{p}/{R}_{s})=8\times {10}^{-4}$ above the near-infrared spectrum. The optical transmission spectrum would decrease by a similar amount if these starspots were instead occulted by the transiting exoplanet (Pont et al. 2013). However, the observed decrease in the optical transmission spectrum is $\sim 3\times $ larger than what would be caused by these spots. Additionally, no brightening events due to occulted starspots are evident in our data (Figure 3).

In summary, we find that cool starspots cannot account for the decreased optical ${R}_{p}/{R}_{s}$ values, though unocculted bright regions of the photosphere with ${\rm{\Delta }}T\simeq +0.1{T}_{\mathrm{eff}}$ can decrease a flat optical transmission spectrum to the observed values.

6.3.4. Physical Interpretation of CPAT Model

The results of our modeling efforts indicate that the deviations from a flat optical transmission spectrum for GJ 1214b could be introduced by heterogeneities in the stellar photosphere, either in terms of temperature or the distribution of absorbers (Figures 13 and 16). However, the near-infrared transmission spectrum is largely unaffected by the stellar photosphere. The combination of the optical and near-infrared data, then, allows us to probe the stellar and planetary contributions to the transmission spectrum simultaneously.

The optical spectrum of a M4.5 dwarf like GJ 1214 is largely driven by opacity from TiO molecular bands (Morgan et al. 1943). Using the CPAT-absorber model framework, we find that an unocculted region of the stellar photosphere with relatively weak absorption in these bands could imprint spectral features on an otherwise flat exoplanetary transmission spectrum. However, the best-fitting CPAT-absorber model requires a metallicity contrast of ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]=-1.31$, corresponding to a depletion in the abundance of absorbers by a factor of 20 in the unocculted region. Efforts to measure a latitudinal dependence of the solar spectrum have found elemental abundances to be within 0.005 dex across latitudes (Kiselman et al. 2011). Moreover, as stars with masses less than 0.35 M (corresponding to spectral types M3.5 and later) are fully convective (Chabrier & Baraffe 1997), a viable mechanism for maintaining significant sustained abundance differences in the stellar photosphere of a M4.5 dwarf like GJ 1214 is not immediately apparent.

Temperature heterogeneities, however, are known to exist in stellar photospheres. We find a temperature heterogeneity could produce the observed discrepancy in ${R}_{p}/{R}_{s}$ between the optical and near-infrared if $\sim 3.2 \% $ of the unocculted stellar disk is ∼350 K hotter than the remaining photosphere. GJ 1214 is known to host cool starspots from starspot-crossing events detected in transit light curves (Carter et al. 2011; Kreidberg et al. 2014). Mid- to late-M dwarfs have also been found to have abundant polar spots from Doppler imaging. Barnes et al. (2015) found the M4.5 dwarf GJ 791.2A to have a mean spot coverage of 3.2% and a maximum spot coverage of 82.3% during its rotation, assuming spots are 300 K cooler than the photosphere. On the Sun, spots are accompanied by bright plage, which include faculae with temperature contrasts of 300–500 K with their surroundings (Topka et al. 1997). Brightening from these faculae overpowers spot darkening, causing total solar irradiance to increase during solar maxima (Fröhlich & Lean 1998; Meunier et al. 2010). Faculae cover roughly 0.36% of the sky-projected solar disk during periods of low solar activity and 3% during high activity (Shapiro et al. 2014). Additionally, faculae are much more common than spots on the Sun, covering $100\times $ more disk-area during periods of low activity, and $10\times $ more area during periods of high activity (Shapiro et al. 2014).

Like the Sun, old, slowly rotating FGK stars are known to have activity cycles dominated by faculae, unlike their younger counterparts, which are spot-dominated (Radick et al. 1983; Lockwood et al. 2007). With an age of 3–10 Gyr (Charbonneau et al. 2009), the photometric variability of GJ 1214 may be faculae-dominated as well. Our data are most consistent with unocculted faculae in the photosphere of GJ 1214 producing the observed offset in the optical transmission spectrum of GJ 1214b.

6.3.5. Observational Considerations of Heterogeneous Stellar Photospheres

The results of our CPAT-absorber and CPAT-temperature modeling efforts demonstrate that, in principle, a heterogeneous stellar photosphere can provide a transmission spectroscopy signal larger than that introduced by the exoplanetary atmosphere in the optical, adding another degeneracy to the modeling and interpretation of exoplanet spectra. At longer wavelengths, however, the effect is less pronounced for a star with the Teff of GJ 1214, and differences between the atmospheric models are more apparent. We model the effect out to the maximum wavelength of the PHOENIX stellar models (5.5 μm), and find the largest differences between transmission spectra for cloudy and hazy atmospheres at 2–5 μm for both CPAT-absorber and CPAT-temperature cases. This would be a promising region to target in transmission spectroscopy studies with the James Webb Space Telescope (JWST), in order to distinguish between cloudy and hazy atmospheres for this planet. Optical spectra like those presented here can complement JWST investigations of GJ 1214 and other targets by constraining the contribution of heterogeneous stellar photospheres to observed transmission spectra.

This analysis suggests that facular brightening may contribute more to transmission spectra of exoplanets than has been previously recognized. If common for M dwarfs, this photospheric heterogeneity could complicate optical transmission spectroscopy studies of exoplanets around small stars, such as GJ 1132b (Berta-Thompson et al. 2015) and TRAPPIST-1b, c, and d (Gillon et al. 2016), including searches for Rayleigh scattering. While unocculted starspots can mimic the transmission signature of scattering in an exoplanetary atmosphere (e.g., McCullough et al. 2014), facular brightening has the potential to mask a scattering signature by reducing the Rayleigh scattering slope in a transmission spectrum. However, the method we provide here can be used to take unocculted faculae into account if high SNR transmission spectra are obtained. Additionally, the same exoplanets would provide a unique opportunity to gain spatial information about M dwarf surfaces if the exoplanetary and photospheric contributions can be uniquely identified. The Transiting Exoplanet Survey Satellite (TESS), which will utilize a long-pass filter from 0.6 to 1.0 μm, will measure transit depths for potentially hundreds of planets around M dwarfs; comparing the TESS transit depths with follow-up measurements in the near-infrared could be one way of probing for spectral features introduced by a heterogeneous stellar photosphere. Along a different avenue of research, future work can investigate the range of stellar temperatures and metallicities for which this effect will be important.

7. SUMMARY

We have presented an optical transmission spectrum of the sub-Neptune GJ 1214b measured during three transits with Magellan/IMACS. The spectrum, which covers 4500–9260 Å in 14 bins with a mean value of ${R}_{p}/{R}_{s}=0.1146\pm 2\times {10}^{-4}$ and mean uncertainty of $\sigma ({R}_{p}/{R}_{s})=8.7\times {10}^{-4}$, is offset below the near-infrared ${R}_{p}/{R}_{s}$ values previously reported and cannot be reproduced by cloud/haze models for this exoplanet. The key points from this study are the following:

  • 1.  
    We find consistent spectra from three different transits taken with two different instrument configurations and reduced with different approaches, resulting in one of the most robust ground-based transmission spectra of a sub-Jovian exoplanet.
  • 2.  
    We find that the optical transit depth is shallower than that measured in the near-infrared. The data hint at more variation in ${R}_{p}/{R}_{s}$ at optical wavelengths than has been observed in the near-infrared.
  • 3.  
    We describe a new model, CPAT, that can be used to evaluate the effect of heterogeneous stellar photospheres in the interpretation of exoplanet transmission spectra.
  • 4.  
    We use the CPAT model in the case of GJ 1214b. We find that the data are not consistent with a perfectly homogeneous stellar photosphere or a photosphere that is compositionally heterogeneous but isothermal.
  • 5.  
    The data are fit best by a model with thick haze (r = 0.1 μm, ${f}_{\mathrm{haze}}=10 \% $) in the exoplanet's atmosphere and hotter photospheric features covering 3.2% of the unocculted stellar disk with a temperature contrast ${\rm{\Delta }}T={354}_{-46}^{+46}$ K. The parameters of these features are consistent with those of solar faculae.
  • 6.  
    Our results highlight the importance of heterogeneous stellar photospheres for the correct interpretation of optical transmission spectra of transiting planets and show that transiting planets may be used as probes of stellar photospheric features.

We thank the anonymous referee for their suggestions and comments, which helped to greatly improve the manuscript. This paper includes data gathered with the 6.5 meter Magellan Telescopes located at Las Campanas Observatory, Chile. We thank the operations staff at Las Campanas Observatory for their support with the observations. We thank the University of Arizona, Harvard, Carnegie, Massachusetts Institute of Technology, and Chilean National TACs for allocating the telescope time for ACCESS. B.R. acknowledges support from the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE-1143953. N.E. is supported by CONICYT-PCHA/Doctorado Nacional. A.J. acknowledges support from FONDECYT project 1130857, the Ministry of Economy, Development, and Tourism's Millennium Science Initiative through Grant IC120009, awarded to the Millennium Institute of Astrophysics, MAS, and BASAL CATA PFB06. The results reported herein benefited from collaborations and/or information exchange within NASA's Nexus for Exoplanet System Science (NExSS) research coordination network sponsored by NASA's Science Mission Directorate.

Facility: Magellan: Baade (IMACS). -

Footnotes

Please wait… references are loading.
10.3847/1538-4357/aa4f6c