ABSTRACT
The accelerating expansion of the universe is one of the most profound discoveries in modern cosmology, suggesting a universe in which 70% of the mass-energy density has an unknown form spread uniformly across the universe. This result has been well established using a combination of cosmological probes, resulting in a "standard model" of modern cosmology that is a combination of a cosmological constant with cold dark matter and baryons. The first compelling evidence for the acceleration came in the late 1990s, when two independent teams studying Type Ia supernovae discovered that distant SNe Ia were dimmer than expected. The combined analysis of modern cosmology experiments, including SNe Ia, the Hubble constant, baryon acoustic oscillations, and the cosmic microwave background, has now measured the contributions of matter and the cosmological constant to the energy density of the universe to better than 0.01, providing a secure measurement of acceleration. A recent study has claimed that the evidence for acceleration from SNe Ia is "marginal." Here we demonstrate errors in that analysis that reduce the acceleration significance from SNe Ia, and further demonstrate that conservative constraints on the curvature or matter density of the universe increase the significance even more. Analyzing the Joint Light-curve Analysis supernova sample, we find evidence for acceleration with SNe Ia alone, and in a flat universe. With our improved supernova analysis and not rejecting all other cosmological constraints, we find that acceleration is quite secure.
Export citation and abstract BibTeX RIS
1. INTRODUCTION
The discovery of the accelerating universe by two teams (Riess et al. 1998; Perlmutter et al. 1999) in the late 1990s was one of the major breakthroughs in cosmology. Using Type Ia supernovae (SNe Ia) as standard candles, both teams independently determined that high-redshift SNe were fainter than expected in a matter-dominated universe, implying the need for a cosmological constant (or more generally, dark energy) to accelerate the expansion of the universe, increasing cosmological distance as a function of redshift.
SNe Ia are not perfect standard candles, however. Work leading up to the discovery (Phillips 1993; Hamuy et al. 1996; Riess et al. 1996; Perlmutter et al. 1997) demonstrated the need for empirical standardization relations. Peak absolute magnitudes correlate with the width of the light curve (broader light-curve SNe are more luminous) and the color of the supernova (redder SNe are less luminous). In the years since, other empirical standardization relations have been noted, including one related to host galaxy stellar mass (Kelly et al. 2010; Sullivan et al. 2010); perhaps driven significantly by the local star formation rate; Rigault et al. 2013).
The Sloan Digital Sky Survey (SDSS) and SuperNova Legacy Survey (SNLS) SN teams have completed the Joint Light-curve Analysis (JLA; Betoule et al. 2014). This analysis incorporates a thorough recalibration of both surveys (Betoule et al. 2013), and the full set of spectroscopically confirmed SDSS SNe Ia (Sako et al. 2014); it represents the most up-to-date large SNe Ia compilation.4 A recent claim (Trøst Nielsen et al. 2016, hereafter N16) was made that this data set provides only "marginal evidence" for acceleration. We examine the statistical model N16 used to make this claim, and find it deficient for the task. In particular, a simple (and well-justified) update of the model to better account for changes in the observed SN light curve parameter distributions with redshift significantly increases the statistical strength of the acceleration evidence.
2. THE STATISTICAL MODEL
In the case of JLA, the standardization relations employed are light curve width (x1 in the framework of SALT2, Guy et al. 2007), color (c), and host galaxy stellar mass. The dependent variable is taken to be the rest-frame B-band magnitude (mB). The light curve parameters are determined by comparing a rest-frame spectral energy distribution model to the observer-frame photometry; similarly, the host stellar mass is estimated from broadband photometry. The cosmological results rely on the ability of the statistical framework to fit the standardization relations (in JLA, these are taken to be linear in x1 and c, and a step function in host mass), yet the uncertainties (a general term that we take to include unexplained dispersion around the model) in the dependent variable (mB) and independent variables (x1, c, host mass) are of similar size. The JLA analysis itself used a frequentist line-fitting procedure with only modest biases in its regime of applicability (Mosher et al. 2014).
In contrast, the statistical model from N16 uses a Bayesian Hierarchical Model (c.f., Gull 1989). In the N16 model, the latent ("true") parameters for each SN are modeled with nuisance parameters, which are marginalized over to obtain an inference of the global parameters. The distribution of the latent parameters must be adequately described by the prior. For example, flat priors on the latent variables cause a bias in the fit (Gull 1989). Making the parameters of the prior ("hyperparameters") part of the model avoids this bias (this multi-level nature gives rise to the name "Hierarchical").
The key shortcoming of the N16 model is that it assumes redshift-independent distributions for x1 and c. As shown in Figure 1, the observed distributions (plotted points) are far from redshift-independent. Two effects visible in the data—selection effects and the correlation between older host galaxies and narrower light-curve (lower x1) SNe (Hamuy et al. 1995)—result in a more luminous distribution of SNe as an increasing function of redshift. The selection effects are particularly evident in color, where only bluer SNe (more negative c) are above the completeness limit for the high-redshift end of each ground-based sample. By incorrectly treating these distributions as redshift-independent, N16 biased their latent x1 and c toward the global mean, effectively removing some of the standardization (Conley et al. 2007; Wood-Vasey et al. 2007; Karpenka 2015).5 The JLA sample was corrected for selection bias in Betoule et al. (2014), but only in the sense that SNe that are selected to be more luminous after standardization are adjusted to be less luminous. The bias correction cannot compensate for a deficient standardization, as provided by a constant-in-redshift model of the distributions.
2.1. Redshift-independent Distributions
As a starting point, we perform an analysis similar to that in N16, using Hamiltonian Monte Carlo to sample from the posterior (we describe the details in the
Download figure:
Standard image High-resolution image2.2. Redshift-dependent Distributions
Next, we introduce a simple model of the observed distributions as a function of redshift. We allow each source of SN discovery (Nearby, Sloan Digital Sky Survey, SuperNova Legacy Survey, Hubble Space Telescope) to have a linear variation in the mean with redshift (for the Hubble Space Telescope SNe, we use only a constant mean in redshift, as this sample is too small to constrain any variation). This model is shown in Figure 1; the variation with redshift is highly statistically significant. We also try a more flexible model in redshift (Rubin et al. 2015), and it makes only a small difference (the only requirement on the model is to be at least as flexible in redshift as the cosmological model under consideration; Rubin et al. 2015). The statistical significance of the acceleration increases to and assuming a flat universe (see Figures 2, 3, right panels). Again, the difference between this estimate and one derived from explicitly measuring the fraction of samples with is modest (only one posterior sample out of 60,000 has ).
Download figure:
Standard image High-resolution image2.3. Other Cosmological Models
For a result that relies only on kinematics, we also compute q0 constraints using the Visser (2004) series expansion of luminosity distance as a function of redshift. We take the first three terms (including q0 and ). The q0 constraints are illustrated on the third row of Figure 3. Even with flat priors on q0 and j0 (allowing the kinematics to venture into regions of parameter space that may be hard to realize dynamically), we find 3.7σ evidence for acceleration (2.8σ with the N16 model). SN data alone can be used to derive constraints on the joint posterior of and the dark energy equation of state parameter (), assuming a flat universe (Garnavich et al. 1998; Perlmutter et al. 1999); we compute constraints for this model as well. For simplicity, we take a flat prior on both and w, and find strong evidence for , as shown in the bottom row of Figure 3. The constraints on q0 are non-Gaussian, but in both the N16 model and ours, no samples (out of 60,000) have .
We next project the constraints from each of our four models to the q0– plane, shown in Figure 4 (both j0 and contribute linearly at the same order in luminosity distance, so we cannot distinguish them in this plane). The constraints for 2D models (models other than flat ΛCDM) are similar. However, the high q0/low region is disfavored by the dynamical models, as –w cannot reach , and even an empty universe ("Milne Model" Milne 1935) only has .
Download figure:
Standard image High-resolution image3. DISCUSSION
Our results (flat universe ) are similar to the frequentist JLA analysis (flat universe ). This is unsurprising; frequentist and Bayesian analyses will converge to exactly the same results under a set of assumptions not far from those made here (Rubin et al. 2015). We also note that more advanced analyses can better take into account statistical properties of the data (modeling selection effects, nonlinear standardization relations, a redshift-dependent host-mass relation, outliers, and a model of unexplained dispersion incorporating x1 and c) (Rubin et al. 2015). N16 did not include the host-mass standardization; excluding this only has a small impact on our results.6 However, we focus our attention on the N16 model of the x1 and c distributions, as it is this model that drives the difference from the JLA analysis.
While constraints derived from SNe Ia alone require a ∼30% flatness constraint to push the supernova measurement of acceleration above 5σ, current experiments have constrained curvature to much better precision than 1% (Planck Collaboration et al. 2016). Even constraints on (e.g., galaxy clusters, Allen et al. 2011), which imply , cut off the tail of the SN-only posterior extending down to a Milne universe and , allowing the acceleration to again reach 5σ confidence.7 With the combination of current experiments (SNe Ia, baryon acoustic oscillations, cosmic microwave background, and the Hubble constant), is constrained to be 0.6911 ± 0.0062 (Planck Collaboration et al. 2016). In order to claim that the evidence for acceleration is "marginal," it is necessary to fully reject all measurements of the curvature of the universe, the basic constraints on the matter density of the universe, and other cosmological data sets.
Even without external constraints, this work demonstrates that a more accurate model for the supernova analysis greatly increases the significance of acceleration. We conclude that the analysis in N16 is both incorrect in its method and unreasonable in its assumptions, leading the authors to question a result that is quite secure when addressed properly.
We appreciate the feedback we received from Greg Aldering, Peter Nugent, Saurabh Jha, Saul Perlmutter, Alex Kim, Peter Garnavich, and Mike Hobson. Support was provided by the Director, Office of Science, Office of High Energy Physics, of the U.S. Department of Energy under contract No. DE-AC02-05CH11231 and NASA ROSES-14 WFIRST Preparatory Science program 14-WPS14-0050.
APPENDIX: SAMPLING FROM THE POSTERIOR
We sample from the posterior using Stan (Carpenter et al. 2016) through PyStan (https://pystan.readthedocs.io). Following Trøst Nielsen et al. (2016), we assume flat priors on all parameters, but require . Our chains are 2500 samples each (after warmup), and show excellent convergence (the diagnostic of Gelman & Rubin 1992 is smaller than 1.01). We run 24 chains for the results with curvature, and 8 chains for the flat universe results.
As in Rubin et al. (2015), in order to speed up sampling, we decompose the light-curve fit covariance matrix into its eigenvectors and sample over the projections onto these (this results in a covariance matrix that has no correlations between SNe). In addition to a fully Bayesian analysis, the only other change we make from N16 is to include the host-mass standardization, as was done in JLA. We make our code available at https://zenodo.org/badge/latestdoi/71841571 (Rubin 2016).
Footnotes
- 4
Currently, JLA represents of the world sample of SNe Ia, so we expect constraints to continue to rapidly improve.
- 5
N16 claim their model is not Bayesian, and use frequentist inference for some global parameters, but the marginalization in their Equation (8) is a Bayesian approach. The priors assumed thus affect the inference.
- 6
For ΛCDM with curvature, the significance of the acceleration changes from with to without the host-mass relation.
- 7
N16 generalize the Milne model to mean any universe with q = 0 at all times, rather than specifically an empty universe. In the q0– plane (Figure 4), it is clear why the distinction is important; models that are curvature-dominated allow low , and thus high q0; even weak flatness constraints disfavor that portion of parameter space.