This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.

IS THE EXPANSION OF THE UNIVERSE ACCELERATING? ALL SIGNS POINT TO YES

and

Published 2016 December 20 © 2016. The American Astronomical Society. All rights reserved.
, , Citation D. Rubin and B. Hayden 2016 ApJL 833 L30 DOI 10.3847/2041-8213/833/2/L30

2041-8205/833/2/L30

ABSTRACT

The accelerating expansion of the universe is one of the most profound discoveries in modern cosmology, suggesting a universe in which 70% of the mass-energy density has an unknown form spread uniformly across the universe. This result has been well established using a combination of cosmological probes, resulting in a "standard model" of modern cosmology that is a combination of a cosmological constant with cold dark matter and baryons. The first compelling evidence for the acceleration came in the late 1990s, when two independent teams studying Type Ia supernovae discovered that distant SNe Ia were dimmer than expected. The combined analysis of modern cosmology experiments, including SNe Ia, the Hubble constant, baryon acoustic oscillations, and the cosmic microwave background, has now measured the contributions of matter and the cosmological constant to the energy density of the universe to better than 0.01, providing a secure measurement of acceleration. A recent study has claimed that the evidence for acceleration from SNe Ia is "marginal." Here we demonstrate errors in that analysis that reduce the acceleration significance from SNe Ia, and further demonstrate that conservative constraints on the curvature or matter density of the universe increase the significance even more. Analyzing the Joint Light-curve Analysis supernova sample, we find $4.2\sigma $ evidence for acceleration with SNe Ia alone, and $11.2\sigma $ in a flat universe. With our improved supernova analysis and  not rejecting all other cosmological constraints, we find that acceleration is quite secure.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The discovery of the accelerating universe by two teams (Riess et al. 1998; Perlmutter et al. 1999) in the late 1990s was one of the major breakthroughs in cosmology. Using Type Ia supernovae (SNe Ia) as standard candles, both teams independently determined that high-redshift SNe were fainter than expected in a matter-dominated universe, implying the need for a cosmological constant (or more generally, dark energy) to accelerate the expansion of the universe, increasing cosmological distance as a function of redshift.

SNe Ia are not perfect standard candles, however. Work leading up to the discovery (Phillips 1993; Hamuy et al. 1996; Riess et al. 1996; Perlmutter et al. 1997) demonstrated the need for empirical standardization relations. Peak absolute magnitudes correlate with the width of the light curve (broader light-curve SNe are more luminous) and the color of the supernova (redder SNe are less luminous). In the years since, other empirical standardization relations have been noted, including one related to host galaxy stellar mass (Kelly et al. 2010; Sullivan et al. 2010); perhaps driven significantly by the local star formation rate; Rigault et al. 2013).

The Sloan Digital Sky Survey (SDSS) and SuperNova Legacy Survey (SNLS) SN teams have completed the Joint Light-curve Analysis (JLA; Betoule et al. 2014). This analysis incorporates a thorough recalibration of both surveys (Betoule et al. 2013), and the full set of spectroscopically confirmed SDSS SNe Ia (Sako et al. 2014); it represents the most up-to-date large SNe Ia compilation.4 A recent claim (Trøst Nielsen et al. 2016, hereafter N16) was made that this data set provides only "marginal evidence" for acceleration. We examine the statistical model N16 used to make this claim, and find it deficient for the task. In particular, a simple (and well-justified) update of the model to better account for changes in the observed SN light curve parameter distributions with redshift significantly increases the statistical strength of the acceleration evidence.

2. THE STATISTICAL MODEL

In the case of JLA, the standardization relations employed are light curve width (x1 in the framework of SALT2, Guy et al. 2007), color (c), and host galaxy stellar mass. The dependent variable is taken to be the rest-frame B-band magnitude (mB). The light curve parameters are determined by comparing a rest-frame spectral energy distribution model to the observer-frame photometry; similarly, the host stellar mass is estimated from broadband photometry. The cosmological results rely on the ability of the statistical framework to fit the standardization relations (in JLA, these are taken to be linear in x1 and c, and a step function in host mass), yet the uncertainties (a general term that we take to include unexplained dispersion around the model) in the dependent variable (mB) and independent variables (x1, c, host mass) are of similar size. The JLA analysis itself used a frequentist line-fitting procedure with only modest biases in its regime of applicability (Mosher et al. 2014).

In contrast, the statistical model from N16 uses a Bayesian Hierarchical Model (c.f., Gull 1989). In the N16 model, the latent ("true") parameters for each SN are modeled with nuisance parameters, which are marginalized over to obtain an inference of the global parameters. The distribution of the latent parameters must be adequately described by the prior. For example, flat priors on the latent variables cause a bias in the fit (Gull 1989). Making the parameters of the prior ("hyperparameters") part of the model avoids this bias (this multi-level nature gives rise to the name "Hierarchical").

The key shortcoming of the N16 model is that it assumes redshift-independent distributions for x1 and c. As shown in Figure 1, the observed distributions (plotted points) are far from redshift-independent. Two effects visible in the data—selection effects and the correlation between older host galaxies and narrower light-curve (lower x1) SNe (Hamuy et al. 1995)—result in a more luminous distribution of SNe as an increasing function of redshift. The selection effects are particularly evident in color, where only bluer SNe (more negative c) are above the completeness limit for the high-redshift end of each ground-based sample. By incorrectly treating these distributions as redshift-independent, N16 biased their latent x1 and c toward the global mean, effectively removing some of the standardization (Conley et al. 2007; Wood-Vasey et al. 2007; Karpenka 2015).5 The JLA sample was corrected for selection bias in Betoule et al. (2014), but only in the sense that SNe that are selected to be more luminous after standardization are adjusted to be less luminous. The bias correction cannot compensate for a deficient standardization, as provided by a constant-in-redshift model of the distributions.

Figure 1.

Figure 1. Binned x1 (left panels) and c (right panels) light curve parameters as a function of redshift for the JLA sample. The trend of color with redshift within each ground-based sample is expected due to the combination of the color-luminosity relation combined with redshift-dependent luminosity detection limits. The top panels show the 68% credible constraints on a constant-in-redshift model, as was used in N16. The bottom panels show our proposed revision. Failing to model the drift in the mean observed distributions demonstrated by the bottom panels will tend to cause high-redshift SNe to appear brighter on average, therefore reducing the significance of accelerating expansion.

Standard image High-resolution image

2.1. Redshift-independent Distributions

As a starting point, we perform an analysis similar to that in N16, using Hamiltonian Monte Carlo to sample from the posterior (we describe the details in the Appendix). We assume a cosmological model with cold matter and a cosmological constant (ΛCDM). We make four measurements: a ΛCDM universe allowed to have spatial curvature (i.e., ${{\rm{\Omega }}}_{m}+{{\rm{\Omega }}}_{{\rm{\Lambda }}}\ne 1$), and a flat ΛCDM universe (the assumption of flatness is discussed more in the discussion section), each with both sets of model assumptions. We compute the deceleration parameter q0, (${q}_{0}\equiv {-\tfrac{\ddot{a}}{a{H}^{2}}|}_{t={t}_{0}}$, equal to ${{\rm{\Omega }}}_{m}/2-{{\rm{\Omega }}}_{{\rm{\Lambda }}}$ for a ΛCDM cosmology). We evaluate the statistical significance of acceleration (${q}_{0}\lt 0$) by comparing the 50th percentile of the posterior with the difference of the 50th percentile and the 84th percentile (taken as an estimate of 1σ in the $+{q}_{0}$ direction), and then rounding to 0.1σ. The statistical significance of the acceleration is $3.1\sigma $ with no constraint on curvature, and $8.7\sigma $ assuming a flat universe (see Figures 2, 3, left panels). The difference between this estimate and one derived from explicitly measuring the fraction of samples with ${q}_{0}\gt 0$ is modest (66 posterior samples out of 60,000 have ${q}_{0}\gt 0$).

Figure 2.

Figure 2.  ${{\rm{\Omega }}}_{m}$${{\rm{\Omega }}}_{{\rm{\Lambda }}}$ constraints enclosing 68.3% and 95.4% of the samples from the posterior. Underneath, we plot all samples. The left panel shows the constraints obtained with x1 and c distributions that are constant in redshift, as in the N16 analysis; the right panel shows the constraints from our model. The red square and blue circle show the location of the median of the samples from the respective posteriors.

Standard image High-resolution image

2.2. Redshift-dependent Distributions

Next, we introduce a simple model of the observed distributions as a function of redshift. We allow each source of SN discovery (Nearby, Sloan Digital Sky Survey, SuperNova Legacy Survey, Hubble Space Telescope) to have a linear variation in the mean with redshift (for the Hubble Space Telescope SNe, we use only a constant mean in redshift, as this sample is too small to constrain any variation). This model is shown in Figure 1; the variation with redshift is highly statistically significant. We also try a more flexible model in redshift (Rubin et al. 2015), and it makes only a small difference (the only requirement on the model is to be at least as flexible in redshift as the cosmological model under consideration; Rubin et al. 2015). The statistical significance of the acceleration increases to $4.2\sigma $ and $11.2\sigma $ assuming a flat universe (see Figures 2, 3, right panels). Again, the difference between this estimate and one derived from explicitly measuring the fraction of samples with ${q}_{0}\gt 0$ is modest (only one posterior sample out of 60,000 has ${q}_{0}\gt 0$).

Figure 3.

Figure 3. q0 histograms, normalized to have an integral of unity. The left panels show the constraints for each cosmology with a constant-in-redshift model of the light curve parameter distributions, as in N16; the right panels show our model. In every case, the statistical significance of the acceleration is higher with our redshift-dependent distribution model. The top row shows the results for ΛCDM cosmologies with curvature allowed. The next row down shows ΛCDM cosmologies with a flat universe. The next row shows the constraints with the kinematic expansion in redshift. Finally, the bottom row shows the results for ${{\rm{\Omega }}}_{m}$w.

Standard image High-resolution image

2.3. Other Cosmological Models

For a result that relies only on kinematics, we also compute q0 constraints using the Visser (2004) series expansion of luminosity distance as a function of redshift. We take the first three terms (including q0 and ${j}_{0}\equiv {\tfrac{\dddot{a}}{a{H}^{3}}|}_{t={t}_{0}}$). The q0 constraints are illustrated on the third row of Figure 3. Even with flat priors on q0 and j0 (allowing the kinematics to venture into regions of parameter space that may be hard to realize dynamically), we find 3.7σ evidence for acceleration (2.8σ with the N16 model). SN data alone can be used to derive constraints on the joint posterior of ${{\rm{\Omega }}}_{m}$ and the dark energy equation of state parameter ($w={P}_{\mathrm{DE}}/{\rho }_{\mathrm{DE}}$), assuming a flat universe (Garnavich et al. 1998; Perlmutter et al. 1999); we compute constraints for this model as well. For simplicity, we take a flat prior on both ${{\rm{\Omega }}}_{m}$ and w, and find strong evidence for ${q}_{0}\lt 0$, as shown in the bottom row of Figure 3. The constraints on q0 are non-Gaussian, but in both the N16 model and ours, no samples (out of 60,000) have ${q}_{0}\gt 0$.

We next project the constraints from each of our four models to the q0$[{j}_{0}-{{\rm{\Omega }}}_{k}]$ plane, shown in Figure 4 (both j0 and ${{\rm{\Omega }}}_{k}$ contribute linearly at the same order in luminosity distance, so we cannot distinguish them in this plane). The constraints for 2D models (models other than flat ΛCDM) are similar. However, the high q0/low $[{j}_{0}-{{\rm{\Omega }}}_{k}]$ region is disfavored by the dynamical models, as ${{\rm{\Omega }}}_{m}$w cannot reach ${j}_{0}\lt -1/8$, and even an empty universe ("Milne Model" Milne 1935) only has ${j}_{0}-{{\rm{\Omega }}}_{k}=-1$.

Figure 4.

Figure 4. q0$[{j}_{0}-{{\rm{\Omega }}}_{k}]$ constraints enclosing 95.4% of the samples from the posterior. For the flat ΛCDM results, we show the 1D credible interval.

Standard image High-resolution image

3. DISCUSSION

Our results (flat universe ${{\rm{\Omega }}}_{m}={0.298}_{-0.031}^{+0.033}$) are similar to the frequentist JLA analysis (flat universe ${{\rm{\Omega }}}_{m}=0.295\pm 0.034$). This is unsurprising; frequentist and Bayesian analyses will converge to exactly the same results under a set of assumptions not far from those made here (Rubin et al. 2015). We also note that more advanced analyses can better take into account statistical properties of the data (modeling selection effects, nonlinear standardization relations, a redshift-dependent host-mass relation, outliers, and a model of unexplained dispersion incorporating x1 and c) (Rubin et al. 2015). N16 did not include the host-mass standardization; excluding this only has a small impact on our results.6 However, we focus our attention on the N16 model of the x1 and c distributions, as it is this model that drives the difference from the JLA analysis.

While constraints derived from SNe Ia alone require a ∼30% flatness constraint to push the supernova measurement of acceleration above 5σ, current experiments have constrained curvature to much better precision than 1% (Planck Collaboration et al. 2016). Even constraints on ${{\rm{\Omega }}}_{m}$ (e.g., galaxy clusters, Allen et al. 2011), which imply ${{\rm{\Omega }}}_{m}\gt 0.2$, cut off the tail of the SN-only posterior extending down to a Milne universe and ${q}_{0}\gt 0$, allowing the acceleration to again reach 5σ confidence.7 With the combination of current experiments (SNe Ia, baryon acoustic oscillations, cosmic microwave background, and the Hubble constant), ${{\rm{\Omega }}}_{{\rm{\Lambda }}}$ is constrained to be 0.6911 ± 0.0062 (Planck Collaboration et al. 2016). In order to claim that the evidence for acceleration is "marginal," it is necessary to fully reject all measurements of the curvature of the universe, the basic constraints on the matter density of the universe, and other cosmological data sets.

Even without external constraints, this work demonstrates that a more accurate model for the supernova analysis greatly increases the significance of acceleration. We conclude that the analysis in N16 is both incorrect in its method and unreasonable in its assumptions, leading the authors to question a result that is quite secure when addressed properly.

We appreciate the feedback we received from Greg Aldering, Peter Nugent, Saurabh Jha, Saul Perlmutter, Alex Kim, Peter Garnavich, and Mike Hobson. Support was provided by the Director, Office of Science, Office of High Energy Physics, of the U.S. Department of Energy under contract No. DE-AC02-05CH11231 and NASA ROSES-14 WFIRST Preparatory Science program 14-WPS14-0050.

APPENDIX: SAMPLING FROM THE POSTERIOR

We sample from the posterior using Stan (Carpenter et al. 2016) through PyStan (https://pystan.readthedocs.io). Following Trøst Nielsen et al. (2016), we assume flat priors on all parameters, but require ${{\rm{\Omega }}}_{m}\gt 0$. Our chains are 2500 samples each (after warmup), and show excellent convergence (the diagnostic of Gelman & Rubin 1992 is smaller than 1.01). We run 24 chains for the results with curvature, and 8 chains for the flat universe results.

As in Rubin et al. (2015), in order to speed up sampling, we decompose the light-curve fit covariance matrix into its eigenvectors and sample over the projections onto these (this results in a covariance matrix that has no correlations between SNe). In addition to a fully Bayesian analysis, the only other change we make from N16 is to include the host-mass standardization, as was done in JLA. We make our code available at https://zenodo.org/badge/latestdoi/71841571 (Rubin 2016).

Footnotes

  • Currently, JLA represents $\sim 60 \% $ of the world sample of SNe Ia, so we expect constraints to continue to rapidly improve.

  • N16 claim their model is not Bayesian, and use frequentist inference for some global parameters, but the marginalization in their Equation (8) is a Bayesian approach. The priors assumed thus affect the inference.

  • For ΛCDM with curvature, the significance of the acceleration changes from $4.2\sigma $ with to $4.3\sigma $ without the host-mass relation.

  • N16 generalize the Milne model to mean any universe with q = 0 at all times, rather than specifically an empty universe. In the q0$[{j}_{0}-{{\rm{\Omega }}}_{k}]$ plane (Figure 4), it is clear why the distinction is important; models that are curvature-dominated allow low ${j}_{0}-{{\rm{\Omega }}}_{k}$, and thus high q0; even weak flatness constraints disfavor that portion of parameter space.

Please wait… references are loading.
10.3847/2041-8213/833/2/L30