A publishing partnership

The following article is Open access

A Comprehensive Measurement of the Local Value of the Hubble Constant with 1 km s−1 Mpc−1 Uncertainty from the Hubble Space Telescope and the SH0ES Team

, , , , , , , , , , , , , , , , , and

Published 2022 July 22 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Adam G. Riess et al 2022 ApJL 934 L7 DOI 10.3847/2041-8213/ac5c5b

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

2041-8205/934/1/L7

Abstract

We report observations from the Hubble Space Telescope (HST) of Cepheid variables in the host galaxies of 42 Type Ia supernovae (SNe Ia) used to calibrate the Hubble constant (H0). These include the complete sample of all suitable SNe Ia discovered in the last four decades at redshift z ≤ 0.01, collected and calibrated from ≥1000 HST orbits, more than doubling the sample whose size limits the precision of the direct determination of H0. The Cepheids are calibrated geometrically from Gaia EDR3 parallaxes, masers in NGC 4258 (here tripling that sample of Cepheids), and detached eclipsing binaries in the Large Magellanic Cloud. All Cepheids in these anchors and SN Ia hosts were measured with the same instrument (WFC3) and filters (F555W, F814W, F160W) to negate zero-point errors. We present multiple verifications of Cepheid photometry and six tests of background determinations that show Cepheid measurements are accurate in the presence of crowded backgrounds. The SNe Ia in these hosts calibrate the magnitude–redshift relation from the revised Pantheon+ compilation, accounting here for covariance between all SN data and with host properties and SN surveys matched throughout to negate systematics. We decrease the uncertainty in the local determination of H0 to 1 km s−1 Mpc−1 including systematics. We present results for a comprehensive set of nearly 70 analysis variants to explore the sensitivity of H0 to selections of anchors, SN surveys, redshift ranges, the treatment of Cepheid dust, metallicity, form of the period–luminosity relation, SN color, peculiar-velocity corrections, sample bifurcations, and simultaneous measurement of the expansion history. Our baseline result from the Cepheid–SN Ia sample is H0 = 73.04 ± 1.04 km s−1 Mpc−1, which includes systematic uncertainties and lies near the median of all analysis variants. We demonstrate consistency with measures from HST of the TRGB between SN Ia hosts and NGC 4258, and include them simultaneously to yield 72.53 ± 0.99 km s−1 Mpc−1. The inclusion of high-redshift SNe Ia yields H0 = 73.30 ± 1.04 km s−1 Mpc−1 and q0 = −0.51 ± 0.024. We find a 5σ difference with the prediction of H0 from Planck cosmic microwave background observations under ΛCDM, with no indication that the discrepancy arises from measurement uncertainties or analysis variations considered to date. The source of this now long-standing discrepancy between direct and cosmological routes to determining H0 remains unknown.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The present expansion rate of the universe, the Hubble constant (H0), sets its size and age scale, relating redshift (the direct consequence of expansion) to distance and time. The value of H0 may be determined locally with measurements of distances and redshifts, and it can also be predicted from a cosmological model calibrated in the early universe (i.e., pre-recombination at redshift z ≥ 1100) with measurements of the cosmic microwave background (CMB). The comparison of measured and predicted values of H0 thus provides a crucial "end-to-end" test of the widest available range of the validity of cosmological models, from early times when the universe is dense and dominated by dark matter and radiation, to the present when it is dilute and dominated by dark energy.

1.1. The SH0ES Program

The Hubble constant is the most accessible parameter in the cosmological model. It can be estimated with a wide range of approaches and accuracies from limited knowledge of many types of astronomical sources, nearly all of which have been utilized in this endeavor over the past century. There have been >1000 estimates published since 1980, with one-third of those in the last five years and 20% in the last two years—a recent quadrupling of the effort indicating the accelerating interest in H0 (Steer 2020). However, past discrepancies internal to the body of local measurements reveal that systematic errors can dominate determinations of H0, and that there is no reason to believe all efforts will regress to the mean or that a more accurate result can be derived from their median (Chen & Ratra 2011). Rather, to keep systematic uncertainties in check, it is necessary to pursue the most powerful, simplest, and most reliable tools, with strict attention paid to understanding, mitigating, and accounting for sources of measurement error.

Since the launch of the Hubble Space Telescope (HST) with a design goal of achieving a 10% determination of H0, the leading approach to measuring it in the local universe (as indicated by the observing time competitively awarded by the community) has relied on imaging of Cepheid variable stars in the host galaxies of recent, nearby Type Ia supernovae (SNe Ia). Cepheids have been favored as primary distance indicators because they are very luminous (MV ≈ −6 mag at P ≈ 30 days), are easy to identify thanks to their periodicity (Leavitt & Pickering 1912), obey a tight period–luminosity relation (PL, the "Leavitt Law") that yields extremely precise distances (3% per source; Riess et al. 2019a, hereafter R19), and have well-understood physics (Eddington 1917; Bono et al. 1999). Other primary distance indicators that have been measured in the hosts of SNe Ia to determine H0 include the tip of the red giant branch (TRGB; Freedman et al. 2019, hereafter F19) and Mira variable stars (Huang et al. 2020).

Cepheids also offer the most opportunities for obtaining strictly differential flux measurements—i.e., the use of the same facility to measure calibrator and source, a key requirement for eliminating zero-point errors. This is feasible through the use of HST to directly observe Cepheids in a large set of SN hosts and in geometric calibrators of Cepheid luminosities: the megamaser host NGC 4258, the Milky Way (hereafter MW) with plentiful parallaxes, and the Large Magellanic Cloud (hereafter LMC) via detached eclipsing binaries (DEBs). SNe Ia are favored to measure the Hubble expansion owing to their high precision (5% in distance per source), ubiquity, and deep reach, which reduces the impact of local flows.

The SH0ES program (Supernovae and H0 for the Equation of State of dark energy) began in 2005 with a proposal in HST Cycle 15 to break the degeneracy among cosmological parameters used to model CMB data and the equation-of-state parameter w = P/(ρ c2), where P is the pressure and ρ is the mass density of dark energy. Its stated ambitious goal, based on the recommendation by Hu (2005), was to eventually reach a percent-level measurement of H0, a goal approached if not fully reached in this work. This project was a "second-generation" effort to measure H0 with HST from a distance ladder of Cepheids and SNe Ia using the then recently installed ACS (and later WFC3) instruments, following successful efforts during the 1990s by the "first-generation" HST Key Project on the Extragalactic Distance Scale (Freedman et al. 2001) and the SNe Ia Luminosity Calibration Program (Sandage et al. 2006), both of which primarily used WFPC2. The former searched for Cepheids in the hosts of numerous secondary distance indicators (excluding SNe Ia) while the latter focused on SN Ia hosts. The types of targets suitable for Cepheid searches and the observing sequences for use with HST were developed and first implemented by these ground-breaking programs.

The precision of the first-generation programs was ultimately limited by the lack of a precise geometric calibration of Cepheid variables, by the limited practical range of WFPC2 to measure Cepheids in SN Ia hosts (distance D ≲ 20–25 Mpc), by the impact of reddening in the optical, and by the limited characterization of the Cepheid metallicity dependence at those wavelengths. The ability of SNe Ia to measure individual distances with ∼5%–10% precision and to sharply delineate the Hubble flow began with the use of light-curve versus luminosity relations (Phillips 1993), SN Ia colors (Riess et al. 1996; Tripp 1998; Phillips et al. 1999), and modern, digital samples (Hamuy et al. 1996; Riess et al. 1999).

Unfortunately, SNe Ia at D ≤ 20 Mpc are rare, occurring about once per decade, with most of the few objects in this range observed up to a century ago using photographic technology. Such observations lacked the photometric precision, well-characterized bandpasses, and accurate determinations of host-galaxy backgrounds, SN light-curve shapes, and SN colors to take advantage of the new standardization methods. The tendency for intrinsically brighter SNe Ia with broader light curves to occur in (late-type) Cepheid hosts would also bias H0 lower without light-curve standardization. A number of systematic differences in the first-generation calibration of SNe Ia by Sandage et al. (2006) were quantified by Riess et al. (2005, Table 16). These differences, totaling about 20%, arose from several effects which were amplified by small sample statistics: problematic SN Ia data such as photographic photometry, highly extinguished objects, and poorly sampled light curves; from photometric anomalies in WFPC2, such as the "long versus short" effect (Holtzman et al. 1995) and charge transfer efficiency (CTE; e.g., Whitmore et al. 1999); and from limited knowledge of the slope of the Cepheid PL relation. The present geometric calibration of the distance to the LMC by Pietrzyński et al. (2019) using DEBs is also 7% smaller than the value assumed by Sandage et al. (2006) to calibrate Cepheids.

The SH0ES program has been designed to improve upon past determinations of H0 by (1) extending the range of Cepheid observations with ACS and WFC3 to reach the hosts of a large sample of "ideal" SNe Ia, free from the preceding problems; (2) using near-infrared (NIR) observations of all Cepheids in SN Ia hosts with NICMOS and WFC3 to reduce the systematic uncertainty associated with the reddening laws for Cepheids and their hosts and the Cepheid metallicity dependence; and (3) calibrating Cepheids with new, geometric distances tied directly with HST to the Cepheids in SN Ia hosts to nullify zero-point uncertainties. "Ideal" or suitable SNe Ia for calibrating H0 (given limited HST time) were defined by Riess et al. (2005) to be (1) observed before maximum light, (2) through low interstellar extinction (AV < 0.5 mag), (3) with the same instruments and filters as the SNe Ia in the Hubble flow (at that time obtained by the Calán/Tololo and CfA surveys), and (4) to have typical light-curve shapes. 11 These characteristics are necessary to provide low dispersion in the Hubble flow, but they applied to only three Cepheid-calibrated SNe Ia from the first-generation projects (SNe 1981B, 1990N, and 1998aq).

Unlike the first-generation programs, which were granted long-term status and a large initial allocation of observing time with HST, the SH0ES program was proposed to the STScI time-allocation committee year by year. The cumulative result, after 15 yr (cycles), has been to collect Cepheid observations in 37 hosts of 42 SNe Ia and calibrate them geometrically to Cepheids in the MW, the LMC, and NGC 4258, with a total of 18 individual HST proposals utilizing >1000 orbits. The source of the distance-ladder data is shown in Figure 1, and the source and sequence of all observations in SN Ia hosts are shown in Figure 2 and listed in Table 1. Observing Cepheids in hosts at D ≈ 40–50 Mpc, double the range of first-generation observations and a factor of 8 increase in targets, was feasible owing to two features of WFC3: significantly better sampling of the point-spread function (PSF); with pixel sizes a factor of 2.5 smaller than the wide channel of WFPC2, which greatly reduced the impact of crowded backgrounds; and a white-light filter (F350LP) that, combined with the better sensitivity of WFC3, reduced the observing time required to identify Cepheids and measure their periods by a factor of ∼4. Calibrating those Cepheids differentially to bright Cepheids in the MW and LMC became feasible through the development of spatial scanning of HST and a rapid slew and guiding mode under gyroscopic control.

Figure 1. Refer to the following caption and surrounding text.

Figure 1. Sources of data for the distance ladder. The red block shows data from this work.

Standard image High-resolution image
Figure 2. Refer to the following caption and surrounding text.

Figure 2. HST observations of Cepheids in 37 hosts of 42 "ideal" SNe Ia and NGC 4258, collected over 20 yr with four cameras and >1000 orbits of HST time. In most cases, 60 to 90 day campaigns in F555W and F814W or in F350LP were used to identify Cepheids from their light curves with occasional observations years later to identify longer-period Cepheids. NIR follow-up observations in F160W are used to reduce the effects of host-galaxy extinction, sensitivity to metallicity, and breaks in the PL relation. Data sources: (1) HST Key Project, Freedman et al. (2001); (2) HST SN Ia Luminosity Calibration Program, Sandage et al. (2006); (3) Riess et al. (2005); (4) Macri et al. (2006); (5) Mager et al. (2013); (6) Yuan et al. (2022).

Standard image High-resolution image

Table 1. Cepheid Observations with HST

  Exposure Time [s]  
GalaxySN(e) IaWFC3AllNIRUT
  NIR a UVIS b opt. c Prop ID(s)Date d
M1012011fe4846377653072128802013-03-03
Mrk 13372006D138232599425994156402020-04-15
N01052007A132703430234302162692020-10-23
N06912005W96473141331413151452017-11-10
N09761999dq154823431234312162692020-11-21
N10152009ig143643933639336128802013-06-30
N13092002fk6991300208928211570,128802010-07-24
N13652012fr36173180091200128802013-08-06
N14482001el, 2021pit60351756217562128802013-09-15
N15592005df100582224522245151452017-09-09
N24422015F60352097620976136462016-01-21
N25252018gv98212117721177151452018-02-14
N26082001bg96472694226942151452018-02-04
N30211995al4426296208872211570,128802010-06-03
N31471997bq, 2008fv, 2021hpr144703742637426151452017-10-28
N32542019np84412210622106156402019-03-11
N33701994ae4376298208822211570,128802010-04-04
N34472012ht45291911419114128802013-12-15
N35832015so96472700127001151452018-03-06
N39722011by66351993219932136472015-04-19
N39821998aq40171400090840115702009-08-04
N40382007sr67942064085684115772010-01-22
N4258Anchor4023410120103690115702020-01-02
N44242012cg36231778217782128802014-01-08
N45361981B25642600095000115702010-07-19
N46391990N53791600077480115702009-08-07
N46801997bp96472521725217156402020-04-24
N54681999cp,2002cr144703656636566151452017-12-22
N55842007af49297494074940115702010-04-04
N56432013aa,2017cbv90522474124741151452018-01-16
N57282009Y138232611126111156402019-05-05
N58612017erp100582179821798151452018-01-13
N59172005cf72352346923469128802013-05-20
N72502013dy54351815818158128802013-11-08
N73292006bh90442466524665156402020-05-06
N75411998dh96472676626766151452017-09-21
N76782002dp120583206032060156402019-05-25
U93912003du137113933639336128802012-12-14

Notes.

a Obtained with WFC3/NIR and F160W; b Obtained with WFC3/UVIS and F555W, F814W, or F350LP used to find and measure the flux of Cepheids; c Includes time-series data from an earlier program and a different camera—see Figure 2; d Date of first WFC3/NIR observation.

Download table as:  ASCIITypeset image

The first SH0ES results (Riess et al. 2009, hereafter R09) were based on Cepheids observed in the hosts of six ideal SN Ia calibrators using ACS (optical) and NICMOS (NIR), and one geometric anchor (NGC 4258) with a maser-based distance of 3% precision (Humphreys et al. 2013) and a large sample of Cepheids (Macri et al. 2006). The result was a 5% measurement of H0, 74.2 ± 3.6 km s−1 Mpc−1, which, combined with the 5 yr WMAP results (Komatsu et al. 2009), yielded w = −1.12 ± 0.12 and was consistent with a value of H0 = 71.9 ± 2.6 determined from WMAP and ΛCDM alone. 12 The second iteration (Riess et al. 2011, hereafter R11) increased the calibrator sample to eight SNe Ia, observed and measured all Cepheids with WFC3 (both optical and NIR), and expanded the geometric calibration of Cepheids beyond NGC 4258 by including two additional independent anchors: the LMC, through various DEB-based distances (e.g., Pietrzyński et al. 2009), and the MW via parallaxes measured with the Fine Guidance Sensor on HST (Benedict et al. 2007). This resulted in H0 = 73.8 ± 2.4, which, coupled with the 7 yr WMAP results (Komatsu et al. 2011), yielded w =−1.08 ±0.10, or an estimate of the effective number of relativistic species of Neff = 4.2 ± 0.7. This result was closely matched by a recalibration of the final HST Key Project results using the same MW parallaxes, different Cepheid measurements, and an updated Hubble diagram of SNe Ia, which yielded H0 = 74.4 ± 2.2 (Freedman et al. 2012, hereafter F12). Indications of any tension between the early and late universe at that time were <2σ in significance.

1.2. The Hubble Tension

The first release of CMB data from the ESA Planck mission (Planck Collaboration et al. 2014) yielded H0 = 67.2 ± 1.2 in the context of ΛCDM, a then 2σ reduction relative to WMAP and a difference of 3σ from the results of R11 and F12. Reanalyses of the R11 data (Fiorentino et al. 2013; Efstathiou 2014; Zhang et al. 2017) produced essentially the same results as R11, with H0 ranging from 72.5 to 76.0. The third iteration of SH0ES (Riess et al. 2016, hereafter R16) more than doubled the calibrator sample to 19 SNe Ia, used refined distance estimates to NGC 4258 and the LMC (Pietrzyński et al. 2013), and new Cepheid parallaxes measured by the SH0ES team using spatial scanning with WFC3 (Riess et al. 2014; Casertano et al. 2016) to reach a 2.4% determination of H0 = 73.2 ± 1.7, 3.4σ greater than the refined value from Planck+ΛCDM of 66.9 ± 0.6 (Planck Collaboration et al. 2016), a difference of ∼9% or 0.2 mag in units of 5 log H0. An extensive number of reanalyses of the R16 data with many variations were undertaken (Cardona et al. 2017; Feeney et al. 2019; Follin & Knox 2018; Bovy 2018; Burns et al. 2018; Dhawan et al. 2018; Avelino et al. 2019), resulting in H0 values ranging from 73 to 74 and uncertainties from 2% to 2.5%. These analyses explored varying reddening laws, use of NIR SN data, use of alternative SN light-curve fitting, and hierarchical Bayesian statistics for data fitting, resulting in little change in H0 or its uncertainty. Javanmardi et al. (2021) selected the Cepheid host from R16 with the largest sample of Cepheids (NGC 5584, also near the median sample host distance) and remeasured these starting from the archived HST pixels and using different methods to photometry, finding agreement with the R16 measurement to 1% precision and ruling out a significant methodological error in these measurements.

Since then, additional sources of MW parallax calibration of Cepheid luminosities have come from further use of HST WFC3 spatial scanning (Riess et al. 2018b), from Gaia DR2 Cepheid parallaxes with HST photometry (Riess et al. 2018a), from Gaia DR2 Cepheid binary companions and cluster hosts (Breuval et al. 2020), and from Gaia EDR3 parallaxes coupled with additional HST photometry (Riess et al. 2021, hereafter R21). Likewise, HST observations of 70 long-period Cepheids in the LMC (Riess et al. 2019b) and improved distance estimates to the LMC (Pietrzyński et al. 2019) and NGC 4258 (Reid et al. 2019) resulted in H0 = 73.2 ± 1.3, raising the difference with Planck+ΛCDM to 4.2σ. Other precise measures of H0 in the local universe from the distance–redshift relation generally range from 70 to 75, and those grounded in the pre-recombination version of ΛCDM range from 67 to 68, and have been extensively reviewed (Verde et al. 2019; Di Valentino et al. 2021; Shah et al. 2021). Because the tension is seen between different routes, which are comparable only via an accurate cosmological model, numerous possible theoretical explanations for the emergent "Hubble tension" have been proposed but no consensus has yet emerged (Di Valentino et al. 2021). Indeed, theoretical priors weigh heavily on these proposals or whether it may be considered "extraordinary" for the ΛCDM model to fail or pass this cosmic test. The test itself, however, is empirical and few would conclude it has yet been satisfactorily passed.

1.3. This Work

In this publication, we more than double the sample of SN Ia calibrators from 19 in R16 to reach 42 objects in 37 hosts. This increase is a milestone for a sample whose size has limited the precision to which H0 can be locally measured. It provides the largest increase in size we can anticipate in the remaining lifetime of HST as it now includes all suitable SNe Ia (of which we are aware) observed between 1980 and 2021 at z < 0.011 and which slowly accrue at ∼1 yr−1. We have also reprocessed and reanalyzed the NIR observations of Cepheids in the previous 19 hosts reported by R16 for consistency with the new sample. We make use of an automated pipeline (W. Yuan et al. 2022, in preparation, hereafter Y22b) to find the Cepheids in 18 new hosts (and reanalyze past hosts), which follows the steps developed manually for the first 19 hosts (Hoffmann et al. 2016, hereafter H16). We benefit from a factor of 3 increase in the sample of Cepheids within NGC 4258 discovered by observing four new fields with HST, fully reanalyzed using the same pipeline by (Yuan et al. 2022, hereafter Y22a). Extensive details concerning the analysis of SN Ia data are given by Scolnic et al. (2021) and Brout et al. (2021).

We present the formalism for measuring H0 from the distance ladder in Section 2; new Cepheid data in Section 3 and in Y22a+Y22b; ancillary data used to measure H0 in Section 4; our baseline, local determination of H0 and a simultaneous measurement of H0 and the expansion history with high-redshift SNe Ia in Section 5; extensions to the baseline and variants of the local measurement of H0 in Section 6; discussion in Section 7; and conclusions in Section 8. Appendices provide further details on characterizing the spectral and photometric properties of the 42 calibrator SNe Ia (A), independent tests of the accuracy of Cepheid photometry (B), the Cepheid metallicity scale (C), and alternative applications of "Wesenheit" magnitudes (D).

2. Measuring the Hubble Constant

2.1. Distance-ladder Formalism

The Hubble constant is the present relation between redshift and distance, cz = H0 D, measured at cosmological distances where expansion is the dominant source of redshift. Here it is measured via a three-step (or three-rung) distance ladder employing a single, simultaneous fit between (1) geometric distance measurements to standardized Cepheid variables, (2) standardized Cepheids and colocated SNe Ia in nearby galaxies, and (3) SNe Ia in the Hubble flow. The fit is accomplished simultaneously by optimizing a χ2 statistic to determine the most likely values of the parameters in the relevant relations. The data include measurements, their uncertainties, and their covariances as described in the next section. The parameters are the distances of all hosts and five additional parameters: the fiducial luminosity of SNe Ia and Cepheids, two parameters standardizing Cepheid luminosities (their dependence on period and metallicity), and H0. This parameterization of the distance ladder can be expressed as a simple system of linear equations in a compact set of matrices as given below, useful for transmission, and for which the maximum-likelihood solution is easily found. The distance-ladder data and scheme are displayed in approximate form in Figure 1.

To briefly summarize the relevant relations, the distance modulus of a source is ${\mu }_{0}={m}_{0}-{M}_{0}=5\,\mathrm{log}\,D+25$, with D the luminosity distance in Mpc, m the apparent magnitude (flux), M the absolute magnitude (luminosity), and the subscript 0 denoting a magnitude free of (or corrected for) intervening absorption by interstellar dust. The form of the dependence of Cepheid or SN Ia luminosity on observed characteristics (i.e., "standardization") has been well determined by prior work and is only briefly reviewed here. The dereddened Cepheid apparent magnitudes (also called "Wesenheit" magnitudes; Madore 1982) at mean phase will be described in Section 3.3 and are identified here as ${m}_{H}^{W}$ (in a specific host like NGC 4258, those Cepheids would have magnitudes given as ${m}_{H,N4258}^{W}$). For the jth such Cepheid magnitude in the ith host given the period Pi, j in days and metallicity [O/H]i,j relative to the Sun, we have

Equation (1)

where ${M}_{H,1}^{W}$ is the fiducial absolute magnitude (in the Wesenheit magnitude system of 3.4) of a Cepheid with $\mathrm{log}\,P=1$ (P = 10 days) and solar metallicity, and the parameters bW and ZW (sometimes called γ in the literature) define the empirical relation between Cepheid period, metallicity, and luminosity. The [O/H]i,j is inferred at its galactocentric radial position as described in Section 3.5.

Any number of Cepheid hosts may have an independent geometric distance that contributes to the calibration of the Cepheid luminosity; e.g., for variables observed in the maser host NGC 4258 we adopt a distance modulus μ0,N4258, the best estimate of the distance, with formal uncertainty σ(μ0,N4258) from Reid et al. (2019). In this case the individual host distance parameter μ0,i above is replaced with the external constraint, converting the apparent magnitudes to absolute,

Equation (2)

and introducing a new parameter ΔμN4258 as the difference from the measured and true distance with the additional, simultaneous constraint equation 0 = ΔμN4258 ± σ(μ0,N4258). This definition allows the simultaneous use of multiple geometric "anchors" to calibrate the distance ladder; as we later show, it also allows the use of additional distance indicators for the same anchor, such as the tip of the red giant branch (TRGB) or Mira variable stars, while keeping track of their mutual dependence on the same geometric constraint.

A set of hosts of both SNe Ia and Cepheids connects the two distance indicators. Thus, for an SN Ia in the ith Cepheid host,

Equation (3)

where ${m}_{B,i}^{0}$ is its maximum-light apparent magnitude that has been standardized (i.e., corrected for variations around the fiducial color, luminosity, and any host dependence; see Scolnic et al. 2021), ${M}_{B}^{0}$ is the fiducial SN Ia luminosity, and μ0,i is the same parameter as in Equation (1). For SNe Ia, unlike Cepheids, the convention for keeping track of covariance in the standardization, as described by Scolnic et al. (2021), is to employ a set of standardized ${m}_{B,i}^{0}$, their uncertainties, and the covariance between any pair. This is an equally mathematically valid approach as keeping track of covariance through the standardizing relation for Cepheids in Equation (1).

The ladder is completed with a set of SNe Ia that measure the expansion rate quantified as the intercept, aB , of the distance (or magnitude)–redshift relation. This is simply ${a}_{B}=\mathrm{log}\,{cz}-0.2{m}_{B}^{0}$ in the low-redshift limit (z ≈ 0) but given for an arbitrary expansion history and for z > 0 as

Equation (4)

measured from a set of SNe Ia ($z,{m}_{B}^{0}$), where z is the redshift due to expansion, q0 is the deceleration parameter, and j0 is the jerk (see Visser 2004 for definitions). The determination of H0 follows from

Equation (5)

If the set of standardized SN Ia magnitudes in the hosts of Cepheids, which serve to calibrate ${M}_{B}^{0}$ (hereafter "calibrators" or CC SNe Ia), and those in the Hubble flow used to measure aB (hereafter HF SNe Ia) have no common sources of uncertainty (i.e., no covariance), then Equations (3) and (4) and the ladder parameters they provide (${M}_{B}^{0}$ and aB ) can be determined independently. This was the approach taken by R16. However, an increasingly thorough quantification of systematic uncertainties in SN Ia measurements, and the standardization undertaken as common practice for the determination of w (Scolnic et al. 2018), has demonstrated nontrivial covariance of SN Ia data, quantified following the approach of Conley et al. (2011) and Dhawan et al. (2020). We therefore undertake the optimization of Equations (3) and (5) simultaneously.

It is useful to expand the same set of Equations (1)–(5) in the form of matrices that organize the data into a vector of magnitude measurements y, the covariance matrix of standard errors of the magnitude measurements C, the equation matrix L, and the vector of free parameters q (hence the model, Lq) as follows:

where ${\sigma }_{\mathrm{tot},1}^{2}$ representing the n × n covariance matrix for the Cepheids in the first host is expanded as

The number of Cepheid hosts is nh, the number of SNe Ia in these hosts is ncc, and the number of SNe Ia in the Hubble flow is nhf. Period uncertainties are comparatively negligible (Yuan et al. 2021). σHST and σGaia denote the uncertainties in ${M}_{H,1,\mathrm{Gaia}}^{W}$ and ${M}_{H,1,\mathrm{HST}}^{W}$ as derived from parallaxes, respectively, while σgrnd denotes the uncertainty in ground-based photometry. The term SNcov is the covariance between SNe, Zcov is the metallicity covariance given later in Equation (9), and Ci,j,k,bkgd is the background covariance given later in Equation (8). The χ2 statistic is given as

Equation (6)

and maximum-likelihood parameters are given as ${q}_{\mathrm{best}}\,={({L}^{T}{C}^{-1}L)}^{-1}{L}^{T}{C}^{-1}y$, while the standard errors and the covariance matrix of the parameters come from the matrix ${cq}={({L}^{T}{C}^{-1}L)}^{-1}$. The value of H0 is derived from the final entry of q, $5\,\mathrm{log}\,$ H0, and its error from the square root of the corner entry of cq. This provides a compact form for storing and transmitting the full data set used to determine H0, to edit or augment it, and to enable others to determine its value. The above formalism is the same as used by R16 with the additions of SN and Cepheid covariance. We will also derive the parameters independently of the analytical solution in Section 5.1 by sampling the χ2 statistic using a Markov Chain Monte Carlo (MCMC) approach to verify the analytical result with a different methodology.

3. Cepheid Observations in SN Ia Hosts and the Maser Host NGC 4258

3.1. Optical Cepheid Discovery

The SH0ES program has been selecting the SNe Ia that are most suitable for calibrating their fiducial luminosity (with selection criteria given in Section 1.1) through observations of Cepheids in their hosts. The results here include a complete sample of all such SNe Ia of which we are aware within z < 0.011 (40 objects), with the addition of two beyond this limit that are useful for testing the range of Cepheid distance measurements, for a total of 42 SN Ia calibrators.

Figure 2 and Table 1 show the sources of the HST data obtained for every SN Ia and host we measured, gathered from the indicated HST cameras, filters, and time periods. All of these publicly available data are readily obtained from the Mikulski Archive for Space Telescopes (MAST). The imaging data are used for both Cepheid discovery and their flux measurement. For the former, a campaign using a filter with a central wavelength in the visual band and ∼11–12 epochs with nonredundant spacings spanning ∼60–100 days is optimal to identify Cepheid variables by their unique light curves and large amplitudes (∼1 mag peak to trough) and accurately measure their periods (Madore & Freedman 1991; Saha et al. 1996; Stetson 1996). Image subtraction may find additional Cepheids (Bonanos et al. 2003), but these objects will be subject to greater photometric biases owing to blends which suppress their amplitudes and chances of discovery in time-series data (Ferrarese et al. 2000). The data were collected from ∼150 HST orbits with WFC3-NIR and ∼700 orbits obtained for the optical identification (350 from WFC3, 170 from ACS, and 180 from WFPC2), including ∼200 orbits from NICMOS superseded by WFC3-NIR, for a total of ∼1050 HST orbits. Additional observations of Cepheids in the MW and LMC anchors utilized ∼200 orbits or snapshots.

Earlier efforts contribute ∼200 orbits of imaging of these hosts: 35 orbits from the HST Key Project, 105 from the SNe Ia Luminosity Program, 36 from Mager et al. (2013), and 50 from Macri et al. (2006), with the remaining ∼800 from SH0ES.

The procedure for identifying Cepheids from time-series optical data in visual or white-light bands has been described extensively (Saha et al. 1996; Stetson 1996; Riess et al. 2005; Macri et al. 2006; Hoffmann et al. 2016); details of the procedures followed for this sample are presented by W. Yuan et al. (2022, in preparation). These procedures utilize the DAO suite of software tools (Stetson 1987, 1994) for crowded-field PSF photometry and are similar to those used previously by the SH0ES team (Hoffmann et al. 2016) and to a large extent by the first generation of HST-based H0 measurements. Past work has demonstrated that the use of different photometry algorithms yields a largely overlapping list of Cepheids with similar periods and photometry (Ferrarese et al. 1998).

The end result is a set of high-confidence Cepheids, which have passed the selection and quality cuts described by Y22b, with periods and mean colors (F555W – F814W) measured in the HST WFC3 photometric system. For each Cepheid, we estimate a precise position in the WFC3/NIR F160W images using a geometric transformation derived from the optical images using bright and isolated stars, with resulting mean position uncertainties for variables <0.03 pix. The positions of the Cepheids are indicated in Figure 3. Figure 4 shows color images of each Cepheid host.

Figure 3. Refer to the following caption and surrounding text.

Figure 3. Images of 18 newly observed hosts of 23 SNe Ia and NGC 4258. Each image is of the Cepheid host indicated. The magenta outline shows the 2farcm7 × 2farcm7 field of the WFC3/NIR observations. Red dots indicate the positions of the Cepheids. Compass indicates north (long axis) and east (short axis). The NGC 4258 image shows Cepheids from Macri et al. (2006) in white and new ones from Yuan et al. (2022) in red.

Standard image High-resolution image
Figure 4. Refer to the following caption and surrounding text.

Figure 4. Pseudocolor images of all Cepheid-bearing galaxies analyzed in this work. From top left, 37 hosts of 42 SNe Ia presented in the same order as Table 1. The last row includes our three anchors (NGC 4258, MW, and LMC) and two supporting galaxies (SMC and M31). Galaxies are presented at arbitrary plate scales, though in most cases the panels encompass the entire ACS or WFC3/UVIS field of view. Credits: SN hosts and NGC 4258—ESA Hubble site; MW, LMC, and SMC—ESA Gaia site; M31—STScI.

Standard image High-resolution image

We present composite light curves of all Cepheids with 10 < P < 80 days in Figure 5 based on ∼60,000 individual F555W or F350LP photometric measurements, which identify these as bona fide classical Cepheids with the characteristic sawtooth light-curve shape of fundamental-mode pulsators. There are more subtle Cepheid light-curve features that are not apparent in individual examples at these distances. However, we can leverage the statistical power of the sample to look for these features as a strong validation test of the universality of Cepheids, as presented below.

Figure 5. Refer to the following caption and surrounding text.

Figure 5. Composite visual (F555W) or white-light (F350LP) Cepheid light curves. Each HST Cepheid light curve with 10 < P < 80 days is plotted after subtracting the mean magnitude and determining the phase of the observation.

Standard image High-resolution image

3.2. Cepheid Validation: The Hertzsprung Progression

The shapes of Cepheid light curves change in subtle but characteristic ways (often referred to as the "Hertzsprung progression"; Hertzsprung 1926) as a function of period, visible only with high signal-to-noise-ratio (S/N) observations. Hertzsprung (1926) noted the characteristic "quick rise and slow decrease" for most periods but also a small range of periods (9–13 days) where the light curves are quite symmetric, and the presence of a "bump" or small local maximum whose phase occurs earlier with increasing period, appearing on the descending phase at P ≈ 6 days, merging with the main peak at P ≈ 10 days, and appearing on the rising phase at P ≈ 10–15 days before disappearing (variables where the bump is apparent are sometimes referred to as "bump" Cepheids). While the general fast rise and slow decline are characteristics of a star pulsating in the fundamental mode, the "bump" is understood through modeling to arise from a 2:1 resonance between the fundamental and second overtone (Cox 1993; Bono et al. 2002) and is a unique feature of classical Cepheids. Cepheids with P > 40 days occur at the high-mass end of the luminosity function and are thus rare, with the MW hosting only a few examples that show a broadening of the light curve and a decrease in amplitude.

By binning all of the extragalactic light curves for the 37 SN Ia hosts and NGC 4258 by period as shown in Figure 6, a striking display of this more subtle light-curve structure emerges. The resonance-induced bump is indicated on the descending phase at P = 5–9 days with the MW Cepheid DL Cas (P = 7.6 days) shown for comparison, the narrow-peaked symmetric curve at P = 9–14 days as in SS CMa (P = 12.4 days), and the resonance bump on the rising phase in the P = 14–19 days as in XZ Car (P = 16.6 days), before transitioning to sawtoothed curves in the next 3 bins (P = 19–35 days). Beyond this range, we see the gradual transition to flatter and more sinusoidal curves matching the MW sequence of SV Vul (P = 45 days), GY Sge (P = 52 days), and S Vul (P = 69 days).

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Composite visual (F555W) or white-light (F350LP) Cepheid light curves binned by period (upper) and compared with individual MW Cepheids near the middle period of the bin. The "Hertzsprung progression" (relation between light-curve shape and period) is apparent, including subtle features like the progression in phase of a resonance "bump" between the second overtone and fundamental pulsation for P < 20 days. The red line is a cubic spline constrained by the averages of bins in phase.

Standard image High-resolution image

While our pipeline selection requires candidate Cepheids to have amplitudes in the characteristic range of 0.2–1 mag, we impose no requirement regarding the presence of these more subtle light-curve features, which are not detectable at the data quality typical of single extragalactic Cepheids (Hoffmann et al. 2016). The presence of these features in aggregate light curves at the appropriate periods offers an additional level of scrutiny and validation of these sources as classical Cepheids, as well as the means for a more detailed comparison to those in the MW. The impressive similarity of these light-curve sequences demonstrates the universality of the physics producing these pulsating standard candles and provides an important test of their consistency along the distance ladder. The appearance of the Hertzsprung progression among the extragalactic sample demonstrates these objects are bona fide Cepheid variables.

3.3. NIR Photometry and Validation in NGC 4258

Following the same procedures described by R16, we NIR photometry of the Cepheids using their positions derived and fixed from the higher-resolution optical data. The NIR images of all Cepheids were reprocessed for this analysis using revised calibrations by the STScI pipeline (including new flat fields and geometric-distortion tables), and we were able to predict the expected NIR positions of the Cepheids found in the optical data with enhanced precision compared to our previous reduction (Hoffmann et al. 2016). Additional sources found in the NIR images are fit simultaneously with the Cepheids using an empirical model of the PSF derived from the solar-analog star P330E. The background calibration 13 and photometric uncertainties are determined from the retrieval of 100 locally placed artificial stars per Cepheid as described by R11 and R16.

In Figure 7 we show the distributions of the artificial-star measurements divided by their standard deviation, which demonstrates that they are well approximated by Gaussian distributions in magnitude space around their mean out to 3σ, a consequence of the log-normal distribution of underlying surface brightness fluctuations (primarily red giants, with larger and brighter collections increasingly rare) in the NIR. The mean difference between the mean and median of the artificial-star distributions, which vanishes for a true Gaussian, is 0.03σ. Averaged across all SN hosts, the difference between the mean and median of the magnitude distribution drops to 0.01σ or 4 mmag, showing no apparent correlation between its third moment (skewness) and the host distance. The distribution for the geometric calibrator, NGC 4258, appears similar to that of the others, with a difference between the mean and median of 0.01 mag. These measurements justify the use of Gaussian statistics in magnitude space in the calculations to follow.

Figure 7. Refer to the following caption and surrounding text.

Figure 7. Distributions of recovered artificial stars in each host. For each real Cepheid, 100 artificial stars of the expected Cepheid magnitude (based on its period and the PL relation) are added to the host images within an annulus with an outer radius of 2farcs4 and measured following the same procedures as for the Cepheids to produce a distribution in magnitudes (log flux). The distributions for each host are combined after subtracting their mean and dividing by their standard deviation. They compare well to a Gaussian with a difference in median and mean (dashed and dotted lines) of ∼0.03σ.

Standard image High-resolution image

The accuracy of the background estimates owes to Cepheids being randomly superimposed on scenes, a consequence of our perspective whose local levels can be measured statistically. A caveat to this approach would be the presence of associated flux (colocated with the Cepheid), which becomes important only if it is then resolved for nearby Cepheids but not for distant ones. The level of such "associated flux" has been measured statistically from hundreds of Cepheids in M31 by Anderson & Riess (2018) to be ∼7 mmag at distances beyond a few Mpc, and is due to associated open clusters that would not be resolved at those distances. This term is explicitly included here in the background estimates in order to compensate for this effect. An additional and direct consequence of a potential miscalibration of the background, independent of the Cepheid mean flux, would be a change in the apparent light-curve amplitude. Riess et al. (2020) determined that the NIR amplitudes of Cepheids in three SN hosts are fully consistent with those in the Milky Way, yielding an independent upper limit of 0.03 mag for the possible misestimation of the background if representative. The sensitivity of H0 to the unresolved background can be further mitigated by the use of a distant anchor with similar background as the SN Ia hosts such as NGC 4258; this will be addressed in Section 4.

In Figure 8 we provide a strong test of the background estimation by comparing the Cepheid photometry in dense (inner) and sparse (outer) regions of NGC 4258. Because the Cepheids in both fields are at the same distance from us and the metallicity gradient in NGC 4258 is small (Bresolin et al. 2016), an apparent difference of the dereddened magnitudes would be a consequence of misestimating the background in the dense field. The difference in intercept (i.e., distance) is 0.01 mag and well within the indicated errors of the means, demonstrating that Cepheid PSF photometry is accurate in the presence of the same level of crowded backgrounds seen in the SN hosts.

Figure 8. Refer to the following caption and surrounding text.

Figure 8. Comparison of Cepheids measured in a dense (inner) field (in red) and sparse (outer) field (in blue) of NGC 4258. Because these Cepheids are at the same distance, the comparison shows the accuracy of the background estimates, which differ in the mean over the same sampled range, $0.7\lt \mathrm{log}(P/\mathrm{day})\lt 1.2$, by 0.45 mag (relative to the Cepheids) yet yield a consistent intercept with σ ≈ 0.05 mag. The difference in metallicity between the samples, Δ[O/H] = 0.08 dex, corresponds to a difference of 0.02 mag, smaller than the precision of this comparison.

Standard image High-resolution image

In Appendix B we provide an independent validation of the PSF photometry using aperture photometry, a method that is accurate when the background is measured from the mean of pixels in concentric annuli and uncomplicated to apply, albeit less precise than the standard approach of using PSFs to model photometry. This test validates the mean PSF photometry in SN hosts to σ ≈ 0.02 mag. In that appendix we further perform an additional "null test" of the background estimates by regressing them against the distance-ladder fit residuals, finding a dependence of 0.010 ± 0.014 mag per magnitude of source background (in the sense of overestimating the background but with no significance).

In Section 7.1 we review multiple tests of Cepheid PSF photometry in addition to six strong tests of background estimates in the presence of crowded backgrounds, all of which indicate that the Cepheid measurements are accurate.

3.4. Dereddened Magnitudes

The SH0ES program uses observations in HST filters at known Cepheid phases in optical (F555W, F814W) and NIR (F160W) bands to correct for the effects of interstellar dust and the finite width in temperature of the Cepheid instability strip. We employ NIR "Wesenheit" magnitudes (Madore 1982) to deredden Cepheids throughout, defined as

Equation (7)

where mH = F160W, mV = F555W, mI = F814W and VI =mV mI in the HST system, and RAH /(AV AI ). Wesenheit magnitudes are not conventional magnitudes, which compare the brightness of one star to another; rather, they are used to compare the brightness of one standardized candle to another through the removal of their unequal extinction, as reviewed in Appendix D. While the value of R obtained from well-characterized extinction laws for these bands is ∼0.4, we note that the correlation between Cepheid intrinsic color and luminosity at a fixed period has the same sense as extinction (cooler is fainter), and is similar in size with an intrinsic value of ∼0.6 (as discussed in Section 6.2). Therefore, the value of R derived for extinction effectively also reduces the intrinsic scatter caused by the breadth of the instability strip. We analyze the sensitivity of H0 to values of R derived from different extinction laws in Section 6.3. In Appendix D, we discuss pitfalls associated with varying R in Equation (7) between galaxies if the intrinsic color is not first subtracted from the observed color 14 (Follin & Knox 2018; Mortsell et al. 2021).

To avoid a magnitude bias, we include only Cepheids with periods above the completeness limit of detection in our primary fit for each host (Y22b). The measurements of Cepheids in SN Ia hosts are provided in Table 2, while Table 3 summarizes the properties of the resulting NIR PL relations. We identify a number of improvements realized here since our previous Cepheid measurements in SN hosts ∼6 yr ago (R16, H16) and for NGC 4258, ∼16 yr ago (Macri et al. 2006).

  • 1.  
    Sky annuli: The size of the annulus used to estimate the level of the sky value of the region around the Cepheid (after subtracting all detected sources) in F160W was reduced in size to inner and outer radii of 0farcs24 and 0farcs8 (from 0farcs96 and 1farcs44 in R16) based on extensive simulations. This determination of the sky level is more precise, although it requires taking into account the contribution to the sky from the wings of the PSF; the resulting offset of 0.008 mag is robustly determined from bright stars and corrected in the final photometry. The random sky error is propagated to the photometry error through the sampling of sky values in the artificial-star analyses).
  • 2.  
    Determination of the background and covariance: As in R16, artificial stars are added in the F160W images in the vicinity of each Cepheid at the apparent magnitude expected from its period and trial fits of the PL relation. The difference between the input and recovered magnitudes is used to refine the initial estimate of the sky background and revise the Cepheid photometry. This process is necessarily iterative as the PL relation used to predict the magnitude of a given Cepheid from its period is determined after correcting the photometry based on the retrieval of the artificial stars. Thanks to greater processing power, we increased the number of iterations since R16 and found that in some cases the iterations in our previous work were not adequate (as they had not fully converged). The new process fully converges with improved determination of the trial intercepts and slopes, and we use the final iteration to estimate a systematic uncertainty of 20% of the background (in units of the Cepheid magnitude correction) as the covariance "error floor" of any pair of Cepheids, j, k in the same ith host, given by
    Equation (8)
    This provides a systematic uncertainty in the range of 0.03–0.06 mag for all Cepheids within each host, with the term "bkgd" representing the change in Cepheid magnitude due to the addition of the mean level of the crowded background from unresolved sources derived from the artificial stars. The artificial-star magnitudes are determined from the trial P–L relations of each host independently from every other host so there is no source of background covariance between different hosts.
  • 3.  
    Reference Files: The photometry benefits from the latest STScI data pipelines, including new flat fields (with better "blob" mapping), bias frames, long-history dark frames, pixel-based CTE corrections for optical data, and geometric-distortion corrections, yielding improved alignment between the optical and NIR frames.
  • 4.  
    Count Rate Nonlinearity (CRNL): We adopt a new calibration of the WFC3/NIR CRNL (Riess et al. 2019b). By convention, this is applied to Cepheids in anchors between their flux and the background level of Cepheids in SN hosts.
  • 5.  
    M101 long-period Cepheids: A reexamination of the Cepheids in M101, together with simulations, revealed that the baseline of the original monitoring campaign (carried out in 2006 and reported by Mager et al. 2013) was too short to provide reliable periods for apparent Cepheids with P > 35 days (equivalent to 1.2× the time span of the observations). Two additional epochs obtained 7 yr later and separated by a week are insufficient to resolve the issue because at these periods the two epochs provide effectively only one phase measurement, and the prevailing period uncertainty of >0.5 days makes the phasing of the two sets unreliable. For that reason we exclude M101 Cepheids with P > 35 days (about 10% of the sample used by R16).
  • 6.  
    We correct Cepheid periods to rest-frame values owing to (1+z) time dilation (Anderson 2019), a small (given our typical z ≈ 0.005) but one-sided effect.

Table 2. WFC3/NIR Cepheids

Field α δ ID P Color a σcol F160W σtot [O/H]Note
 (J2000) [days][mag][dex]
M101210.9114854.357230546726.8530.910.1124.040.440.05HST
M101210.8846454.3389801108306.8731.050.1923.360.620.09HST

Note.

a F555W – F814W.

Only a portion of this table is shown here to demonstrate its form and content. A machine-readable version of the full table is available.

Download table as:  DataTypeset image

Table 3. Properties of NIR PL Relations

GalaxyNumberP〈[O/H]〉 a
 FoVmeas. b fit c [days][dex]
M10131126025915.80.10
Mrk 133721201552.9−0.18
N0105328841.5−0.13
N069131282846.50.09
N097657353340.20.02
N101526201852.5−0.03
N130957535354.1−0.08
N136566474529.0−0.14
N144890777335.2−0.11
N155913611011034.40.00
N244223817717736.00.00
N252585737340.40.10
N260825222245.40.11
N302126161632.20.06
N314729282752.30.17
N325454484841.5−0.18
N337082737342.5−0.12
N344711610210136.0−0.16
N358362545441.6−0.06
N397266545232.30.03
N398231272729.9−0.14
N403838292953.60.03
N44241710931.10.06
N453645414036.0−0.15
N463936303038.7−0.01
N468018111155.1−0.06
N5468118939354.9−0.10
N558419616716536.8−0.10
N564329425125131.80.13
N572825202044.30.15
N586160414143.80.06
N591717141437.9−0.30
N725030212137.3−0.28
N732938313154.60.17
N754150333349.1−0.12
N767821161642.80.02
U939136333339.6−0.22
SN Total26802173215036.5−0.01
N425855545144314.4−0.10
M31555519.1−0.11
LMC d 34233913.3−0.29
SMC14514313.3−0.72
Total All31653129

Notes.

a Solar value: 12 + log [O/H] = 8.69, Asplund et al. (2009). b Good-quality measurement, within the allowed color range, period above completeness limit. c After 3.3σ outlier rejection (1.2% of sample). d 69 of these are from HST and 270 from the ground.

Download table as:  ASCIITypeset image

We note that the Cepheid color measurements, VI, employed in Equation (7) to determine the baseline value of H0 here and in R16 (as well as most variants) are relatively insensitive to the previously noted improvements to the calibration of the Cepheid optical measurements realized in the last 6 or 16 years and included in Y22a and Y22b. These include the use of artificial stars to estimate the crowded backgrounds, revised flat fields, archival dark frames, updated geometric-distortion maps to rectify frames, and pixel-based CTE corrections, most of which were not implemented by H16 or by Macri et al. (2006), from which the N4258 data in H16 were derived. These improvements cancel to first order in the difference, VI, as do changes in CCD sensitivity, which can decline on orbit or change abruptly when the electronics are refurbished as occurred for ACS in 2009. However, the use of optical Wesenheit magnitudes to determine H0 without bias (as we explore in Section 6.13) requires fully calibrated optical magnitudes rather than only accurate optical colors (as in H16 and provided in R16). The fully calibrated optical magnitudes available in Y22a and Y22b are suitable for this purpose.

We do not attempt to quantify the impact of each of these improvements individually; however, in the aggregate, matching Cepheids within 1'' of those in R16, the net change to Cepheid photometry in F160W is that 63% (37%) of Cepheids are fainter (brighter). We provide additional details of this comparison in Appendix B.

3.5. Cepheid Metallicities

As in R16 and H16, we measured radial gradients of the strong-line abundance ratios (R23) of oxygen to hydrogen in H ii regions in the Cepheid hosts. The optical spectra were obtained with the Low-Resolution Imaging Spectrometer (LRIS; Oke et al. 1995)) on the Keck I 10 m telescope on Maunakea, Hawaii. See R16 and H16 for details regarding the observations and data reduction.

We define the metallicity of each Cepheid to be the value of this linear function at its galactocentric radius. We have revised the calibration of these strong-line abundance measurements relative to those from Zaritsky et al. (1994, hereafter Z94) used by R16, taking advantage of more recent calibrations between R23 and the metallicity 12 + log [O/H]. We adopt the average of nine recent literature calibrations, with the transformations between the Z94 system and the newer ones given by Teimoorinia et al. (2021). Further details about the metallicity measurements and their uncertainties are given in Appendix C.

We also use direct abundance measurements derived from high-resolution spectra of Cepheids in the MW, LMC, and Small Magellanic Cloud (SMC). In Appendix C, we evaluate the consistency of the direct and radial (strong-line) abundance measures by comparing H ii regions and Cepheid spectral abundances in the MW. While we use the average of nine strong-line abundance calibrations for our baseline results, we select the Pettini & Pagel (2004, hereafter PP04) calibration for comparison to the mean, as it is known to provide a good match to measured extragalactic stellar abundances (Bresolin et al. 2016, and F. Bresolin, 2021, private communication), and to estimate the uncertainty of the strong-line metallicity estimates. Specifically, we propagate a systematic uncertainty in the strong-line abundance scale as well as its covariance by including in our fit covariance matrix (given in Section 3) the product of the difference between the mean calibration and the PP04 calibration for the ith and jth Cepheids,

Equation (9)

This approach requires an iteration to use the same value of ZW in Equation (9) that is determined from the optimization of the global χ2. The mean difference of ∼0.05 dex between the average and PP04 scale represents a systematic uncertainty in the abundance scale, propagated here in the covariance matrix, which is consistent with the empirical assessment in Appendix C

4. Anchor Constraints and Ancillary Data

The strongest constraints on the Cepheid PL and P–L–C relations come, not surprisingly, from the nearest star-forming galaxies whose samples of Cepheids have better temporal sampling, higher resolution, wider wavelength coverage, and far greater S/N than we can expect to achieve from the distant Cepheids presented above that occupy the second rung of the distance ladder. An accurate and precise determination of H0 requires leveraging such data to empirically constrain Cepheid properties. Ignoring such data naturally reduces the precision in H0 and as a consequence the significance of any Hubble tension, but this is not a reasonable approach to determine the source of any tension. We adopt uncertainty estimates from the indicated, external sources as provided and without alteration, and will test their internal consistency within the distance ladder in the following sections.

Here we describe the data we use, in addition to those presented in Section 3. Some of these data, namely those described in Section 4.1 through 4.3, are anchor constraints—they provide direct information on the zero-point of the Cepheid PL relation. For this purpose, we require (a) that Cepheids be observed directly in our three-filter HST photometric system, most importantly in the NIR with the WFC3 F160W filter, and (b) that the distance determination, to individual Cepheids or to their host, be purely geometric. Additionally, we make use of several data sets, to which we refer as ancillary data, which do not directly constrain the zero-point of the PL relation but provide useful information on other characteristics of Cepheids; these include Cepheid measurements in nearby hosts with high-precision photometry (in similar filters but not in our standard HST system), Cepheids in hosts without a precision geometric distance, and information on SNe Ia in the Hubble flow.

4.1. Milky Way Cepheids

Trigonometric parallaxes to MW Cepheids offer a direct source of geometric calibration of their luminosities. We employ two samples with precise parallaxes and fluxes measured on the same HST system as the extragalactic Cepheids and with direct spectroscopic metallicity measurements. The first is a sample of eight MW Cepheids from Riess et al. (2018a) with parallaxes measured with HST WFC3 spatial scanning. The other contains 75 MW Cepheids with parallaxes from Gaia EDR3 as given by Riess et al. (2021). We do not use the sample from Benedict et al. (2007) that was part of R16 because their fluxes are too bright (0 < mH < 3 mag) to be measured directly with HST nor have they been measured with good accuracy from the ground; the parallax sample from Gaia EDR3 provides superior information on MW Cepheids.

Because the distance uncertainties are measured (and Gaussian) in parallax space jointly and are not insignificant (>5%), the direct transformation of parallax to distance modulus (i.e., distance to magnitudes) would yield a bias (often called the "Lutz–Kelker" bias, following Lutz & Kelker 1973). One can compensate for it by estimating approximate statistical corrections between the parallax and distance moduli for individual Cepheids, assuming the form of their spatial distribution in the MW using Bayesian inference (Bailer-Jones et al. 2021). However, it is simpler and more reliable to analyze the MW Cepheid data directly in parallax space, in order to retain the Gaussian parallax errors, and to derive their joint constraint on the Cepheid absolute-magnitude zero-point (${M}_{H,1}^{W}$) following Riess et al. (2021), or similarly through the use of "astrometric-based luminosity" (Arenou & Luri 1999).

Given that the constraints from the parallaxes across a Cepheid sample are related by the PL relation, the combined constraint is evaluated in parallax space and the resulting error in the mean is greatly reduced (for our two samples to 1% and 3%) so that the resulting mean constraint on the zero-point in magnitudes can then be approximated as Gaussian to better than 0.1%. We note that the 1% calibration from Gaia EDR3 includes marginalization over the Gaia parallax offset term as described in Riess et al. (2021) and that the measured σ = 6 μas uncertainty in the offset is the source of 0.9% uncertainty for the MW Cepheid sample due to its mean 650 μas parallax. These constraints are given in Table 4 for the PL relation determined here. The constraints are not identical to those provided in Riess et al. (2021) because they employ the same PL parameters, b and ZW , that optimizes the global χ2 for all Cepheids, not just MW Cepheids. Riess et al. (2021) showed that the global PL parameters, including slope and metallicity dependence, are good fits for the MW Cepheids on their own.

Table 4. Ancillary Cepheid Data

SampleReference N P〈[O/H]〉PhotometrySelectionNotes
   [days][dex]   
MW Gaia EDR3Riess et al. (2021)6612.50.13HSTmH , mV , mI see ref. ${M}_{H,1,\mathrm{Gaia}}^{W}=-5.903,$ σGaia = 0.024 a ,
+HST       ZW = −0.20 ± 0.12
MW WFC3 SSRiess et al. (2018a)822.60.05HST mH , mV , mI see ref. ${M}_{H,1,\mathrm{HST}}^{W}=-5.810,$ σHST = 0.054 a
LMC HSTRiess et al. (2019b)7016.0−0.29 b HST mH , mV , mI see ref. 
LMC groundMacri et al. (2015)27212.6−0.29 b ground mH , mV , mI P > 5 days mH transformed to 2MASS5
SMC groundKato et al. (2007)1459.9−0.70 b ground mH , mV , mI P > 5 days, mH transformed to 2MASS5
       r < 0.6° 
M31 SH0ESLi et al. (2021)5519.1−0.11HST mH , mV , mI P > 4 d 
M31 PHATKodric et al. (2018)46310.50.12HST mH , mJ P > 4 days mH , mJ transformed to mH , mV , mI

Notes.

a Measured following Riess et al. (2021) with global fit PL parameters. b From Romaniello et al. (2021) and Romaniello et al. (2008) and 〈[Fe/H]〉~〈[O/H]〉−0.06.

Download table as:  ASCIITypeset image

4.2. Cepheids in NGC 4258

The R16 analysis used 143 Cepheids observed with HST in the maser host NGC 4258 with a geometric distance from Reid et al. (2019). With new HST imaging campaigns in four fields shown in Figure 3, we have more than tripled that sample to 443 Cepheids; details of their identification are given by Yuan et al. (2022). These Cepheids are included in Table 2.

4.3. Cepheids in the LMC

We include in the joint constraint the sample of 70 Cepheids in the LMC measured on the same HST three-band photometric system given by R19. These have also been corrected with the best-fit planar geometry model of the LMC to be at a single distance as described by R19. The distance to the center of the LMC is given by Pietrzyński et al. (2019) from a sample of 20 DEBs as μ0 = 18.477 ± 0.0263 mag. Romaniello et al. (2021) has obtained spectra for 68 of these variables, demonstrating that they are consistent within the errors of having a single, common abundance of [Fe/H] = −0.40 dex. Sixty percent of the variables have sufficiently measured lines to further provide a mean spectroscopic abundance of [O/H] = −0.29 ± 0.02 dex, which we adopt for the full LMC sample. LMC Cepheids are corrected for the WFC3 CRNL across the 5 dex between the Cepheid flux and the SN-host background (Riess et al. 2019a). The LMC Cepheids from HST also set the intrinsic scatter in ${m}_{H}^{W}$ owing to the finite width of the instability strip to be 0.07 mag, which we include for all Cepheids.

The LMC also provides ancillary data, as defined above, that can be used independently of their inferred zero-point to refine the characterization of the PL relation. For this purpose we use the ground-based sample of 785 LMC Cepheids from Macri et al. (2015) with mH , mV , and mI photometry limited to 270 with P > 5 days. This photometry has been transformed to the HST system as described by Equations (10)–(12) in R16; however, we assign a common, systematic uncertainty of σgrnd = 0.10 mag to the transformed magnitudes, hence the simultaneous constraint takes the form 0 = Δzp ± σgrnd, where Δzp is a parameter describing the difference between the ground and HST zero-points to account for possible systematics associated with ground-based observations. Because the assigned systematic uncertainty is a factor of ∼10 larger than the mean of the smaller HST LMC sample (which has no such photometric system difference), the consequence is that the ground-based sample has negligible (100× less) weight in the tie to the LMC distance; therefore, the ground sample only helps constrain the slope of the PL relation, still an invaluable contribution.

4.4. Cepheids in the SMC

An important development in support of the distance ladder is the recent measurement of a geometric distance to the SMC from 10 DEBs (Graczyk et al. 2020) with a precision of better than 2%. While this precision and method would make the SMC suitable as an additional anchor, two issues present limitations. The first is the considerable line-of-sight depth of the SMC, which can cause a 10% dispersion in star distances across the full SMC structure and an offset between the DEBs and the Cepheids in the SMC. Here we follow the approach of Breuval et al. (2021) to make use of a sample of SMC Cepheids by (1) correcting for the depth by adopting the same geometry of the SMC as used by Graczyk et al. (2020) to characterize the DEBs and (2) limiting the Cepheids to the inner core of the SMC, a radius of 0fdg6. This combination yields a Cepheid sample that, based on the SMC geometric model, is offset in depth by 2 mmag from the mean DEB distance (with an uncertainty of a small fraction of this) and has a modeled dispersion in depth of σ = 0.024 mag, while still providing a sample of 145 Cepheids (with P > 5 days) that can yield valuable constraints on the Cepheid metallicity term. Earlier studies (Gieren et al. 2018) were unable to employ the DEB-based distance from Graczyk et al. (2020) and did not focus on the core of the SMC (Wielgorski et al. 2017).

The other limitation is the lack of HST photometry for SMC Cepheids. Elsewhere we have been able to negate zero-point uncertainties in the distance scale by using Cepheids observed with the same instruments. However, the SMC Cepheids can still provide a powerful constraint on the Cepheid metallicity term, which is well-constrained using only differential, cross-calibrated ground-based photometry and the differential DEB distance measurement between the SMC and LMC, the latter of which is known better than the simple difference in the individual DEB cloud distances.

As discussed by Graczyk et al. (2020), most of the uncertainty in the DEB distance estimates to the LMC and SMC is systematic and propagates from the uncertainty in the surface brightness versus color calibration of red giants, the zero-point of the V-band and K-band photometry, and the uncertainty in the extinction law. Because the aforementioned SMC and LMC DEB measurements utilized the same relations, observational setup, and reduction methodologies, their differential distance from DEBs as given by Graczyk et al. (2020) is far better constrained to 0.500 ± 0.017 mag, independent of the absolute calibrations and their uncertainties in the DEB method. The use of consistently calibrated LMC and SMC ground photometry and this differential distance, even without reference to the absolute DEB distances, helps constrain the metallicity term as we will show in Section 6.2. Here we use the 2MASS Point Source Catalog (Cutri et al. 2003) to produce a consistent calibration 15 of the H-band LMC Cepheid data from Macri et al. (2015) and the H-band SMC data from Kato et al. (2007), and mV and mI data from OGLE III (Soszynski et al. 2008), with data provided in Table 2. The precision of these transformations allows us to fully leverage the constraint on the distance difference from Graczyk et al. (2020).

We take the mean SMC Cepheid metallicity to be Δ[Fe/H] = −0.41 dex relative to the LMC or [O/H] = −0.70 dex (Romaniello et al. 2021, 2008) ([Fe/H] = −0.76). We add to this a systematic uncertainty (common covariance) in the mean LMC and SMC metallicity of 0.05 dex, as in Equation (9), each, separately, and relative to other hosts.

4.5. Cepheids in M31

In Li et al. (2021) we presented measurements of 55 Cepheids in M31 using the same three-filter HST system adopted elsewhere. Owing to their low dispersion and broad range in period, these Cepheids provide additional constraints on the slope of the PL relation, independent of any prior knowledge of the distance to M31; this is how we employ these data for our baseline analysis. Given the M31 metallicity gradient of −0.023 dex kpc−1 (Zurita & Bresolin 2012), these Cepheids span only a narrow range of abundances with a mean [O/H] = −0.1 dex and a standard deviation of 0.05 dex.

A much larger sample of M31 Cepheids is available from the HST PHAT Treasury program (Dalcanton et al. 2012), but the filters used to observe it do not correspond to the three used here, limiting its utility. The PHAT program observed these Cepheids with WFC3 F160W and a "wide-J" filter (F110W; Riess et al. 2012) defined a transformation to the ${m}_{H}^{W}$ system. In R16 we included measurements for 375 PHAT Cepheids before the availability of those from Li et al. (2021). An expanded compilation from Kodric et al. (2018) includes 522 Cepheids from the PHAT program with 3 < P < 78 days. We use the latter sample as an alternative to Li et al. (2021) in some variants of our baseline analysis in Section 6.5 because of its powerful leverage to examine evidence of a possible break in the PL relation near P ≈ 10 days.

4.6. Period–Luminosity Relations

Figure 9 shows the 40 individual Cepheid host ${m}_{H}^{W}$ PL relations (not including the MW as explained above) for a period range of 5 < P < 120 days fitted with a common slope. Before combining the results of many hosts in a global analysis, we examine in Figure 10 the independently fitted slopes of the ${m}_{H}^{W}$ PL relations across all Cepheid hosts.

Figure 9. Refer to the following caption and surrounding text.

Figure 9. HST NIR Wesenheit Cepheid PL relations. The Cepheid magnitudes are shown for 37 SN Ia hosts, M31, and 2 of the 3 possible distance-scale anchors (LMC and NGC 4258). The uniformity of the photometry and metallicity reduces systematic errors along the distance ladder. A single slope is shown and used for the baseline, but we also allow for a break (two slopes) as well as limited period ranges in some analysis variants.

Standard image High-resolution image
Figure 10. Refer to the following caption and surrounding text.

Figure 10. Independent determinations of the slope of the PL relation in each host plotted in order of precision. Black (red) points are after (before) outlier rejection. We find that the slopes are consistent with a single value of −3.27 to −3.30 as indicated by the blue line. The most metal poor (SMC) and metal rich (MW) also have consistent slopes. Error bars are for the fit error only and do not include uneven period sampling or background covariance. The large change in slope for Mrk 1337 was due to the relatively small sample size and rejection of a few Cepheids far from its center.

Standard image High-resolution image

The slopes are all consistent (at the ∼2σ level) with a mean in the range of −3.27 to −3.30 mag dex−1. The most tightly constrained slope comes from the LMC with −3.284 ±0.017 mag dex−1, mostly from the ground sample. Overall, we see no evidence to reject the null hypothesis of a single slope. There have been claims in the past of a break at P ≈ 10 days, which we will consider as a variant of the primary analysis in the next section. However, the mean slopes from these data below and above P = 10 days are −3.33 ± 0.02 and −3.21 ± 0.06 mag dex−1 (respectively), a difference of 1.8σ; individual hosts with the strongest constraint (LMC and the M31 PHAT sample) show slope changes at P = 10 days in opposite directions. Furthermore, formal uncertainties in these slopes are somewhat underestimated because of the uneven sampling of periods between hosts. A Monte Carlo analysis (bootstrap resampling with replacement) from hosts with the largest samples of Cepheids shows variations owing to uneven period sampling increases the formal slope uncertainty typically by 10% and up to 35% for the LMC. We find that the metallicity dependence has a negligible effect on the mean slope. Therefore, in the following we will consider a single slope for 5 < P < 120 days in our baseline analysis, but we analyze the impact of a break or limited period range on the determination of H0 as variants of the baseline analysis.

4.7. Geometric Distance Priors

We make use of the geometric distances to NGC 4258 from masers (Reid et al. 2019) of μ n4258 = 29.398 ± 0.032 mag, the LMC from DEBs (Pietrzyński et al. 2019) of μ lmc =18.477 ± 0.0263 mag, and the SMC from DEBs (Graczyk et al. 2020) in the form of Δμ lmcsmc = 0.500 ±0.017 mag, as well as the MW Gaia EDR3 and HST spatial-scan parallaxes discussed in Section 4.1. These are all given in Table 4.

4.8. SN Magnitudes

We adopt standardized SN Ia magnitudes ${m}_{B}^{0}$ from the Pantheon+ analysis (Scolnic et al. 2021; Brout et al. 2021), where the value ${m}_{B,i}^{0}$ is a measure of the maximum-light apparent B-band brightness of an SN Ia in the ith host at the time of B-band peak, corrected to the fiducial color and luminosity determined for each SN Ia from its multiband light curves and a light-curve-fitting algorithm. We use the uncertainties and covariance of ${m}_{B}^{0}$ as given by the Pantheon+ analysis. The SN Ia covariance matrix has substantial off-diagonal terms and is displayed in Figure 11. An improvement in the current analysis of calibrator SNe Ia over R16 is our use of multiple SN light-curve data sets for most calibrators, 77 sets in all for 42 SNe Ia, a mean of ∼2 independent sets per SN, reducing measurement errors (not intrinsic scatter, which is covariant among multiple samples of the same SN) by a mean factor of 1.4. These data sets are given in Scolnic et al. (2021).

Figure 11. Refer to the following caption and surrounding text.

Figure 11. Distance-ladder covariance matrix. Graphical representation of the covariance matrix of measurements with values applicable to different sources indicated. The covariance matrix contains past sources of systematic error that propagate here as error in the fit. Among Cepheid hosts, square blocks along the diagonal indicate the background covariance within a host (Equation (8)) while the covariance pattern between hosts farther off the diagonals comes from the covariance of the metallicity scale (Equation (9)). The SN–SN covariance is characterized in the Pantheon+ analysis (Brout et al. 2021; Scolnic et al. 2021).

Standard image High-resolution image

5. The Local Value of H0, the Baseline

Our calibrator sample contains 42 SNe Ia in the 37 Cepheid hosts presented in the previous section and 277 SNe Ia in the Hubble flow, all the objects at 0.0233 < z < 0.15 from the Pantheon+ sample that pass the same quality cuts and are in late-type hosts like the Cepheid calibrators. Criteria for inclusion in the sample used to measure the Hubble flow and variations are further considered in Section 6.7.

Our baseline (best) analysis includes the full array of constraints on our model and what we assess to be the optimal choices (considering a wide range of alternatives in Section 6) in the treatment of the available data. Its results, derived from the optimization of χ2 in Equation (6), are given in the first line of Table 5, which also provides the best-fit parameters. This fit gives a ${\chi }_{\mathrm{dof}}^{2}$ = 1.03 with N = 3445 degrees of freedom, with Cepheid slope, metallicity, and luminosity parameters bW =−3.299 ± 0.015 mag/dex, ZW = −0.217 ± 0.046 mag/dex, and ${M}_{H,1}^{W}$ = −5.894 ± 0.017 mag. These parameters are similar to those found by R16 and updated by R19, with small increases in the absolute values of the slope of the PL relation (from −3.26 ± 0.03 in R16) and the metallicity dependence (from −0.17 ± 0.06 in R19). Additional "nuisance" parameters in the baseline fit are ΔμN4258 = −0.013 ± 0.022 mag and ΔμLMC = 0.010 ± 0.019 mag. Table 6 provides individual host distances and SN parameters. Figure 12 displays the baseline data, fit, and residuals, while Figure 13 provides more details about the central panel of the preceding figure.

Figure 12. Refer to the following caption and surrounding text.

Figure 12. Complete distance ladder. The simultaneous agreement of distance pairs: geometric and Cepheid-based (lower left), Cepheid- and SN-based (middle), and SN- and redshift-based (top right) provides the measurement of H0. For each step, geometric or calibrated distances on the abscissa serve to calibrate a relative distance indicator on the ordinate through the determination of MB or H0. Results shown are an approximation to the global fit as discussed in the text. Red SN points are at 0.0233 < z < 0.15, with the lower-redshift bound producing the appearance of asymmetric residuals when plotted against distance.

Standard image High-resolution image
Figure 13. Refer to the following caption and surrounding text.

Figure 13. Comparison of SN and Cepheid distance measures on the second rung of the distance ladder, and residuals (middle). Because the residual panel includes SN measures on both axes, residuals are convolved by the SN error arrow as indicated. An alternate plot with Cepheids on the abscissa is shown in Figure 12. SNe that share a common host are identified with various colors. We find no significant trend (<1.5σ) in this space. Results shown are an approximation to the global fit as discussed in the text. The lower panel replaces the NIR-based Cepheid magnitudes with the optical only from Fit 60.

Standard image High-resolution image

Table 5. Fits for H0

FitVariant χ2 dof N H0 b γ ${M}_{W}^{0}$ ${M}_{B}^{0}$ ab
1Baseline1.03344573.04 1.01−3.299 0.015−0.217 0.046−5.894−19.2530.714158
Cepheid-clipping Variants: Section 6.1
2global σclip = 3.31.03344673.19 1.01−3.298 0.015−0.216 0.046−5.891−19.2490.714174
3individual P–L σclip = 3.30.99337073.25 1.02−3.296 0.015−0.201 0.045−5.893−19.2480.714315
4tight:one-by-one MAD σclip = 3.00.99342973.12 1.01−3.299 0.015−0.220 0.045−5.893−19.2510.714175
5tight:global σclip = 3.00.99343273.22 1.01−3.298 0.015−0.215 0.045−5.891−19.2480.714194
6loose:global σclip = 5.01.16347573.34 1.01−3.294 0.016−0.221 0.049−5.888−19.2440.714183
7loose:one-by-one MAD σclip = 5.01.15347473.42 1.01−3.295 0.016−0.222 0.048−5.888−19.2420.714178
8loose:individual P–L σclip = 5.01.05339773.35 1.01−3.296 0.015−0.202 0.046−5.892−19.2440.714257
9none1.23348173.41 1.01−3.290 0.016−0.206 0.050−5.885−19.2420.714248
Geometric Anchors Variants: Section 6.2
10N42581.06345472.51 1.54−3.294 0.015−0.204 0.051−5.905−19.2690.714174
11Milky Way1.03344673.02 1.19−3.298 0.015−0.208 0.051−5.895−19.2540.714179
12LMC1.03344673.59 1.36−3.298 0.015−0.208 0.051−5.878−19.2370.714178
13N4258+MW1.03344673.00 1.09−3.298 0.015−0.207 0.050−5.895−19.2540.714171
14N4258+LMC1.03344673.35 1.17−3.299 0.015−0.211 0.050−5.885−19.2440.714179
15MW+LMC1.03344673.25 1.05−3.298 0.015−0.216 0.046−5.889−19.2470.714169
Cepheid Dust-color Treatment Variants: Section 6.3
16Fitzpatrick 99 law RV = 2.51.03344673.24 1.00−3.291 0.015−0.209 0.046−5.850−19.2470.714171
17CCM law RV = 3.11.04344573.09 1.00−3.310 0.015−0.226 0.046−5.957−19.2520.714183
18Nataf law RV = 3.31.03344573.32 0.99−3.276 0.015−0.204 0.046−5.804−19.2450.714141
19 RW free global1.03344673.40 1.02−3.286 0.016−0.204 0.046−5.835−19.2420.714167
20intrin. col. subtr. F99 RV = 3.31.03344673.13 1.01−3.201 0.015−0.218 0.046−5.604−19.2500.714170
21intrin. col. subtr. F99 RV = free1.03344673.34 1.01−3.201 0.015−0.206 0.046−5.582−19.2440.714170
22intrin. col. subtr. RV (host mass-SFR)1.05344673.85 1.02−3.202 0.015−0.228 0.046−5.605−19.2290.714230
23None (AH values assumed to cancel)1.11343774.78 1.03−3.188 0.016−0.115 0.048−5.434−19.2010.714086
P–L Break/Span Variants: Section 6.4
24Break at P = 10 days1.03344472.68 1.04−0.10 0.05−0.222 0.046−5.887−19.2640.714151
25Only use P > 10 days1.06300473.15 1.11−3.337 0.023−0.169 0.051−5.861−19.2500.714158
26Only use P < 60 days0.93369973.99 1.04−3.261 0.011−0.236 0.044−5.898−19.2260.714276
M31 Cepheid Sample Variants: Section 6.5
27M31 PHAT sample1.02385473.21 1.00−3.297 0.013−0.234 0.044−5.893−19.2480.714184
28M31 PHAT sample +Break at P = 10 days1.02385272.74 1.03−0.07 0.04−0.240 0.044−5.891−19.2620.714156
Metallicity Variants: Section 6.6
29no metallicity dependence1.04344673.52 1.01−3.296 0.015...−5.860−19.2390.714194
30 PP04 metallicity scale1.04344672.84 1.00−3.297 0.015−0.166 0.042−5.883−19.2590.714162
TRGB Inclusion Variants: Section 6.7
31adds EDD TRGB+N4258tip1.03345772.76 0.95−3.301 0.015−0.197 0.045−5.891−19.2620.714253
32adds CCHP TRGB+N4258tip1.03345772.29 0.94−3.304 0.015−0.208 0.046−5.894−19.2760.714254
Hubble-flow Sample Variants: Section 6.8
33all host types 0.0233 < z < 0.151.03365273.32 0.99−3.298 0.015−0.216 0.046−5.891−19.2460.714479
34highz:all host types 0.0233 < z < 0.801.00448373.68 0.98−3.298 0.015−0.216 0.045−5.891−19.2440.716225
35skip local alltypes 0.06 < z < 0.151.04331873.35 1.06−3.298 0.015−0.217 0.046−5.891−19.2450.714311
36highz:skip local alltypes 0.06 < z < 0.81.00414973.90 1.01−3.298 0.015−0.217 0.045−5.891−19.2420.716991
37highmass:hubble flow host logmass >101.04330472.97 1.04−3.298 0.015−0.217 0.046−5.891−19.2510.713297
Calibrator Sample Variants: Section 6.9
38complete calibrator sample z < 0.0111.03344673.30 1.02−3.298 0.015−0.217 0.046−5.891−19.2450.714191
39complete sample z < 0.011+TRGB1.04345872.88 0.95−3.301 0.015−0.196 0.045−5.888−19.2580.714297
40highmass:calibrator host logmass >101.03344573.54 1.10−3.296 0.015−0.214 0.046−5.891−19.2380.714201
41use least-crowded half1.02344673.34 1.16−3.297 0.015−0.223 0.046−5.892−19.2460.714597
42use most-crowded half hosts1.02344573.35 1.37−3.293 0.015−0.215 0.046−5.891−19.2430.714063
43only 19 hosts from R16 1.02344673.47 1.17−3.297 0.015−0.229 0.046−5.893−19.2420.714505
44only hosts since R16 1.02344573.07 1.31−3.293 0.015−0.210 0.046−5.890−19.2520.714178
45closer half hosts mb < 131.03344673.07 1.16−3.295 0.015−0.219 0.046−5.891−19.2520.714191
Excluded SN Surveys Variants: Section 6.10
46No SDSS SNe1.03344672.90 1.02−3.298 0.015−0.217 0.046−5.891−19.2490.712428
47No CSP SNe1.02344573.43 1.06−3.298 0.015−0.217 0.046−5.891−19.2460.715130
48No literature SNe1.03344673.47 1.05−3.298 0.015−0.213 0.046−5.890−19.2400.714072
49No LOSS SNe1.02344773.26 1.04−3.297 0.015−0.213 0.046−5.891−19.2510.715022
50No Swift SNe1.03344573.09 1.02−3.297 0.015−0.217 0.046−5.891−19.2480.713551
51No CfA1/2 SNe1.03344673.03 1.03−3.298 0.015−0.218 0.046−5.891−19.2500.713464
52No CfA3/4 SNe1.03344773.31 1.02−3.298 0.015−0.214 0.046−5.891−19.2450.714257
53No foundation SNe1.01344673.46 1.03−3.298 0.015−0.217 0.045−5.891−19.2410.714196
54No pre-2000 SNe1.02344673.20 1.09−3.297 0.015−0.218 0.046−5.891−19.2450.713530
SN-fitting Variants: Section 6.11
55SN scatter monochromatic1.00344473.54 1.08−3.296 0.015−0.223 0.045−5.892−19.2380.714067
Peculiar-velocity Variants: Section 6.12
562MRS1.03344673.12 1.01−3.298 0.015−0.216 0.046−5.891−19.2490.713775
57CMB frame z1.04344672.70 1.00−3.298 0.015−0.216 0.046−5.891−19.2490.711260
58 q0 = −0.521.03344673.19 1.01−3.298 0.015−0.216 0.046−5.891−19.2490.714149
59 q0 = −0.52 all highz1.00448373.65 0.98−3.298 0.015−0.216 0.045−5.891−19.2440.715950
Optical Wesenheit Variants: Section 6.13
60optical Wesenheit clipping=one-by-one0.94362672.70 1.03−3.299 0.010−0.248 0.041−5.858−19.2640.714413
61optical Wesenheit clipping=global0.94362672.90 1.03−3.291 0.010−0.247 0.041−5.857−19.2590.714417
62optical Wesenheit F99 RV = 2.50.92361873.20 1.03−3.230 0.010−0.202 0.041−5.623−19.2490.714359
63optical Wesenheit CCM RV = 3.10.97362672.46 1.03−3.335 0.010−0.270 0.042−6.020−19.2720.714469
64optical Wesenheit N4258 only0.92362374.85 2.31−3.291 0.010−0.211 0.045−5.797−19.2010.714429
65optical Wesenheit MW only0.93362371.93 1.15−3.291 0.010−0.211 0.045−5.883−19.2880.714421
66optical Wesenheit LMC only0.92362374.26 1.39−3.291 0.010−0.211 0.045−5.814−19.2180.714416
67optical Wesenheit+TRGB0.94363872.15 0.94−3.301 0.010−0.239 0.041−5.858−19.2810.714385

Download table as:  ASCIITypeset images: 1 2

Table 6. Approximations for Distance Parameters

NHostSN ${m}_{B,i}^{0}$ σ μCeph a σ ${M}_{B,i}^{0}$ σ μhost b σ Rc
   (mag)
1M1012011fe9.78000.11529.1940.039−19.4140.12129.1780.0410.44
2M13372006D13.6550.10632.9220.124−19.2670.16332.9200.1230.43
3N01052007A15.2500.13334.5310.253−19.2810.28634.5270.2500.37
4N06912005W13.6020.13932.8470.109−19.2450.17732.8300.1090.42
5N09761999dq14.2500.10333.7190.151−19.4680.18333.7090.1490.30
6N10152009ig13.3500.09432.5700.075−19.2200.12032.5630.0740.46
7N13092002fk13.2090.08232.5460.060−19.3370.10232.5410.0590.38
8N13652012fr11.9000.09231.3790.057−19.4790.10831.3780.0560.46
9N14482001el12.2540.13631.2900.037−19.0360.14131.2870.0370.41
10N14482021pit11.7520.20031.2900.037−19.5380.20331.2870.0370.41
11N15592005df12.1410.08631.5010.062−19.3610.10631.4910.0610.35
12N24422015F12.2340.08231.4570.065−19.2230.10531.4500.0640.36
13N25252018gv12.7280.07432.0670.100−19.3390.12432.0510.0990.39
14N26082001bg13.4430.16632.6290.155−19.1860.22732.6120.1540.39
15N30211995al13.1140.11632.4750.160−19.3610.19832.4640.1580.34
16N31472021hpr13.8430.15933.0440.165−19.2010.22933.0140.1650.45
17N31471997bq13.8210.14133.0440.165−19.2230.21733.0140.1650.45
18N31472008fv13.9360.20033.0440.165−19.1080.25933.0140.1650.45
19N32542019np13.2010.07432.3320.077−19.1300.10732.3310.0760.52
20N33701994ae12.9370.08232.1230.052−19.1860.09732.1200.0510.33
21N34472012ht12.7360.08931.9390.035−19.2030.09631.9360.0340.40
22N3583ASASSN-15so13.5090.09332.8080.081−19.2990.12332.8040.0800.34
23N39722011by12.5480.09431.6440.090−19.0960.13031.6350.0890.46
24N39821998aq12.2520.07831.7240.072−19.4720.10631.7220.0710.36
25N40382007sr12.4090.10631.6150.117−19.2070.15831.6030.1160.57
26N44242012cg11.4870.19230.8560.130−19.3690.23230.8440.1280.87
27N45361981B11.5510.13330.8380.051−19.2870.14230.8350.0500.49
28N46391990N12.4540.12431.8180.085−19.3640.15031.8120.0840.48
29N46801997bp13.1730.20532.6060.208−19.4330.29232.5990.2050.48
30N54681999cp13.8800.08033.1200.075−19.2400.11033.1160.0740.39
31N54682002cr13.9930.07233.1200.075−19.1270.10433.1160.0740.39
32N55842007af12.8040.07931.7750.053−18.9710.09531.7720.0520.36
33N56432013aa11.2520.07930.5700.050−19.3180.09330.5460.0520.39
34N56432017cbv11.2080.07430.5700.050−19.3620.08930.5460.0520.39
35N57282009Y13.5140.11533.1170.206−19.6020.23633.0940.2050.58
36N58612017erp12.9450.10732.2320.101−19.2870.14732.2230.0990.33
37N59172005cf13.0790.09532.3620.122−19.2840.15432.3630.1200.47
38N72502013dy12.2830.17831.6280.126−19.3440.21831.6280.1250.49
39N73292006bh14.0300.07933.2740.116−19.2440.14033.2460.1170.39
40N75411998dh13.4180.12832.5040.120−19.0860.17632.5000.1190.30
41N76782002dp14.0900.09333.1960.155−19.1060.18133.1870.1530.39
42U93912003du13.5250.08432.8490.068−19.3240.10832.8480.0670.44

Notes. ${m}_{B,i}^{0}$ and error given is an approximation, the simple average from 1 or more SN surveys.

a Approximate, Cepheid-based distances derived without inclusion of the SNe in each host. b Cepheid-based distances derived without inclusion of any SNe in any host. c Empirical host reddening ratio derived from mass, SFR, using EDA, see Section 6.3.

Download table as:  ASCIITypeset image

The fiducial SN absolute-magnitude parameter applicable to the Pantheon+ standardization (Scolnic et al. 2021; Brout et al. 2021) is ${M}_{B}^{0}$ = −19.253 ± 0.027 mag. The value of the Hubble constant derived from the baseline fit is H0 = 73.04 ±1.01 km s−1 Mpc−1. Including an additional systematic uncertainty from the analysis variants as discussed in Section 6.14 yields

Equation (10)

The determination of H0 in the measured space is $5\,\mathrm{log}$ H0 = 9.318 ± 0.031 (with systematics), where the errors are symmetric in log space and slightly asymmetric for H0 with errors of (−0.98 and +1.03 for the fit). The full difference from the Planck+ΛCDM prediction of H0 = 67.4 ± 0.5 (Planck Collaboration et al. 2020) in units of ${\rm{\Delta }}5\,\mathrm{log}$ H0 is 0.176 ±0.035 mag (errors in quadrature), a difference of 5.0σ (one in 3.5 million).

The dispersion (weighted by the measurement errors) between the 42 calibrator SNe Ia and Cepheids is σ = 0.130 mag, which is equivalent to (albeit slightly lower than) the 0.135 mag dispersion between the SN magnitudes and redshifts of the Hubble-flow sample. We would expect these dispersions to be comparable (as they are) because the additional sources of scatter independent of SNe and applicable to the two comparisons (Cepheid distance errors and peculiar velocities) are both subdominant to SN scatter. From these dispersions and the good global χ2, we conclude that there is no unexplained variance in the baseline fit beyond the intrinsic scatter of SNe Ia and the intrinsic width of the Cepheid instability strip.

5.1. Markov Chain Monte Carlo Sampling

To check the preceding calculation using a different methodology, we performed Markov Chain Monte Carlo (MCMC) sampling. Our likelihood function is

Equation (11)

where the p(L, C, yq) is a Gaussian likelihood based on the covariance matrix C; if C is diagonal, ${C}_{{ij}}={\sigma }_{i}^{2}{\delta }_{{ij}}$, it has the form

Equation (12)

Because the covariance matrix C is independent of q in our setup, Equation (11) can be simplified to

Equation (13)

For the calculation of χ2, we followed the method described in Equation (6). For numerical accuracy, we substituted C−1 with an inverse of Cholesky-decomposed lower triangular matrix ${\tilde{C}}_{L}^{-1}$ (which satisfies ${\tilde{C}}_{L}{\tilde{C}}_{L}^{-1}=I$ and ${\tilde{C}}_{L}{\tilde{C}}_{L}^{T}=C$):

Equation (14)

As a "neutral" prior, we chose the uniform distribution

Equation (15)

which is centered at the results from the analytical result. Its width is set to be sufficiently (20×) larger than the estimated standard deviation from linear regression σ. We find that this range is more than enough to allow the determination of median and standard deviation of the resulting population from μ and σ.

We used the Python package emcee (Foreman-Mackey et al. 2013) to perform sampling under conditions mentioned above. The initial states are uniformly distributed in the prior range (Equation (14)) for a total of 100 walkers. The convergence is monitored using emcee's recommended method to estimate the autocorrelation time τ, which employs a method originally proposed by Goodman & Weare (2010). The burn-in time is set to be 5τ to allow chains to fully converge. 16

The convergence of the sampled distribution is checked visually and by the estimated autocorrelation time value. We required the total number of chains after burn-in N to satisfy N > 100τ before any visual inspection for convergence is performed. The samples used in the final analysis are contained well within the inner region of the initial states for all parameters, indicating that our choice of prior size did not affect the final result.

The results for selected parameters are shown in Figure 14. All parameters exhibit Gaussian-like posterior probability density functions (PDFs), which indicates that the results from the linear-regression method are a good representation of the results from this project. The locations of Gaussian-equivalent percentiles (i.e., the locations at which the same relative volume is met) in our sample suggest that the samples in H0 are more broadly distributed toward larger values (right-hand side; RHS) than smaller ones (left-hand side; LHS), a consequence of Gaussian errors in magnitudes of 5 log H0, with the mean distances between each line ${\bar{\sigma }}_{R}=1.06$ on the RHS and ${\bar{\sigma }}_{L}=0.98$ on the LHS (toward lower H0), similar to the analytical results.

Figure 14. Refer to the following caption and surrounding text.

Figure 14. MCMC sampling of the χ2 statistic of the global fit showing all free parameters except the individual host distances. Contours are 1σ, 2σ, and 3σ confidence regions. We find that the means and uncertainties agree very well with the analytical minimization of χ2.

Standard image High-resolution image

More than 120 million samples were used for the baseline fit to delineate the position of the 5σ confidence interval for H0, as shown in Figure 15. The calculated values of σ, whose mean is ${\bar{\sigma }}_{\mathrm{all}}=1.02$, contains the estimated uncertainty from the linear-regression method σreg = 1.01, hence showing a full consistency between two methods. Using these three values (${\bar{\sigma }}_{L}$, ${\bar{\sigma }}_{\mathrm{all}}$, and σreg), and regarding the 50th percentile value as the most probable value for the first two cases, our results are H0,MCMC−L = 73.04 ± 0.98, H0,MCMC−all = 73.42 ± 1.02, and H0,reg = 73.04 ± 1.01 km s−1 Mpc−1.

Figure 15. Refer to the following caption and surrounding text.

Figure 15. Extended MCMC sampling of the posterior for H0 to measure out to the ±5σ confidence level. The upper panel shows the probability density for the baseline from SH0ES and from the Planck Collaboration et al. (2020) chains. The bottom panel shows the log of the probability density to improve the ability to see the tails. We note some asymmetry to the distribution, with intervals on the low-H0 side a little smaller than that on the high side, as the measurements are Gaussian in magnitude and in 5 log H0, hence slightly skewed in H0.

Standard image High-resolution image

The significance of the discrepancy between our results and those of Planck Collaboration et al. (2020), H0,Planck =67.4 ±0.5, are 5.1σ, 4.9σ, and 5.0σ, respectively.

5.2. Simultaneous Constraints on H0 and the Expansion History

This section illustrates an approach to simultaneously measure H0 and the expansion history for an arbitrary form of H(z), using the full covariance of the data sets.

The "near-field" determination of H0 in the preceding section is quite insensitive to specific knowledge of the recent expansion history because the mean redshift at which the measurement is made, 〈z〉 = 0.055, is low. Yet to make the measurement more precise at even these small redshifts, in the prior section we accounted for the derivative of the expansion history in Equation (4) through the empirical derivative of H(z) measured from higher-redshift SNe Ia, q0. We set q0 = −0.55 as this is historically a good fit to high-redshift SNe Ia. 17 This empirical approach is fully independent of the CMB, so an independent comparison of H0 to the CMB with ΛCDM is appropriate.

However, to consider less-conventional expansion histories such as a rapid change in H(z) together with the measured value of H0, as may be undertaken in the effort to find a resolution of the Hubble tension, it might be necessary and it is certainly more reliable to explicitly account for the dependence of H0 on the form of the expansion history, H(z), at low redshifts (Camarena & Marra 2021; Efstathiou 2021). Furthermore, there is covariance in the measurements of SNe Ia, whether in Cepheid hosts or at moderate redshifts (Scolnic et al. 2018; Dhawan et al. 2020), and it is necessary to account for this when SNe Ia are used simultaneously to measure H0 and the recent expansion history.

It is important to recognize that standardized SN Ia data can provide only relative distance measurements between all SNe measured within the same standardization scheme, with SN parameters, uncertainties, and covariance with values relevant within the context of the algorithm used to standardize the SNe. Therefore, to avoid inconsistencies between SN standardization schemes or loss of knowledge of measurement covariance, one would ideally make full use of all relevant SN data simultaneously to determine absolute quantities. A straightforward and reliable path is to use the set of absolute distances to SN hosts (their uncertainties and covariance) derived from only the first two rungs without the use of any SN data, together with a consistently standardized set of SNe (in these hosts and in the Hubble flow), to determine H0 and H(z) simultaneously, the so-called "forward" direction. Or, one could alternatively follow an "inverse" approach, starting with CMB data and using the best-fit ΛCDM model to calibrate the distance–redshift relation of SNe including the predicted distances of nearby hosts of SNe Ia and Cepheids (or TRGB). 18 Here we follow the forward approach.

For the set of SN hosts with calibrated distances (using only the first two rungs of the distance ladder), the dereddened absolute distance modulus μ0 is given by

Equation (16)

with terms given in Equation (3). We then have a second set of SNe with cosmological redshifts z > 0,

Equation (17)

The two leading terms on the RHS of Equation (17) can be replaced with any empirical or cosmological model for H(z), such as the example of ΛCDM with the dark energy equation-of-state parameter w and mass density ΩM ,

Equation (18)

where

Equation (19)

The LHS of Equations (16) and (17) (or 16 and 18) are measured quantities, either μ0,host or mB , and the free parameters MB , H0, and q0 (or w and ΩM for Equation (18) instead of Equation (17)) are determined by simultaneously optimizing these.

Following this approach, the set of 37 values of μ0,host for 42 SNe Ia given in Table 6 (or 40 values for 46 SNe Ia including the TRGB data as in Section 6.7) and their covariance matrix, derived from the first two rungs (i.e., without the use of any SN data), are taken as input. The covariance matrix of μ0,host is shown in Figure 16. In addition, the values of mB for the SNe in these hosts and in the Hubble flow (including their redshifts) and the covariance matrix between all SN measurements are taken as additional data and the two constraining relations are solved simultaneously.

Figure 16. Refer to the following caption and surrounding text.

Figure 16. Host distance (μhost) covariance matrix. This is the covariance for the baseline set of 37 hosts derived only from the first two rungs and without the use of any SN Ia data. The nonzero off-diagonal terms result from common anchors, common parameters of the Cepheid PL relation, and covariance of the metallicity scale. The systematic uncertainty in all host distances may be measured from the square root of the level of the off-diagonal elements and is 0.019 mag, or 0.9% in distance.

Standard image High-resolution image

As an example, in Figure 17 we follow this approach, marginalizing over the SN standardization parameter, MB , and simultaneously measuring H0 and q0. As expected, H0 will be quite uncorrelated with q0 (or other cosmological parameters) unless a cosmological model produces a much more rapid change in H(z) at z ≪ 1 than the polynomial in Equation (17). For this case we add SNe from Pantheon+ at 0.15 < z < 0.8 to the prior analysis, finding q0 = −0.51 ±0.024 and H0 =73.30 ± 1.04 km s−1 Mpc−1. The result is very similar to our baseline result, and the added freedom in the expansion history has had little impact on the uncertainty. Brout et al. (2021) give results for a Friedman–Robertson–Walker (FRW) expansion history governed by w and ΩM . We provide the SN-independent host distances and their covariance matrix at pantheonplussh0es.github.io to allow for other forms of H(z).

Figure 17. Refer to the following caption and surrounding text.

Figure 17. Simultaneous measurement of H0 and expansion history. Here the expansion history is fit with a single free parameter, q0, though as discussed in the text, it can be fit with any form of H(z) using the set of μhost and a set of consistently standardized SNe Ia in these hosts and in the Hubble flow.

Standard image High-resolution image

6. Extensions or Variants of the Baseline Analysis

The baseline analysis was identified as providing the most accurate model of the data that is also the most economical in terms of the number of free parameters. Here we review 12 categories of alternatives or extensions to the baseline analysis, selected to explore the sensitivity of the results to additional considerations and systematic uncertainties. These are given as 67 fits listed in Table 5, shown graphically in Figure 18, and summarized in Section 6.14.

Figure 18. Refer to the following caption and surrounding text.

Figure 18. Display of 67 fits in 12 categories of alternatives or extensions to the baseline as shown in Table 5.

Standard image High-resolution image

6.1. Cepheid Clipping

The optical selection of our Cepheid sample is discussed by H16 and Y22b. As in R16, we include only Cepheids with colors F814W–F160W within 0.8 mag of the median color in each host to remove blends with unresolved sources of comparable luminosity and different color (e.g., red giants, blue supergiants, unresolved star clusters). This is a useful criterion as it is independent of distance and period, insensitive to reddening, and anchored to the physical properties of Cepheids (i.e., stars with spectral types F–K). We still may expect a small number of outliers owing to fully blended yellow supergiants, or the sample may include a small number of objects erroneously identified as Cepheids.

Our baseline analysis removes Cepheids that deviate from the global fit at >3.3σ (Chauvenet's criterion), iteratively discarding the single largest outlier (i.e., MAD algorithm) until none remain above the threshold. The fraction of such objects is 1.2% (these outliers are available upon reasonable request). Fits 2–9 explore other rejection approaches: global (i.e., all objects above the threshold are removed as opposed to the most deviant followed by recalculation of the fit), from the individual PL relations, and with tighter or looser thresholds (3σ or 5σ), as well as no rejection. The median H0 of these alternatives to the baseline clipping is larger by 0.2 km s−1 Mpc−1. Because the global outlier removal (Fit 2) is faster to calculate and gives results within 0.15 km s−1 Mpc−1 of the MAD baseline, we use this approach for most other fits unless we explicitly state the use of MAD. For the number of degrees of freedom here, fits with ${\chi }_{{dof}}^{2}\gt 1.07$ are considered not good (probability to exceed >3σ), which applies to Fits 6, 7, and 9 that remove few or no Cepheid outliers.

6.2. Geometric Anchors: Consistency with Metallicity

Fits 10–15 provide the results of including the geometric distance measurement(s) of only one or at most two anchors, rather than all three. The goal here is to explore the possibility of an unexpected error in one of the external geometric distance constraints. In Figure 19 we compare the geometric distance estimate to each anchor with the value modeled using only its Cepheids and the geometric distance of the other two anchors. As shown, the external measured distances are consistent with their Cepheids at −0.2σ for NGC 4258, +0.4σ for the LMC, and −0.5σ for the MW. Their internal consistency (despite having different mean abundances) is a consequence of the Cepheid metallicity dependence.

Figure 19. Refer to the following caption and surrounding text.

Figure 19. Comparison of each geometric anchor distance with its expected value based only on the Cepheids it hosts and the distance ladder calibrated by the geometric distance of the other two anchors. The green histograms are MCMC samples from Fits 13, 14, and 15 showing the expected distance of an anchor whose independent distance was excluded from the analysis. The independent measurement (blue curve) and prediction (green) are in good agreement in all cases to <1σ.

Standard image High-resolution image

The consistency of the anchors is most readily seen by comparing their intercepts and metallicity as shown in Figures 20 and 21. Here we determine the intercepts from the Cepheid data table for each host, NGC 4258, LMC, SMC, and the MW using their geometric distances for an absolute measure; see also Breuval et al. (2021) for a similar analysis. It is clear that the consistency of the anchors is a direct consequence of the Cepheid metallicity dependence, which has been greatly refined since R16. In our primary analysis we find a metallicity dependence on [O/H] of −0.217 ± 0.046 mag/dex. In R16 our primary result was γ = −0.13 ± 0.07 (with variants values ranging from −0.24 to −0.08) and in R19 it was –0.17 ± 0.06. The total χ2 for the three anchors in 21 is 0.18, which for a line fit leaves one remaining degree of freedom, and the likelihood to find agreement this good or better is 33% and thus nominal but not surprising.

Figure 20. Refer to the following caption and surrounding text.

Figure 20. Marginalized posterior covariance between H0 and the metallicity term, ZW , for the three two-anchor cases and the baseline fit. The metallicity term is well constrained, with a substantial tightening owing to the differential DEB distance between the two clouds (LMC and SMC) from Graczyk et al. (2020). The term has little correlation with H0 because the anchor abundances span the range in the SN hosts, but this term provides for the consistency between the anchors.

Standard image High-resolution image

A number of recent developments have tightened this constraint considerably while broadening its range. The DEB distance for the SMC from Graczyk et al. (2020) as discussed in Section 4.4 provides a differential measurement between the Cepheids in the LMC and SMC, which constrains the metallicity dependence. As shown in Figure 21, this constraint alone gives γ = −0.22 ± 0.05, similar to the values and uncertainties found by Breuval et al. (2021) for [Fe/H], and falling along the line that joins the other two anchors. It is one of the strongest constraints available because it comes from the difference in the DEB distances to each Cloud, a measure that has a small uncertainty owing to calibration cancellation and that does not depend on the comparison between the LMC and the other two anchors (Graczyk et al. 2020). In addition, the constraint internal to the MW Gaia EDR3 parallaxes alone (−0.22 ± 0.09) indicates a similar value. 19 The global fit also makes use of the internal metallicity gradients in the SN hosts to constrain ZW . Individually, these are not constraining, with a median uncertainty greater than 1 and a minimum of 0.3. However, combined, these are supportive of the results from the local galaxies albeit less constraining, yielding −0.13 ±0.11 mag dex−1 by combining their independent fits with uncertainties in both axes.

Figure 21. Refer to the following caption and surrounding text.

Figure 21. The metallicity term and consistency of the geometric anchors. The mean slope of four hosts with geometric distances is plotted against the intercept of the Wesenheit magnitude PL relation. The DEB distance difference between the SMC and LMC from Graczyk et al. (2020) as discussed is independent of the calibration uncertainties of the DEB method, making this link, the dashed line, robust and independent of the other anchors (a linear fit to the four points is very good, though not unexpectedly so with P = 10% to be better). The gray constraint on the MW comes from the breadth of metallicities and individual Gaia EDR3 parallaxes. The metallicities of the Cepheids in SN hosts span the range of the anchors, making the value of H0 insensitive to the value of the metallicity dependence, with a change in H0 of 0.2 units for a change in ZW of 0.1 mag dex−1.

Standard image High-resolution image

Abundance measurements for 68 of the 70 LMC Cepheids used here (Romaniello et al. 2021) show that they are consistent with a single value, and the lack of any measurable breadth in metallicities negates the ability to measure an abundance dependence internal to the LMC as claimed in prior analyses (Freedman & Madore 2011) and further discussed by Romaniello et al. (2021). Together, these developments provide a consistent result of a ∼−0.2 mag dex−1 metallicity dependence in the NIR that also provides accord among the anchors and is little changed by excluding any anchor as shown in Figure 20.

It is important to note that excluding knowledge of the geometric distance to an anchor (e.g., the DEB distance to the LMC) as we do in Fits 10–15 does not exclude the Cepheids in that anchor, which remain extremely valuable (at any distance) for constraining the global properties of Cepheids. Rather, when excluding knowledge of an anchor distance, we allow that distance to become a free parameter which may subsequently be compared to the external geometric estimate. In this case, a parameter such as the slope of the PL remains constrained by the excluded anchor Cepheids because these Cepheids are only consistent with a single distance to the host with an accurate value of their slope.

6.3. Variants with Color and the Reddening Ratio, R

Here we explore variants in how the Cepheid color, VI, is used in their distance determination. Our baseline analysis derives the Wesenheit parameter R (sometimes refereed to as RH to avoid confusion with the optical reddening law parameter, RV ) from the Fitzpatrick (1999) law with reddening parameter RV = 3.3, assuming MW-like reddening for the sample of late-type hosts and thus R = 0.386. Fits 16, 17, and 18 (respectively) change the reddening parameter to RV = 2.5 or adopt different laws—Cardelli et al. (1989, with RV = 3.1) or Nataf et al. (2016, appropriate for the inner halo). The relationship between R, RV , and the reddening laws is shown in the upper-left panel of Figure 22. Fit 19 allows the value of R for all hosts to be a common but free parameter. The value derived from MW Cepheids for R is 0.36 ± 0.04 (Riess et al. 2021) and for the full sample of Cepheids here is 0.34 ± 0.02. These fits yield a very similar value for H0; including R as a free parameter has little impact on the uncertainty.

Figure 22. Refer to the following caption and surrounding text.

Figure 22. Reddening ratios and impact on H0. The upper-left panel shows various reddening laws and their free parameter RV used to estimate the appropriate ratio RH for dereddening the Cepheids and the range over which this is applicable. The upper-right panel shows the empirical determination of RH using the MW Cepheids from R21 and individual estimates parameterized by host, type, mass, and SFR from SDSS galaxies in Hahn et al. (2022). The lower panel shows the color excess, E(VI), for Cepheid samples. The net change to H0 due to differential Cepheid reddening on the ladder is 0.08 RH (0.03) mag, with realistic differences in RH producing changes <0.01 mag or <0.5%.

Standard image High-resolution image

If one wishes to allow for differences in the reddening law or the value of R across different hosts, it is necessary to first subtract the intrinsic color of Cepheids using their empirical period–color (PC) relation in order to separate the component of the color that results from dust reddening; see Appendix D for details. Sandage et al. (2004) dereddened MW Cepheids to determine $\langle {(V-I)}_{0}\rangle =0.256\mathrm{log}P+0.497$. We used the LMC reddening maps from Skowron et al. (2021) to deredden the Cepheids in the catalog of Macri et al. (2015) to derive a PC relation of $\langle {(V-I)}_{0}\rangle =0.238\mathrm{log}P+0.513$, very similar to the MW one. From the same data we derived the intrinsic PC relation from the LMC Cepheids as above to be ΔmH =0.635(±0.021) Δ(VI).

We use a mean relation of $\langle {(V-I)}_{0}\rangle =0.25\mathrm{log}P+0.50$ to first subtract the intrinsic color from all Cepheids using their periods, and then in Fit 20 we substitute the Wesenheit magnitudes in Equation (7) for mH R E(VI) with R =0.386, which reduces H0 by 0.1 km s−1 Mpc−1. In Fit 21 we allow this reddening ratio to be a free parameter with little change from the result given in Fit 19.

We can also consider different values for the reddening ratio, R, for different hosts. One might derive these as the value that optimizes the relation between colors and magnitudes within each host. However, after subtracting the intrinsic PC relation, the small residual color span, coupled with relatively large color measurement uncertainties, does not provide any meaningful constraint on the individual values of R for a given host beyond the few nearest galaxies as demonstrated in Appendix D. In addition, Appendix D shows that determining an unbiased estimate of R requires accounting for uncertainties in both axes (in this case color and brightness with the statistical issue discussed by Tremaine et al. 2002), and failing to do so leads to large underestimates of R and its uncertainty, as seen (for example) in Perivolaropoulos & Skara (2021) and Mortsell et al. (2021). We find typical uncertainties in individual values of R to be ∼1 and thus uninformative. Likewise, Follin & Knox (2018) also concluded that such color data are "insufficient to make a completely data-driven inference on [individual] R," and they used a "wide prior" of 0.39 ± 0.1, finding distant hosts consistent with that prior and little impact on H0. It is also crucial to recognize that the empirical reddening law is only valid for use at RV > 2.5, as stated by Fitzpatrick et al. (2019), because sight lines with RV < 2 are not seen among the hundreds of massive MW stars used to determine the law as shown in Figure 22 (indeed, the Rayleigh-scattering limit for absorption corresponds to RV ≥ 1.5). Thus, there is no empirical support for R < 0.3 from these laws or from the stars (significantly, of the same B type that later become Cepheids) that inform them.

An alternative approach with better grounding is to derive individual values of R for each host based on its specific properties and the empirically determined correlation of these properties with dust attenuation. We use the empirical dust attenuation (EDA) framework from Hahn et al. (2022), which derives individual extinction laws for hosts as a function of their mass, type, and star formation rate (SFR) as determined to best match colors to SDSS galaxy observations. These individual estimates of R are given in Table 6 and shown in Figure 22, and have a mean of 0.42 and dispersion of 0.10, matching well the prior used by Follin & Knox (2018). The smallest value for any host is 0.30 for NGC 7541, demonstrating that values lower than this become unrealistic as discussed further in Appendix D. Fit 22 uses these modeled values of R from Hahn et al. (2022) and results in an increase in H0 of 0.8 km s−1 Mpc−1.

It is clear why varying the reddening ratio in hosts is not effective at changing H0. The mean value of E(VI) in the SN hosts is 0.35 mag, as shown in Figure 22. For the Cepheids in the anchors MW (Gaia EDR3), LMC, and NGC 4258, it is 0.58, 0.23, and 0.24 mag (respectively), for a mean anchor value (weighted by the precision of the anchor distance) of 0.43 mag. Thus, the net difference in E(VI) between anchors and SN hosts is 0.08 mag, and hence the full impact on H0 of correcting for Cepheid reddening is 0.03 mag or ∼1.5%, with perturbations to this procedure changing the result by a smaller amount. As shown in Figure 22, the strongest empirical evidence has the characteristic value of R less than 0.1 from the baseline; hence, e.g., ΔH0 = 0.08ΔR (mag) <0.01 mag. In Fit 23 we discard the use of color altogether, representing a reasonable assumption that the Cepheid extinction at 1.6 μm is modest and to first approximation cancels along the distance ladder, and find that H0 goes up by 2%.

6.4. Form of the P–L Relation

In Section 4.6 and Figure 10 we measure the slopes of the PL relations of each host as well as the mean slope above and below P = 10 days, finding no clear evidence of a break. It is important to note that we have not included Cepheids with P < 5 days (either in the anchors or SN hosts) in our analyses. Such Cepheids could provide additional support for a break; however, Cepheids with overtone pulsations become much more common below this period, obey a different PL relation, and may be confused with fundamental Cepheids.

However, in R16 the baseline was to allow for a break at P = 10 days, in accordance with prior claims of a break in the optical in the LMC (Sandage et al. 2004; although no evidence for one was found by R16). To allow for additional comparison to R16, in Fit 24 we allow for a break at P = 10 days with independent slopes above and below this pivot. The two slopes have a slope difference of 0.10 ± 0.05 as given in Table 5, which lowers H0 by 0.45 km s−1 Mpc−1, with the difference largely driven by the ground-based samples from the LMC and SMC, whose Cepheids at lower periods have a shallower slope by 0.12 ± 0.08 and 0.34 ± 0.20 (respectively). To further explore the evidence for a break, we expand the sample of Cepheids in M31 by including those from Kodric et al. (2018, sample III) in Fit 28. Because in M31 the opposite occurs (shorter-period Cepheids have a steeper slope by 0.20 ± 0.12), the inclusion of both samples reduces the difference in slope to 0.07 ± 0.04 and H0 is lower than the baseline by only 0.3 km s−1 Mpc−1. A free-form analysis of a slope change (allowing each host to have two individual slopes) reduces the sample evidence of a break to <1σ, with only three hosts (the LMC, SMC, and M31) providing any significant weight at P < 10 days (MW Cepheids with Gaia parallaxes provide no indication of a change in slope). The lack of significant evidence of a break (or change in slope) is why the baseline analysis uses the single-slope parameterization. Another option here is to exclude Cepheids with P < 10 days, which we do in Fit 25, resulting in an 0.1 unit increase in H0.

6.5. M31 Variants

Fits 27 and 28 exchange the Li et al. (2021) sample of M31 Cepheids for the PHAT sample from Kodric et al. (2018) with a 10 fold increase in the number of variables, but with filters that are transformed to the set of three used elsewhere rather than directly observed with them. The main value of this change is to gain further traction of a possible break at P = 10 days as discussed in the prior section.

6.6. Variants of Metallicity

In Fit 29 we ignore the metallicity term and find that it has little impact, with H0 rising by 0.3 km s−1 Mpc−1. The metallicity term has little impact on H0, as shown in the marginal confidence region of Figure 20. The reason can be seen in Figure 21, where the distribution of Cepheid metallicities in SN hosts matches well the range of the three anchors; thus, the metallicity term does not move the SN hosts with respect to the mean of the anchors. It does, however, provide for consistency of the Cepheid and anchor distances.

Appendix C gives further details about the Cepheid metallicity scale we adopt, the average of nine recent and well-characterized relations (Teimoorinia et al. 2021). In Fit 30, we replace the use of the mean metallicity scale with one of these systems (Pettini & Pagel 2004) 20 based on [O iii] and [N ii] lines, which we show in Appendix C provides consistent metallicities for MW H ii regions with those at the same radius derived from spectra of MW Cepheids.

6.7. Inclusion of TRGB

The TRGB offers an additional, independent rung between anchors and SN hosts (Freedman et al. 2019; Anand et al. 2021, and references therein). The inclusion of TRGB distances can in principle improve the calibration of SN Ia hosts and add to the number of such objects on the second rung. However, before including TRGB distances, it is important to determine if they are consistent with those employed here for Cepheids. 21 We use the TRGB samples measured by the CCHP (Freedman 2021, hereafter F21) and EDD (Anand et al. 2021) groups.

To determine the consistency of the two distance indicators, we calibrate both using the same geometric anchor, NGC 4258, whose Cepheids and TRGB were both observed with HST, making this a purely differential comparison to a set of the same SN hosts. In Figure 23 we compare the distances from each method for all seven SN Ia hosts with Cepheid and TRGB measurements, which are available from both CCHP and EDD. The mean difference between Cepheids and TRGB is −0.014 ± 0.030 and 0.002 ± 0.030 mag for EDD and CCHP, respectively (we have not included the four more-distant SN Ia hosts measured by Jang & Lee 2017 and included in F19 because Anand et al. 2021 could not identify any reliable TRGB for these, but we plot them in Figure 23), and conclude these are consistent (see Section 7.2 for further discussion). For consistency with the preceding Cepheid-only analyses, we do not include SNe Ia with TRGB measures that do not pass the SN quality cuts employed above (∣c∣ < 0.15 and ∣x1∣ < 2), which affects four objects (SNe 1989B and 1998bu with c = 0.3 corresponding to AV ≈ 1 mag, and SN 1981D with c = 0.2, also with high reddening; SN 2007on with x1 = −2.2). Ten TRGB distances for 11 SNe Ia are thus added: the 7 hosts with Cepheid distances (and 8 SNe) in Figure 23 (M101 and NGC 1365, 1448, 4038, 4424, 4536, and 5643) and 3 additional hosts without Cepheids with 4 additional SN calibrators (SNe 1980N and 2006dd in NGC 1316, SN 2011iv in NGC 1404, and SN 1994D in NGC 4526).

Figure 23. Refer to the following caption and surrounding text.

Figure 23. Cepheid distances from this work and TRGB distances from the CCHP (F21) and the EDD (Anand et al. 2021) groups to the same SN hosts calibrated by the same geometric anchor distance to NGC 4258, all measured with HST. The seven hosts indicated are all of those measured by these three groups. In blue we show four more-distant hosts measured by Jang & Lee (2017) for which evidence of a TRGB was not detected by Anand et al. (2021). The Cepheid and TRGB distances are consistent, allowing us to combine them (including for three additional TRGB hosts: NGC 1316, 1404, and 4526) simultaneously in Fits 31 and 32.

Standard image High-resolution image

As we did for the Cepheids, we use TRGB distances where the anchor and SN hosts made use of the same telescope and instrument to negate telescope zero-points. In this case we first use Table 2 from Anand et al. (2021), which includes the calibration based on new observations of NGC 4258 from our program (GO 16198) that also employ the same filters (F606W and F814W), ACS electronics, and similar level of CTE to the SN hosts. We do not include TRGB zero-points in the Magellanic Clouds or the MW because these have not been measured directly on the HST system (owing to the impracticable area). For the joint analysis we include an additional parameter for the TRGB luminosity, MI , which is optimized in the fit. We note that the TRGB constraint is included as available for an SN host simultaneously with the Cepheid constraint through the addition of the relation for the ith SN host,

Equation (20)

and the calibrating relation

Equation (21)

which adds a single free parameter, MI,TRGB.

Including TRGB (EDD), Fit 31 lowers H0 by 0.3 units to 72.76 ± 0.95 km s−1 Mpc−1 and reduces the overall error by 5%, yielding a value of MI,TRGB = −4.003 ± 0.025 mag, similar to that found by Anand et al. (2021). The TRGB SN host and Pantheon SN data yield 71.5 km s−1 Mpc−1 before the sample is tripled by combining with the SN-Cepheid hosts with the increase in H0 of ∼1 km s−1 Mpc−1, consistent with the shot noise of each subsample as discussed in detail in Section 7.2. We note that because the EDD TRGB parameterization includes a color dependence, the value of MI,TRGB quoted here corresponds to their fiducial, blue TRGB with VI = 1.23 mag. For Fit 32 we replace the EDD TRGB measurements with the CCHP set as given by F19 and F21 for the same SN hosts and by Jang et al. (2021) for NGC 4258 TRGB 22 and without a color dependence to match the CCHP implementation. This reduces H0 by 0.45 units from the EDD-based result (0.75 units below the baseline). The difference between the EDD- and CCHP-based result is a direct consequence of the 0.04 mag brighter measurement of the tip in NGC 4258 by Jang et al. (2021) compared to EDD, the significance of which is 1.9σ as further discussed in Section 7.2. The Jang et al. (2021) measurement in NGC 4258 in concert with the other constraints yields MI,TRGB = −4.025 ± 0.023 mag for the above parameter. (We note that a direct comparison of the MI,TRGB parameter to that observed for NGC 4258 requires adding to this ΔμN4258 in Equation (2), which for the baseline is −0.013 mag.) The mean result from the two TRGB implementations is 72.53 ±0.95 km s−1 Mpc−1, which we adopt as a reference value for TRGB inclusion.

6.8. Hubble-flow SN Sample Variants

Here we explore fits utilizing different selection criteria for inclusion in the Hubble-flow sample. The goal of the selection of SNe and their hosts for the Hubble-flow sample is to match as well as possible the same criteria used to collect the calibrator sample to guard against the presence and imbalance between samples of additional host or SN characteristics currently known or unknown that may correlate with Hubble residuals. The enhanced size of the calibrator sample, now with 42 SNe Ia, reduces the likelihood of a chance imbalance of such properties.

The host selection for the baseline Hubble-flow samples requires visual identification from the best available optical imaging to be a spiral type in the full range of Sa–Sd, making them likely hosts of massive star formation in the last 0.1 Gyr—the primary criterion for our targeting them for Cepheid observations. This excludes highly inclined hosts (>75°) as these are complex targets and were not considered for finding Cepheids nearby. The host requirements are in addition to the SN Ia quality cuts, which are relatively tight in order to match the calibrators (∣c∣ ≤ 0.15, ∣x1∣ < 2), with the first observation earlier than 5 days after maximum light, distance measurement error of ≤0.2 mag, with outliers >3.5σ from the Hubble flow removed. The redshift range, as in R16, is 0.0233 < z < 0.15. We refer to this sample as "LZSPI" (low-z, spiral). In Section 7.2 we will further analyze the host properties of these samples to confirm their balance in mass, SFR, and specific star formation rate. In Appendix A we compare the colors and light-curve shapes of the SN samples, confirming the similarity of the calibrator and selected Hubble-flow sample.

In Fit 33 we expand the Hubble-flow sample by removing any limitation on the host and including all SNe that pass the Pantheon+ quality cuts (which allow ∣c∣ ≤ 0.3 and ∣x1∣ < 3), nearly doubling the sample to 482 SNe. It is noteworthy that there is no change in H0, suggesting that after Pantheon+ standardization, SNe in different host types provide consistent distances. In Fit 34, we expand the sample to high redshift (z < 0.8) and more than 1300 SNe, which raises H0 by 0.5 km s−1 Mpc−1. The use of such high-redshift SNe requires knowledge of q0 as discussed in Sections 5.2 and 6.12. Fits 35 and 36 use these two expanded samples but exclude z < 0.06 to circumvent concerns about the presence of a hypothesized local void structure. These raise H0 by ∼0.3 km s−1 Mpc−1.

6.9. Calibrator SN Sample Variants

Here we explore variations of the calibrator sample. A complete, volume-limited sample is desirable, as it has the simplest and most unbiased selection function. Such a sample is limited by the volume z ≤ 0.011 of suitable calibrators between 1980 and 2021 and can be selected by excluding two SNe Ia from the baseline sample at higher redshifts (SNe 1999dq and 2007A), with results given in Fit 38. Fit 39 contains the same with the added TRGB distance measures. Fit 40 limits the hosts to only high-mass galaxies, $\mathrm{log}(M/{M}_{\odot })\gt 10$.

In Section 7 we discuss tests of the Cepheid background. Here we undertake another test of the background with Fits 41 and 42, which divide the calibrator sample into halves with lower-than and higher-than-median background. The difference in H0 is 0.0 km s−1 Mpc−1, which is less than the independent shot noise of each half (1.3% or 0.9 units for each), and thus there is no indication of a difference in H0 for high and low backgrounds. Fits 43 and 44 respectively use only the same 19 SNe from R16 or only the 23 SNe Ia added here, yielding a difference of 0.4 units. Fit 45 includes only the nearer half, defined from SNe as having mB < 13 mag (the median of the sample, corresponding to D < 28 Mpc), as nearer hosts offer greater spatial resolution, lowering H0 by 0.1 units.

6.10. Excluded SN Survey Variants

The SN standardized magnitudes have been drawn from the Pantheon+ sample, which is based on more than a dozen past SN surveys. The Pantheon+ sample recalibrates each survey photometrically to a common reference using the standard-star measurements in the fields of each SN to negate the impact of survey calibration errors. In addition, most SNe observed in the Cepheid host sample are matched by SNe in the Hubble-flow sample observed by the same SN survey with no one survey having a dominant share. As demonstrated by Brownsberger et al. (2021), "gray" photometric survey errors strongly and beneficially cancel by populating both samples with SNe from the same surveys. Brownsberger et al. (2021) find that survey miscalibration and incomplete cancellation would affect the H0 measured from the survey mix used here at σ = 0.15 km s−1 Mpc−1 even for extremely large survey zero-point errors of ∼0.1 mag, and more likely 0.06 units for realistic survey calibration errors of 0.025 mag.

To further explore the sensitivity of H0 to survey errors, Fits 46–54 present the results excluding each of the major surveys contributing to the sample. While none of these exclusions change the baseline H0 by more than ∼0.3 units, we note that excluding the CSP sample raises H0 by 0.3 units, with most of this (0.2) seen in the change in the Hubble intercept (aB ). Using a Hubble-flow sample exclusively from the CSP (while using a mix of surveys for the calibrator sample) yields a lower value of H0 by 0.5 units. While this is consistent with the sample shot noise, it will reduce H0 for the CCHP results, which use only the CSP sample for the Hubble flow. Brownsberger et al. (2021) find that this sample asymmetry is expected to produce errors of 0.8 units for realistic calibration errors of 0.025 mag; it is discussed further in Section 7.2.

6.11. SN-fitting Variants

Fit 55 changes the way the intrinsic scatter of SN colors is modeled from an empirical approach that includes both a component intrinsic to SNe Ia and another due to host dust as given by Brout & Scolnic (2021) and is further described in the Pantheon+ sample (Brout et al. 2021; Scolnic et al. 2021). It is a simpler description, where the intrinsic scatter of SNe Ia is monochromatic (i.e., dispersion only in the luminosity, not the color) from Guy et al. (2010) and used in the JLA analysis (Betoule et al. 2014) and in the first Pantheon compilation (Scolnic et al. 2018). This fit raises H0 by 0.3 units. The calibrator sample has no preference for either method, yielding similar dispersion between SN and Cepheid distances.

6.12. Velocity Variants

Here we provide variations related to values of the redshifts as implemented in Equation (4), which benefit from the combined and improved values from Pantheon+ (Carr et al. 2021). Peterson et al. (2021) provide a comprehensive overview and comparative analyses of various predictions of empirical cosmic flows (or peculiar velocities). They found important improvements to the Hubble-flow residuals by (1) replacing SN-host redshifts with their host-galaxy group redshift (when available), and (2) using local density maps to account for motions induced by local gravity. The latter are provided by constrained realizations of the peculiar-velocity field by Carrick et al. (2015), Said et al. (2020), and Lilow & Nusser (2021) based on 2M++ (Lavaux & Hudson 2011) and 2MRS (Huchra et al. 2012; Macri et al. 2019), respectively. The baseline included both the group-redshift replacement and the 2M++ corrections from Peterson et al. (2021). Fit 56 exchanges the 2M++ values for those from 2MRS, which provide a comparable improvement in residuals and lower H0 by 0.05 units. A noteworthy change is seen in Fit 57, which forgoes the flow corrections and leaves the redshifts in the CMB frame, reducing H0 by 0.5 units. However, as shown by Peterson et al. (2021) for 585 SNe Ia with z < 0.08, the tightening of the Hubble diagram (from σ = 0.17 mag to <0.15 mag, or a decrease in χ2 of 100) gives evidence in favor of these corrections which is too strong to ignore. Further, the increase in H0 that comes with the decrease in residuals runs counter to the hypothesis that the SN sample lives inside a large-scale void that artificially raises the local value of H0 (Kenworthy et al. 2019).

While peculiar flows cause a small perturbation in H0 measured from SNe Ia at 0.0233 < z < 0.15, they would produce a greater uncertainty if we forgo the use of SNe and measure H0 directly from only the first two rungs—that is, from the Cepheid-host redshifts (which are not used in the three-rung distance ladder). The baseline sample host redshifts have a mean of cz = 2000 km s−1, with many <1000 km s−1. Elsewhere (W. D. Kenworthy et al. 2022, in preparation), we present an analysis of H0 from this two-rung ladder, which importantly accounts for the spatial covariance of the local peculiar flows, largely limiting the available precision from this route to 3%–4% and demonstrating the value of SNe Ia for the third rung for measuring H0.

Fits 58 and 59 raise q0 from −0.55 to −0.52 (equivalent to raising ΩM in flat ΛCDM from 0.30 to 0.32) for either the local sample of spiral hosts or the sample with all hosts and for z < 0.8, with little impact on H0 relative to these samples at q0 = −0.55. In Section 5 we considered a free-form fit for H(z) using q0 as a free parameter simultaneous to the determination of H0.

6.13. Optical Wesenheit Variants

Fits 60–67 use an optical-only Wesenheit magnitude, substituting for Equation (7), mW I = mI R(VI), and thus discarding the NIR observations. The Fitzpatrick (1999) reddening law with RV = 3.3 yields R = 1.19 in the HST passband system (mV = F555W, mI = F814W). The optical Wesenheit has the advantage of lower "sky" backgrounds (and their fluctuations) but the disadvantage of higher reddening (and sensitivity to the form of the reddening law). The baseline fit with the optical Wesenheit yields 72.70 ± 1.07 km s−1 Mpc−1, similar to the baseline fit. However, the optical Wesenheit is somewhat noisier when compared to the SN distances with a relative dispersion of 0.16 mag (versus ∼0.13 mag with the NIR data). We also see larger variations in the anchors and the color variants as seen in Figure 18, with the scatter among optical-based variants that is three times greater than the NIR-based results and comparable to the statistical uncertainties. This illustrates the rationale by the SH0ES program for pursuing NIR observations for Cepheids.

Both of these differences are expected consequences of variations in the reddening law in the optical. For example, for hosts whose Cepheids have a mean E(VI) = 0.4 mag, a difference between a Fitzpatrick (1999) reddening law with RV = 2.5 and RV = 3.3 causes a difference in distance of only 0.01 mag for the NIR Wesenheit but 0.09 mag for the optical one, which can explain the aforementioned noise. In R16 we concluded that future improvements must rely on NIR data until additional studies of variations in reddening laws in the optical were available. The situation has not improved in that regard. While optical-only Wesenheit data have yielded similar values of H0 (Freedman et al. 2012; Riess et al. 2016), their larger systematic uncertainties make them unsuitable to pursue the percent-level determination of H0 we approach, and their further analysis is not pursued here. Nevertheless, none of the optical Wesenheit fits result in a noteworthy change to H0.

6.14. Summary of Variants and Error Budget

Our baseline determination of H0 lies 0.2 km s−1 Mpc−1 (20% of the uncertainty) below the median of all analysis variants, indicating that it is a good proxy for the set. In R16 we measured the dispersion of 23 variants and identified that as a systematic error. In this analysis we have moved previous sources of systematic uncertainty into the covariance matrix to include them formally, and thus most of the variants presented here were intended to gauge sensitivities in the analysis (e.g., excluding a data source) rather than true uncertainties. Nevertheless, we measure the dispersion of the NIR variants (see Figure 18) around a 3σ-clipped mean to be 0.3 km s−1 Mpc−1 and conservatively add this in quadrature as characterizing additional systematic uncertainties to yield a full uncertainty in H0 of 1.04 km s−1 Mpc−1, or 1.4%.

None of the variants appear to offer a particularly promising route to solving the Hubble tension, with none shifting the value of H0 much below the full error interval. The lowest value of H0 comes from Fit 65, 71.93 ± 1.19 km s−1 Mpc−1, from discarding the NIR Cepheid data and two anchors (leaving only the Milky Way); the highest is from Fit 22, 74.85 ± 2.33 km s−1 Mpc−1 and comes from discarding the use of Cepheid colors to account for extinction. Both of these fits represent suboptimal accounting of dust.

Table 7 and Figure 24 present the full error budget derived (in approximate form) from the global fit and in comparison to prior analyses from the SH0ES team. The fractional reduction presented here is the largest seen since that between Riess et al. (2011) and Riess et al. (2016).

Figure 24. Refer to the following caption and surrounding text.

Figure 24. Comprehensive error budgets for six iterations of the SH0ES measurement of H0 including this one (2022) and the Key Project from 2001 (Freedman et al. 2001). The greatest improvement here is realized by the increase in the number of SN calibrators, to decrease all terms to <1%. The combined error is indicated by horizontal dashed lines.

Standard image High-resolution image

Table 7.  H0 Error Budgets (%), Terms Approximated from the Global Fit

TermDescriptionRiess+ (2016)Riess+ (2019)This work
  LMCMW4258LMCMW4258LMCMW4258
σμ,anchor Anchor distance2.12.12.61.21.52.61.21.0 a 1.5 b
σPL,anchor Mean of PL in anchor0.1...1.50.4...1.50.4...1.0
λ,1,2 zero-points, anchor-to-hosts1.41.40.00.10.70.00.10.1 a 0.0
σZ Cepheid metallicity, anchor-hosts0.80.20.20.90.20.20.50.150.15
 subtotal per anchor2.62.53.01.51.73.01.41.01.8
   $\underbrace{\,}$ $\underbrace{\,}$ $\underbrace{\,}$
All Anchor subtotal 1.6  1.0  0.7
${\sigma }_{\mathrm{PL}}/\sqrt{n}$ Mean of PL in SN Ia hosts 0.4  0.4  0.4
${\sigma }_{\mathrm{SN}}/\sqrt{n}$ Mean of SN Ia calibrators (# SN)1.3 (19)1.3 (19)0.9 (42−46)
σmz SN Ia mz relation 0.4  0.4  0.4 
σPL PL slope, Δ log P, anchor-hosts 0.6  0.3  0.3 
statistical error, ${\sigma }_{{{\rm{H}}}_{0}}$  2.2  1.8  1.3 
Analysis systematics c  0.8  0.6  0.3 
Total uncertainty on ${\sigma }_{{{\rm{H}}}_{0}}$ [%] 2.4  1.9  1.35 

Notes.

a Riess et al. (2021). b Reid et al. (2019). c Uncertainties labeled in past analyses as "systematics" related to the metallicity scale, Cepheid background/crowding corrections, and SN systematics are formally included here in the covariance matrix in Figure 11 and thus propagate there as part of the complete uncertainty. Following past work, we measure the remaining systematic errors as the standard deviation of analysis variants presented in each work as in the dispersion in Figure 18 and as discussed in Section 6.14. All terms here are approximations derived from the global fit.

Download table as:  ASCIITypeset image

7. Discussion

Here we address a number of considerations using the preceding fits.

7.1. Accuracy of Cepheid Photometry

The accuracy of Cepheid photometry is important to the measurement of H0. The dependence of photometry on calibration has been negated by the use of the same photometric system throughout the above measurements. Here we review a number of tests of the relative accuracy for extragalactic Cepheids.

  • 1.  
    Replication of PSF photometry by others: Measurements of NIR PSF Cepheid photometry using the same raw pixels but different software and methods have been directly compared to those used here by Javanmardi et al. (2021) in NGC 5584 and by Yuan et al. (2021) in NGC 4051, both finding good agreement with reported differences in distance moduli of 0.024 ± 0.046 and 0.00 ± 0.04 mag (respectively). 23 While this is short of a full replication of all hosts, it is sufficient to exclude a large methodological error in Cepheid photometry as a primary source of the ∼0.2 mag H0 tension.
  • 2.  
    Replication of PSF photometry with apertures: In Appendix B we use aperture photometry, an independent method that is simple, highly reproducible, and accurate, albeit less precise than the standard approach of using PSFs to model photometry in crowded fields. For this validation, we employ aperture photometry for which the background is measured from the mean pixel value (not the mode of the background pixels) in an annulus centered on the Cepheids. This approach does not depend on artificial-star measurements to determine the variable background because the pixels in the annulus include the mean source contribution to the background. We compare the photometry and find good agreement in their means with a difference (PSF minus aperture) of 0.008 ±0.010 mag for the Cepheids in SN Ia hosts and 0.002 ±0.030 mag for the Cepheids in NGC 4258.

There are a number of strong tests of the accuracy of the background estimates presented here.

  • 1.  
    The Cepheids in NGC 4258 have a similar mean level of crowded backgrounds as in the SN Ia hosts (see Figure B2), almost fully negating a systematic underestimate or overestimate of the background on the determination of H0 when NGC 4258 is the sole anchor. The crowded background is similar because although NGC 4258 is three to four times closer than the mean SN host, its Cepheids have been mined from fields that are closer to the dense center by a similar factor. This results in H0 = 72.51 ±1.54 km s−1 Mpc−1, similar to the baseline result.
  • 2.  
    In Figure 8, we compare the PL relations of Cepheids in the dense, inner (high background) and sparse, outer region (low background) of NGC 4258, finding a negligible difference of 0.01 mag.
  • 3.  
    In Appendix B we regress the background with the distance-ladder fit residuals and find a dependence of 0.010 ± 0.014 mag per magnitude of source background (in the sense of overestimating the background and consistent with no misestimate). The background misestimate trends required to explain the tension are strongly excluded as shown in Figure B2.
  • 4.  
    The optical background is nearly an order of magnitude smaller than that in the NIR owing to the higher resolution, smaller pixels, and lower flux from red giants. The baseline results are consistent with those from the optical Wesenheit, which do not include the NIR data, H0 = 72.70 ± 1.30 km s−1 Mpc−1.
  • 5.  
    There is no significant trend with distance and distance difference between Cepheids and SNe Ia (zero at <1.5σ); see Figures 12 and 13.
  • 6.  
    Splitting the calibrator sample by background (Fits 41 and 42) yields no difference in H0. Splitting in distance (determined by the SN Ia, Fit 45) yields a difference of 0.1 units. Both are consistent with no trend based on the shot noise of half the sample (1.0 units).

Finally, external to this paper, an additional and unavoidable consequence of the miscalibration of the background, independent of Cepheid mean flux, would be a change in apparent light-curve amplitude. Riess et al. (2020) compared the NIR amplitudes of Cepheids in three SN hosts and in the MW, found them to be consistent, and provided a quantitative limit of any misestimate of background to be 0.03 mag.

7.2. Consistency of TRGB and Cepheid Distance Scales

In Figure 23 we presented a comparison of distances measured with Cepheids and TRGB to seven SN Ia hosts—the set that allows for a purely differential and direct comparison by employing the same geometric calibration source (NGC 4258), with data in both host and calibrator obtained with the same telescope (HST) and setup to negate zero-point and geometric calibration errors. This comparison further employed the two most widely used methods for measuring the TRGB, edge-detection (F19) and luminosity-function fit (EDD). These Cepheids and TRGB measures are consistent with each other, with a mean difference of −0.002 ± 0.03 mag (CCHP; Freedman 2021) and −0.014 ±0.03 mag (EDD; Anand et al. 2021). With no mean difference between methods as a starting point, we explore the broader question of the sources of the difference in the value of H0 from distance ladders measured through the use of either method.

In Table 8 we expand on the results from Anand et al. (2021), who compared the value of H0 from EDD, 71.5 km s−1 Mpc−1, and CCHP, 69.8 km s−1 Mpc−1, derived using the same images to measure the TRGB and the same geometric calibration source, NGC 4258. For consistency with the comparison presented by Anand et al. (2021) of EDD and CCHP, which was provided in units of magnitudes (for differences in 5 log ΔH0), we retain this unit below and in Table 8 in which a difference of 0.03 mag corresponds to a difference of ∼1.0 km s−1 Mpc−1.

Table 8. Sources of Differences in H0 between EDD, CCHP, and SH0ES (in Magnitudes)

TermΔF19ΔF21ΔEDD
EDD versus CCHP TRGB
1. Zero-point (NGC 4258)0.060.06
2. No TRGB detected by EDD (four hosts)0.000.00
3. NGC 14040.010.01
4. NGC 5643*0.01
5. ΔmTRGB*−0.03−0.02
TRGB subtotal0.050.05
SH0ES Cepheids versus EDD TRGB
6. +SN 2021pit−0.02−0.02−0.02
7. −SN 2007on−0.01−0.01−0.01
8. −3 red SNe Ia0.000.000.00
9. +33 SNe Ia in 30 hosts0.060.060.06
Calibrator set change subtotal0.030.030.03
10. LMC+MW anchors0.020.020.02
11. Flows0–0.020–0.020–0.02
12. Hubble-flow surveys0–0.020–0.020–0.02
Total∼0.10∼0.10∼0.05

Notes. "*" weighted by SN, not by host. ΔF19, ΔF21 = differences between Freedman et al. (2019) or Freedman (2021) and EDD, respectively. ΔEDD =differences between EDD and SH0ES. Descriptions of individual entries: (1) measured zero-point calibration of the TRGB in NGC 4258; (2) Anand et al. (2021) did not detect the TRGB in four SN-host galaxies; (3) inclusion of NGC 1404 (not directly measured by F19; EDD include SN 2007on which F19 included and F21 excluded); (4) inclusion of NGC 5643 (not available in F19); (5) mean difference in measured values for the TRGB of the remaining hosts; (6) SN 2021pit not available in EDD; (7) exclude SN 2007on in NGC 1404 owing to x1 < −2; (8) exclude 3 SNe with c > 0.15; (9) addition of 33 SNe Ia; (10) addition of LMC and MW anchors; (11) corrections for cosmic flows in Pantheon+ raises H0 by 0.02 mag relative to CSP SNe (see Peterson et al. 2021); (12) EDD used both Pantheon or CSP, the latter lowers H0 by 0.02 mag relative to Pantheon+ average of surveys. The uncertainties in all terms above are 0.01 mag.

Download table as:  ASCIITypeset image

The largest contribution (0.04 mag of the full 0.05 mag difference) arises from the difference in the calibration of the TRGB measured in NGC 4258 applicable to (i.e., in relation to) the mean SN host, where CCHP derived MI,TRGB = −4.05 mag for a blue TRGB and EDD derived MI,TRGB = −4.01 mag, color-corrected to the fiducial, blue TRGB with F606WF814W =1.2 mag. Although additional sources in the LMC and MW have been used to support the TRGB calibration by F21, we use NGC 4258 as the reference here because it is the only source where the TRGB calibration is available directly on the HST system measured in a manner consistent with the TRGB in SN hosts. 24 The difference in the EDD and CCHP measurement of the tip in NGC 4258 is persistent, having been found for two different fields of NGC 4258, being significant at the ∼2σ level and is readily evident in Figure 3 of Anand et al. (2021), where the apparent location of the TRGB edge of mF814W =25.372 ± 0.014 mag given by Jang et al. (2021) appears much brighter than the edge highlighted at 25.43 ± 0.025 mag. A comparison of each group's color–magnitude diagrams (i.e., photometry) might identify the cause of the difference, but these data are only available from Anand et al. (2021). Nevertheless, we offer the option for either TRGB result in Fits 31 and 32.

The second half of Table 8 provides the sources of difference between the EDD TRGB analysis (based on 16 SNe Ia) for which H0 = 71.5 km s−1 Mpc−1 and our baseline analysis with TRGB (based on 46 SNe Ia), starting with the same geometric calibrator (NGC 4258). The primary change is in the makeup of the Cepheid and TRGB study calibrator samples with 34 SNe Ia added here and 4 subtracted, a net change of 30 objects producing a net increase in H0 of 0.03 mag or 1 km s−1 Mpc−1. The first three rows break out changes related to calibrators available from both TRGB analyses, which together lower H0 by 0.03 mag. The first change is due to the addition of SN 2021pit, which recently appeared in NGC 1448 and is added here. SN 2007on has a light-curve shape (x1 = −2.2), which falls outside the quality range of ∣x1∣ < 2 imposed in our baseline analysis and is excluded. There are three SNe Ia (SNe 1981D, 1989B, and 1998bu) used by CCHP and EDD that are redder (c > 0.15 and AV > 0.5 mag) than our baseline quality range cut and have not been included in this or any past SH0ES analyses. The next row adds 33 SNe Ia in 30 Cepheid hosts, tripling the sample from the preceding line's 13 SNe Ia in 10 hosts (with EDD and CCHP TRGB measurements and not excluded by SN quality cuts) to 46 SNe Ia in 40 hosts (for Fit 30, which includes TRGB-only hosts). This step raises H0 by 0.06 mag or 2 km s−1 Mpc−1. This difference is fully seen internal to the set of Cepheid-only SN Ia calibrations (i.e., independent of absolute anchors or TRGB distances) in the baseline fit as shown in Figure 25, which compares the two sets of SNe as measured only by Cepheids. The 9 SNe in 7 EDD TRGB hosts are brighter than the 33 SNe Ia in 30 hosts without EDD and CCHP TRGB by 0.08 ± 0.05 mag as measured by Cepheids, which is consistent at 1.6σ with the sampling noise of the two sets (0.04 mag and 0.02 mag random error, respectively). As seen in Fits 38–45, this difference does not correlate with potentially relevant changes to the calibrator sample including splitting in distance, background level, or newness of the measurements, so we conclude that this difference of 1.6σ is a not-unexpected result of the increase in sample statistics.

Figure 25. Refer to the following caption and surrounding text.

Figure 25. Histogram of the 42 SN Ia absolute magnitudes of SNe Ia calibrated from Cepheids in 37 hosts. The 9 SNe Ia in 7 hosts for which a TRGB distance is available from both EDD and CCHP are shown in green (calibrated here with Cepheids), and the 33 SNe Ia in 30 hosts without a TRGB measure from both in blue (calibrated here with Cepheids). The difference in their means of 0.08 ± 0.05 mag is consistent with the shot noise of the SN samples as discussed in Section 7.2, and combined with other differences between the TRGB calibrator set (lines 6–8) produce a net difference in H0 of 1 km s−1 Mpc−1 (0.03 ± 0.05 mag) as shown in Table 8, which is less than 1σ from the shot noise of the two samples.

Standard image High-resolution image

Combining all of the additions and subtractions to the calibrator sample, the net change between the calibrator samples with both TRGB measures versus the baseline+TRGB here is 0.03 mag and the combined result of all membership changes is less than 1σ from the shot noise of the two samples. We note that if we use only the (average) TRGB results of the 13 good-quality SNe, discarding 33 good-quality SNe with only Cepheid measurements, we get 71.0 ± 2.5 km s−1 Mpc−1 as expected (near the mean of CCHP and EDD), but this uses less than a third of the SN sample and has a higher shot noise, while still consistent with the baseline.

As the sample size from both methods increases, we would expect them to regress to an increasingly similar mean. Including the two other anchors (MW and LMC) raises H0 by 0.02 mag (and results in MI,TRGB = −4.00 ± 0.024 mag), though each of the three anchors is consistent with the others at the 0.5σ level, as shown in Figure 19.

Finally, there are two developments based on recent analyses of SNe in the Hubble flow, which yield an increase in the value of H0 relative to the CCHP TRGB analysis in F19 and F21, independent of the calibrator sample. The CCHP measurement relies on a calibrator SN sample from several SN surveys, but selects a Hubble-flow SN sample from only the CSP SN survey. Regardless of whether the CSP survey is better calibrated than others in an absolute sense, Brownsberger et al. (2021) show that the use of similar surveys for both SN samples reduces errors in H0 arising from survey miscalibration to 0.1 km s−1 Mpc−1 owing to error cancellation, but to only 0.8 km s−1 Mpc−1 for the CCHP reliance on one survey compared to many. Scolnic et al. (2021) find that the CSP sample measures fainter compared to the mean of all other surveys by 0.025 mag, which matches what we find when we remove CSP as shown by Fit 47 in Table 5, where H0 then increases by 0.4 km s−1 Mpc−1. With only 5 of 19 calibrators in F19 observed with the CSP survey, the incomplete cancellation of this survey difference would be 0.6 km s−1 Mpc−1. In addition, the CCHP analysis of SNe Ia does not account for cosmic flows expected from local density maps, the best of which (2M++ or 2MRS) as shown by Peterson et al. (2021) reduces the Hubble diagram χ2 for ∼500 SNe Ia by >50 units, decreases the scatter, and raises H0 by ∼0.5 km s−1 Mpc−1. These two terms at the bottom of Table 8 should be included in a direct comparison between the results here and those from CCHP. The combination of both balancing SN survey errors in both samples and accounting for peculiar velocities would raise the CCHP value by 1.1 units and is hard to ignore. As expected, if we use only CSP for the Hubble flow and also do not account for cosmic flows, the baseline H0 goes down by 1.5 km s−1 Mpc−1.

To summarize, we find that TRGB and Cepheids give consistent distances when using the same anchors and consistent procedures, so that comparisons are meaningful. There are two main sources of the difference in H0 between the CCHP implementation in F21 and the baseline here. (1) As given by Anand et al. (2021), a (net) difference of 0.04 mag between the EDD and CCHP implementation of TRGB in F21 arises from the difference in the apparent location of the tip in NGC 4258 and is the source of the difference between H0 = 69.8 and 71.5 km s−1 Mpc−1 (i.e., TRGB-only results) in these two studies. (2) The other, corresponding to a net increase of 1 km s−1 Mpc−1 (0.03 mag) between the EDD TRGB and the baseline, is due to differences in the SN Ia calibrator samples including a reduction of 1 km s−1 Mpc−1 owing to our exclusion of 4 SNe Ia that fail quality cuts and an increase of 2 km s−1 Mpc−1 owing to the fainter mean seen for 30 SNe in 33 Cepheid-only hosts compared to 9 SNe in 7 TRGB+Cepheid hosts, a difference seen internal to Cepheid measurements and consistent with a statistical fluctuation due to the combined changes at the 1σ level (see Figure 25). We expect that additional TRGB data will result in these distributions agreeing, as we see no reason for a difference besides shot noise. Relative to F21, the sample difference change is 1.3 km s−1 Mpc−1 (0.04 mag) with other differences from the baseline owing to the absence of an accounting for peculiar flows as described by Peterson et al. (2021) and not including SNe from multiple surveys in the Hubble flow to cancel zero-point differences among the calibrator surveys as described by Brownsberger et al. (2021). Combined, these raise H0 by ∼1.1 km s−1 Mpc−1, and we see no reason not to include these in an SN Ia–derived measurement of H0 given their strong empirical support. The combined Fit 30 yields 72.76 ± 0.95 km s−1 Mpc−1 for EDD and 72.29 ±0.94 km s−1 Mpc−1 for CCHP, and we cite the mean of the two (72.53 ± 0.95 km s−1 Mpc−1) as representative of the combination of Cepheids and TRGB.

7.3. Consistency of SNe Ia on Second and Third Rungs

In Appendix A we show that the calibrator SNe Ia are spectroscopically all prototypical and photometrically have a distribution of light-curve shapes and colors that are well matched to the selected Hubble-flow sample. The use of tighter-than-typical quality cuts (∣c∣ < 0.15, ∣x1∣ < 2) ensures that the sample comparison is insensitive to the standardization method, as it makes little difference in the sample means.

In R11 the Hubble-flow sample consisted of SNe Ia without limitations placed on the properties of their hosts (with about two-thirds coming from spirals). Rigault et al. (2015) suggested that the calibrator sample, all with spiral hosts and thus greater mean SFR, could introduce a bias in H0 if Hubble-flow residuals (even after accounting for an empirical host-mass dependence as done by R11 and R16) presented a residual correlation with host SFR (either the global rate of the host or local to the SN). R16 addressed this sample difference by including as an analysis variant a Hubble-flow sample composed of only spiral or globally star-forming hosts, a precaution (regardless of whether such a correlation exists) conservatively adopted here as the baseline. Because the size of the Hubble-flow sample is so much larger than the calibrator sample, it is sensible to cull the former to match the selection of the latter to control even yet-undiscovered systematics with little cost to the precision of H0. It is also important to recognize that the size of a correlation of Hubble residuals with a host property depends on the method of SN standardization and the SN sample host selection and that it is quite possible for a specific combination of method and sample to show a significant correlation that does not exist for different methods and samples.

Jones et al. (2018) measured the correlation of the local and global SFR and specific SFR (sSFR) with the Hubble residuals from Pantheon SNe (Scolnic et al. 2018) used by R16 and found little or no significant correlation with implied corrections (if significant) at the 0.3 km s−1 Mpc−1 level. Here we use two SN standardization methodologies from Pantheon+, one of which has no correlation with host mass (Brout & Scolnic 2021) in the baseline, and Fit 55, which has a 0.045 mag step at $\mathrm{log}(M/{M}_{\odot })=10$, increasing H0 by 0.3 km s−1 Mpc−1.

In Table 9 we compare the global properties of galaxies in the calibrator and Hubble-flow samples including mean mass, SFR, and SSFR. The baseline Hubble-flow and calibrator samples have the same mean SFR (0.3), consistent with their matched selection (the mean SFR of the early-type hosts in Pantheon+ is −0.6). The mean mass is also similar, with the calibrators higher by 0.5 dex—a difference smaller than the dispersion of either sample. In Appendix A we compare the distributions of host masses for various samples. To produce a late-type Hubble-flow sample with mass exceeding the calibrator sample, Fit 37 limits the LZSPI to $\mathrm{log}(M/{M}_{\odot })\gt 10$ and lowers H0 by 0.3 km s−1 Mpc−1. We conclude that the calibrator and baseline Hubble-flow samples are well matched in mass and SFR.

Table 9. Host Properties of SN Ia Samples

Sample N Log(M/M)Log SFRLog sSFR
Baseline Calibrator3710.3 SD = 0.60.3 SD = 0.5−9.8 SD = 0.7
LZSPI HF2769.8 SD = 0.80.3 SD = 0.7−9.3 SD = 0.4
LZBRD HF48210.1 SD = 0.80.1 SD = 0.9−9.6 SD = 0.7
HZBRD HF1354N/AN/AN/A
LZSPI HF $\mathrm{log}(M/{M}_{\odot })\gt 10$ 13210.4 SD = 0.50.4 SD = 0.7−9.7 SD = 0.6

Download table as:  ASCIITypeset image

We would not expect any other host properties, especially any local to the SN that was not a selection criterion, to significantly differ between samples. For example, Anderson et al. (2015) measured the relative strength of Hα at the sites of 98 SNe Ia in exclusively late-type, star-forming hosts, including by chance 20 of the 38 selected for Cepheid measurements, and the fraction with detected local Hα is similar for calibrators and Hubble-flow hosts (30% versus 45%). Indeed, it would be very hard to understand how a difference in local SN-host properties could occur between the two sets of hosts with matched selection. While additional host or SN properties beyond the ones we have used to measure H0 may be used to improve SN distance estimates now or in the future, the matching of SN and host samples employed here and the large sample size for each would mitigate any significant effect on the determination of H0.

7.4. State of the Hubble Tension

Our baseline determination of H0 is 73.04 ±1.04 km s−1 Mpc−1 (with systematics), which exceeds the Planck+ΛCDM result by 5σ. It may be of interest and it is straightforward to calculate a combined value of H0 that is free of measurement interdependencies and has lower uncertainty using additional redshift–magnitude relation data. We start with the combined Cepheid and TRGB result, 72.53 ±0.99 km s−1 Mpc−1 (or Fits 31 and 32), and to this we add two recent measures that provide enough information that allow us to have a consistent calibration and redshift frame while avoiding double use of data. Pesce et al. (2020) measured six masers in the Hubble flow; excluding the nearest from that set, NGC 4258 (because its maser distance is used here), and quoting their result in the 2M++ frame (same as our baseline) gives H0 = 72.1 ± 2.7 km s−1 Mpc−1. Blakeslee et al. (2021) measured the IR surface brightness fluctuation distances with HST in 63 galaxies calibrated by the TRGB in the 2M++ frame. Accounting for the small difference in TRGB zero-point (MI = −4.014 ± 0.025 mag for the mean of CCHP and EDD) and used there (MI = −4.03 mag) yields H0 = 74.0 ±3.0 km s−1 Mpc−1. The combination of these independent and independently consistent measures gives H0 = 72.61 ±0.89 km s−1 Mpc−1 (or H0 = 72.42 ± 0.89 km s−1 Mpc−1 with the CCHP TRGB in Fit 32 and H0 = 72.80 ±0.89 km s−1 Mpc−1 from the EDD TRGB with Fit 31), a local determination with 1.2% precision, which is also 5.2σ greater than the Planck+ΛCDM result. Other combinations may be determined but require care to avoid measurement inconsistencies in calibration, redshift frame, or double use of any data.

There has been a wide variety of ideas proposed to resolve the Hubble tension, including (but not limited to) an episode of scalar-field dark energy before recombination, the presence of additional species of neutrinos (perhaps with interactions), decaying dark matter, the presence of primordial magnetic fields, a changing electron mass, decaying or interacting dark matter, a breakdown of the Friedmann–Lemaître–Robertson–Walker metric or general relativity, and so on; we direct the reader to recent reviews, such as Di Valentino et al. (2021) and Schöneberg et al. (2021). These proposals range from moderately successful to unsuccessful with no clear resolution. Some of the more successful ideas mitigate the tension through a similar mechanism, such as increasing H(z) in the early universe so that recombination occurs earlier, thereby shrinking the sound horizon which is the fundamental scale of the CMB (and also of baryon acoustic oscillations). In many scenarios this will produce additional features in the CMB, which are either incompatible with the data or make them appear more plausible (Hill et al. 2021; Poulin et al. 2021). The presence of unaccounted systematics in early- or late-universe measurements have also been suggested, but in Section 6 we comprehensively reviewed those pertaining to the route presented here, with none showing indications of validity.

Both the late- and early-universe data present formidable obstacles to hypotheses involving "new physics" or new systematics owing to the rigor and redundancy of the measurements; any proposals require specificity to see which may be viable. However, opportunities for progress on this problem exist on many fronts. We anticipate gains from improved characterization of H(z), the use of new facilities to refine the local measurements (e.g., LIGO and JWST) and the early-universe measurements (e.g., CMB Stage 4 and the Simons Observatory), neutrino experiments, as well as from new theoretical insights.

8. Conclusions

  • 1.  
    Our baseline determination of H0 is 73.04 ±1.04 km s−1 Mpc−1 (with systematics) from a Cepheid-only calibration of 42 SNe Ia with good SN data quality, or 72.53 ± 0.99 km s−1 Mpc−1 combining Cepheid and TRGB for a total of 46 SN Ia calibrators with good SN data quality.
  • 2.  
    The measurement exceeds the Planck+ΛCDM result by 5σ (one in 3.5 million), making it implausible to reconcile the two by chance.
  • 3.  
    An exhaustive study of variations in the analysis and systematic uncertainties including 67 variants of analyses reveals no indication of significant inconsistencies within the measurement or promising sources of unrecognized error. The dispersion of the 59 NIR variants is 0.3 km s−1 Mpc−1 and is conservatively adopted as an additional systematic uncertainty. The mean of the variants is 73.25 km s−1 Mpc−1, which is higher than the baseline by 0.2 km s−1 Mpc−1.
  • 4.  
    We find the dispersion between 42 SN Ia and Cepheid relative distance measures is σ = 0.130 mag, similar (albeit lower) than the σ = 0.135 mag dispersion of SNe Ia in the Hubble-flow sample and yielding no evidence of excess noise in Cepheid distance measurements.
  • 5.  
    We find that Cepheid and TRGB distance measures are consistent when starting and ending from the same hosts (i.e., between rung one and two). We highlight a net difference of 1.3 km s−1 Mpc−1 (or 0.04 ± 0.02 mag) between measurements by two groups of the location of the tip in NGC 4258 and the resulting calibration of the TRGB (we use the mean of both), and a net 1 km s−1 Mpc−1 (0.03 mag) higher value of H0 from the change (tripling) of the SN calibrator sample, which is consistent with the ±0.05 mag shot noise of the subsamples.
  • 6.  
    We find that each of the three independent geometric anchors is consistent with the distance predicted by its Cepheids and the other two anchors. Improvements in the calibration of the Cepheid metallicity dependence have tightened this conclusion.
  • 7.  
    The SNe between the second and third rungs of the ladder are hosted by galaxies of the same late type (i.e., spiral) with the same or similar mean SFR and mass. Their color and shape distributions are also highly consistent. The calibrator set contains a complete sample of all suitable SNe Ia (i.e., with good data quality) at z < 0.011 in the last four decades. We see no indication of differences between the mean properties of the samples' hosts, nor a reason with matched selection that such would exist and impact H0.
  • 8.  
    Extragalactic Cepheids appear to have a uniform relation between period and luminosity consistent with a single slope and fine structure in their light curves that resembles those in the MW at the same period (i.e., the Hertzsprung progression).
  • 9.  
    The constraint provided by the distance ladder presented here is well approximated by the derived value of H0, except in the case of models that introduce rapid, unexpected, late-time changes in H(z) (relative to either ΛCDM or low-order fits to H(z)), or perhaps some forms of new physics. For such models, we advise replacing H0 with the absolute SN Ia host distances derived from the first two rungs, the set of SN Ia standardized magnitudes in these hosts, SN Ia magnitudes in the Hubble flow, and their covariance, and provide an example that yields a joint constraint of H0 = 73.30 ±1.04 km s−1 Mpc−1 and q0 = −0.51 ± 0.024.
  • 10.  
    The source of this long-standing, significant discrepancy between the local and cosmological routes to determining the Hubble constant remains unknown.

We thank Graeme Addison, Fabio Bresolin, George Efstathiou, and Doron Kushnir for helpful conversations related to this work. We are grateful to Peter Challis for sharing an unpublished spectrum of SN 2008fv. The ESA Hubble office provided most of the color composites shown in Figure 4. An anonymous referee provided a thoughtful report that led to improvements in this paper.

This research was supported by NASA/HST grants GO-12879, GO-12880, GO-13334, GO-13335, GO-13344, GO-15145, GO-15146, and GO-15640 from the Space Telescope Science Institute (STScI), which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS5-26555. Some of the data presented in this paper were obtained from the Mikulski Archive for Space Telescopes (MAST) at the STScI. The specific observations analyzed can be accessed at doi:10.17909/fkay-8z97.

L.M.M. acknowledges additional support from the Mitchell Institute for Fundamental Physics & Astronomy at Texas A&M University. D.O.J. acknowledges support from NASA Hubble Fellowship grant HF2-51462.001 awarded by the Space Telescope Science Institute. A.V.F.'s group at UC Berkeley is also grateful for financial assistance from NSF grant AST-1211916, the TABASGO Foundation, the Christopher R. Redlich Fund, the Miller Institute for Basic Research in Science (in which A.V.F. is a Miller Senior Fellow), and numerous individual donors.

Some of the data presented herein were obtained at the W. M. Keck Observatory, which is operated as a scientific partnership among the California Institute of Technology, the University of California, and NASA; the observatory was made possible by the generous financial support of the W. M. Keck Foundation. We thank Brad Tucker for designing the multislit masks used for the Keck LRIS spectroscopy of H ii regions in the host galaxies.

Appendix A: Properties of Calibrator SNe Ia

The members of the SN Ia calibrator sample were first identified when selecting hosts for observing Cepheids with HST. We have attempted to follow the criteria for selection given by Riess et al. (2005) to provide reliable SN Ia distances: objects observed before maximum light, through low interstellar extinction (AV < 0.5 mag), with modern (i.e., nonphotographic) photometry, and with typical light-curve shapes. Here we analyze the spectra of the calibrators to confirm their initial selection.

A.1. Spectral Properties Using DeepSIP

We consider two samples of calibrators. The first contains 40 SNe Ia and is a complete, volume-limited (z < 0.011) sample of all suitable SNe Ia seen over the years 1980–2021 in spiral hosts (a requirement for finding Cepheids, with results for H0 provided for these alone in Fit 38). A second sample contains just two SNe Ia (SN 1999dq and SN 2007A) from a program targeting hosts that are located at greater redshifts and are more luminous, in an effort to reach the Hubble flow directly with Cepheids. Using the following quantitative spectroscopic analysis, we find that all 42 SNe Ia are in the normal range.

We obtain spectra directly from the Open Supernova Catalog 25 (OSC; Guillochon et al. 2017), which aggregates data from numerous sources (including notable low-redshift SN Ia spectroscopy releases; e.g., Silverman et al. 2012; Blondin et al. 2012; Folatelli et al. 2013; Stahl et al. 2020b). From the OSC-retrieved spectra, we select—per SN Ia—the nearest-to-maximum-light spectrum having (i) full coverage 26 of the characteristic Si ii λ 6355 absorption feature, and (ii) an S/N of at least 10 per pixel. 27 If these criteria cannot be met by the available spectra, we reduce the S/N threshold to 5 pixel−1; if still no spectra satisfy the criteria, we remove the Si ii λ 6355 coverage requirement. If no spectra are available even after this relaxation, our automated algorithm flags the SN in question for manual intervention. Following this approach, 37 spectra are obtained with our full criteria satisfied, one is obtained with our most relaxed criteria, and two fail.

We resolve these two failures as follows.

  • 1.  
    SN 2008fv was spectroscopically classified as a normal SN Ia in CBET 1522 (Challis 2008). Though never published, we have obtained this spectrum from the author.
  • 2.  
    SN 2021hpr has three spectra available on the Transient Name Server. 28 We take the spectrum that best satisfies our full criteria as stated above.

We also override successful OSC acquisitions in several cases where more suitable spectra (i.e., those that better match our criteria) are available elsewhere. The spectra in our final set have a median phase of −0.1 days, with the earliest at −11.7 days and the latest at 14.5 days. A full accounting of relevant metadata is available upon reasonable request.

A key concern in studies that utilize SN Ia distances (such as this one) is ensuring that the objects used are indeed "normal" SNe Ia in the sense that they can be standardized using width–luminosity relations (Phillips 1993). Here we take the conservative approach of excluding subluminous SN 1991bg–like (Filippenko et al. 1992b; Leibundgut et al. 1993) and overluminous SN 1991T–like objects (Filippenko et al. 1992a; Phillips et al. 1992), thereby defining "normal" SNe Ia as those that fall within the central, most well-studied and precise region of the Phillips relation. Moreover, we confirm the normalcy of our selected objects spectroscopically by employing the deepSIP package (Stahl et al. 2020a), which provides highly effective, trained convolution neural networks that, among other things, can (i) classify if a spectrum belongs to an SN Ia with a rest-frame phase between −10 days and 18 days and light-curve shape (parameterized by SNooPy's Δm15 parameter; see Burns et al. 2011 for more details) between 0.85 and 1.55 mag, a conservatively narrow window corresponding to "normal" objects, and (ii) predict (with uncertainties) quantitative Δm15 values. Because we know the rest-frame phase of each spectrum in our sample from the light-curve-derived times of maximum brightness, distinctions made by the aforementioned classifier provide a direct probe if the SNe Ia in our sample are spectroscopically normal. The calibrator SNe Ia have DeepSIP Δm15 values between 0.84 and 1.37 mag.

We find that only two spectra do not satisfy this test, one of which (SN 2008fv) is expected owing to it being the sole case in our sample where the spectrum is at a phase earlier than −10 days. The other case (SN 1997bq) is most certainly spectroscopically normal (see Blondin et al. 2012). The fact that this single false negative is obtained is not unexpected because in developing deepSIP, Stahl et al. (2020a) tuned the decision threshold of the model with an eye to their subsequent scientific use case in which false positives (i.e., classifying an SN Ia as normal when it actually is not) represent a far worse error than false negatives (i.e., failing to classify an SN Ia as normal when it actually is). We visualize the entire spectral sample along with representative near-maximum-light spectra of SN 1991bg and SN 1991T for reference, color-coded by deepSIP-predicted Δm15 value, in Figure A1.

Figure A1. Refer to the following caption and surrounding text.

Figure A1. Spectra (scaled fλ ) of all 42 SNe Ia in the calibrator sample, color-coded by their deepSIP score calibrated to the photometric parameter, Δm15. As discussed in the text, we find all calibrator SNe to be in the normal range. Peculiar objects SN 1991T and SN 1991bg bracket the extremes and are shown for comparison.

Standard image High-resolution image

A.2. Photometric Properties

In Figure A2 we show the distributions of the SALT II color (c) and shape (x1) parameters, as well as the host masses, for the calibrator sample and for three Hubble-flow samples: (1) the baseline 0.0233 < z < 0.15 spiral sample and the tighter quality cuts ∣c∣ < 0.15, ∣x1∣ < 2; (2) the same redshift range for all host types and the Pantheon+ standard quality cuts ∣c∣ < 0.3, ∣x1∣ < 3; and (3) a sample of all types to z < 0.8 and standard quality cuts. It can be seen that the samples are well matched in the mean, with the baseline sample better matched in breadth to the calibrator sample owing to the tighter quality cuts. As a result of this investigation, we conclude that each SN in the calibrator sample is unambiguously normal and thus can be reliably standardized to match well the baseline Hubble-flow sample.

Figure A2. Refer to the following caption and surrounding text.

Figure A2. SN Ia and host properties for the calibrator sample (green) and three Hubble-flow samples. The color (left panel) and light-curve shape (middle) distributions are in good agreement as discussed in the text, particularly for the baseline sample (Lo-z Spiral, orange) whose quality cuts limits are ∣c∣ < 0.15 and ∣x1∣ < 2. The right panel shows the host mass including a truncation of the Lo-z sample (dashed line) with values for this and other host properties in Table 9.

Standard image High-resolution image

Appendix B: Independent Tests of Cepheid Photometry

B.1. Aperture Photometry

The photometry of Cepheids presented here is derived from a set of standard procedures referred to in the astronomical literature as "scene modeling" or "crowded-field photometry." These methods use an empirical description of the PSF to model unresolved sources (i.e., stellar profiles) by comparing image pixels to a model constructed from the superposition of PSFs, each with its own X and Y coordinate and amplitude as well as the constant level of the background. The initial set of sources and their positions may be derived from the image or a catalog.

This approach offers greater precision than fixed-aperture photometry (i.e., summing flux in an aperture), as it can separate the blended flux of distinct sources and additionally improves the S/N by the optimal weighting of source pixels. Bias resulting from the inability to resolve nearly coincident background sources from the Cepheid can be determined statistically by adding artificial stars of known flux to the scene. This bias is really just the mean level of the fluctuating background. Uncertainties are measured from the distributions of recovered artificial stars. Frequently used software packages that enable this approach include DAOphot, DoPHOT, DolPhot, HSTphot, and Romaphot. In general, the use of these different packages has been shown to yield similar results subject to the settings for which these packages are employed.

However, it is valuable to have a robust cross-check of the accuracy of the photometry measured with these techniques using a simpler approach that is easily replicated by others without reference to any specific piece of software. Such a method is aperture photometry, which, though less precise, provides a strong test of the accuracy the Cepheid photometry reported above.

Since aperture photometry cannot readily separate superimposed sources in dense regions, a few considerations described here are necessary to produce accurate aperture photometry. (1) Small apertures are required, here set to 1.4 drizzled pixels (0farcs11) in radius, along with an aperture correction derived from our PSF model to estimate the flux outside the aperture. (2) We limit the comparison to one-third of the Cepheid sample with the lowest surface brightness, a sample large enough to measure H0 but which still has less background and thus greater precision. (3) We use a mean of the pixels in an annulus around each Cepheid to determine the sky, rather than the mode or median, which is more commonly used in sparse regions. This last step is very important for deriving accurate aperture photometry in the presence of a fluctuating background. In dense regions, the mean of the sky is an unbiased estimator of the level beneath a source, as it includes both the spatially constant background as well as the mean level of superimposed sources. In contrast, the mode or median of the crowded sky or any statistic calculated after fitting and subtracting visible stars necessarily underestimates the sky under the target source where we cannot resolve or subtract blended sources. We assume the position of the Cepheid was previously identified from sparse, high-contrast optical images, as is the case for the SH0ES program.

We measured aperture photometry of the Cepheids using the same images (before removing any background level) used for the PSF-based photometry and the same, fixed Cepheid positions. The sky value was set to the simple mean of the sky pixels in an annulus between radii of 15 and 25 pixels from each Cepheid. 29 Figure B1 displays a comparison of the aperture and PSF photometry for the Cepheids as a function of PSF magnitude, showing agreement within the sample mean errors.

Figure B1. Refer to the following caption and surrounding text.

Figure B1. Comparison of PSF photometry (with backgrounds from artificial stars) for Cepheids in SN hosts (black) and NGC 4258 (blue) used here and basic small-aperture photometry with backgrounds from the mean pixel in a local annulus as discussed in the text.

Standard image High-resolution image

We can make a few additional observations about the comparison that clarifies the relationship between the statistics used to measure the sky in aperture photometry and the crowded background corrections used to account for bias in PSF photometry. Because PSF photometry sets the sky as a "floor" level added to modeled sources, the PSF sky level will be similar to the mode (i.e., the most common value or peak of the distribution of sky annulus pixels) of the sky pixels used in (uncrowded) aperture photometry, the uniform level of pixels without apparent sources. Therefore, the difference in aperture photometry calculated from a sky level using the mode or mean of the sky pixels will be similar to the crowded background correction in PSF photometry. In sparse fields, the mode and mean are equivalent, and the crowded background bias is negligible.

The asymmetric distribution of pixel levels in the sky annulus also explains the useful feature that in log-normal or magnitudes, the Cepheid uncertainties that are dominated by the asymmetric distribution of sky pixels are relatively Gaussian to a few standard deviations, as shown in Section 3.3.

B.2. Impacts of Improvements in Photometry

As discussed in Section 3.4, we itemize half a dozen improvements in Cepheid photometry from the last time these Cepheids were measured between ∼6 and ∼16 yr ago. Matching Cepheids by position, the error-weighted mean of the matched Cepheids are fainter in SN hosts by 0.06 mag and in NGC 4258 by 0.04 mag (a net difference of 0.02 mag when NGC 4258 calibrates the Cepheids and H0). For the LMC, Cepheids became fainter by 0.03 mag between the ground sample used by R16 and their replacement by direct HST observations in R19. The mean change in F555W – F814W was below 0.01 mag for Cepheids in SN hosts and in the outer field of NGC 4258 but 0.06 mag in the inner field relative to the previous calibration undertaken by Macri et al. (2006; and provided in H16), which lacked accounting for the crowded background level, a larger effect in F814W than in F555W, and pixel-based CTE rectification. We cannot report the change in photometry for MW Cepheids because we use different MW parallax samples here than those in R16.

For context, R16 propagated a zero-point uncertainty between ground and HST system photometry of σ = 0.03 mag, which is the same size as the changed observed for the LMC sample upon direct observation with HST. The net change between the Cepheids in the SN hosts and NGC 4258 or the LMC, the quantity that determines H0, is 0.02, comparable to the systematic uncertainty in R16 of 0.026 mag for the NGC 4258 anchor and smaller than the overall uncertainty in H0 of 0.052 mag. A more noteworthy change appears between the MW-only anchor results where the replacement with the Gaia EDR3 results plus HST photometry with the sample from Benedict et al. (2007) and their ground-based photometry reduced H0 by 0.086 mag, with some of this change related to the aforementioned update of the Cepheid photometry in SN hosts, though the quadrature sum error for only the parallax samples is ∼0.06 mag, making this change not surprising. For additional context for the LMC, we note that the DEB distance also decreased by ∼0.015 mag and the metallicity term by 0.033 mag (same sense) since R16. These factors explain the net increase in H0 for the LMC.

B.3. Background Dependence of Fit Residuals

An accurate assessment of the background flux is critical to the use of standard candles. The Cepheid backgrounds are determined statistically by adding and measuring artificial stars in random positions local to the Cepheid scenes as described above. Here we provide an additional null test of the background estimates.

Because the individual backgrounds are determined locally as part of measuring the photometry, they are not part of the model. A null test of the level of background is to analyze the correlation of background with the model residual of each Cepheid. If backgrounds are significantly systematically overestimated or underestimated, we would expect a correlation in this space as the residual would be a function of the (misestimated) background. We show the results of this test in Figure B2 for the baseline fit, binning the residuals in small ranges of background values. These bins show no significant trend with background. A linear fit gives a relation with slope 0.015 ± 0.014 mag per mag of background in the sense of overestimating the background by 0.006 ± 0.005 mag at the mean background level in the SN hosts and in NGC 4258 (as displayed in that figure). The figure also shows the expected trend if the background were underestimated enough to produce a 0.2 mag change in H0 for the mean host (i.e., to solve the tension), which is very far from the data (and would also apply to the Cepheids in NGC 4258 and thus not address the discrepancy for that anchor in Fit 10).

Figure B2. Refer to the following caption and surrounding text.

Figure B2. Mean differences between Cepheid photometry and distance-ladder fit versus the local background measured by artificial stars (in units of the Cepheid magnitudes), binned in equal numbers of Cepheids. Red is for Cepheids in SN Ia hosts and green for Cepheids in the anchor NGC 4258, with vertical lines indicating the mean background of each set. An underestimate of Cepheid backgrounds in SN Ia hosts at a level of 0.2 mag for the mean Cepheid background or an overestimate of Cepheid backgrounds in NGC 4258 by 0.2 mag for the mean Cepheid (indicated), or half that level for both simultaneously, would be required to solve the tension, but these possibilities are inconsistent with the data, which are consistent with no overestimates or underestimates for either set. Furthermore, it is hard to imagine how an overestimate or underestimate would not similarly affect (and therefore cancel) Cepheids in both NGC 4258 and SN Ia hosts.

Standard image High-resolution image

For those who would like to check the accuracy of independently-measured extragalactic Cepheid PSF photometry, we can provide (upon reasonable request) images of hosts with artificial Cepheids of known brightness and position added to the frames. Reproducing the known photometry would be an important test of any photometry algorithm that can be completed blindly and in advance of an independent determination of H0.

Appendix C: Cepheid Metallicity

Cepheid abundances beyond the Magellanic Clouds are generally derived from radial metallicity gradients measured from the ratios of strong emission lines in H ii regions, such as the R23 strong-line diagnostic based on oxygen and hydrogen (Zaritsky et al. 1994, hereafter Z94). The conversion of R23 to 12 + log [O/H] in the Z94 formula was based on photoionization models that themselves were calibrated at a time when the standard solar metallicity was believed to be 12 + log [O/H] = 8.93 (Anders & Grevesse 1989). Since Asplund et al. (2005, 2009), the solar abundance was revised downward by 0.24 dex to 8.69 ± 0.05. Over the same period, empirical transformations between strong-line measures like R23 were also revised downward by ∼0.2–0.3 dex, making them more consistent with stellar measures referenced to the Sun (Pettini & Pagel 2004; Kewley & Ellison 2008; Tremonti et al. 2004; Dopita et al. 2016; Curti et al. 2017). Bresolin et al. (2016) have shown good agreement between extragalactic oxygen abundance measurements from young stars and from H ii regions using these more modern systems (particularly for Pettini & Pagel 2004, hereafter, PP04 O3N2; F. Bresolin, private communication). Collisionally excited lines generally underestimate stellar abundances by 0.2 dex for reasons discussed by Carigi et al. (2019). Using 30,000–50,000 galaxy spectra from SDSS DR7, Teimoorinia et al. (2021) derived third-order polynomial conversions between the Z94 scale and nine different strong-line metallicity calibrations developed since 2004.

For our measurement of H0, we employ two strong-line abundance systems. The first is the simple mean of all nine recent (since 2004) diagnostics of Teimoorinia et al. (2021). This revises the Z94 scale down by a mean of 0.28 dex. The change is not quite constant, as the mean produces flatter galaxy gradients than Z94, revising the Z94 values down an additional 0.05 dex at >9.0 and less at <8.8 by 0.04 dex.

The second is the PP04 O3N2 diagnostic, which uses the dereddened abundances of [O iii] λ5007 relative to Hβ λ4861 and [N ii] λ6583 relative to Hα λ6563 to measure the log of the ratio of [O iii] to [N ii], O3N2, with the calibration 12 +log [O/H] = 8.73 − 0.32 × O3N2. The difference between the Z94 diagnostic and the PP04 O3N2 diagnostic is very nearly constant, reducing the oxygen abundance scale by 0.23 dex (0.235 dex at Z94 >9.0 and 0.222 at <8.8). The PP04 conversions of Teimoorinia et al. (2021) are valid for oxygen abundance on the Z94 scale >8.35, encompassing all Cepheid hosts, with a typical stated precision of 0.03 dex. This change not surprisingly mirrors the drop in solar abundance of 0.24 dex from the revisions of Asplund et al. (2005, 2009) and the related recalibration of strong-line diagnostics. For this reason, it has made little difference for Cepheid measurements whether to use the prior Z94 calibration referenced to the Anders & Grevesse (1989) solar abundance or the revised (mean of) the recent systems referenced to the Asplund et al. (2005, 2009) solar abundance.

When calibrating Cepheids in SN Ia hosts to those in NGC 4258, both sets of metallicities are derived using the same strong-line abundance scale so that differences in abundance calibration do not affect the difference in abundance. However, LMC and MW Cepheid abundances are generally measured through direct spectroscopy of the stars, so when these calibrators are employed it is important to test the consistency of direct and indirect methods for measuring Cepheid abundances. This is most readily accomplished in the MW where both measures are available. Luck & Lambert (2011) provide oxygen abundances for 219 MW Cepheids on the Asplund et al. (2005) solar calibration of 8.69 from which they measured a gradient of 12 + log [O/H] = 9.303 (±0.028) −0.056(±0.003) dex kpc−1 as shown in Figure C1.

Figure C1. Refer to the following caption and surrounding text.

Figure C1. Comparison of oxygen abundance gradients in the MW inferred by spectra of Cepheids (blue) and from the PP04 calibration of H ii regions (black) with data sources indicated in the plot. The mean levels are found to be consistent to σ < 0.1 dex, and we propagate systematic differences in scales in the covariance matrix (as discussed in the text).

Standard image High-resolution image

To compare to these, we derived the strong-line abundances from the three largest samples of MW H ii regions from Esteban et al. (2017), Esteban & García-Rojas (2018), and Arellano-Córdova et al. (2021). We used the tabulated values of [O iii] λ5007, Hβ, Hα, and [N ii] λ6583 in those references to calculate the ratio O3N2 and from that the PP04 calibration to determine 12 + log [O/H] as shown in Figure C1. The abundance gradient from the Cepheids as seen in that figure is a good match to the H ii regions. A formal fit gives the PP04 stellar scale = 0.035 ± 0.06 dex. From the Teimoorinia et al. (2021) transformations we find the mean difference between the PP04 and the nine recent systems, PP04−ave = 0.047 dex, demonstrating that the average of the nine systems is right on the stellar scale to a precision of ∼0.06 dex. In Section 3.5, Equation (9), we employ a metallicity covariance in the fit covariance matrix measured as the difference between the mean of the metallicity diagnostics and the PP04 O3N2 diagnostic, which propagates both a 0.05 dex scale uncertainty, and also the differences that depend on the range of metallicity and rise to 0.1 dex at 12 + log [O/H] = 8.8 (i.e., [O/H] = +0.11) and up to 0.15 dex at 9.0. We also present the determination of H0 on both reference systems as well for the exclusion of any dependence of Cepheid luminosity on metallicity to bracket the full range.

Appendix D: Use and Misuse of Wesenheit Magnitudes

The Wesenheit magnitude used to measure distances in Equation (7) is not a conventional magnitude (i.e., not a ratio of flux to a standard star), but rather a "dereddened magnitude" (see Appendix C of Madore & Freedman 1991 for a review), the ratio of two standardized candles with the same fiducial luminosity would be expected to have in the absence of reddening. The Wesenheit formalism takes advantage of the candles having the same mean luminosity and color (before being reddened by dust) to preserve their difference in distance (in magnitudes) without requiring knowledge of either the (absolute) intrinsic luminosity or color.

This method, and potential pitfalls with alternative uses of these magnitudes, can be better understood by decomposing the Cepheid color into its constituent parts. We can define the apparent Cepheid color,

Equation (22)

where (VI)0 is the color (at a given period) in the middle of the Cepheid instability strip, Δ(VI) is the displacement in color from the midline of the instability strip (with an accompanying displacement in mean luminosity owing to Stefan's law projected into the observed bandpasses, redder is fainter) and is quantified empirically in the Cepheid P–L–C relation. For a statistical sample of Cepheids beyond period completeness (which provides Cepheids uniformly distributed over the instability strip, mixed by the comparatively small evolution time to cross the strip compared to the larger differences in formation times), the mean 〈Δ(VI)〉 = 0 mag. Hence, the relative difference in distance (in magnitudes) between samples of Cepheids is statistically preserved by subtracting from the apparent magnitudes of each any fixed number, R, which multiplies the (same) intrinsic color of the midlines of their instability strips (i.e., R(VI)0). If the value of R is chosen to be the reddening ratio derived from an accurate reddening law of dust, then the subtracted term will also remove the absorption by dust (i.e., R E(VI)) from each Cepheid for dissimilar values of E(VI). This is the function of a Wesenheit magnitude. A "happy coincidence" is that Stefan's law is quite parallel to the reddening law (redder is fainter or dustier), so this operation will also reduce the small dispersion due to the width of the instability strip (Madore & Freedman 1991). As given in Section 6.3, we derived the intrinsic P–L–C (Period-Luminosity-Color) relation from the LMC Cepheids as above to be ΔH = 0.635 (±0.021) Δ(VI), the same sense and similar in scale to reddening by dust for which R ≈ 0.4.

It is not valid to use a different reddening ratio (law) in different hosts by simply varying the value of R for each host using the same Wesenheit system. Using a different value of R for different Cepheids (or for Cepheids in different hosts) in Equation (7) without first removing the intrinsic color from the apparent color produces a change in the measured differences in the distance of Cepheid samples of size R1(VI)0 −R2(VI)0, where R1R2, which is unrelated to dust but is rather a large, ad hoc change in the standard candle luminosity itself as a fraction of its color. Because the typical (VI)0 color is a factor of a few times larger than the typical E(VI), this change in Cepheid luminosity will dominate the dust correction and produce a bias that is a rather arbitrary fraction of a Cepheid's color. This is a shortcoming in the analysis presented by Mortsell et al. (2021).

It is also important to recognize that Cepheids are well-understood yellow supergiant stars that will only pulsate over a very narrow range in surface temperature of log T = 3.7 (T = 5000 K), which sets their intrinsic color of (VI)0 ≈ 0.8 mag (at P = 10 days) at the instability strip of the temperature–magnitude diagram of stars. This is the surface temperature where its internal structure supports pulsations by the kappa mechanism (change in internal opacity in a deep partial ionization zone from ionization and recombination of He+) so that the value of (VI)0 is well constrained by the physics that causes a Cepheid to be formed.

Thus, varying R in the Wesenheit definition does not appear to follow any specific hypothesis. To consider the hypothesis of different reddening laws in different hosts, it is necessary to first subtract the intrinsic color as described in Section 6.3 and by Follin & Knox (2018).

D.1. Errors in Color and Fitting

Because the value of R is tightly constrained by the sample of MW Cepheids to 0.363 ± 0.038 (and better together with the Cepheids in other nearby hosts such as M31 and M101), the error in color (VI) can be propagated to an error in H through HR(VI) ignoring the small uncertainty in R and added in quadrature in the covariance matrix of errors. However, this is no longer true if R is poorly constrained by the data and the errors in color are large, as would be the case if we tried to determine R independently for each individual SN host after subtracting the intrinsic color. Because the measurement errors in color are significant, to determine individual host values of R without bias, it is necessary to consider errors in both axes (H-band magnitude and VI color) to optimize R as discussed for a similar problem by Tremaine et al. (2002) or follow the constrained approach of Follin & Knox (2018).

Unfortunately, beyond the nearest few galaxies, the data are quite inadequate for constraining R owing to the combination of a small range of color and comparatively large color uncertainties. Figure D1 shows the available Cepheid data for a nearby host with considerable measurable extinction (M31), one of the two nearest SN Ia hosts (NGC 5643) for which the data are just sufficient, and for a typical SN Ia host (UGC 9391) where the data are insufficient. Accounting only for uncertainty in the dependent variable produces a biased result and underestimates the uncertainty when the data quality is low, as is readily seen in Monte Carlo simulations compared to the input value of R.

Figure D1. Refer to the following caption and surrounding text.

Figure D1. Measuring the reddening ratio R in each host. For each host we subtract the intrinsic period–color relation as given in the text to provide E(VI) and correct the H-band magnitudes for the PL relation (period and metallicity). The ideal case, M31, has a large span of extinction (because it is highly inclined) and small measurement errors (because it is nearby) to produce a good constraint on the slope, R, whether we consider errors in only the dependent variable (red line and dotted for uncertainty) or both axes (green line). R = 0 is shown as a black line. NGC 5643 is one of the two nearest SN Ia hosts and has the most Cepheids; it yields some constraint on R along with M101. In contrast, the rest of the SN hosts have data similar to UGC 9391, which produces an uninformative constraint on R and shows a large difference in its value and uncertainty by ignoring uncertainties in both axes. The lower right shows the bootstrap resampling for all hosts, with a different color for each, showing that a meaningful constraint is only seen for the few nearest cases, with the others relatively flat. The measurable cases are consistent with the MW R shown as a black dashed line, as are the sum of all SN hosts. Ignoring errors in E(VI) underestimates R and its uncertainty, as discussed in the text.

Standard image High-resolution image

In these cases, the breadth of colors is largely attributed to the measurement errors rather than simply extinction. Accounting for uncertainty in both axes, we find that the value of R is quite unconstrained beyond the nearest few hosts. A bootstrap resampling of the data shows that the constraints on R become uninformative beyond the nearest two SN Ia hosts with a mean uncertainty for the rest of ∼1.5. The lower-right panel of Figure D1 shows the likelihoods for different host R values. We find that the nearest few large spirals have R consistent with the MW, but there is no meaningful constraint or well-defined peak for the rest. We can combine all such constraints as shown in the figure with a result that is unsurprisingly not well defined. Mortsell et al. (2021) and Perivolaropoulos & Skara (2021) did not include the color errors, which our analysis shows leads to an underestimate of the values of R and their uncertainties for nearly all SN hosts. From a Bayesian perspective, the addition of a free parameter, R, for each host (∼40 free parameters) is not justified by the improvement in the fit over the global R, as also suggested in the analysis by Mortsell et al. (2021) and Follin & Knox (2018). An informed approach to modeling individual host R values without so many poorly constrained free parameters can be found in Section 6.3, Fit 22, and Hahn et al. (2022).

The EDA framework also provides estimates of the mean extinction seen midway through the SN hosts (i.e., the average location of a Cepheid) in our NIR bandpass from the same host properties used to determine R above but now including the host inclination. These values range from AH = 0.07 mag (NGC 4424, which at type Sa is the earliest spiral in our sample) to 0.30 mag with a mean of 0.18 mag and a dispersion of 0.05 mag. This mean is similar to the empirical result of the product of the SN-host mean E(VI) and R, which yields〈AH 〉= 0.14 mag, indicating that the level of extinction inferred from the Cepheid reddening is the expected amount. 30

Footnotes

  • 11  

    These color and shape requirements translate in the Pantheon SN standardization (Scolnic et al. 2018) as ∣c∣ < 0.2 and ∣x1∣ < 2.

  • 12  

    To improve readability, the conventional units of H0 will frequently be omitted in the rest of this paper. ΛCDM refers to the standard cosmological model with the cosmological constant (Λ) and cold dark matter (CDM).

  • 13  

    Sometimes called a "crowding correction" because it adds the mean level of randomly superimposed sources to the initial background estimate derived from the unresolved or constant sky level.

  • 14  

    This might be pursued to allow the extinction law to vary in every host, but if the intrinsic color is not first subtracted, it has the unintended consequence of producing a large variation in the luminosity of the standard candle itself, which is unrelated to dust, is inconsistent with the premise of a distance ladder where stars (once standardized) have luminosities independent of the rung they live on, and most importantly is not supported by the data as shown in Appendix D.

  • 15  

    Macri et al. (2015): LMC, H 2mass = mH + 0.0116 − 0.0054(JK − 0.4) − 0.0189(JK − 0.4)2, rms = 0.038 mag (N ≈ 34,000). Kato et al. (2007): SMC, H 2 m ass = mH − 0.0246 − 0.0228(JK − 0.4) + 0.0106(JK − 0.4)2, rms = 0.033 mag (N ≈ 14,000).

  • 16  

    We used the final estimation of τ(N = Nmax), where the relative change of estimated value becomes small enough: $d\tau /{dN}/\tau \ll 0.001 \% $.

  • 17  

    It is also the expectation for a consensus ΛCDM with ΩM = 0.3 and ΩΛ = 0.7.

  • 18  

    A variation of the inverse approach is to predict MB and compare that with local hosts as suggested by Efstathiou (2021), an approach that is equivalent in principle but the dependence of MB on the method of SN standardization makes it less widely applicable.

  • 19  

    R21 derived −0.20 ± 0.13 from 66 Cepheids with HST photometry. Ripepi et al. (2021) used a larger ground-based sample of N = 317 fundamental and first-overtone pulsators to derive −0.37 ± 0.09 on the ground system with similar filters, steeper by ∼1.5σ than R21 but less applicable here owing to the presence of overtones and objects with low accuracy. Groenewegen (2018) used 205 MW Cepheids with Gaia DR2 parallaxes, somewhat less precise than those in EDR3, to derive a NIR Wesenheit abundance term of −0.204 ± 0.14. Here we identify 211 Cepheids with basic quality cuts, fundamental-mode only, VI < 2 mag, mG > 6 mag, P > 3 days, and Gaia GOF (goodness of fit) <10, transforming the ground magnitudes to the HST filter system to obtain −0.22 ± 0.09 with a slope of −3.29 as indicated in Figure 21.

  • 20  

    The PP04 calibration was suggested by F. Bresolin (private communication, 2021) and indicated in Bresolin et al. (2016) to provide a good match to extragalactic-star spectral abundances, making it a good reference system.

  • 21  

    It is not possible to make an absolute statement of whether TRGB and Cepheid distances are consistent because this will depend on how each is calibrated and measured. Here we focus on a specific implementation of each.

  • 22  

    The Jang et al. (2021) measurement in NGC 4258 used observations with a different filter, F555W, and different electronics than the F606W and the refurbished electronics used for all SN hosts and by Anand et al. (2021). More significantly, the state of CTE degradation on ACS was also markedly different between these observations of NGC 4258. In 2003–2005, a typical TRGB star in the halo would have suffered CTE losses of ∼0.04 mag. At the time of the SN-host observations in 2015–2019, such stars would have lost ∼0.14 mag, more similar to the observations analyzed by Anand et al. (2021). While pixel-based CTE rectification in the STScI pipeline attempts to account for such losses, the differential loss over the 15 yr would be ∼0.1 mag and the uncertainty in the correction would be a sizable fraction of that, likely a few hundredths of a magnitude. For this reason it is preferred to use data more closely spaced in time.

  • 23  

    Comparing the Cepheids in Table 3 in NGC 5584 in common with Javanmardi et al. (2021) but revised here since R16, we find they are consistent in the mean with a difference in the Wesenheit of 0.07 ± 0.05 mag; their results are brighter, which if applied to all SN hosts would raise H0 by 2 km s−1 Mpc−1.

  • 24  

    Although we do not recommend mixing photometric systems, to demonstrate the consequence of including the ground-based zero-points of TRGB as well as the Jang et al. (2021) measure of the tip of NGC 4258 we can add a strong prior of MI,TRGB = −4.05 ± 0.02 mag to Fit 31, in which case we find H0 = 72.00 ± 0.86 km s−1 Mpc−1, very close to the mean of the baseline here and the F21 value of H0, simply weighted by the size of each SN sample that passes quality cuts in each set, and a posterior result of MI,TRGB =−4.040 ± 0.015 mag.

  • 25  
  • 26  

    As in Stahl et al. (2020b), we define "full coverage" as having a wavelength minimum below 5750 Å and a maximum above 6600 Å.

  • 27  

    Ties when two spectra have exactly the same phase are broken by taking the one with broader wavelength coverage.

  • 28  
  • 29  

    If anomalously bright Cepheids are removed as outliers from the final sample it is necessary to apply the same threshold to the sky pixels before calculating their mean to avoid a measurement bias. In practice, it is common to exclude Cepheids brighter than the PL by ≤3σ or >1 mag to provide a more robust PL relation. Thus, we remove sky pixels from the determination of the mean, which would cause an outlier of this size, i.e., those ≥2.5× the level of the central Cepheid pixel. For the images of Cepheids studied here this fraction is typically a few percent of the brightest Cepheids and sky pixels. To avoid a similar bias in PSF photometry, it is necessary to apply this same threshold when measuring the simple mean of artificial stars to determine the mean background correction.

  • 30  

    A more detailed comparison would require modeling of the Cepheid selection in the presence of variable depth of dust and is beyond the scope here.

Please wait… references are loading.
10.3847/2041-8213/ac5c5b