CGM2 I: The Extent of the Circumgalactic Medium Traced by Neutral Hydrogen

, , , , , , , , , , and

Published 2021 April 29 © 2021. The American Astronomical Society. All rights reserved.
, , Citation Matthew C. Wilde et al 2021 ApJ 912 9 DOI 10.3847/1538-4357/abea14

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/912/1/9

Abstract

We present initial results from the Cosmic Origins Spectrograph (COS) and Gemini Mapping the Circumgalactic Medium (CGMCGM ≡ CGM2) survey. The CGM2 survey consists of 1689 galaxies, all with high-quality Gemini-GMOS spectra, within 1 Mpc of 22 z ≲ 1 quasars, all with a signal-to-noise ratio of ∼10 Hubble Space Telescope/COS G130M+G160M spectra. For 572 of these galaxies with stellar masses 107 M < M < 1011 M and z ≲ 0.5, we show that the H i covering fraction above a threshold of NHI > 1014cm−2 is ≳0.5 within 1.5 virial radii (RvirR200m). We examine the H i kinematics and find that the majority of absorption lies within ±250 km s−1 of the galaxy systemic velocity. We examine H i covering fractions over a range of impact parameters to infer a characteristic size of the CGM, ${R}_{\mathrm{CGM}}^{14}$, as a function of galaxy mass. ${R}_{\mathrm{CGM}}^{14}$ is the impact parameter at which the probability of observing an absorber with NHI >1014 cm−2 is >50%. In this framework, the radial extent of the CGM of M > 109.9 M galaxies is ${R}_{\mathrm{CGM}}^{14}={346}_{-53}^{+57}$ kpc or ${R}_{\mathrm{CGM}}^{14}\simeq 1.2{R}_{\mathrm{vir}}$. Intermediate-mass galaxies with 109.2 < M/M < 109.9 have an extent of ${R}_{\mathrm{CGM}}^{14}={353}_{-50}^{+64}$ kpc or ${R}_{\mathrm{CGM}}^{14}\simeq 2.4{R}_{\mathrm{vir}}$. Low-mass galaxies, M < 109.2 M, show a smaller physical scale of ${R}_{\mathrm{CGM}}^{14}={177}_{-65}^{+70}$ kpc and extend to ${R}_{\mathrm{CGM}}^{14}\simeq 1.6{R}_{\mathrm{vir}}$. Our analysis suggests that using Rvir as a proxy for the characteristic radius of the CGM likely underestimates its extent.

Export citation and abstract BibTeX RIS

1. Introduction

The circumgalactic medium (CGM) represents the complex interface between stellar evolution, feedback from supermassive black holes, and the cosmic web, dictated by the large-scale cosmology of the universe (e.g., Lambda cold dark matter; Tumlinson et al. 2017). These phenomena span many orders of magnitude in relevant spatial scales and involve several different subdisciplines of astrophysics. In the canonical cosmological picture of galaxy evolution of White & Rees (1978), gas accretes onto galactic halos from the intergalactic medium (IGM), cools and condenses to form the interstellar medium, and then eventually stars. At the same time, mass and energy is returned to the CGM via supernovae and active galactic nucleus (AGN) feedback, with some fraction potentially deposited back into the IGM.

In spite of its role as the harbor for a significant fraction of galactic baryons (e.g., Davé et al. 2010; Shull et al. 2012; Werk et al. 2014), the CGM is predicted to have low gas densities (n ≪ 0.01 cm−3) due to its vast volume. Such low-density gas is most efficiently observed using absorption lines found in the spectra of bright background objects, usually quasars (quasi-stellar objects (QSOs)). Generally, QSO absorption spectroscopy provides very high sensitivity to extremely low column densities in various ions, down to N ∼ 1012 cm−2, and access to a broad range of ionized metal transitions that trace gas at both high and low densities in the CGM.

Bahcall & Spitzer (1969) first posited that galaxies are responsible for the high-column density absorption lines in QSO spectra. In the 40 yr since the prediction of Bahcall & Spitzer (1969), there has been much effort made to link QSO absorption lines to both the IGM and CGM. In particular, the IGM traces the large-scale structure of the universe, and is observed mainly through the Lyα forest (see Rauch 1998 and McQuinn 2016 for reviews). Previous works have found that the H i Lyα forest is associated with two distinct populations based on the strength of absorption systems; a low-density population that traces the cosmic web and a high-column density population that lies within dark matter halos (e.g., Morris et al. 1993; Tripp et al. 1998; Chen et al. 2005; Tejos et al. 2014; Burchett et al. 2020). Linking sets of absorption lines arising at the same redshift (absorber systems) with redshifts of foreground galaxies in close proximity to the QSO sightline has essentially defined the last 20 yr of CGM work (e.g., Chen & Tinker 2008; Prochaska et al. 2011; Tumlinson et al. 2013; Rudie et al. 2019). Boksenberg & Sargent (1978) and Bergeron & Stasińska (1986) reported some of the first identifications of intervening galaxies responsible for intermediate redshift absorption-line systems in quasar spectra. Subsequently, Lanzetta et al. (1995) found that all the luminous galaxies in their sample exhibit extended gaseous envelopes (not yet called CGM) out to at least ∼160 kpc.

Most of the early studies either searched for galaxies associated with known absorbers or they conducted blind surveys that started with no previously known information about either the absorbers or the galaxies at the outset of the survey. However, Bowen et al. (1991, 1995, 2002) used an inverted approach in which they searched for specific absorption features (e.g., Ca ii, Mg ii, or H i) affiliated with galaxies with redshifts and properties that were known prior to the absorption survey. The advantage offered by this approach is that the CGM of galaxies with specific properties of interest could be accumulated more efficiently than in blind or absorption-selected surveys. Unfortunately, while many QSOs can be found behind low-z galaxies at interesting impact parameters, the first generation of Hubble Space Telescope (HST) spectrographs could not go deep enough to access most of those QSOs with a practical allocation of telescope time.

In 2009, the installation of the Cosmic Origins Spectrograph (COS) on HST, 20−30 times more sensitive than its UV spectrograph predecessors, allowed for targeted CGM studies of statistical samples of galaxies nearby in projection to background QSOs. The studies carried out with HST/COS over the last decade have revealed complex connections between the bulk properties of galaxies and their CGM. For example, the presence of O vi in the CGM, 10−150 kpc from the star-forming (SF) host galaxy's disk, is somehow linked to the present-day SF properties of the disk (Tumlinson et al. 2011). The H i content of the CGM scales with that of the host galaxies' H i disks, perhaps implying that the disk is being fed by accretion from the IGM and CGM (Borthakur et al. 2015). In addition, large quiescent galaxies that have long since ceased star formation were observed to contain rich reservoirs of cool (∼104 K) gas (e.g., Thom et al. 2012; Chen et al. 2018), which should provide fuel for new star formation.

Thus far, the empirical picture of the CGM remains somewhat piecemeal, assembled from a variety of targeted observations of small-moderate sized samples carried out by different teams. Generally, various surveys of the CGM at low-z have revealed a multiphase medium extending as far as 0.5–1 Rvir (R200m), which exhibits absorption from H i and a range of metal ions with ionization potential energies ranging from 10–239 eV (e.g., Lanzetta et al. 1995; Muzahid et al. 2011; Tripp et al. 2011; Stocke et al. 2013; Tumlinson et al. 2013; Werk et al. 2013; Burchett et al. 2019). Absorption signatures of low-ionization state metals like Mg ii and Si ii are rarely detected greater than ∼100 kpc away from their host galaxies (Bordoloi et al. 2011; Nielsen et al. 2013; Werk et al. 2013; Bordoloi et al. 2014). By comparison, highly ionized metal lines such as O vi are detected out to ∼200 kpc (Tumlinson et al. 2011; Johnson et al. 2015, 2017; Prochaska et al. 2019) and in M31, out to ∼500 kpc (Lehner et al. 2020). This progress at low-z has been mirrored at z ∼ 2−3 with the Keck Baryonic Structure Survey (KBSS; Rudie et al. 2012), which found typical scales of the CGM to be ∼300 kpc for L* galaxies.

Observational progress enabled by HST/COS has presented several challenges to our current models of galaxy evolution. Photoionization modeling of the 104 K cool CGM gas around L* galaxies has revealed it to be an order of magnitude under-pressurized with respect to the envisioned 106 K virialized, ambient hot halo (e.g., Werk et al. 2014; Prochaska et al. 2017; Haislmaier et al. 2021), and thus is not likely in thermal or pressure equilibrium as expected from simulations or simple theoretical arguments. Complex, nonequilibrium physical processes may include precipitation (e.g., Voit 2019), shocks (e.g., McQuinn & Werk 2018), cosmic rays (e.g., Ji et al. 2020), kiloparsec-scale fountain cycles (e.g., Fraternali & Binney 2008; Kim & Ostriker 2018), multifilament gas flows fueling star formation (Martin et al. 2019), galactic winds (Bordoloi et al. 2011; Burchett et al. 2021; Huang et al. 2020), and turbulent mixing layers (Fielding et al. 2020). Furthermore, simulations have generally underpredicted observed column densities of various ions, of both high- and low-ionization potential, compared to observations of galactic halos (e.g., Stinson et al. 2012; Hummels et al. 2013; Oppenheimer et al. 2016; Liang et al. 2016; Fielding et al. 2017). Newer simulations (e.g., Oppenheimer et al. 2018; Lehner et al. 2020; Nelson et al. 2020) have more faithfully reproduced observations.

Recent simulations have shown that the CGM content, thermal structure, and kinematic properties are highly dependent on the mass and spatial resolution of simulations (e.g., van de Voort et al. 2019). High-resolution simulations that focus explicitly on the CGM, with resolved spatial scales of ∼100 pc and mass scales of ∼103 M out to 250 kpc, have helped resolve the discrepancies with the observed column densities (Peeples et al. 2019; Hummels et al. 2019), more naturally producing cool, denser clumps. However, it remains important to characterize the full extent of the CGM, as simulations still struggle to recreate all key aspects of the extended CGM.

For the above reasons, it has been challenging to put quantitative, physically motivated bounds on the radial extent of the CGM. Generally, statistical QSO galaxy samples are limited to single sightlines per galaxy due to the rarity of QSOs (but see Chen et al. 2014; Bowen et al. 2016) and thus assembling large enough samples to place constraints on the extent of the CGM is challenging work. A comprehensive z < 1 CGM survey requires both space- and ground-based spectroscopy. The former are required to measure key far-UV (FUV) transitions such as Lyα, Lyβ and metal species (e.g., C ii, Si ii, C iv, O vi). At low redshift, only Ca ii, Na i, Fe ii, and Mg i and Mg ii can be observed from the ground. However, these species are confused by ionization and dust depletion and are hard to interpret by themselves. On the other hand, large samples of galaxies are accessible to ground-based spectrographs to measure precise (zerr < 10−4) redshifts of potentially associated galaxies.

To fill this need for a comprehensive, uniform CGM study, we have completed a deep spectroscopic survey of >1000 galaxies around z > 0.5 QSOs, all within a few arcminutes of the background QSO sightline. The galaxies and absorption systems are blindly selected such that no preference to galaxy type or absorber optical depth is explicitly imposed. Our survey, COS and Gemini Mapping the Circumgalactic Medium (CGMCGM ≡ CGM2), was designed to address many puzzles related to the CGM, such as: (i) What is the physical state of the gaseous halo when galaxies quench their star formation? (ii) What effects do environment and merger history have on the physical state and content gaseous halo? (iii) How do galaxy properties relate to the metal content and kinematic structure of their gaseous halos and vice versa? (iv) What is the physical extent of the CGM? In this first presentation of the CGM2 survey, we address the last question and in particular, how its spatial extent may depend upon halo mass and galaxy type. Ultimately, this study will inform how galaxy evolution and the large-scale structure of the universe are connected.

CGM2 includes a significant investment of effort from a team of University of Washington Undergraduate Astronomy majors led by CGM2 PI: Jessica Werk, the Student Quasar Absorption Diagnosticians (aka the Werk SQuAD (Student Quasar Absorption Diagnosticians) whose work is described in detail in the relevant sections. The outline of this paper is as follows. Section 2 describes the observations and data reduction for the CGM2 spectroscopic galaxy survey. Section 3 describes an analysis of the galaxies' spectroscopic and photometric properties. In Section 4, we discuss our initial empirical results from associating galaxies with absorbers. We then derive a characteristic physical scale for the CGM in Section 5 by examining the H i galaxy clustering. Section 6 compares our results to previous surveys of similar design and hydrodynamical cosmological simulations. We summarize and conclude this work in Section 7. Throughout this analysis we adopt the Planck15 (Planck Collaboration et al. 2016) cosmology as encoded in the ASTROPY package (Astropy Collaboration et al. 2013; Price-Whelan et al. 2018). All distances are in physical space (not co-moving) unless otherwise noted.

2. Observations and Data Reduction

2.1. QSO Sample Selection

The CGM2 survey is built upon the COS-Halos (GO11598, GO13033; Tumlinson et al. 2013) and COS-Dwarfs (GO12248; Bordoloi et al. 2014) surveys, which used 263 orbits of COS time to observe ∼80 QSOs with z = 0.2−1.0. These surveys were designed to probe the halo(s) of one or two foreground galaxies well inside Rvir using suitable QSO galaxy pairs. The COS-Halos QSO catalog was selected from the Sloan Digital Sky Survey (SDSS) DR5 quasar catalog for QSOs that are UV bright (Galaxy Evolution Explorer FUV ≲18) and lie at z ≲ 1. The target QSOs were further selected to avoid Mg II absorbers at z > 0.4 to avoid losing a large range of the FUV QSO spectrum to Lyman Limit Systems. While this criterion did not affect the original surveys, which typically targeted specific galaxies at z < 0.4, it does affect the CGM2 sample because it selects against high-column density systems at z ≳ 0.4.

COS-Halos galaxies were selected via photometric redshifts to target stellar masses of M ≃ 1010−11 M at 0.1 < z < 0.3 with projected separations from a nearby QSO of ρ < 150 kpc. The precise redshifts (and other galaxy properties) were subsequently constrained with follow-up spectroscopy (Werk et al. 2012). The COS-Dwarfs survey was designed to probe the CGM of galaxies with $\mathrm{log}{M}_{\star }/{M}_{\odot }\lesssim {10}^{10}$ at z ≃ 0.01–0.05 with masses and redshifts derived via SDSS photometry and spectroscopy. Both surveys yield galaxies that are the closest spectroscopically identified galaxy to each QSO sightline and were designed to avoid biases with respect to galaxy neighbors, large-scale environment, or status as a satellite of a larger halo (in cases where neighbors were known).

For the CGM2 multislit spectroscopic follow-up, we targeted a subset of "high-value" QSO fields with our Gemini program, either those with zQSO > 0.6 or those with HST imaging available, which allowed us to obtain the morphology of any absorption-hosting galaxies with z < 0.5. To date, we have obtained multislit galaxy spectra in 22 QSO fields as part of CGM2. The fields we surveyed, and some basic properties of the background QSOs, are tabulated in Table 1.

Table 1. QSO Fields in CGM2

QSOQSO LongR.A.Decl. zQSO mg
J0226+0015J022614.46+001529.736.560280.258270.61517.15
J0809+4619J080908.13+461925.6122.283946.32380.65716.54
J0843+4117J084349.49+411741.6130.956241.29490.9917.31
J0914+2823J091440.38+282330.6138.6682928.391840.73517.79
J0935+0204J093518.19+020415.5143.825812.070980.64916.99
J0943+0531J094331.61+053131.4145.881735.525410.56417.16
J1001+5944J100102.55+594414.3150.260659.73730.74616.08
J1016+4706J101622.60+470643.3154.0941847.112040.82217.12
J1022+0132J102218.99+013218.8155.579131.538560.78916.75
J1059+1441J105945.23+144142.9164.938514.69530.63116.93
J1059+2517J105958.82+251708.8164.995125.28580.66217.39
J1112+3539J111239.11+353928.2168.1629635.657840.63617.73
J1133+0327J113327.78+032719.1173.365783.455330.52517.54
J1134+2555J113457.62+255527.9173.740125.92440.7116.8
J1233-0031J123304.05-003134.1188.26688−0.526160.47117.76
J1241+5721J124154.02+572107.3190.475157.352050.58317.58
J1342-0053J134251.60-005345.3205.71503−0.895920.32616.92
J1419+4207J141910.20+420746.9214.7925142.12970.87317.04
J1437+5045J143726.14+504555.8219.3589250.765510.78317.57
J1553+3548J155304.92+354828.6238.2705235.807950.72216.46
J1555+3628J155504.39+362848.0238.7683336.480010.71417.76
J2345-0059J234500.43-005936.0356.2518−0.993350.78916.8

Note. The full sample of QSOs included in the CGM2 Survey: (1) QSO short name; (2) QSO long name, R.A. in hms and decl. in dms, all J2000; (3 and 4) R.A., decl. in decimal degrees; (5) QSO redshift; (6) SDSS g-band magnitude of QSO.

Download table as:  ASCIITypeset image

2.1.1. COS Data Reduction

The COS spectra were taken using both the G130M and G160M gratings. The balance between the time allocated to G130M and G160M was designed to achieve a signal-to-noise ratio (S/N) of ≃ 10–12 per resolution element (FWHM ≃ 16–18 km s−1) or better over 1150–1800 Å. The reduction of the COS spectra is explained in detail in Tumlinson et al. (2013) and Bordoloi et al. (2014) and follows the same method used by Tripp et al. (2011), Meiring et al. (2011), Tumlinson et al. (2011), and Thom et al. (2012) but we provide a brief description here. The COS data were obtained from MAST 11 and processed via CALCOS v2.12 (Hirschauer 2021) with standard parameters and reference files. First, exposures taken at the same grating and CENWAVE were combined. These co-added spectra were then co-added with exposures in the same grating at different CENWAVEs. This was followed by a combination of the two grating spectra to produce a single one-dimensional (1D) trace from 1150–1800 Å. At each co-add, exposures were combined by aligning common Milky Way (MW) interstellar absorption lines. Features related to the design of COS show up in the raw data and must be removed. The photocathode grid wires cast shadows on the detector as well as other fixed-pattern noise features and were removed with a flat field and moreover are mitigated by the use of FP SPLITS. Flat-field reference files, prepared and communicated to us by D. Massa at STScI and filtered for high-frequency noise by E. Jenkins, were used to correct these fixed-pattern features. These flats do not however, correct for gain-sag depressions in the spectra created by prolonged exposure to bright geocoronal emission lines (Sahnow et al. 2011). The affected regions are flagged by the CALCOS pipeline and are rejected in our co-addition process. The 1D spectra are binned to Nyquist sampling resulting in a 1D, flat-corrected spectra with two bins per COS resolution element (FWHM ≃ 18 km s−1) and an S/N of ∼8–10. Errors arising from counting statistics (Poisson) are propagated through each step in the calibration.

2.2. Spectroscopic Galaxy Survey

2.2.1. Survey Design

All galaxy spectra were obtained at the Gemini North and South Observatories using the GMOS instrument (Hook et al. 2004; Gimeno et al. 2016) in multi-object spectrograph mode. The GMOS observations enabled us to obtain accurate, precise redshifts for low-redshift (0.1 ≲ z ≲ 1) galaxies as faint as g ∼ 24.5 for SF galaxies and g ∼ 24 for early-type galaxies. Over three observing semesters from 2014–2015 at both GMOS-N and GMOS-S, we surveyed galaxies in the high-quality QSO fields listed in Table 1.

GMOS uses slit masks that must be cut for each field based on highly accurate astrometry from either a "pre-image" taken in GMOS imaging mode or derived from an accurate catalog. For most of our fields, we used Gemini-GMOS pre-imaging to create the masks. However, for fields observed in 2014A, specifically J0809 + 4619, J0943 + 0531, J1133 + 0327, J1134 + 2555, J1241 + 5721, and J1555 + 3628, we used a combination of HST Wide Field Camera 3 (WFC3) imaging for angular separations less than 1' from the QSO along with SDSS imaging for galaxies further from the sightline to construct a slit mask target catalog. We refer to the observational program that observed these fields as the "Rollup" program and the observations are differentiated in Table 2 with a "Q" in the Mask Name column. Subsequent observations used Gemini-GMOS g- and i-band pre-imaging to create the slit mask target lists and are referred to as "Large Program" (LP) observations. From the pre-imaging, we derived astrometry as well as magnitudes from SExtractor (Bertin & Arnouts 1996) using Gemini-calibrated photometric zero-points and color corrections. We then did an optimization to get as many g < 24 galaxies into each slit mask as possible (see below).

Table 2. CGM2 Multislit Mask Observations

QSOMask NameDateInstrumentN(slits)N(z)
J0809 + 4619GN2014AQ001-012014-11-19GMOS-N3124
J0809 + 4619GN2014AQ001-022014-11-20GMOS-N2721
J0809 + 4619GN2014AQ001-032014-11-21GMOS-N2921
J0809 + 4619GN2014AQ001-042014-11-23GMOS-N1715
J1134 + 2555GN2014AQ001-052014-06-21GMOS-N2926
J1134 + 2555GN2014AQ001-062014-06-22GMOS-N2926
J1134 + 2555GN2014AQ001-072014-06-25GMOS-N2420
J1134 + 2555GN2014AQ001-082014-12-21GMOS-N1613
J1241 + 5721GN2014AQ001-092014-06-25GMOS-N2718
J1241 + 5721GN2014AQ001-102014-06-30GMOS-N2423
J1555 + 3628GN2014AQ001-112014-06-24GMOS-N3329
J1555 + 3628GN2014AQ001-122014-06-24GMOS-N2624
J0914 + 2823GN2014BLP003-012015-01-17GMOS-N4529
J0914 + 2823GN2014BLP003-022015-01-17GMOS-N4427
J0914 + 2823GN2014BLP003-032015-01-17GMOS-N4526
J0843 + 4117GN2014BLP003-042015-01-16GMOS-N3928
J0843 + 4117GN2014BLP003-052015-01-16GMOS-N3931
J0843 + 4117GN2014BLP003-062015-01-16GMOS-N3931
J1059 + 1441GN2014BLP003-082015-01-16GMOS-N4538
J1001 + 5944GN2014BLP003-102015-01-18GMOS-N4536
J1001 + 5944GN2014BLP003-112015-01-18GMOS-N4529
J1001 + 5944GN2014BLP003-122015-01-18GMOS-N4133
J1016 + 4706GN2014BLP003-132015-01-19GMOS-N4234
J1016 + 4706GN2014BLP003-142015-01-19GMOS-N4030
J1016 + 4706GN2014BLP003-152015-01-19GMOS-N4233
J1112 + 3539GN2014BLP003-192015-01-17GMOS-N4933
J1112 + 3539GN2014BLP003-202015-01-17GMOS-N4631
J1112 + 3539GN2014BLP003-212015-01-17GMOS-N4337
J1059 + 2517GN2015ALP003-012015-05-18GMOS-N4731
J1059 + 2517GN2015ALP003-022015-05-20GMOS-N4329
J1059 + 2517GN2015ALP003-032015-06-08GMOS-N4225
J1419 + 4207GN2015ALP003-042015-05-17GMOS-N4527
J1419 + 4207GN2015ALP003-052015-05-17GMOS-N4633
J1419 + 4207GN2015ALP003-062015-05-17GMOS-N4332
J1419 + 4207GN2015ALP003-072015-05-17GMOS-N4125
J1437 + 5045GN2015ALP003-082015-05-24GMOS-N4033
J1437 + 5045GN2015ALP003-092015-05-22GMOS-N4331
J1437 + 5045GN2015ALP003-102015-06-19GMOS-N4230
J1553 + 3548GN2015ALP003-112015-05-17GMOS-N4532
J1553 + 3548GN2015ALP003-122015-05-18GMOS-N4535
J1553 + 3548GN2015ALP003-132015-05-18GMOS-N4128
J0943 + 0531GS2014AQ002-012015-04-19GMOS-S1913
J0943 + 0531GS2014AQ002-022015-04-18GMOS-S73
J0943 + 0531GS2014AQ002-032015-02-18GMOS-S2822
J0943 + 0531GS2014AQ002-042015-02-19GMOS-S189
J1133 + 0327GS2014AQ002-052015-03-15GMOS-S3227
J1133 + 0327GS2014AQ002-062015-04-25GMOS-S2418
J1133 + 0327GS2014AQ002-072015-04-25GMOS-S2621
J1133 + 0327GS2014AQ002-082015-05-14GMOS-S2217
J1233-0031GS2014AQ002-092015-04-19GMOS-S3019
J1233-0031GS2014AQ002-102015-05-14GMOS-S2924
J1342-0053GS2014AQ002-112015-01-04GMOS-S3118
J1342-0053GS2014AQ002-122015-06-06GMOS-S2922
J0226 + 0015GS2014BLP004-012014-10-26GMOS-S4740
J0226 + 0015GS2014BLP004-022014-10-26GMOS-S4432
J0226 + 0015GS2014BLP004-032014-10-26GMOS-S4329
J2345-0059GS2014BLP004-042015-06-19GMOS-S4125
J0935 + 0204GS2014BLP004-072015-02-13GMOS-S4526
J0935 + 0204GS2014BLP004-082015-02-13GMOS-S3220
J0935 + 0204GS2014BLP004-092015-02-17GMOS-S3825
J1022 + 0132GS2015ALP004-012015-02-17GMOS-S4026
J1022 + 0132GS2015ALP004-022015-02-17GMOS-S3830
J1022 + 0132GS2015ALP004-032015-02-17GMOS-S3627
J1022 + 0132GS2015ALP004-042015-02-17GMOS-S3423

Note. Multislit observation with GMOS taken as part of the CGM2 Survey: (1) QSO short name; (2) unique program mask name with project ID; (3) date of mask observation; (4) instrument, either GMOS-N or GMOS-S; (5) number of slits placed on each mask; (6) number of slits that yielded a reliable redshift.

Download table as:  ASCIITypeset image

In our GMOS programs, we used the R400 grating with 1'' slits to balance wavelength coverage with spectral resolution. The fields from the Rollup programs listed above were dithered across two exposures per mask at central wavelengths of λ λ6000 and 6900 Å in order to avoid losing information to the chip gaps. The LP fields have spectra consisting of three exposures per mask at three separate central wavelengths, λ λ6900, 7000, and 7100 Å. Despite these differences, both programs yielded similar depths with total exposure times of 1 hr per mask. Our final spectra achieved an S/N of at least a few per pixel at λobs = 4800 Å, the approximate wavelength of the 4000 Å break at z ∼ 0.2.

Because precise redshift determination from our spectra was the primary goal, we chose a grating to cover 4400–9000 Å in order to detect emission from z ≲ 0.5 [O ii], Hβ (for both redshifts and star formation metrics) and Hα as well as absorption from Ca H + K and Na D in passively evolving galaxies. The R400 grating provided a spectral resolving power of R = λλ ≃ 950, which corresponds to a velocity resolution of 300 km s−1 per resolution element. Ultimately, this setup allowed for determining the velocity centroid to ∼50 km s−1 for precise redshift determination. We note that every spectrum does not exhibit uniform wavelength coverage given the design of GMOS. Galaxies that are not placed near the center of the field have redder or bluer coverage than the range quoted above.

The slit masks were designed such that slits were placed on ∼85% of objects within 1' of the QSO, with additional slits placed to fill the 5.5'' × 5.5'' area of the detector. Slit placement constraints meant that ∼40–50 slits could be placed on one mask. We aimed to obtain spectra of ∼80–120 unique galaxies per QSO field. Some fields in the earlier Rollup programs (those observed in 2014A) had as few as two and as many as four masks per field. In the later observing campaigns, all fields (except J1059 + 1441, which was only observed with one mask due to problems in the mask making) have three masks per field. This is shown in Table 2. An example of the targeting strategy can be seen in Figure 1 for the field J0843 + 4117. The red circles highlight the galaxies with z < zQSO for which we obtained reliable redshifts, while the white circles were targeted for spectra but for which we did not recover a reliable redshift.

Figure 1.

Figure 1. An example of the survey design and targeting strategy of CGM2 showing the slit placements centered around QSO J0843 + 4117 (blue diamond) overlaid on the g-band pre-image from the Gemini-GMOS detector. The circles denote where slits were placed on the slit masks. White circles indicate galaxies with slits whose final spectra did not yield a reliable redshift, while the red circles indicate galaxies that produced reliable redshifts. Large blue dashed circles show the one and two arcminute radii from the QSO. The purple circle just off the left of the detector is the COS-Dwarfs (Bordoloi et al. 2014) galaxy target for this field whose spectra was obtained by SDSS with an impact parameter of ρ = 113 kpc at z = 0.0300.

Standard image High-resolution image

All masks were observed between 2014 June and 2015 June on either GMOS-N or GMOS-S. Table 2 provides details of each multislit mask observed. Columns 5 and 6 list the number of slits placed and the number of slits that yielded reliable redshifts, respectively. A summary of the completeness of our galaxy spectroscopic survey is shown Table 3 for each field as a function of angular separation from the central QSO and limiting magnitude. The completeness is given as the fraction of reliable, redshift-yielding spectra, with "ZQ" > 2 of galaxies (CLASSSTAR < 0.5) by SExtractor within some angular separation to the QSO. The fields J2345-0059 and J1059 + 1441 have only one mask per field and thus have a lower completeness relative to the rest of the sample. In general, we are nearly 100% complete out to 2' from the QSO for galaxies with an i-band magnitude of <22. The completeness drops to ∼50% for a limiting magnitude of i < 23. 68% of the galaxies used in this analysis lie within 2' of the central quasar. Details of the survey design, completeness and their impact on scientific results will be discussed in the full presentation of the CGM2 survey, currently in preparation (Werk et al. 2021).

Table 3. Completeness

QSO ${C}_{\mathrm{22,1}^{\prime} }$ ${C}_{\mathrm{22,2}^{\prime} }$ C22,FOV ${C}_{\mathrm{23,1}^{\prime} }$ ${C}_{\mathrm{23,2}^{\prime} }$ C23,FOV
J0226 + 00150.570.790.470.340.410.23
J0809 + 46191.001.001.000.770.700.37
J0843 + 41171.001.000.630.700.510.31
J0914 + 28231.001.001.000.800.630.40
J0935 + 02040.930.860.420.330.400.20
J0943 + 05311.001.000.540.610.420.20
J1001 + 59441.001.000.650.870.530.32
J1016 + 47061.001.000.550.810.620.31
J1022 + 01321.000.970.790.590.510.35
J1059 + 14410.600.470.320.240.240.15
J1059 + 25171.001.000.680.720.690.36
J1112 + 35391.001.000.490.540.430.22
J1133 + 03271.000.950.480.680.460.23
J1134 + 25551.000.990.660.700.570.33
J1233-00311.000.440.250.430.190.11
J1241 + 57211.000.790.350.900.360.14
J1342-00531.000.580.290.540.300.14
J1419 + 42071.001.001.001.001.000.76
J1437 + 50451.001.000.661.000.630.33
J1553 + 35481.001.000.821.000.740.40
J1555 + 36281.001.000.620.670.530.24
J2345-00590.280.400.160.160.190.09
Median1.001.000.570.690.510.27

Note. Completeness of the CGM2 galaxy catalog along with the median in each radial and magnitude limited bin. The completeness is defined as the fraction of reliable spectra, with "ZQ" > 2, to objects designated as a galaxy (CLASSSTAR < 0.5) by SExtractor (1) QSO field; (2) completeness within 1' of the QSO at a limiting i-band magnitude of 22; (3) completeness within 2' of the QSO at a limiting i-band magnitude of 22; (4) completeness within the GMOS field of view (FOV) of the QSO at a limiting i-band magnitude of 22; (5) completeness within 1' of the QSO at a limiting i-band magnitude of 23; (6) completeness within 2' of the QSO at a limiting i-band magnitude of 23; (7) completeness within the GMOS FOV of the QSO at a limiting i-band magnitude of 23. The fields J2345-0059 and J1059 + 1441 have only one mask per field and thus have a low completeness relative to the rest of the sample.

Download table as:  ASCIITypeset image

2.2.2. Gemini Data Reduction

The spectra were reduced using a combination of Gemini's PyRAF package and PypeIt 12 (Prochaska et al. 2020). The initial reduction closely follows the GMOS Data Reduction Cookbook. 13 After obtaining the raw spectra from Gemini Observatory Archive, the data were organized according to field and mask. Biases were created by downloading all bias exposures of the same 2 × 2 binning taken within ∼1 month from the observations if they were from the same detector. Flat-field and NeAr wavelength calibration exposures were taken along with each mask and were prepared using the standard methods in the cookbook. This procedure performs the bias subtraction, performs automatic slit edge finding, cuts the slits out of the image and isolates them, flat fields each slit with the flats taken at the same central wavelength, and then performs a wavelength transformation for each slit. This results in multiple 2D spectra. In general, the typical rms of the wavelength solution is 0.1 pixel, which corresponds to 0.04 Å given the dispersion of GMOS of 0.4 Å per pixel. We then turned to PypeIt to perform sky subtraction, spectral continuum tracing, and a combination of the wavelength dithered. The spectra were then flux calibrated using a sensitivity function based on a selection of spectrophotometric flux calibration standards, BD28 + 4211, EG21, EG131, and Wolf1346, choosing whichever was closest on the sky. The final 1D spectra were co-added in vacuum wavelength space weighted by the inverse variance of the individual exposures.

3. Analysis

3.1. Galaxy Redshift Determination

The process of determining galaxy redshifts was done in two stages. First, each 1D extracted spectrum was passed through an automated redshift fitting code, REDROCK 14 (v0.14). REDROCK was developed by the DESI team and uses a template fitting algorithm to generate a set of ranked best-fitting models, identifying the object's type (QSO, star, galaxy) and corresponding redshift.

However, much of our galaxy sample has a moderate to poor S/N, and the automated REDROCK redshift guesses can thus fail catastrophically by fixating on spurious features. We constructed a method of manually vetting the REDROCK redshifts by eye using a custom GUI, VETRR. 15 Each redshift was visually assessed and assigned a quality flag, ZQ , by one of the authors, and by at least two members of the Werk SQuAD. A ZQ of either 0, 1, 3, or 4 were assigned to each spectrum, where a ZQ = 0 indicates the spectrum has a S/N that is too low to be useful. ZQ = 1 denotes a good spectrum but we are not confident in identifying a redshift (i.e., has no clear absorption or emission lines). ZQ = 3 are spectra with one absorption or emission line that was confidently identified. These are usually strong [OII], Hα emission or weak CaII absorption. A solitary Hα emission only falls into this category if Hβ is off the detector and the strong emission line is too narrow to be [OII], which is a marginally unresolved doublet in these data. ZQ = 4 represents a spectrum for which we are most confident in the redshift, with at least two absorption or emission lines identified. ZQ = 2 was not used. The fully vetted galaxy survey database that we use for our analysis contains only spectra with ZQ > 2. The statistical uncertainly of our redshift identification was determined by computing the standard deviation of redshift identifications by at least three humans for a sample of 50 galaxies and is typically in the range of σz ∼ 50–100 km s−1 (z ∼ 0.00016–0.00030). The redshift distribution of the vetted galaxy database containing 971 galaxies with redshifts less than that of the field quasar is shown in Figure 2.

Figure 2.

Figure 2. The redshift distribution of the CGM2 galaxy catalog for galaxies with z < zQSO. The redshift reliability is encoded in yellow and red. Red represents our most reliable redshift quality flag of "4" with spectra displaying more than one strong absorption or emission line. A quality flag of "3" was reserved for spectra with only one strong emission line and thus a less reliable redshift designation. Approximately 85% (820 out of 971) of our spectra were given the highest reliability flag. The typical statistical uncertainly of our redshifts is σz ∼ 50–100 km s−1 (z ≃ 0.00016–0.00030).

Standard image High-resolution image

In addition to manually vetting the galaxy redshifts, we also visually identified a galaxy spectral type during the process of examining each 1D spectrum. If strong emission lines and weak continuum were present, we classified the galaxy as "SF." If only absorption lines along a detected continuum were found, we classified the galaxy as "quiescent" or "E." If both emission and absorption lines were identified, we classified the galaxy as a combination of the two, "SF+E." In several cases, stars were mistakenly targeted in our slit masks, and they are identified as star. Stars are never included as part of our vetted galaxy database. Examples of galaxy spectra of each type are shown in the three panels of Figure 3. These spectra contain some poorly subtracted sky lines, and telluric absorption lines at ∼7600 Å, which are the sorts of spurious spectral features that can cause REDROCK to fail. The top panel displays an emission-line SF galaxy that shows emission from several highlighted strong emission lines, the middle panel shows a combination-type, SF+E spectrum with both [OII] emission and strong CaII absorption against a bright stellar continuum, and the bottom panel shows an example of an absorption-only spectrum, type E.

Figure 3.

Figure 3. Examples of typical Gemini-GMOS spectra with a quality flag of ZQ = 4, along with the error shown in blue. These spectra highlight our method of visual galaxy spectral typing. The cutout insert in each panel shows an example of the key spectral features used in redshift determination. The top spectrum is classified as an SF galaxy, the middle panel shows a galaxy with both emission lines and absorption lines, and the bottom spectrum is an example of galaxy with an older stellar population with strong Ca H + K absorption.

Standard image High-resolution image

3.2. Galaxy Photometry and Spectral Energy Distribution Fitting

In addition to the galaxy spectroscopic catalog, we constructed a photometric galaxy catalog to derive stellar masses and galaxy star formation rates (SFRs). Our spectra are generally insufficient in signal and flux calibration to analyze them via spectral fitting codes (e.g., Cappellari 2017). To estimate stellar masses, we used Code Investigating GALaxy Emission (CIGALE; Noll et al. 2009; Boquien et al. 2019) to fit the spectral energy distribution (SED) and retrieve stellar mass and SFRs. We note that while there are a myriad other SED fitting codes available (as well as direct color–mass relations) we chose CIGALE for its options for stellar population models, dust models, etc. across large swaths of parameter space. We also chose it to compare directly to complementary surveys such as COS Absorption Survey of Baryon Harbors (Burchett et al. 2019; Prochaska et al. 2019).

3.2.1. Galaxy Photometry

One major challenge in constructing a photometric catalog for the CGM2 survey was that the spectroscopic target catalog is generally fainter than available public all-sky surveys and thus the photometric coverage is not uniform in all fields. We chose to gather photometric data for every spectroscopic target in the galaxy catalog, totaling 2310 unique targets.

We created the photometric catalog by crossmatching our target to the Dark Energy Spectroscopic Instrument (DESI) Legacy Imaging Surveys DR8 (Dey et al. 2019). The imaging survey for DESI is composed of data from three telescopes covering ∼14,000 deg2 over − 18° < δ < + 84° (∣b∣ > 18°). These three programs include the Beijing-Arizona Sky Survey (BASS), the DECam Legacy Survey (DECaLS), and the Mayall z-band Legacy Survey (MzLS), which provide g-, r-, and z-band photometry to ∼23.3 mag.

The photometry is corrected for Galactic extinction. This is the most extensive publicly available optical survey and provided the bulk of the photometry. In cases of overlap between the north and south catalogs, we chose the southern DECaLS observations. We limited matches to objects with an S/N > 2 and chose the closest match within 1.3'' of our targets in order to limit mismatches between our faint sources and the Legacy Survey catalogs. This gave us 1985 targets with at least one band of photometry. In addition to the g, r, and z bands, DR8 provides crossmatched Wide-field Infrared Survey Explorer (Cutri et al. 2013) observations in 3.4, 4.6, 12, and 22 μm.

In order to cover a larger wavelength range to better estimate the SED, we also crossmatched our catalog with the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) DR 2 (Chambers et al. 2016) with coverage of the grizy bands, utilizing the MAST crossmatch service, using a 1.3'' threshold. We limited the photometry to those marked as extended objects with good stack photometry and grizy < 23.3, 23.2, 23.1, 22.3, 21.3, giving 393 objects with photometry in at least one band.

In addition we also queried the SDSS DR14 (Abolfathi et al. 2018) survey with ugriz coverage where we restricted matches to less than ugriz < 22.15, 23.13, 22.70, 22.20, 20.71, totaling 331 targets with photometry in at least one band.

To make sure we included photometry for every object in the CGM2 survey that may be too faint or in crowded areas for the public surveys, we included photometry from the GMOS imaging. The target selection and slit mask design was based on Gemini-GMOS i-band and g-band imaging except in the case of the Rollup programs, GN-2014A-Q1 and GS-2014A-Q2, which involved a combination of HST Wide Field Channel-Advanced Camera for Surveys and Gemini imaging. This allowed us to use the Gemini images to get photometry for each field, the details of which are presented in Table 4. Two exposures in each band were combined and processed by Gemini. We obtained g- and i-band magnitudes of the Gemini sources using the SExtractor software. We used pre-calibrated photometric zero-points and color corrections for GMOS-N and GMOS-S for an initial pass on the Gemini photometry. In order to further calibrate the Gemini photometry, we crossmatched our m < 21 sources to the g- and i -band SDSS photometric sources, and then bootstrapped the Gemini photometry below the SDSS magnitude limit within each field. This consisted of applying a constant magnitude offset to the Gemini sources to match the SDSS photometry in the g and i bands.

Table 4. CGM2 Gemini-GMOS Imaging

Project ID g${t}_{\exp }$ (s) i${t}_{\exp }$ (s)
GN-2014A-Q1X150
GS-2014A-Q2X150
GN-2014B-LP-4450200
GS-2014B-LP-3450200
GN-2015A-LP-3450200
GS-2015A-LP-4450200

Note. Imaging observations with GMOS-N and GMOS-S taken as part of the CGM2 Survey: (1) Project ID; (2) g-band exposure time in seconds; (3) i-band exposure time in seconds

Download table as:  ASCIITypeset image

All photometry from Gemini, SDSS, and Pan-STARRS was corrected for Galactic reddening based on the values in Schlafly & Finkbeiner (2011) provided by the NASA/IPAC Extragalactic Database (NED) 16 by querying our targets coordinates via ASTROQUERY. 17 We employed the SVO Filter Profile Service (Rodrigo et al. 2012) to obtain our filter transmission curves for the telescopes used in these surveys as required input to CIGALE.

CIGALE includes many models as options to include in fitting. For stellar populations, we used the Bruzual & Charlot (2003) models, assuming a Chabrier (2003) initial mass function. We chose a grid of metallicities ranging from 0.001–2.5 Z. We used a delayed star formation history model with an exponential burst. The e-folding time of the main stellar population models ranged from 0.1–8 Gyr. We varied the age of the oldest stars in the galaxy from 2–12 Gyr. We included an optional late burst with an e-folding time of 50 Myr and an age of 20 Myr. We varied the burst mass fraction from 0.0 or 0.1 to turn this feature on or off. Nebular emission and reprocessed dust models (Dale et al. 2014) were also included with the default values. The dust models have slopes ranging from 1–2.5 and the nebular models include no active galactic nuclei.

We employed the Calzetti et al. (1994) dust attenuation law, but we also included a "bump" in the UV (see discussion in Prochaska et al. 2019) at 217.5 nm with an FWHM of 35.6 nm. The bump amplitude was set at 1.3 and the power-law slope was −0.13 (Lo Faro et al. 2017). We varied the color excess of the stellar continuum from the young population, E(B V), from 0.12–1.98. Finally, we used a reduction factor of 0.44 to the color excess for the old population compared to the young stars.

3.3. Galaxy Stellar Masses and Derived Properties

After fitting each galaxy's SED, CIGALE then outputs several useful parameters, including stellar mass, SFR, and the rest-frame absolute luminosity in each band. Figures 4 and 5 show the resultant mass distributions with masses spanning M ≈ 106–1011 M and ${\bar{M}}_{\star }={10}^{9.3}{M}_{\odot }$ at $\bar{z}=0.44$. In order to calculate the virial radius of the galaxies, we first calculate the halo mass using the abundance matching method of Moster et al. (2013) with the modifications used in Burchett et al. (2016). We adopt R200m, the radius within which the average mass density is 200 times the mean matter density of the universe, as the virial radius (Rvir) of a galaxy halo.

Figure 4.

Figure 4. Star formation rate (SFR) vs. stellar mass for the CGM2 galaxy sample as estimated by CIGALE. The locus of galaxies tracks a monotonic increase of SFR with stellar mass, known informally as the "star-forming main sequence" (SFMS). The density of galaxies in this space is indicated via shading of the hexagonal bins. The gray shaded region corresponds to the redshift dependent fit of the SFMS from Schreiber et al. (2015) of galaxies spanning z = [0.23, 0.63]. This range represents the 16th and 84th percentiles of the galaxy catalogs redshift distribution.

Standard image High-resolution image

Figure 4 shows the CIGALE-derived SFRs versus stellar masses, which exhibit, as expected, a positive correlation known as the SFMS. We compare our models to the redshift dependent fit from Schreiber et al. (2015) over a range of galaxies spanning z = [0.23, 0.63]. This range represents the 16th and 84th percentiles of the CGM2 galaxy catalogs redshift distribution. Typical uncertainties on photometry-derived stellar masses range from a factor of 3–5 (e.g., Blanton & Roweis 2007; Werk et al. 2012) corresponding to ∼0.5 dex, and errors on CIGALE-derived SFRs are similar.

Figure 5 shows galaxy stellar masses versus galaxy systemic redshift, and differentiates among our three galaxy spectral types visually derived from the GMOS spectra. Above z ∼ 0.5, we are no longer detecting galaxies with M ≲ 108 M. Both SF and SF+E galaxy spectral types are preferentially distributed among lower stellar masses as seen in the marginal distributions. We find that, as expected, there are more E type galaxies at higher galaxy stellar masses than a random distribution would predict. As shown in Figure 6, the CIGALE-derived color–mass diagram of our galaxy sample shows the bimodality of the SF and non-SF galaxies found in large galaxy surveys such as SDSS (e.g., Chang et al. 2015).

Figure 5.

Figure 5. The distribution of galaxy stellar mass as a function of galaxy systemic redshift with marginal distributions on the right and top. The red, green, and blue colors correspond to the galaxy spectral type determined from visual inspection of GMOS spectra. Red circles show absorption line only, or elliptical (E) type galaxies, green circles show galaxies displaying a combination of absorption and emission lines associated with star formation (SF+E), and blue circles show emission-line only, or SF galaxies.

Standard image High-resolution image
Figure 6.

Figure 6. CGM2 sample in a color–mass diagram using the gi color and the galaxy stellar mass, M. Multiband photometry was not available for all of the galaxy targets, only objects with both bands are shown here. The bimodal populations of the SF and passive galaxies are evident. Due to the nature of our survey, we are slightly biased against faint, passive galaxies since retrieving a redshift in the case of absorption lines requires large continuum flux.

Standard image High-resolution image

3.4. Absorber Catalog

To identify absorption features in the QSO spectra, the HST/COS UV spectra were visually inspected by members of the Werk SQuAD in a multistep process that closely follows the procedure described in Tejos et al. (2014) and is designed to leave no QSO absorption feature unidentified. This process includes identifying all of the most ubiquitous metal ions present in QSO spectra at the wavelengths covered by the HST/COS spectra, not just the H i Lyman series. These other metal ions will be explored in future papers on CGM2. To identify the absorption features, the Werk SQuAD used a module from the PYIGM 18 software package, IGMGUESSES . The software allows for a straightforward comparison of multiple transitions from different elements, as multiple lines are displayed with their expected relative intensities given by their atomic parameters simultaneously for a given redshift.

During the line identification process, the absorption lines are assigned a reliability score. "a" signifies a certain feature. For example, the always-present MW interstellar medium lines at z = 0 fall into this reliability category. Other examples include absorption lines observed in two or more transitions of H i, or multiple metal lines that align with hydrogen lines within ± 30 km s−1, and metal doublets or multiplets that show the expected relative strengths as derived from their oscillator strengths and wavelengths (Morton 2003) and similar velocity profiles. A reliability score of "b," or possible, includes single H i lines with no associated metal lines, and metal ions having only one transition within the observed wavelength range. Other cases of assigning "b" values involve messy blends from absorption lines at different redshifts, weak or uncommon metal lines in an otherwise strong absorption system, and velocity offsets >30 km s−1 from other "a" lines. If a line did not fall into either of the previous categories we gave it a reliability score of "c," or unreliable. In our analysis, we did not include any absorbers in the unreliable "c" category. The full catalog of identified absorbers will be presented in detail in future work.

The end result of this line identification process for all 22 QSOs is a catalog of 2914 distinct absorption components, 2071 of which have a reliability rating of certain or possible. An absorption component is defined by an absorption line or lines with a distinct central velocity (or redshift). In practice, individual components offset by <20 km s−1 may not be separable in the HST/COS spectra. An absorber, or absorption system, is a set of absorption-line components within ∣δ v∣ ≈ 1000 km s−1. For example, Lyα and Lyβ are distinct lines but would be part of the same H i component if aligned within the COS resolution velocity (redshift). This component may be grouped with other H i or metal ion components to form an absorption system.

Different absorbers lie at distinct redshifts and may physically correspond to clouds or layers of gas in the CGM of galaxies at their respective redshifts. Absorbers may also be clouds or filaments of gas in the IGM referred to as the Lyα forest, not directly associated with a nearby galaxy. The term absorber is often used interchangeably with the term system. However, absorber distinctly does not imply an association with a galaxy and is a more empirical term. Absorption systems may have multiple components at different velocities within their assigned redshift ranges. The vast majority of our H i absorption systems cover >3 Lyman series lines, many with multiple absorption components.

In order to retrieve more physical quantities such as column density, NHI, we used the apparent optical depth method (AODM) from Savage & Sembach (1991) as encoded in the linetools 19 package. Because we have column density measurements from several Lyman Series lines in most cases, the mean 1σ uncertainties on column density is 0.17 dex for unsaturated H i lines to column densities ≃1017.5 cm−2.

4. Connecting Galaxies and Absorbers: The CGM2 H i Survey

4.1. Defining CGM H i Absorption Systems

With our separately completed galaxy and absorber databases, we can now begin to connect the two as a study of the CGM. In order to construct CGM systems, we first group the individual absorption component identifications into absorption systems, or absorbers. The grouping of absorption components was done using a clustering algorithm from SKLEARN (Pedregosa et al. 2011), MEANSHIFT. This algorithm groups individual absorption components together within a window function of 1000 km s−1. The resultant absorber catalog consists of groups of components we call absorption systems.

We then crossmatched the galaxies and absorption systems if the relative velocity difference of the galaxy and the velocity centroid of at least one of the components of an absorption system exhibits ∣δ v∣ < 500 km s−1. We chose this velocity threshold to include absorption systems that could be at or above the escape velocity of the most massive galaxies in our sample. If no absorption system is found at the redshift of a galaxy, we measure the 2σ upper limit within δ v = ± 30 km s−1 of the galaxies redshift using the normalized error of the quasar flux. If there was an interloping line at this redshift, we measured the AODM column density as a conservative upper limit. Our results are not sensitive to the choice in the velocity window. We find 181 systems consisting of 416 distinct components that exhibit H i column densities above our 2σ detection threshold, giving an average of 2.3 detected components per galaxy (absorption system) within our stated velocity window. We find 2.4 average components per galaxy for a smaller window of ∣δ v∣ < 250 km s−1, while for a larger velocity window, we find a small decrease to 2.2 detected components per galaxy for a window of ∣δ v∣ < 1000 km s−1. The average column density of detected, but not saturated, components remains at 1014.9 cm−2 in each case.

Thus, we are left with a galaxy-centric CGM survey that consists of absorption-line column density measurements (or limits) around 971 galaxies with reliable redshifts that lie <1000 km s−2 from the quasar. We denote these galaxy-absorber pairs as CGM systems. Figure 7 shows the AODM H i column densities for all 971 CGM systems as a function of galaxy systemic redshift. Saturated lines provide only lower limits to the H i column density and are shown as upward-facing triangles. Non-detections are shown as 2σ upper limits, and as downward-facing triangles. There is an obvious "knee" of amplitude 0.5 dex in the lowest H i column densities at a redshift of z ∼ 0.5. This decreased sensitivity to weak H i absorption features is driven by the redshifting of Lyα out of the wavelength range of the COS G160M grating. The column density measurements at z > 0.481 are derived from measurements of the Lyβ absorption line and/or weaker Lyman series transitions, leading to a decrease in sensitivity for a fixed S/N. This shift in sensitivity motivates us to limit our H i analysis to z < 0.481, corresponding to λLyα (1 + z) = 1800Å, or the reddest end of the COS G160M grating. By imposing this limit, we ensure nearly uniform sensitivity to H i column density for our CGM sample.

Figure 7.

Figure 7. Scatter plot and marginal distributions of column densities vs. redshift for the H i systems detected in the CGM2 survey. The mean 1σ uncertainties on column density is 0.17 dex for unsaturated H i lines to column densities ≃1017.5 cm−2, which is of order the size of the symbol (see Figure 9 for the size of our uncertainties). The measurements designated with upward-facing triangles are saturated absorption lines and are thus lower limits while circles represent detections. Smaller gray downward-facing triangles are 2σ upper limits for galaxies where no corresponding absorption was measured. The visible break in the minimum NHI at ≳0.48 is shown by the vertical dashed line, and marks the redshift at which Lyα shifts out of the COS G160M bandpass and thus becomes inaccessible. There can be multiple systems at the same column density and redshift, which appear as darker points in this plot. This occurs when multiple galaxies lie within ∣δ v∣ < 500 km s−1 of the absorption systems.

Standard image High-resolution image

After limiting our CGM system survey to the aforementioned redshift, we construct the CGM2 H i survey, consisting of 572 total galaxy-absorber pairs. The survey includes all galaxies with reliable redshifts and H i absorption systems (including the non-detections within ∣δ v∣ < 500 km s−1), collectively referred to as CGM systems. In the H i covering fraction analysis that follows, it is possible to have multiple galaxies at similar redshifts but at differing impact parameters that match with individual absorption systems (74 systems total). For understanding the H i extent of the CGM, we want to understand the correlation of galaxies and absorbers and thus do not limit our matches to the closest or most massive galaxy. However, in our complementary H i velocity analysis, we limit the survey to the galaxy with the smallest impact parameter, thus leaving 522 CGM systems. Future studies, depending on their specific aims, will make independent choices about how to include galaxy-absorber pairs.

In the following analyses, we examine trends with the impact parameter, ρ, which quantifies the projected distance between the QSO and the galaxy in the rest frame of the galaxy. Figure 8 shows the impact parameter–redshift distribution of CGM systems. The gray curve approximates the distance to the edge of the detector in the 5.5' GMOS FOV, assuming the QSO is in the center, in order to highlight the survey coverage as a function of redshift. The QSO was slightly offset from the center in certain fields, either to better place guide stars or to avoid bright foreground stars; thus, a few galaxies fall on or near this approximate field-size limit.

Figure 8.

Figure 8. The distribution of impact parameters as a function of redshift. The gray curve approximates the distance to the edge of the detector in the 5.5' GMOS FOV, assuming the QSO is in the center to highlight the survey coverage as a function of redshift. Although most of the QSOs are centered in the FOV, in a few cases they had to be offset to avoid bright stars.

Standard image High-resolution image

4.2. H i Covering Fraction: Definition and Threshold

To quantify the radial profile and extent of the CGM, we use the covering fraction fc as a measure of the probability of the presence of H i. The covering fraction is the comparison of "hits" (H) and "misses" (M), with a hit being defined as a galaxy with a corresponding absorber at or above the detection threshold for the full ensemble (see below), while a miss occurs when the 2σ upper limit on a detection is below the threshold at the redshift of the galaxy. A system with a 2σ upper limit above the threshold is ignored altogether because this indicates that the S/N of the spectrum is not adequate for detection of the H i lines. The covering fraction is the fraction of hits versus the total number of CGM systems above the threshold in a given bin, fc = H/(H + M).

We choose a threshold of NHI ≥ 1014 cm−2 for the covering fraction calculations, which is supported by previous survey work. In particular, Chen et al. (2005) found that H i column densities below 1013.6 cm−2 are consistent with being randomly distributed with respect to known galaxies. They also showed that the correlation of galaxies and absorbers does not depend sensitively on NHI for strong absorbers, NHI> 1013.6 cm−2. Furthermore, a value of NHI ≥ 1014 cm−2 was shown in Tejos et al. (2014) to be more highly correlated with galaxies than gas at lower column densities. Adopting a threshold of NHI ≥ 1014 cm−2 additionally provides us with a sample against which we can compare to the cumulative column density distributions of H i systems found in Danforth et al. (2016), who also used this value.

4.3. The Empirical H i Galaxy Connection

This section presents an empirical analysis of the H i galaxy connection. In the following analysis we generally avoid differentiating the systems via their spectroscopic galaxy classifications due in part to the fact that we do not expect to see differences in covering fractions of H i in star-forming and quiescent galaxies (Thom et al. 2012; Keeney et al. 2017) but include it in Figure 9 to illustrate this. Additionally, Figures 6 and 5 show that the galaxy spectral classification is correlated with the galaxy stellar mass, and thus we have relatively few galaxies with comparable masses but with differing SF classification. Future analyses that examine the metal ions present in the CGM will focus on galaxy spectroscopic type.

Figure 9.

Figure 9. Column density as a function of mass for ρ < 300 kpc (top) and R < 1.5 Rvir (bottom). CGM systems with saturated absorption are marked with upward-facing triangles, while non-detections are displayed as lighter, downward-facing triangles at their corresponding 2σ upper limits. Circles represent CGM systems with measured NHI. Measured 1σ uncertainties in the column density of the detected CGM systems are shown as lines inside the markers. Red markers indicate a spectroscopically determined quiescent galaxy classification E, while blue corresponds to those galaxies with emission lines present in their spectra, classified as SF and SF+E. Our sample of quiescent galaxies predominately reside in the highest-mass bin. Covering fractions fc are plotted with respect to the right axes and are calculated without differentiating spectroscopic galaxy categories. The gray boxes correspond to the binomial confidence interval of the covering fraction (NHI > 1014 cm−2), with the mean fc in each bin denoted with a dotted line. The column density increases as a function of mass, while the covering fraction remains greater than fc > 0.5 for galaxies with masses of M = 108−11 M.

Standard image High-resolution image

In Figure 9, we present the H i column density (left axis) and covering fraction (right axis) as a function of stellar mass for galaxies within 300 kpc (top panel) and 1.5 Rvir (bottom panel). The covering fraction for each bin is shown as a dotted line with the gray boxes corresponding to the 68% binomial confidence intervals. At R < 1.5 Rvir, fc remains consistent with fc ≳ 0.5 at all ranges in mass. We also notice a trend of increasing covering fraction and column density as a function of galaxy stellar mass.

The H i column densities and impact parameters are shown in Figure 10 as an anticorrelation of column density with increasing separation between the galaxy and QSO sightline. Figure 10 separates our galaxy sample into three mass bins, each containing approximately equal numbers of galaxy-absorber systems. The top panel contains 191 systems with M > 109.9 M. The middle panel also contains 191 systems with 109.2 < 1010 M/M < 109.9, while the bottom panel corresponds to a sample of 190 systems with M < 109.2 M.

Figure 10.

Figure 10. Column density NHI as a function of impact parameter, ρ (left) and Rvir (right). The panels are arranged vertically in order of decreasing mass. Galaxy-absorber systems with dark circles represent galaxies with NHI detections in the corresponding QSO spectrum. Systems with dark, upward-facing triangles show our inferred lower limits due to saturation in NHI. Similarly, systems with gray, downward-facing triangles denote the 2σ upper limit on NHI(non-detections). Measured 1σ uncertainties in the column density of the detected CGM systems are shown as gray lines but are smaller than the marker size in every case. NHI covering fraction, fc , corresponds to the right axes in bins of 100 kpc (left) and 1 Rvir (right). The dotted lines represent fc , assuming a detection threshold of NHI = 1014 cm−2 with the 68% binomial confidence interval shown as gray shaded regions about the mean fc .

Standard image High-resolution image

We find that for the highest- and intermediate-mass galaxy samples, fc drops off monotonically but remains ≥50% out to an impact parameter of ρ ≃ 400 kpc, corresponding to R ≃ 2 Rvir for these masses, before flattening out in the case of the intermediate-mass sample. In the lowest-mass regime, we find lower (fc ∼ 60%) covering fractions at the smallest separations. We also observe a shallower profile in covering fraction out to R ≃ 2 Rvir, beyond which the covering fraction remains elevated and consistent with ∼50% out to 600 kpc, or 5 Rvir.

The extended distribution seen in the lower two panels, where the covering fraction remains elevated, could imply that the galaxies are located inside (or close to) the halos of other (massive) galaxies (e.g., Burchett et al. 2016). In an upcoming CGM2 paper, we plan to explore the impact of galaxies' environments on the properties of their CGM. Briefly, we find that 56% (243/435) of galaxies with M < 1010 M have M > 1010 M neighbors within 300 kpc and 1000 km s−1 of the sightline. We observe 75% (86/121) of galaxies with M > 1010 M have neighbors above this mass threshold within the same physical window, consistent with the fact that larger galaxies are more highly clustered. Furthermore, we generally see that low-mass galaxies with nearby massive neighbors tend to have elevated H i covering fractions out to ∼2–3 Rvir compared to galaxies with no detected massive neighbors. In tandem, these effects support our conclusion that the elevated covering fractions out to large impact parameters for our M < 1010 M sample are consistent with environmental effects.

Our results are consistent (within 2σ) with those presented by Wakker & Savage (2009) who found that Lyα absorbers at z < 0.017 with an equivalent width >50 mÅ (NHI ≃ 1013 cm−2) have covering fractions of 100% at ρ < 400 kpc. Similarly, Prochaska et al. 2011 found high covering fractions out to $\rho =300{h}_{72}^{-1}$ kpc for absorbers with equivalent widths >30 mÅ. This observed enhancement of covering fraction in the low-mass sample could also be due in part to the fact that our CGM systems were defined to have separations of ∣δ v∣ <500 km s−1, which is around the expected escape velocities of high-mass galaxies but is more likely to encompass unassociated absorbers in the low-mass sample that trace the cosmic web. However, low-mass galaxies have smaller escape velocities and shallower potential wells, and thus would likely exhibit gas being ejected at larger velocities. There are clearly many competing effects in this mass range.

Figure 11 shows the covering fraction and confidence intervals for the total sample at NHI ∈ 1013−15 cm−2 (blue, green, yellow). For the lowest column density threshold in the low-mass sample, the H i covering fractions remain elevated at ∼80% out to at least 4 Rvir. This signal must be dominated by galaxy–galaxy clustering as the fc of random incidence in a velocity window at the mean redshift is fc = 0.14 for NHI > 1013 cm−2. (see Section 5). In contrast, stronger absorbers preferentially occur at smaller impact parameters. For example, it is at log NHI ≳ 14.5 that the covering fraction drops below 50% by R = Rvir.

Figure 11.

Figure 11. Covering fraction of H i as a function of ρ/Rvir for column density thresholds of NHI > 1013 cm−2 (blue), NHI > 1014 cm−2 (green), NHI > 1015 cm−2 (yellow), NHI > 1016 cm−2 (purple), and NHI > 1017 cm−2 (pink). Shaded regions represent the 1σ (68%) binomial confidence intervals. Here, we connect the center of the radial bins to highlight the difference in the distributions. We see that for column densities less than 1014 cm−2 show little correlation with galaxies. The covering fraction at R < Rvir in for the highest column densities (NHI > 1015 cm−2) never gets higher than 0.7.

Standard image High-resolution image

4.4. H i Velocity Offsets

Here, we examine the kinematics of the H i absorption in our CGM2 sample of z < 0.481 absorbers that are associated with galaxies. Velocity distributions of CGM absorbers quantify the amount of material gravitationally associated with the assumed host halo. For simplicity and clarity, we trim the sample to those galaxies closest to the absorber impact parameter (ρ), thus each absorber is associated with only one galaxy. We are left with a sample of 171 unique galaxy-absorber pairs at z < 0.48 that were detected with a signal >2σ.

In Figure 12, we show the distribution of velocity centroids of the 416 detected absorption components associated with these 181 CGM systems in the rest frame of the galaxy systemic redshift as a function of column density. The boxes show the quartiles of distribution in velocity while the whiskers show the extent of the distribution. Outliers beyond 1.5 times the innerquartile range are displayed as points. We see that systems with NHI < 1014.0 cm−2 have a higher median and larger spread in velocity. The component velocity centroids do not exhibit any clear trends with impact parameter. Relevant to this discussion, we recall that absorption components are required to lie within ±1000 km s−1 of each other. The only velocity constraint imposed with respect to the galaxy is that at least one of these absorption components must lie within ±500 km s−1 of a CGM2 galaxy systemic redshift in order to be classified as a CGM absorption system.

Figure 12.

Figure 12. The distribution of absolute velocities as a function of NHI displayed in a box and whisker plot for each mass sample. The boxes display the quartiles of the distribution centered at each bin spanning 1 decade in column density, while the whiskers extend to show the rest of the distributions of the bins. Outliers are defined as points that lie outside 1.5 times the innerquartile range and are displayed as small diamonds. Each bin is split into a high (blue), intermediate (green), and low-mass (red) sample. We see a strong anticorrelation between velocity spread and column density. The single line seen at higher-column densities in the low-mass sample indicate there are is only one point in each column density bin. This highlights how rare high-column density systems are around low-mass galaxies.

Standard image High-resolution image

We find that >53% of the detected H i components are located within ±250 km s−1 of the galaxy systemic redshift, independent of H i column density. Furthermore, only 27% of all absorption components lie at ∣δ v∣ > 500 km s−1. Finally, approximately 83% of the total H i column density in the CGM2 survey lies within ±250 km s−1 of the associated host galaxy, consistent with the results of COS-Halos (Tumlinson et al. 2013). Some absorption systems show a total extent >500 km s−1, which likely captures both bulk motions of galaxies and possible "missassociations" due to incompleteness our galaxy survey, especially at magnitudes >23.

Were absorption components distributed uniformly in velocity space relative to the galaxies, limited only by the 1000 km s−1 window selection function, we would not expect to find such a high concentration of absorbers at low δ v. To first order, Figure 12 shows that our CGM survey is capturing gas that is mostly not exceeding the escape velocity of its host halo, where the escape velocity of an M ≈ 109.5 M galaxy at Rvir is ∼250 km s−1. There is very clearly a gravitational association between the galaxies and the absorption at higher-column densities. The component velocity centroids do not exhibit a clear trend with impact parameter but we find that the median velocity for components at ρ > Rvir to be 233 km s−1 with a standard deviation of 453 km s−1, while components at ρ < Rvir have a median velocity of 185 km s−1 with a standard deviation of 354 km s−1. In this unique galaxy-absorber pairing framework, 80% of the absorption components with NHI> 1015 cm−2 lie within 1 Rvir. We caution that for this kinematic analysis we paired specific galaxies with absorbers on the criterion that they were the closest in physical separation from the absorption sightline. Without this bias, the trend remains, however.

Lastly, Figure 12 also shows the column-density dependence of the absorption component velocity distribution. Here, we see that absorption systems with NHI > 1014 cm−2 concentrate more strongly at low ∣δ v∣ than weaker systems, and there is a clear column density dependence to the overall velocity concentration at ∣δv < 250 km s−1. Counting by column density we find that 53% of the total fitted column density lies within ±250 km s−1. In contrast, we find that only 44% of the NHI < 1014 cm−2 absorption components lie at ∣δ v∣ < 250 km s−1, while 67% of the NHI > 1014.0 cm−2 absorption components lie at ∣δ v∣ < 250 km s−1. This concentration increases with increasing column density. For a limit of NHI = 1015 cm−2, the percentages of components within ∣δ v∣ < 250 km s−1 shift to 50% and 86% for the low and high-column density thresholds, respectively. Similarly, the high velocity components tend to have low column density. 27% of all absorption components lie at ∣δ v∣ > 500 km s−1, but less than 5% of those components have NHI > 1014 cm−2. Generally, there is no systematic effect preventing high-column density components from appearing at higher velocities, so this trend captures an important characteristic of the CGM.

5. H i Galaxy Clustering: ${R}_{\mathrm{CGM}}^{14}$

5.1. Setup

In order to quantitatively describe the radial dependence of the CGM and estimate its extent, we perform an absorber-galaxy cross-correlation analysis. We aim to measure the excess probability of detecting an absorber given the proximity of a galaxy over the proximity-agnostic average rate. Our analysis of the H i galaxy clustering is similar to the one developed by Hennawi & Prochaska (2007) and follows more closely the analysis by Prochaska et al. (2019). We define the 3D cross-correlation function, ξag (r) as

Equation (1)

In order to determine the best-fitting parameters, r0 and γ, we define a likelihood function as

Equation (2)

where Phit is defined to be the probability of detecting one or more H i systems and Pmiss is the probability of detecting none. This probability has both a radial and redshift dependence. An absorber is considered a "hit" if it falls within our window of δ v = ± 500 km s−1 of a galaxy and we measure a column density above a threshold ${N}_{\mathrm{HI}}^{\mathrm{thresh}}$. We define Pmiss to be the probability of observing zero events from a Poisson distribution where the rate is the number of events expected from the average density of absorbers and our clustering term:

Equation (3)

Here, $\langle d{ \mathcal N }/{dz}\rangle \delta z$ is the mean number of absorbers in a window of redshift δ z. [1 + χ(r)] represents an excess in the number of absorbers due to clustering. This boost due to clustering can be expressed in terms of the three-dimensional correlation function as

Equation (4)

where we are integrating Equation (1) along the length of a cylinder of length 2δ v/aH(z). Here a = 1/(1 + z) and H(z) is the Hubble parameter. The probability of a hit, Phit, is the complement of the probability of a miss: Phit = 1 − Pmiss. Phit is equivalent to the covering fraction fc . The covering fraction of a random sightline is then

Equation (5)

We take $\langle d{ \mathcal N }/{dz}\rangle $ from Danforth et al. (2016), who measured the occurrence of 5138 individual extragalactic absorption lines of H i in 82 QSO/AGN spectra at redshifts zAGN < 0.85. The occurrence rate of H i absorbers $d{ \mathcal N }/{dz}$ is expressed with a functional form as follows:

Equation (6)

with C0 = 16 and γ = 2.3. This measurement is valid only for absorbers with ${N}_{\mathrm{HI}}^{\mathrm{thresh}}\geqslant {10}^{14}$ cm−2, which coincides with the threshold in our definition of hits and misses.

In order to estimate r0 and γ in our 3D correlation function, Equation (1), we follow a Bayesian approach. The posterior probability function can be defined as

Equation (7)

where ki ∈ {0, 1} specifies whether system i is a hit or a miss and p( k r , z , r0, γ) is the likelihood function ${ \mathcal L }$ defined in Equation (2). We define the priors as follows:

Equation (8)

and

Equation (9)

where r0 is measured in ${h}_{68}^{-1}$ co-moving Mpc and ${ \mathcal N }(\mu ,\sigma )$ is the normal distribution. These priors were chosen based on physical arguments and previous results on absorber-galaxy clustering (e.g., Tejos et al. 2014). We used the Markov Chain Monte Carlo (MCMC) sampler emcee (Foreman-Mackey et al. 2013) to generate samples from the posterior probability distribution function over r0, γ.

The data were cut as in the previous analysis with a redshift cut of z < 0.481 and subdivided into the three equal sized mass samples as before with identical priors used for each sample.

5.2. H i Galaxy Clustering Results

Figure 13 illustrates the results of the MCMC parameter estimation. The plots on the left show the covering fraction as a function of the perpendicular separation in co-moving megaparsecs.

Figure 13.

Figure 13. Left: covering fraction of H i gas with NHI > 1014 cm−2 as a function of the physical impact parameter in co-moving megaparsecs for galaxies in order of decreasing mass. The black crosses are binned evaluations of the covering fraction in each bin with 95% binomial confidence limits on covering fraction. The small gray ticks at the top and bottom of the figures indicate impact parameter for those systems that were hits (top) and misses (bottom). Beneath the black crosses are samples drawn from the posterior distribution showing the range in the γr0 parameter space. The gray boxes on the bottom of the plots on the left indicate the covering fraction for sightlines taken at random, $\langle d{ \mathcal N }/{dz}\rangle \delta z$. The plots on the right further illustrate the one and two-dimensional projections of the posterior probability distributions of our parameters as well as indicate the median values for each. The highest-mass sample contains 191 CGM systems at a mean redshift of $\bar{z}=0.34$ and a random sightline covering fraction of fc,rand = 0.14. The intermediate-mass sample contains 191 CGM systems at a mean redshift of $\bar{z}=0.33$ and a random covering fraction of fc,rand = 0.14. The lowest-mass sample contains 190 galaxies at a mean redshift of $\bar{z}=0.26$ and a random covering fraction of fc,rand = 0.14. Using the samples from the posterior distributions in r0 and γ, we calculate ${R}_{\mathrm{CGM},{\rm{p}}}^{14}={346}_{-53}^{+57}$ kpc, ${R}_{\mathrm{CGM},\ {\rm{p}}}^{14}={353}_{-50}^{+64}$ kpc, and ${R}_{\mathrm{CGM},\ {\rm{p}}}^{14}={177}_{-65}^{+70}$ kpc in order of decreasing mass samples. The extent of the CGM drops off in the lowest-mass sample.

Standard image High-resolution image

The gray boxes on the bottom of the plots on the left indicate the covering fraction for sightlines taken at random, ${f}_{c}^{\mathrm{rand}}=1-\exp [ \langle d{ \mathcal N }/{dz} \rangle \delta z]$ at the mean redshift of the sample, $\bar{z}=0.34$, $\bar{z}=0.34$ and $\bar{z}=0.26$ for the high-, intermediate-, and low-mass samples, respectively. The plots on the right further illustrate the marginal distributions of the posteriors of our parameters as well as indicate the median values for each. The covering fraction due to a random sightline is fc,rand = 0.14.

5.3. Estimating ${R}_{\mathrm{CGM}}^{14}$

In order to estimate a characteristic size of the NHI > 1014 cm−2 CGM, ${R}_{\mathrm{CGM}}^{14}$, we devise a method in which we use the parameters to calculate the impact parameter at which the covering fraction (fc = Phit) exceeds 0.5. Within this impact parameter, a sightline has a greater than 50% chance of exhibiting a H i column with NHI > 1014 cm−2. We can then estimate the posterior distribution of RCGM by calculating it for each sample taken from the posterior distributions of γ and r0. Using the samples from the posterior distributions in r0 and γ, we calculate ${R}_{\mathrm{CGM},{\rm{p}}}^{14}={346}_{-53}^{+57}$ kpc (${R}_{\mathrm{CGM},{\rm{c}}}^{14}={463}_{-71}^{+76}$ co-moving kpc), ${R}_{\mathrm{CGM},\ {\rm{p}}}^{14}={353}_{-50}^{+64}$ kpc (${R}_{\mathrm{CGM},{\rm{c}}}^{14}={469}_{-66}^{+85}$ co-moving kpc), and ${R}_{\mathrm{CGM},\ {\rm{p}}}^{14}={177}_{-65}^{+70}$ kpc (${R}_{\mathrm{CGM},{\rm{c}}}^{14}={222}_{-81}^{+88}$ co-moving kiloparsecs) in order of decreasing mass samples. The extent of the CGM remains relatively similar in size (∼350 physical kiloparsecs) except for the lowest-mass sample. These correspond to ${R}_{\mathrm{CGM}}^{14}={1.2}_{-0.2}^{+0.2}$ Rvir, ${R}_{\mathrm{CGM}}^{14}={2.4}_{-0.3}^{+0.4}$ Rvir, and ${R}_{\mathrm{CGM}}^{14}={1.6}_{-0.6}^{+0.6}$ Rvir for the mass samples in order of decreasing mass, respectively, where Rvir was calculated using the mean redshift and mass of each sample. These estimates are in agreement with our qualitative empirical estimates from the previous analysis. These results are summarized in Table 5.

Table 5. Results

M ${R}_{\mathrm{CGM},{\rm{p}}}^{14}$ ${R}_{\mathrm{CGM},{\rm{c}}}^{14}$ ${R}_{\mathrm{CGM}}^{14}$ $\bar{z}$
(M)(kpc)(kpc)(Rvir) 
M > 109.9 ${346}_{-53}^{+57}$ ${463}_{-71}^{+76}$ ${1.3}_{-0.2}^{+0.2}$ 0.34
109.2 < M < 109.9 ${353}_{-50}^{+64}$ ${469}_{-66}^{+85}$ ${2.4}_{-0.4}^{+0.4}$ 0.33
M < 109.2 ${177}_{-65}^{+70}$ ${222}_{-81}^{+88}$ ${1.6}_{-0.6}^{+0.6}$ 0.26

Note. Results of H i galaxy clustering analysis and ${R}_{\mathrm{CGM}}^{14}$: (1) stellar mass limits of each sample; (2) ${R}_{\mathrm{CGM}}^{14}$ in physical kiloparsecs; (3) ${R}_{\mathrm{CGM}}^{14}$ in physical kiloparsecs; (4) ${R}_{\mathrm{CGM}}^{14}$ normalized to the average virial radius of the sample; (5) average redshift of each sample

Download table as:  ASCIITypeset image

6. Comparison with Other Work

6.1. Previous Surveys

CGM2 contains the largest sample by a factor of 5 of CGM absorption systems within ∼600 kpc of putative host galaxies. Our increased coverage is partly due to the fact that we probe a larger volume than previous surveys by confirming sub-L* galaxies out to higher redshifts. For this reason, CGM2 presents an excellent opportunity to study the H i traced CGM at galaxy-absorber separations that exceed Rvir for a wide range of stellar masses. Here, we compare our results to those from other observational surveys of the CGM.

Figures 13 and 14 and Table 5 provide a summary of our covering fraction results for our mass segregated galaxy samples. The red, green, and blue shaded curves in Figure 14 correspond to the samples cut at masses of M⋆,cut = 109.204, 109.888 M as described above. This figure allows us to better compare the covering fraction profiles of each mass sample to each other. As we saw in the last section, we find a mass dependence on the extent of the CGM; the covering fraction remains higher for larger galaxy mass when looking at the physical impact parameter. When we consider the impact parameter normalized to the average virial radii of each mass sample, we find the covering fraction remains elevated to larger radii (≳2 Rvir) for the intermediate-mass sample, while for the other mass samples the extent of the CGM is similar (≳1 Rvir). We also find that the covering fraction of the lowest-mass sample never exceeds 80%, while for both higher mass samples, the covering fraction is consistent with ∼100%.

Figure 14.

Figure 14. Covering fraction of H i as a function of impact parameter ρ (left) and ρ/Rvir (right) highlighting NHI > 1014 cm−2 broken down into the three mass samples. The solid lines correspond to the covering fraction in bins of 100 physical kiloparsecs (left) and 1 Rvir/ (right). The shaded region encodes the 1σ error in the covering fraction measurement. The covering fraction decreases monotonically in either measure to ∼300 kpc or 2 Rvir. We find that the covering fraction in the lowest radial bins is consistent with ∼100% in each case except for the lowest-mass sample. We note that the intermediate-mass sample extends to further radius when looking at the normalized impact parameter (right) as compared to either of the other mass samples.

Standard image High-resolution image

Many previous studies have examined the covering fraction of H i as a function of galaxy projected separation, although most of these surveys are limited to ρ ≲ 300 kpc. These surveys are all generally constructed in the same manner, with HST UV spectroscopy with COS and/or Space Telescope Imaging Spectrograph and a spectroscopic galaxy catalog to connect galaxies and absorbers.

One of the largest such surveys was carried out by Tejos et al. (2014) who attempted to explore the connection between the IGM and galaxies by measuring the H i galaxy cross correlation at z < 1 at distances between ∼1 and 10 Mpc. They used multiple ground-based instruments to build a new spectroscopic survey of 2143 galaxies in eight QSO fields with HST spectroscopy. They combined their catalog with existing catalogs to build a survey of ∼17,500 galaxies. They did not limit their galaxy survey to only the nearest galaxies but used a statistical approach to examine the cross correlation of galaxies and absorption systems. They found that the H i Lyα forest to be divided into two main categories: a population of low column density absorbers tracing the cosmic web and a higher-column density population that traces the dark matter halos in which galaxies reside. CGM2 has the benefit of being more sensitive to fainter galaxies closer to the QSO itself allowing us to better constrain the extent of the CGM. However, their larger galaxy sample at larger separations (which explores different scales) presents a great opportunity to compare our cross-correlation analysis. Using the same 3D cross-correlation power law, Equation (1), they found r0 = 3.8 ± 0.2 ${h}_{70}^{-1}$ Mpc and γ = 1.7 ± 0.1 in their sample of SF galaxies. This sample is similar to our high-mass sample since we are dominated by SF galaxies and indeed, we find agreement consistent within 1σ.

A mass-dependent CGM was examined in Bordoloi et al. (2018) who used the 85 galaxies in the COS-Halos and COS-Dwarfs surveys at z ∼ 0 with M ranging from 8–11.6 $\mathrm{log}{M}_{\star }/{M}_{\odot }$. This sample was limited to impact parameters (ρ < 160 kpc). They found a mass and radius dependence of the strength of H i absorption, where the equivalent width of H i increases with M and with a decreasing impact parameter. Here, we are primarily focused on the mass and radius dependence of the covering fraction, but find trends generally consistent with those observed by Bordoloi et al. (2018).

Chen et al. (2001) found Lyα with column densities NHI ≳ 1014 cm−2 in 34/47 galaxies (fc ≈ 0.7) out to ρ ≃ 330 kpc. Their sample consists of ∼L* H galaxy pairs with ∣δ v∣ < 500 km s−1 spanning 0.1 < z < 0.9 ($\bar{z}=0.36$). They also found a sharp decline thereafter. We find close agreement ${f}_{c}={0.78}_{-0.89}^{+0.06}$ (162/209) when applying the same criterion to our sample also seeing a sharp decline around 400 kpc. (Figure 14).

Prochaska et al. (2011) used 14 QSO sightlines with previously published equivalent width (W0) measurements of Lyα to carry out a galaxy survey that targeted 37 L > 0.01 L galaxies at $\bar{z}=0.18$. They connected absorbers and galaxies with ∣δ v∣ < 400 km s−1, although they showed that using a larger velocity window of ∣δ v∣ < 600 km s−1 made no qualitative difference in their results. They found covering fractions of order unity (≈90%) for NHI > 1013 cm−2 gas out to ρ = 300 kpc. Comparing our covering fractions with NHI > 1013 cm−2 and for galaxies with M > 108.55 M to approximate their L > 0.01 L sample, we find ${f}_{c}={0.85}_{-0.05}^{+0.04}$ (128/150), which is roughly consistent with their value.

Wakker & Savage (2009) conducted a large survey of the H i galaxy connection at z ≲ 0.017, consisting of 76 QSO sightlines and ∼20,000 local galaxies. They found covering fractions of 77% for Lyα absorbers >50 mÅ within ρ < 400 kpc and ∣δ v∣ < 400 km s−1 of L > 0.1 L galaxies. If we limit our sample to L > 0.1 L galaxies and use a Lyα absorber threshold of ∼50 mÅ (NHI ∼ 1013 cm−2), we find covering fractions, ${f}_{c}={0.89}_{-0.05}^{+0.04}$ (99/111) for galaxies within 400 kpc. This discrepancy could imply that covering fractions increase with redshift.

Most recently, a large survey of H i was carried out by Keeney et al. (2018, hereafter K18). Their survey consisted of 47 COS sightlines (COS GTO) with a higher S/N (S/N ∼ 15–50) compared to our ∼10) QSO spectra. Using ground-based telescopes, they constructed a spectroscopic galaxy database of ∼9,000 galaxies with the aim of >90% completeness to 1 Mpc down to 0.1 L* at z ≲ 0.1. Due to their higher S/N, they could consistently measure weaker absorption lines, down to NHI ≥ 1012.8 cm−2. Leveraging the high completeness of K18 at low redshifts, they were able to measure the H i column densities out to 4 Rvir with enough galaxies in this range (243) to make precise statements about the radial profile of H i. They found the covering fraction for L < L* galaxies (corresponding to our low-mass sample) to feature a shallower decline than that of their L > L* sample. Qualitatively, we show similar results. This difference in covering fraction behavior between high- and low-mass samples can readily be seen in Figures 11 and 14. The elevated covering fractions at large radii in the L < 0.1 L* sample imply that contributions to NHI > 1014 cm−2 CGM gas are dominated by low-mass galaxies. This result is consistent with Prochaska et al. (2011) in relation to H i, and with high-ionization metals such as O vi (Tumlinson & Fang 2005; Pratt et al. 2018); Prochaska et al. 2019).

We note that the high-mass sample drops to a very low covering fraction fc ≃ 0.20 at 4 Rvir while in the lower mass sample, fc remains elevated. At radii greater than 3 Rvir, we may be limited to only higher-z high-mass galaxies due to the detector size but this detector size bias should not affect the lower mass samples.

In addition, previous low-redshift studies have found that low column density gas (NHI = 1013−14 cm−2) is likely uncorrelated with galaxy halos (Chen et al. 2005; Prochaska et al. 2011; Danforth et al. 2016; but, see Tejos et al. 2014 who found that 50% of weak lines can still be correlated with galaxies on 1–10 Mpc and Tripp et al. 1998 who showed that the weakest absorbers are not randomly distributed). First, low column density material exhibits high covering fractions out to 1 Mpc, which we can be seen in Figure 11. Second, low column density material exhibits less velocity correspondence with the systemic velocities of galaxies nearby in projection (Tumlinson et al. 2013), as we find a median velocity difference of 233 km s−1 with a standard deviation of 453 km s−1 for ρ > 1 Rvir, while we find a median velocity of 185 km s−1 with a standard deviation of 354 km s−1 for ρ < 1 Rvir. Traditionally, such low column density material is attributed to the Lyα forest, or to gas in a filament like structure (Tejos et al. 2014), physically distinct from the CGM. This was examined in greater depth by Burchett et al. (2020) who conclusively tied the diffuse IGM to the cosmic web. They found that the H i absorption signature decreases past ρ > Rvir and settles to the cosmic mean matter density.

Turning to higher redshifts, Rudie et al. (2012) used the ground-based KBSS to investigate the z ∼ 2−3 CGM surrounding 886 galaxies. Their sample contained 48 galaxies at $\bar{z}=2.3$ within ρ ≲ 300 kpc for which they measured a covering fraction of fc = 0.81 ± 0.06 for absorbers with NHI > 1014 cm−2. By comparison, if we choose a mass range of 1010.4 < M/M < 1011 to approximate their mass distribution (Erb et al. 2006) we find ${f}_{c}={0.86}_{-0.14}^{+0.08}$ (18/22), which is good agreement with their results. However, looking at the extended CGM ρ < 1 Mpc, we find a discrepancy: we measure ${f}_{c}={0.49}_{-0.08}^{+0.08}$ (40/83) versus their 0.70 ± 0.03. These numbers are in agreement at the 2σ level, however. Interestingly, the extended CGM may show a decrease in covering fraction as the universe evolves. This phenomenon may be due to the development of virial shocks that ionize the gas, as suggested by Burchett et al. (2018). At z = 2.3, 300 kpc ∼ 2 Rvir for a 7 × 1010 M galaxy while at z = 0.3 (CGM2) 300 kpc ∼ 0.8 Rvir. Alternatively, this could simply be due to the fact that a column density of NHI > 1014 cm−2 traces lower density peaks at high-z.

6.2. Comparison with Hydrodynamical Simulations

We now turn to a brief comparison with hydrodynamical simulations. van de Voort et al. (2019) simulated a roughly z ∼ 0 L* galaxy using a new refinement technique to better resolve the CGM. They found an increase in the H i column density and resultant covering fraction when the resolution is increased to resolve 1 kpc scales. We find their model to be in good agreement with our NHI ≥ 1014 cm−2 covering fraction measurements when we limit our sample to M ∼ 1010.5 M (see their Figure 3). Comparing our column density measurements to van de Voort et al. (2019) and Hummels et al. (2019), we also see good agreement out to their limiting distances of 200 and 100 kpc, respectively (compare Figure 10 to Figure 2 in van de Voort et al. (2019)). Our high H i covering fractions and column densities at R < Rvir are in conflict with earlier simulations that consistently underpredict the column density of low ions in the CGM (e.g., Shen et al. 2012; Stinson et al. 2012; Ford et al. 2013; Hummels et al. 2013; Liang et al. 2016), validating the work that has gone into creating these new high-resolution techniques. To understand the extent of the CGM around a diverse sample of galaxies, we encourage future simulations extending to at least 4 Rvir, and covering a larger range of galaxy masses, down to 0.01 L*.

7. Summary and Conclusions

We have reported the first results from the CGM2 survey, a comprehensive survey of the z < 1 CGM at least five times larger than previous surveys such as COS-Halos at comparable redshifts. This paper has presented the detailed properties of the survey design and the procedures followed in the collection and processing of the data. We present an H i study that combines high-resolution HST/COS UV spectra of 22 background QSOs with Gemini/GMOS spectra of 572 foreground galaxies having stellar masses 106 M < M < 1011.5 M and z < 0.481. The S/N of ∼10–12 of these COS spectra and access to the H i Lyman limit enables us to constrain the H i column densities and kinematics of associated CGM absorption, and to ultimately examine the extent of the CGM as a function of galaxy mass and physical separation from the QSO sightline.

We find that high-column density circumgalactic material is associated with galaxies at high statistical significance out to 2 Rvir, whereas H i absorption with NHI < 1014 cm−2 is more broadly distributed in both impact parameter and velocity space and may not be associated directly with massive galaxies. Our kinematic analysis reveals that the detected strong H i is most likely gravitationally associated with the most nearby galaxy, while weaker H i components seen at absolute velocity offsets ≳500 km s−1 may be instead associated with extended large-scale structures. We find generally good agreement between our sample and the prior studies that have examined the CGM of low-redshift galaxies out to similar and larger separations.

We define the cool CGM as the region surrounding a galaxy in which the probability of observing an absorber with NHI > 1014.0 cm−2 is >50%. Our column density threshold, NHI > 1014.0 cm−2, is motivated by previous observational work that examines the statistical, large-scale (megaparsec) correlations between galaxies and QSO absorption lines. In essence, our definition of ${R}_{\mathrm{CGM}}^{14}$ demands that around a given galaxy, one is more likely than not to find material that has been empirically associated with galaxies. The picture that emerges is of a diffuse, CGM extending to ∼350 kpc around galaxies with stellar mass ≳ 109 M at z ≲ 0.5. For galaxies of lower stellar mass, the extent of the CGM is <200 kpc. At all stellar masses, the extent of this CGM exceeds a virial radius, especially for galaxies with intermediate masses where the ${R}_{\mathrm{CGM}}^{14}$ exceeds 2 Rvir. Therefore, these results imply that using Rvir as a proxy for the characteristic edge of the CGM may significantly underestimate its true extent. The detailed nature of the mass dependence of the CGM will be examined in a forthcoming paper. Additional follow-up studies using CGM2 data will consider transitions from a wide range of ionized metals and absorption-line profile analyses to characterize the ionization state, metallicity, kinematics, and mass of the CGM at low redshift.

M.C.W., J.K.W., and K.T. acknowledge the people of the Dkhw'Duw'Absh, the Duwamish Tribe, and other tribes on whose traditional lands we study and work at the University of Washington. M.C.W., J.K.W., and K.T. acknowledge support for this work from the NSF under Grant No. NSF-AST-1812521, and the contributions from the following University of Washington current and former undergraduate Astronomy majors who participated in the Werk SQuAD from 2017–2020: Steven Bet, Douglas Branton, Joseph Breneman, Dustin Burnham, Olivia Petry Caplow-Munro, Justin Collins, Apurva Goel, Samuel Johnson, Courtney Klein, Dalton LaCoste, Christina Lindberg, Camellia Magness, Travis Mandeville, Karalyn Ostler, Magdalyn Paige, Locke Patton, Brittany Platt, David Staker, Matthew Strasbourg, Sophia Taylor, Mercedes Thompson, and Leonardo Zhu. Additionally, the authors thank the anonymous referee for a productive report that improved the analysis.

M.C.W. would like to thank Daniel J. D'Orazio and David Fleming for useful discussions.

This work was based on observations obtained at the Gemini Observatory, which is operated by the Association of Universities for Research in Astronomy, Inc., under a cooperative agreement with the NSF on behalf of the Gemini partnership: the National Science Foundation (United States), National Research Council (Canada), CONICYT (Chile), Ministerio de Ciencia, Tecnología e Innovación Productiva (Argentina), Ministério da Ciência, Tecnologia e Inovação (Brazil), and Korea Astronomy and Space Science Institute (Republic of Korea). The Gemini North telescope is located within the Maunakea Science Reserve and adjacent to the summit of Maunakea. We are grateful for the privilege of observing the universe from a place that is unique in both its astronomical quality and its cultural significance.

This research has made use of the SVO Filter Profile Service (http://svo2.cab.inta-csic.es/theory/fps/) supported by the Spanish MINECO through grant AYA2017-84089.

This work is partly based on tools and data products produced by GAZPAR operated by CeSAM-LAM and IAP.

The Pan-STARRS1 Surveys (PS1) and the PS1 public science archive have been made possible through contributions by the Institute for Astronomy, the University of Hawaii, the Pan-STARRS Project Office, the Max Planck Society and its participating institutes, the Max Planck Institute for Astronomy, Heidelberg and the Max Planck Institute for Extraterrestrial Physics, Garching, The Johns Hopkins University, Durham University, the University of Edinburgh, the Queen's University Belfast, the Harvard-Smithsonian Center for Astrophysics, the Las Cumbres Observatory Global Telescope Network Incorporated, the National Central University of Taiwan, the Space Telescope Science Institute, the National Aeronautics and Space Administration under Grant No. NNX08AR22G issued through the Planetary Science Division of the NASA Science Mission Directorate, the National Science Foundation grant No. AST-1238877, the University of Maryland, Eotvos Lorand University (ELTE), the Los Alamos National Laboratory, and the Gordon and Betty Moore Foundation.

The Legacy Surveys consist of three individual and complementary projects: the Dark Energy Camera Legacy Survey (DECaLS; NOAO Proposal ID 2014B-0404; PIs: David Schlegel and Arjun Dey), the Beijing-Arizona Sky Survey (BASS; NOAO Proposal ID 2015A-0801; PIs: Zhou Xu and Xiaohui Fan), and the MzLS (NOAO Proposal ID 2016A-0453; PI: Arjun Dey). DECaLS, BASS, and MzLS together include data obtained, respectively, at the Blanco Telescope, Cerro Tololo Inter-American Observatory, National Optical Astronomy Observatory (NOAO); the Bok Telescope, Steward Observatory, University of Arizona; and the Mayall Telescope, Kitt Peak National Observatory, NOAO. The Legacy Surveys project is honored to be permitted to conduct astronomical research on Iolkam Du'ag (Kitt Peak), a mountain with particular significance to the Tohono O'odham Nation.

NOAO is operated by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation.

This project used data obtained with the Dark Energy Camera (DECam), which was constructed by the Dark Energy Survey (DES) collaboration. Funding for the DES Projects has been provided by the U.S. Department of Energy, the U.S. National Science Foundation, the Ministry of Science and Education of Spain, the Science and Technology Facilities Council of the United Kingdom, the Higher Education Funding Council for England, the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign, the Kavli Institute of Cosmological Physics at the University of Chicago, Center for Cosmology and Astro-Particle Physics at the Ohio State University, the Mitchell Institute for Fundamental Physics and Astronomy at Texas A&M University, Financiadora de Estudos e Projetos, Fundacao Carlos Chagas Filho de Amparo, Financiadora de Estudos e Projetos, Fundacao Carlos Chagas Filho de Amparo a Pesquisa do Estado do Rio de Janeiro, Conselho Nacional de Desenvolvimento Cientifico e Tecnologico and the Ministerio da Ciencia, Tecnologia e Inovacao, the Deutsche Forschungsgemeinschaft and the Collaborating Institutions in the Dark Energy Survey. The Collaborating Institutions are Argonne National Laboratory, the University of California at Santa Cruz, the University of Cambridge, Centro de Investigaciones Energeticas, Medioambientales y Tecnologicas-Madrid, the University of Chicago, University College London, the DES-Brazil Consortium, the University of Edinburgh, the Eidgenossische Technische Hochschule (ETH) Zurich, Fermi National Accelerator Laboratory, the University of Illinois at Urbana-Champaign, the Institut de Ciencies de l'Espai (IEEC/CSIC), the Institut de Fisica d'Altes Energies, Lawrence Berkeley National Laboratory, the Ludwig-Maximilians Universitat Munchen and the associated Excellence Cluster Universe, the University of Michigan, the National Optical Astronomy Observatory, the University of Nottingham, the Ohio State University, the University of Pennsylvania, the University of Portsmouth, SLAC National Accelerator Laboratory, Stanford University, the University of Sussex, and Texas A&M University.

BASS is a key project of the Telescope Access Program (TAP), which has been funded by the National Astronomical Observatories of China, the Chinese Academy of Sciences (the Strategic Priority Research Program "The Emergence of Cosmological Structures" Grant XDB09000000), and the Special Fund for Astronomy from the Ministry of Finance. The BASS is also supported by the External Cooperation Program of Chinese Academy of Sciences (Grant No. 114A11KYSB20160057), and Chinese National Natural Science Foundation (Grant No. 11433005).

The Legacy Survey team makes use of data products from the Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE), which is a project of the Jet Propulsion Laboratory/California Institute of Technology. NEOWISE is funded by the National Aeronautics and Space Administration.

The Legacy Surveys imaging of the DESI footprint is supported by the Director, Office of Science, Office of High Energy Physics of the U.S. Department of Energy under Contract No. DE-AC02-05CH1123, by the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility under the same contract; and by the U.S. National Science Foundation, Division of Astronomical Sciences under Contract No. AST-0950945 to NOAO.

Software: SExtractor (Bertin & Arnouts 1996), pygim (Prochaska et al. 2017), PypeIt (Prochaska et al. 2020), ASTROPY (Astropy Collaboration et al. 2013; Price-Whelan et al. 2018), PyRAF (Science Software Branch at STScI 2012), linetools (Prochaska et al. 2016), REDROCK (https://github.com/desihub/redrock), CIGALE (Noll et al. 2009; Boquien et al. 2019), seaborn (Waskom & The Seaborn Development Team 2020), pandas (Pandas Development Team 2020; McKinney 2010), corner (Foreman-Mackey 2016), emcee (Foreman-Mackey et al. 2013).

Footnotes

Please wait… references are loading.
10.3847/1538-4357/abea14