The following article is Open access

ALMA Reveals a Large Overdensity and Strong Clustering of Galaxies in Quasar Environments at z ∼ 4

, , , , , , , , and

Published 2022 March 4 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Cristina García-Vergara et al 2022 ApJ 927 65 DOI 10.3847/1538-4357/ac469d

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/927/1/65

Abstract

We present an Atacama Large Millimeter/submillimeter Array (ALMA) survey of CO(4–3) line emitting galaxies in 17 quasar fields at z ∼ 4 aimed at performing the first systematic search of dusty galaxies in high-z quasar environments. Our blind search of galaxies around the quasars results in five CO emitters with S/N ≥ 5.6 within a projected radius of R ≲ 1.5 h−1 cMpc and a velocity range of δv = ±1000 km s−1 around the quasar. In blank fields, we expect to detect only 0.28 CO emitters within the same volume, implying a total overdensity of ${17.6}_{-7.6}^{+11.9}$ in our fields, and indicating that quasars trace massive structures in the early universe. We quantify this overdensity by measuring the small-scale clustering of CO emitters around quasars, resulting in a cross-correlation length of ${r}_{0,\mathrm{QG}}={8.37}_{-2.04}^{+2.42}\,{h}^{-1}\,$ cMpc, assuming a fixed slope γ = 1.8. This contradicts the reported mild overdensities (x1.4) of Lyα emitters (LAEs) in the same fields at scales of R ≲ 7 h−1 cMpc, which are well described by a cross-correlation length ${3.0}_{-1.4}^{+1.5}$ times lower than that measured for CO emitters. We discuss some possibilities to explain this discrepancy, including low star formation efficiency, and excess of dust in galaxies around quasars. Finally, we constrain, for the first time, the clustering of CO emitters at z ∼ 4, finding an autocorrelation length of r0,CO = 3.14 ±1.71 h−1 cMpc (with γ = 1.8). Our work, together with the previous study of LAEs around quasars, traces simultaneously the clustering properties of both optical and dusty galaxy populations in quasars fields, stressing the importance of multiwavelength studies, and highlighting important questions about galaxy properties in high-z dense environments.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Theoretical studies and simulations of the formation and evolution of large-scale structure in the universe have been largely developed in the last decades, but observational constraints in this field remain one of the major challenges for astrophysicists. In the standard current paradigm, the cosmological model is parameterized by the Lambda cold dark matter (ΛCDM) model, and the structures grow hierarchically through gravitational instability (e.g., Dodelson 2003; Padmanabhan 2006; Schneider 2015), forming a filamentary structure of dark matter and galaxies. In this framework, the most massive galaxies in the early universe are placed in the most massive dark matter halos (Springel et al. 2005; Angulo et al. 2012), tracing the most massive structures. Since the dark matter halo mass can be directly related to the clustering of a population (Cole & Kaiser 1989; Mo & White 1996; Sheth & Tormen 1999), the generic prediction is that massive galaxies should have a large autocorrelation function.

The population with the largest autocorrelation function in the early universe are quasars, and thus they represent key objects to test structure formation models at early epochs. Quasars are accreting supermassive black holes (SMBHs), and are some of the most luminous sources in the universe. They are observed to peak at z ∼ 2–3 (e.g., Richards et al. 2006) and have been detected up to z = 7.5 (Bañados et al. 2018; Yang et al. 2020). The quasar autocorrelation has been measured as a function of redshift up to z ∼ 4, and it has been found to rise steadily over the redshift range 0 ≲ z ≲ 2 (Croom et al. 2005; Myers et al. 2006; Porciani & Norberg 2006; Shen et al. 2007; da Ângela et al. 2008) with a clustering similar to that of local galaxies, and to strongly increase between z ∼ 2 and z ∼ 4 (Shen et al. 2007; White et al. 2012; Eftekharzadeh et al. 2015; Timlin et al. 2018).

Additionally, the quasar autocorrelation function has been shown to get progressively steeper on small (R ≲ 40 h−1 pkpc) scales at z < 3 (Hennawi et al. 2006), and the small-scale clustering has been shown to also increase from z ∼ 3 to z ∼ 4 (Shen et al. 2010). The measurements of z ∼ 4 quasar clustering result in an autocorrelation length r0 = 24.3 ± 2.4 h−1cMpc (at a fixed slope γ = 2.0), implying that quasars are the most clustered population at this redshift and that they inhabit very massive (Mhalo > 6 × 1012 M h−1) dark matter halos (Shen et al. 2007), in agreement with theoretical predictions (Springel et al. 2005; Angulo et al. 2012). These results imply that z ∼ 4 quasars are tracers of massive structures, and thus they should be surrounded by a large overdensity of galaxies.

Several efforts have been made to detect such overdensities in quasars' environments, but the results reveal a contradictory and confusing picture. Most of the quasar environment studies at z ≳ 4 have been performed at optical wavelengths, and they have been aimed at detecting overdensities of Lyman break galaxies (LBGs) or Lyα emitters (LAEs) around individual quasars. Some of the studied fields show overdensities of galaxies (Stiavelli et al. 2005; Zheng et al. 2006; Kashikawa et al. 2007; Kim et al. 2009; Utsumi et al. 2010; Capak et al. 2011; Swinbank et al. 2012; Morselli et al. 2014; Adams et al. 2015; Farina et al. 2017; Balmaverde et al. 2017; Kikuta et al. 2017; Ota et al. 2018) whereas others exhibit a similar number density of galaxies compared with blank fields (Kim et al. 2009; Willott et al. 2005; Bañados et al. 2013; Husband et al. 2013; Simpson et al. 2014; Mazzucchelli et al. 2017; Goto et al.2017). It thus remains unclear whether z ≳ 4 quasars reside in overdensities, as would be implied by their strong autocorrelation. The contradictory results obtained so far might be a consequence of heterogeneous methodologies, depth, and probed physical scales, together with the low-number statistics and the large cosmic variance expected to affect these studies (Buchner et al. 2019; García-Vergara et al. 2019).

One strategy that has been recently used to overcome the low-number statistics and high cosmic variance is to characterize quasar environments by targeting a larger number of fields and measuring the quasar–galaxy cross-correlation function. Using this technique, García-Vergara et al. (2017) demonstrate that LBGs at scales R ≲ 9 h−1 cMpc are strongly clustered around z ∼ 4 quasars with a cross-correlation length ${r}_{0}={8.83}_{-1.51}^{+1.39}\,{h}^{-1}\,\mathrm{cMpc}$ (at a fixed slope γ = 2.0) and an overall overdensity of 1.5. They find a very good agreement with the expectations assuming a deterministic bias model, in which quasars and galaxies trace the same underlying dark matter distribution. García-Vergara et al. (2019) studied the environments of 17 bright quasars at z ∼ 4 at scales of R ≲ 7 h−1cMpc traced by LAEs (with $\mathrm{log}({{\rm{L}}}_{\mathrm{Ly}\alpha }[\mathrm{erg}\,{{\rm{s}}}^{-1}])\,\geqslant 42.61$) selected using narrowband (NB) imaging. They find an average LAE overdensity around quasars of 1.4 for the full sample, and a cross-correlation length of ${r}_{0}={2.78}_{-1.05}^{+1.16}\,{h}^{-1}\,\mathrm{cMpc}$ (at a fixed slope γ = 1.8). Although the cross-correlation was found to be consistent with a power-law shape, indicative of a concentration of LAEs centered on quasars, they find 2.1 times fewer LAEs than the expectation computed by assuming a deterministic bias model.

This work contradicts the findings of recent deeper ($\mathrm{log}({{\rm{L}}}_{\mathrm{Ly}\alpha }[\mathrm{erg}\,{{\rm{s}}}^{-1}])\geqslant 42.1$) observations of 27 z = 3–4.5 quasar environments performed with the Multi Unit Spectroscopic Explorer (MUSE; Bacon et al. 2010) on the Very Large Telescope (VLT) that revealed a large presence of LAEs (Fossati et al. 2021). They find an overdensity of 4.1 at scales R ≲ 0.6 h−1cMpc, and they measure a cross-correlation length ${r}_{0}={3.15}_{-0.40}^{+0.36}\,{h}^{-1}\,\mathrm{cMpc}$ at a fixed slope γ = 1.8. 8

The aforementioned works statistically demonstrate the existence of galaxy overdensities concentrated around quasars at z ∼ 4, but they are still contradictory in the detection of the ubiquitous large galaxy concentration expected based on the theoretical deterministic bias model. Various possibilities can be cited to explain this discrepancy. First, most of the previous studies rely on quasar redshifts determined from rest-frame UV-emission lines that are known to be offset from the quasar systemic redshifts (e.g., Richards et al. 2002; Shen et al. 2007, 2016; Coatman et al. 2017). If these offsets are large, it would imply that the search for galaxies has actually been performed at higher or lower radial comoving distances from the quasar, where the clustering signal is vanishingly small, resulting in a lower number density of galaxies, and thus the expected large overdensities would not be detectable.

A second possibility is that the complex physical processes associated with quasar radiation affect the visibility of LAEs in their vicinity. On one hand, ionizing radiation coming from the quasar could suppress star formation in nearby galaxies (e.g., Francis & Bland-Hawthorn 2004; Bruns et al. 2012), making the Lyα line emission and UV continuum less intense—and implying a lower number of detectable galaxies in quasar environments. On the other hand, the ionizing radiation from the quasar could induce Lyα fluorescent emission (Cantalupo et al. 2012), resulting in an increase of the LAE number density. These two effects work in opposite directions, and disentangling them becomes extremely difficult unless the environments are traced by a galaxy population less affected by such feedback effects. Note, however, that only small-scale environment studies would be more impacted by the mentioned effects, since the ionizing radiation is not expected to extend up to the large scales at which most previous quasar environment studies have been performed.

Finally, galaxies in quasar environments could be more dusty, and thus the optical/UV emission would be slightly obscured, making them undetectable at optical wavelengths. This would explain why only deep LAE studies reveal large overdensities (e.g., Fossati et al. 2021), while shallower observations miss a large fraction of them (e.g., García-Vergara et al. 2019). Submillimeter observations, which are less affected by dust, would be required to explore this possibility. Indeed, some serendipitous Atacama Large Millimeter/submillimeter Array (ALMA) detections of dusty galaxies have been recently reported in high-z quasars environments, on scales of R < 0.5 cMpc (Decarli et al. 2017; Trakhtenbrot et al. 2017; Venemans et al. 2020), suggesting a possible strong clustering of dusty galaxies around quasars. However, the quasar–galaxy cross-correlation function has not been constrained.

In summary, incomplete sampling of the galaxy population around quasars could be responsible for the contradictory results obtained so far. Systematic and multiwavelength studies are still required to statistically quantify galaxy overdensities of different galaxy populations in z ∼ 4 quasar environments.

In this paper, we present an ALMA survey of CO(4–3) line emitters in the environs of 17 quasars at z ∼ 4, the same quasar sample recently studied for LAE overdensities by García-Vergara et al. (2019), to constrain the small-scale (R ≲ 1.5 h−1 cMpc) clustering of CO emitters, and ultimately trace simultaneously the clustering properties of both optical and dusty galaxy populations around quasars. This is the first quasar sample observed with ALMA for environmental studies purposes, and thus provides an opportunity to statistically trace—for the first time—the overdensity and clustering of dusty galaxies around high-z quasars.

Additionally, our observations provide a more precise measurement of the quasar redshift, previously determined from rest-frame UV-emission lines, which is important for clustering analyses. We note that, with our observations, we could also trace the number density of continuum sources; however, this information is not useful for clustering analysis. Given the roughly constant flux density of galaxies at submillimeter wavelengths across the redshift range 1 ≲ z ≲ 7 (Blain et al. 2002), we would not distinguish between foreground and background sources, and therefore all the detected sources would be included in the clustering computation. The clustering signal would thus be integrated over large comoving volumes, which would strongly dilute the real clustering signal in the close environment of quasars.

The focus of the present paper is on the CO(4–3) emitter number counts in quasar environments, while two forthcoming papers (C. García-Vergara et al. in preparation) analyze the quasar host galaxy properties and the multiwavelength properties of galaxies in quasar fields, respectively.

The outline of the paper is as follows. In Section 2, we describe the targeted quasar sample, the available ancillary data in the fields, the ALMA observations, and data reduction. In Section 3, we describe the detection and selection of CO(4–3) emitting galaxies and present the final catalog. The galaxy number counts and clustering analysis are presented in Section 4. We discuss and interpret our results in Section 5, and we summarize in Section 6. The Appendix discusses the impact of several factors on our clustering measurement. Throughout the paper, we adopt a ΛCDM cosmological model h=0.7, Ωm = 0.30, ΩΛ = 0.70, and σ8 = 0.8, which is consistent with Planck Collaboration et al. (2020). Comoving and proper Mpc are denoted as "cMpc" and "pMpc," respectively. Magnitudes are given in the AB system (Oke 1974; Fukugita et al. 1995).

2. Observations and Data Reduction

2.1. Quasar Sample

The quasar sample used in this study is the same sample presented by García-Vergara et al. (2019), who observed 17 quasar fields at optical wavelengths to search for LAEs in z ∼ 4 quasar environments. We refer the reader to the aforementioned work for details of the quasar selection strategy. Briefly, the 17 quasars were selected from the Sloan Digital Sky Survey (SDSS; York et al. 2000) and the Baryon Oscillation Spectroscopic Survey (BOSS; Eisenstein et al. 2011; Dawson et al. 2013) quasar catalog (Pâris et al. 2014) to lie within a redshift window of z ∼ 3.862–3.879 (corresponding to δv = 1066 km s−1 at z = 3.87). This redshift window was chosen based on the coverage of the optical narrowband filter used to detect LAEs. The quasar redshifts were determined based on one or more rest-frame UV-emission lines (S iv λ 1396, C iv λ 1549, and C iii] λ 1908) and using the calibration of emission-line shifts from Shen et al. (2007) to estimate the systemic redshift. All the quasars have 1σ redshift uncertainties ≲800 km s−1. We tabulate the quasar positions, redshifts, and i-band magnitudes in Table 1.

Table 1. Optical Properties of the Targeted Quasars

Target IDR.A.Decl.Redshift i mag
(SDSS)(J2000)(J2000)  
J0040+170600:40:17.426+17:06:19.783.873 ± 0.00818.91
J0042-102000:42:19.748−10:20:09.533.865 ± 0.01218.57
J0047+042300:47:30.356+04:23:04.733.864 ± 0.00819.94
J0119-034201:19:59.553−03:42:16.513.873 ± 0.01320.49
J0149-055201:49:06.960−05:52:18.853.866 ± 0.01319.80
J0202-065002:02:53.765−06:50:44.543.876 ± 0.00820.64
J0240+035702:40:33.804+03:57:01.593.872 ± 0.01220.03
J0850+062908:50:13.457+06:29:46.913.875 ± 0.01320.40
J1026+032910:26:32.976+03:29:50.633.878 ± 0.00819.74
J1044+095010:44:27.798+09:50:47.983.862 ± 0.01220.52
J1138+130311:38:05.242+13:03:32.613.868 ± 0.00819.10
J1205+014312:05:39.550+01:43:56.523.867 ± 0.00819.37
J1211+122412:11:46.935+12:24:19.083.862 ± 0.00819.97
J1224+074612:24:20.658+07:46:56.333.867 ± 0.00819.08
J1258-013012:58:42.118-01:30:22.753.862 ± 0.00819.58
J2250-084622:50:52.659−08:46:00.223.869 ± 0.01219.44
J2350+002523:50:32.306+00:25:17.233.876 ± 0.01220.61

Note. Quasar positions are determined from optical images (SDSS/BOSS quasar catalog; Pâris et al. 2014), redshifts are based on one or more rest-frame UV-emission lines and calibrated to estimate the systemic redshift (see Section 2.1), and magnitudes correspond to the i-band magnitudes from SDSS.

Download table as:  ASCIITypeset image

2.2. Ancillary Data

The 17 quasar fields were previously observed (Program ID: 094.A-0900) using the FOcal Reducer and low-dispersion Spectrograph 2 (FORS2; Appenzeller & Rupprecht 1992) instrument on the VLT in 30 hours of observing time. Optical deep observations were acquired using the narrow band HeI (λc = 5930 Å, FWHM = 63 Å) and the broad bands gHIGH (λc =4670Å, hereafter g) and RSPECIAL (λc = 6550 Å, hereafter R) to detect LAEs around the quasars. The total area observed per field was 6.8 × 6.8 arcmin2, with an image pixel scale of 0.25''/pix. The medians of the 5σ limiting magnitudes reached in the fields were 24.45, 25.14, and 25.81 for the HeI, R, and g images, respectively.

A catalog of 25 LAEs within R ≲ 7 h−1 cMpc in the quasar fields is available (García-Vergara et al. 2019). LAEs were selected based on a color selection criteria such that they have a (rest-frame) EWLyα > 28Å and a solid detection of S/N ≥ 5.0 in the HeI band. All the LAEs lie approximately within ±1598 km s−1 (corresponding to ± 18.7 cMpc at z = 3.78) from the quasar systemic redshift, as given by the coverage of the used NB filter. Only two out of 25 LAEs lie within a radius of 59'' from the central quasar, which is the area covered by the ALMA observations presented in this study.

2.3. ALMA Band 3 Observations

Our observations were carried out during ALMA Cycle 7, as part of the observing program #2019.1.00411.S (PI: C. Garcia Vergara). The data were taken in Band 3 (84–116 GHz) between 2019 October 4 and 2020 March 19. Most targets were observed in a single scheduling block with 45–50 minutes on-source, except for SDSS J0040+1706, SDSSJ1138+1303, SDSSJ1205+0143, and SDSSJ1224+0746, which were observed in two scheduling blocks.

The 17 pointings were centered at the optical coordinates of each quasar (specified in Table 1). The spectral setup consists of four spectral windows (SPWs). One of the SPWs (hereafter SPW0), with a 1.875 GHz bandwidth, was centered on the expected observed frequency of the CO(4–3) emission line from the central quasar, given the quasar systemic redshifts reported in Table 1. The 1.875 GHz bandwidth correspond to δv ∼ 6000 km s−1 (or δ z = 0.098) at the observed frequencies in SPW0. The SPW0 was placed into the lower sideband and configured in Frequency Division Mode (FDM), which is usually used for spectral-line observations, and provides a spectral channel width of 0.488 MHz. Channels were then averaged together resulting in a final resolution of 7.8125 MHz (equivalent to ∼25 km s−1 at the observed frequency).

The remaining three SPWs were located at higher frequencies, with a 2.0 GHz bandwidth. One of them (SPW1) was adjacent to SPW0, and the other two (SPW2 and SPW3) were in the upper sideband (at ∼12 GHz from the lower sideband). SPWs 1, 2, and 3 were configured in Time Division Mode (TDM), which is usually used for continuum observations because it optimizes the continuum sensitivity. This provides a spectral channel width of 15.625 MHz.

Most observations were taken in the C43-4 array configuration with a maximum baseline length of 783 m, but several observations were taken in a more extended configuration, with a maximum baseline length of 1231 or 2517 m. Table 2 summarizes the details of individual observing blocks. The shortest baseline was always 15 m, and the maximum recoverable scale ranged between 10'' and 14''. The number of 12 m antennas used ranged between 32 and 49.

Table 2. ALMA Observations and Imaging Summary

Target IDDates observed Nant Max baselineBeam size (PA) σcont σSPW0 log ${L}_{\mathrm{CO}(4-3)}^{{\prime} }$
   [m][arcsec × arcsec], [deg] μJy beam−1 mJy beam−1 [K km s−1pc2]
J0040+17062020 Mar 17321231 m1.21 × 1.00 (36)12.80.329.78
 2020 Mar 18391231 m  
J0042-10202019 Oct 1949783 m1.42 × 1.19 (−85)8.90.239.64
J0047+04232019 Oct 1448783 m1.38 × 1.26 (−37)9.00.249.65
J0119-03422019 Oct 1943783 m1.41 × 1.22 (−78)8.40.239.64
J0149-05522019 Oct 1949783 m1.39 × 1.19 (−65)8.80.249.66
J0202-06502019 Oct 1949783 m1.49 × 1.17 (−62)8.50.239.64
 2019 Oct 14 a 43783 m   
J0240+03572019 Oct 4482517 m1.24 × 0.91 (−53)14.10.359.82
J0850+06292020 Mar 244783 m1.38 × 1.22 (49)10.10.259.67
J1026+03292019 Oct 1946783 m1.37 × 1.24 (−58)9.80.239.63
J1044+09502020 Mar 243783 m1.41 × 1.30 (23)9.50.239.64
J1138+13032020 Mar 1643783 m1.30 × 1.17 (42)9.30.249.66
 2020 Mar 19451231 m   
J1205+01432019 Oct 1946783 m1.33 × 1.23 (−76)8.50.239.64
 2020 Mar 343784 m   
J1211+12242020 Mar 1543783 m1.47 × 1.20 (−32)11.80.299.74
J1224+07462020 Mar 1440783 m1.38 × 1.24 (−40)8.70.239.64
 2020 Mar 1540783 m   
J1258-01302020 Mar 344783 m1.46 × 1.18 (−46)10.70.309.75
J2250-08462019 Oct 1847783 m1.39 × 1.18 (−86)10.40.259.67
J2350+00252019 Oct 1448783 m1.35 × 1.24 (−50)9.00.259.67

Notes. For each target, we list the dates of observations, the number of antennas used, and the baseline range. The synthesized beam sizes are for naturally weighted images for the final data products. We include the rms noise measured in the dirty continuum images and the rms noise per channel (channel width of 7.8125 MHz, equivalent to ∼25 km s−1 at the observed frequency) measured in the dirty spectral cube created from SPW0 (which is targeting the expected CO(4–3) emission). We also report the limiting luminosity of the CO line at 5.6σ (the S/N threshold used in this work), assuming a line width of 331 km s−1 (see details in Section 4.1). Since the rms is measured in the non-primary beam corrected image, the reported limiting luminosity corresponds to the center of the pointing for each field, but this increases with increasing radius.

a Semi-pass observation.

Download table as:  ASCIITypeset image

The primary beam FWHM is 66'' at 94.5 GHz, and the typical size of the synthesized beam is 1farcs4 × 1farcs2 (see Table 2).

2.4. Imaging

All data processing and imaging was performed using Casa (Common Astronomy Software Applications package, McMullin et al. 2007), version 5.6.1. We process the data using the standard ALMA pipeline, however, some data sets required further manual flagging of problematic antennas. Specifically, we flagged the antennas DA60 and DA58 in J1258-0130, DV04 and DV11 in J0240+0357, DV19 in J1138+1303 (2020 March 19 observation), and DV17 for J1044+0950. The sources were too faint for a successful self-calibration.

For each quasar field, we used Casa's task tclean to create both dirty-image and clean spectral cubes at native frequency resolution. We use the Högbom cleaning algorithm and recalculate the noise per visibility (setting fastnoise=False). All images were created using natural weighting to maximize the surface brightness sensitivity. For the cleaned images, we used a stopping threshold of 1.5σ without manual masking. We set the pixel size to 0farcs2, which ensures that the synthesized beam is sufficiently subsampled. Note that the line search was performed on the dirty cubes, and the clean cubes are only used to extract the detected sources (see Section 3.1).

J0240+0357 was observed with a rather sparse UV-plane coverage, resulting in a noticeably non-Gaussian dirty beam with large sidelobes. To improve the image quality, we reimaged this source using an outer Gaussian taper of 1farcs5.

We use the dirty spectral cubes created from SPW0 (the cube at the expected observed frequency of the quasar emission line) without a primary beam correction to compute the rms noise per 7.813 MHz channel throughout the scanned frequency range in a rectangular region that covers most of the region within the primary beam FWHM. The rms noise varies only slightly per channel, (typically ∼3%), so we compute the median rms noise measured in all the channels of the spectral cube for each field and report these values in Table 2.

Finally, we create a continuum image by combining all four SPWs. For each quasar field, we compute the rms noise in the dirty continuum images (without primary beam correction) over the same region as for the SPW0 cube and tabulate the values in Table 2. The achieved rms is in agreement with expectations from the ALMA sensitivity calculator. Note that, in our analysis, we do not subtract the continuum from the data.

3. Emission Line Catalog

In this study, we focus on the measurement of the small-scale clustering of CO(4–3) emission lines around the central quasar. Therefore, we limit our emission line search to the spectral cube created from SPW0, where the central quasar is expected to be located. The procedure and catalog presented in this section only contain sources detected in this spectral cube, which cover ±3000 km s−1 around the quasar. At the end of this section, we briefly present the detection of the quasars, and the comparison between their optical and ALMA redshifts (see Section 2.1). We present further details about the quasars in a forthcoming paper (C. García-Vergara et al. in preparation).

3.1. Line Search Algorithm

We perform a blind search for emission lines in the quasar fields (i.e., on the 17 spectral cubes extracted from the SPW0). Since we do not expect to detect extremely bright sources in our cubes, we prefer to use the "dirty" cubes to perform the line search, allowing us to preserve the intrinsic properties of the noise in the cubes. We use findclumps, an IRAF-based routine originally developed for the blind search of CO lines performed in the ALMA spectroscopic survey in the Hubble Ultra Deep Field (ASPECS; Walter et al. 2016; Decarli et al. 2019), a 3D survey of gas and dust in distant galaxies. We refer the reader to these works for further details about the algorithm implemented in the findclumps routine. Briefly, findclumps performs a top-hat convolution using Nchn consecutive frequency channels of the cube to create a combined image, with Nchn ranging from 3 up to 19 (corresponding to 74–470 km s−1 at z = 3.87) in steps of two channels. For each combined image, the rms is computed 9 and a search for sources is performed using SExtractor (Bertin & Arnouts 1996), resulting in one source catalog. The signal-to-noise (S/N) ratio of each source in this catalog is computed using the peak flux of the source and the rms computed for the combined image. We only keep sources with S/N ≥ 3.5 in the catalog. The search is performed not only within the primary beam FWHM but over the entire area covered by our observations, which is given by a circular area with a radius 59'' (at which the telescope sensitivity is 10% of the maximum).

Since the source search is performed repetitively using different numbers of channels, we obtain several individual source catalogs, and we combine these into one final catalog per cube; however, the resulting final catalog contains duplications. For example, if the FWHM of an emission line is equivalent to 19 channels, the algorithm will detect such a line with the maximum S/N in the combined image created by using 19 consecutive frequency channels. However, that source may also be detected in the combined image created by using fewer consecutive channels, although with lower S/N because in this case only part of the total flux is encompassed in the combined image.

Since the S/N of a line is maximized when detected combining a number of channels equivalent to the exact width of the line, we look for all the sources that fall within 1farcs5 (which is similar to the size of the synthesized beam in our images) and 650 km s−1 (0.21 GHz at z = 3.87), and we only keep in our final catalogs the source with the highest S/N. This procedure effectively removes the duplications, but we could also be missing some real close pairs of sources (for example mergers). We repeat this line search process on all our cubes, resulting in 17 source catalogs.

3.2. Fidelity

The assessment of the reliability of the sources in our catalogs is crucial to distinguish real detections from spurious detections only caused by noise peaks exceeding our S/N threshold. The reliability of a line can be determined by exploring the noise properties of each cube. Specifically, we apply the same line search algorithm described in Section 3.1 to the negative cubes, and we create final catalogs containing (unphysical) negative lines detected with S/N ≥ 3.5. Since negative lines are only produced by noise fluctuations, this catalog can be used to determine the probability that a positive line detection is due to noise as a function of their S/N. Note, however, that this probability also depends on the line width, such that at a fixed S/N, a source with a wider line has a higher probability to be real than a source with a narrower line (see a detailed discussion in González-López et al. 2019).

To estimate the noise distribution of each data cube, and under the assumption of a Gaussian distribution for the noise, we fit a Gaussian (centered at S/N = 0) to the S/N histogram of the negative lines for the different width lines. To improve the statistics, we have grouped detections with different widths together before performing the fitting; specifically, we grouped lines with channels width Nchn 3 and 5, 7 and 9, etc. To better take into account the low-number statistics in the tails of the noise distribution, we perform the fitting using a Poisson maximum likelihood estimator.

We compute the reliability (or fidelity F) of a detection as:

Equation (1)

where Npos(S/N, Nchn) and Nneg(S/N, Nchn) are the number of positive emission lines and the expected number of negative emission lines, respectively, in each S/N bin. The expected number of negative emission lines is computed using a Gaussian function with the best-fitted parameters found by the fitting process described above.

In some previous works (e.g., Walter et al. 2016; Loiacono et al. 2021), the fidelity has been computed using the cumulative number counts (i.e., Nneg( ≥ S/N) and Npos( ≥ S/N)) instead of the differential number counts. However, if a cumulative approach is used, the computed fidelity at intermediate-S/N values (for instance at S/N ∼ 4.5) would be completely dominated by the higher-S/N detections (for instance, a detection with S/N ∼ 9.0), resulting in a boost of the fidelity for lower-S/N detections. To avoid this artificial boost of fidelity of intermediate-S/N sources, here we use Nneg and Npos as the differential numbers of detections in bins of S/N values.

The noise properties varies among the 17 cubes, so we repeat this process for all our 17 cubes individually and determine the fidelity of each detected source in the cubes. For visualization purposes only, we computed the median fidelity over the cubes and show this in Figure 1. Since we compute fidelity using the differential number of detections, the fidelity cannot be constrained in S/N bins where no positive lines are detected. For computing the median, we only use the bins with constrained fidelity over the fields (data points in Figure 1) and we linearly interpolate between the S/N values.

Figure 1.

Figure 1. Median fidelity of a line candidate as a function of S/N for different line widths. In our analysis, we use a fidelity computed for each cube separately, but we show the median value in this figure for visualization. Taking into account all the fields together, we find that fidelity 0.8 is typically reached at S/N ∼ 5.6, S/N ∼ 5.7, S/N ∼ 5.8, and S/N ∼ 6.0 for sources with line widths 15–19 (371–470 km s−1), 11–13 (272–322 km s−1), 7–9 (173–223 km s−1), and 3–5 (74–124 km s−1) channels, respectively.

Standard image High-resolution image

3.3. Line Catalog

The choice of the fidelity threshold to keep a source in our catalogs is crucial and has to be taken carefully. This choice varies between different studies, depending on the aim of the analysis. Decarli et al. (2019) is aimed to constrain the CO luminosity function (used in our work as a reference for the computation of the background number counts as described in Section 4.1), and they choose a fidelity threshold of 0.2. However, they treat the fidelity as an upper limit and use a Monte Carlo simulation to generate several realizations of their final catalog, in which they randomly assign a fidelity between 0.0 and their upper limits, and they only keep sources if this random fidelity value exceeds their threshold of 0.2. This means that lower-fidelity sources are less likely to be kept in the catalog for each realization.

This approach is adequate for measuring the luminosity function because, even if sources with low fidelity (likely to be spurious) are included in some realizations, the overall statistics of the number counts will stay mostly unchanged (as explicitly checked in Decarli et al. 2019). Additionally, low-fidelity sources are mostly composed of faint sources, only affecting the faint end of the luminosity function, which has little impact on the overall fitting for the number counts.

In the case of clustering measurements, secure detections are required, since all the sources contribute with the same weight to the final measurement, no matter their intrinsic flux. Including spurious detections would have a strong impact on the final measurement, especially for sparse samples. For this reason, we prefer to choose a more conservative fidelity threshold of ≥0.8, and we only keep sources fulfilling this criterion in our final catalogs. We nevertheless explore the impact of the fidelity threshold choice on our clustering results in the Appendix.

From our median fidelity computation (see Figure 1), we find that fidelity 0.8 is typically reached at S/N ∼ 5.6, S/N ∼ 5.7, S/N ∼ 5.8, and S/N ∼ 6.0, for sources with line widths 15−19 (371–470 km s−1), 11−13 (272–322 km s−1), 7−9 (173−223 km s−1), and 3−5 (74−124 km s−1) channels, respectively.

Since the findclumps algorithm determines the source S/N using a global rms computed on the primary beam FWHM region of the ALMA pointing, the S/N of the detected candidates located close to the edges of the pointing may be overestimated due to noise fluctuation affecting these regions of the pointing. However, we still could detect reliable sources in these regions, so we include sources within the entire area covered by our observations, but we increase the fidelity threshold up to ≥0.9 for sources located in the edges of the pointing (where the telescope sensitivity is lower than 20% of the maximum—or equivalently, at radii larger than 50.14'' from the central quasar).

Considering all the fields together, our final catalogs contain nine sources (excluding the detection of the central quasar). Note that this is the total number of sources detected in all the SPW0 cubes, but not all of them are necessarily included for the clustering computation (see details in Section 4.1).

We extract the 1D spectra for all the selected candidates from the clean cubes. At the resolution of our observations, the sources are not significantly resolved. Therefore, we perform the spectral extraction on the brightest pixel of the source, and we perform a Gaussian fit with a flat continuum to constrain the line center (νline) and the line standard deviation (σline). We use the immoments task from the Casa package to create a 0th moment map for each source. For that, the data cube is integrated over a frequency width given by 2.8σline centered on νline, which recovers ∼84% of the line flux.

We measured the total integrated line flux from the 1D spectra by adding the flux from all the channels within ±1.4σline from the emission line. We note that this may result in a slightly different S/N ratio than the reported from the line search algorithm, since the computation is made over a different number of channels in some cases (see the gray area versus the dotted lines in the 1D spectra shown in Figure 2). The fluxes are not corrected by the primary beam response of the telescope. Note that the integrated line flux of the individual objects is not relevant for the clustering analysis, since clustering is based only on the position of the sources. The only requirement is that the total integrated line flux of all the sources should be always greater than or equal to 5.6σ (the S/N threshold used in this work), with σ being the limiting flux of the survey. The limiting flux of the survey is the only relevant flux measurement used in the clustering analysis (see Section 4.1), and it is computed based on the rms of the cubes (see Table 2).

Figure 2.

Figure 2.  Left: Line maps for all the sources detected in our survey with fidelity ≥0.8 (or ≥0.9, if they are located at radius >50farcs14 from the pointing center), integrated over a frequency width given by 2.8σline around the center of the line. Contours start at ±2σ and increase in steps of 1σ. The white ellipse in the lower left corner shows the FWHM beam size. Right: 1D extracted spectra on the brightest pixel of each source. We show the Gaussian plus flat continuum fit as a red curve. The shaded gray area shows the line width as detected by the line search algorithm. The vertical lines show the channels where the integrated line flux was measured, corresponding to a width of 2.8σline centered on νline. In each panel, we report the best-fitted FWHM value, the S/N of the line determined by the line search algorithm (which is computed over the gray area shown in the spectra), and the fidelity. Line maps and spectra are both extracted from the cleaned cubes, and they are not corrected by the primary beam response.

Standard image High-resolution image

In Figure 2, we show the moment 0 maps and the 1D extracted spectra for all the sources in our catalogs, and the best-fitted parameters for the lines are listed in Table 3. In Table 5, we summarize the number of emission lines in each individual field.

Table 3. Properties of the Emission Lines Detected in All the 17 Fields Properties of the Emission Lines Detected in All the 17 Fields

IDR.A. (J2000)Decl. (J2000) νline FWHMline S/NFidelity δ θ $\delta {\rm{v}}$ F Optical Counterpart
 [deg][deg][GHz][km s−1]  [''][km s−1][mJy km s−1] 
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)
J0042-1020.200:42:18.48−10:20:36.4895.061296.010.9033.0−1767.1978.08 ± 13.75 R > 25.6, g > 26.3
J0119-0342.101:20:00.37−03:42:59.9095.33585.760.8545.1−2266.3648.65 ± 9.14 R > 25.7, g > 26.4
J0119-0342.301:20:02.15−03:42:16.9693.901285.710.8539.02270.9470.32 ± 14.13 R > 25.7, g > 26.4
J0149-0552.2 a 01:49:08.41−05:52:41.3494.79475.910.8031.3−851.9742.31 ± 9.73 R > 25.7, g > 26.5
J0202-0650.1 a 02:02:54.72−06:50:51.8894.8217313.111.0016.1−829.85193.75 ± 15.79 R = 24.4, g = 25.7
          Lyman break detection
          with gR = 1.3
          located at 1farcs2
J0850+0629.1 a 08:50:14.2206:29:49.9394.8042817.601.0011.9−710.84556.37 ± 27.58 R > 25.7, g > 26.3
J1211+1224.1 a 12:11:45.0612:25:06.2594.65756.260.9354.9559.3190.54 ± 16.91 R > 25.3, g > 26.3
          LAE located at 1farcs6
J1258-0130.112:58:43.81−01:31:09.3594.852995.970.9153.1−1220.26120.69 ± 26.47 R > 25.7, g > 26.4
J2250-0846.2 a 22:50:50.92−08:45:40.3994.437288.501.0032.7398.63277.69 ± 34.97 R > 25.6, g > 26.4

Notes. This table contains all the emission lines detected with Fidelity $\ge 0.8$ (or $\ge 0.9$ if they are Located at Radius >50farcs14 from the Pointing Center). Columns (1), (2), and (3) indicate the name of the candidate and the R.A., decl. sexagesimal coordinates. Columns (4) and (5) indicate the central frequency and the FWHM of the detected line as determined by the best Gaussian+continuum fitting. Column (6) shows the S/N of the detection as determined by the line search algorithm. Column (7) shows the fidelity computed using Equation (1). Columns (8) and (9) show the angular distance and the velocity distance, respectively, between the emission line and the central quasar, assuming that the observed line corresponds to the CO(4–3) transition. For the computation of the velocity distance, we use the redshift based on the ALMA detection of the quasar, when it is detected with fidelity ≥0.8; otherwise, we use the redshift as determined from the optical quasar spectrum. Column (10) indicates the (non-primary beam corrected) integrated line flux measured from the 1D spectra presented in Figure 2. Column (11) shows the optical color–magnitude g and R measured in a 2'' aperture in the position of the source (or at the position of a counterpart, if one exists within 2''). The lower limits for the magnitude correspond to 3σ limits. We provide comments about optical counterparts for the source.

a Included for the clustering measurement.

Download table as:  ASCIITypeset image

One of the sources (J2250-0846.2) is identified by our algorithm as two close (δ θ = 0farcs4, δv ∼ 625 km s−1) sources. This could be either a galaxy merger or a galaxy with a two-peaked emission line, with total width FWHM > 700 km s−1. In this study, we consider this as a single object, since even in the merger scenario this would likely trace the same dark matter halo. We study this object further in a forthcoming paper (C. García-Vergara et al. in preparation).

Our candidates span a large range of line widths, ranging from 47 km s−1 to 728 km s−1, with three of the lines showing small FWHM values (<100 km s−1). 10 To explore whether the existence of such narrow lines is expected and reasonable, we checked the line widths of the 16 CO-emitting galaxies blindly found in the ASPECS survey with fidelity >0.9 (González-López et al. 2019). We find that their FWHMs range from 40 km s−1 to 617 km s−1, including two very narrow lines with 40 km s−1 (with S/N = 7.9) and 50 km s−1 (with S/N = 9.5), respectively. Using HST counterparts for the CO emitters detected in ASPECS, Aravena et al. (2019) suggest that these galaxies are likely very face-on, which would explain the detection of such narrow line widths in CO-emitting galaxy surveys.

3.4. Line Identification and Contamination

As we targeted quasars with known redshifts, which are expected to be surrounded by large overdensities of galaxies, and given that we are tracing relatively small areas in the sky, we assume that all the detected lines correspond to the CO(4–3) transition and that the serendipitous detection of emission lines at other redshifts is negligible. To support this assumption, here we quantify the probability that each of our detected lines is actually a galaxy at a different redshift, and we also use our optical images to look for a counterpart of the detected emission lines.

3.4.1. Probability of Low-z Contaminants

First, we compute the probability that each source in our catalog is a low-z galaxy. For this computation, we first explore what transitions at low-z could be detectable given the frequency setup of our survey, and we compute the comoving volume traced for them in each quasar field. We consider a circular area given by R ≤ 59'' (the same used to look for emission lines) and a redshift coverage given by (i) δv = ±3000 km s−1, corresponding to the width of the whole cube, the same as where we detect our nine candidates, and (ii) δv = ±1000 km s−1, corresponding to the width used for the clustering analysis, where we detect five candidates (see details in Section 4.1). We summarize this information in Table 4. The main possible contaminants in our sample are the CO transitions CO(1–0) at z ∼ 0.2, CO(2–1) at z ∼ 1.4, and CO(3–2) at z ∼ 2.7.

Table 4. Probability of the Presence of Low-z Contaminants in Our Survey

Transition ${z}_{\min }$ ${z}_{\max }$ Volume3000 N3000 N1000
   [h−1cMpc ]  
(1)(2)(3)(4)(5)(6)
CO(1–0)0.2060.2316.4860.0230.008
CO(2–1)1.4131.461147.2760.2500.083
CO(3–2)2.6192.691253.0830.1030.034

Note. For the different transitions, we report the redshift and the comoving volume traced in one quasar field at the redshift of the transition. The comoving volume is given by a circular area with radius R ≤ 59'' and a redshift coverage given by δv = ±3000 km s−1. Columns (5) and (6) show the expected number of lines per quasar field in the comoving volume indicated in column (4), and in 1/3 of that volume (i.e., assuming δv = ±1000 km s−1), respectively. Note that, in the absence of clustering, the number of expected lines only depends on the volume; therefore, the number of lines simply decrease by a factor of three in column (6) compared with column (5).

Download table as:  ASCIITypeset image

To compute the probability, we use the CO luminosity function of the different CO transitions (Decarli et al. 2019) at the corresponding redshift, and integrate it down to the 5.6σ limiting luminosity of our survey (this is the same S/N threshold that is used in this work for the detections of CO emitters; see Section 3.3) to compute the number density of sources for each transition. We multiply this quantity by the traced comoving volume to obtain the expected number of contaminants in one field, and we show the result in Table 4. The limiting luminosity at the center of the pointing is reported in Table 2. Note that the limiting luminosity varies over the ALMA pointing, due to the decrease of sensitivity with the radius, thus we include this in our computation, following the procedure described in Section 4.1.

The probability that each of the lines presented in our final catalog is CO(1–0), CO(2–1), and CO(3–2) is 2%, 25%, and 10%, respectively. If we only focus on the sample used for the clustering analysis composed of five sources, we find that the probability that each of these sources is CO(1–0), CO(2–1), and CO(3–2) is 0.8%, 8.3%, and 3.4%, respectively.

3.4.2. Optical Counterparts

The second approach that we use to explore possible contamination from low-z galaxies is to cross-match the spatial position of the detected lines with the position of LAEs detected in the optical images available for our 17 fields. We limit the search for the optical counterparts of our emission lines to a radius of 2''. Using the available LAE catalog (see section Section 2.2), we find that one of the detected candidates (J1211+1224.1) coincides with the position of an LAE detected in our previous survey. We also explored the optical images at the position of all the line detections, but we do not find any LAE candidate at these positions, even when relaxing the LAE detection threshold down to S/N > 3.

We also check the possibility of including additional low-fidelity line emission sources at the position of the only other S/N > 5 LAEs that were located within the ALMA Field-of-view. However, even by relaxing the fidelity criterion down to 0.3 in the ALMA catalog, no line detections were found within 3'' from the position of the LAE.

Since the UV-continuum emission of z ∼ 4 galaxies is characterized by a strong flux break at λRF = 1216 Å (the so-called Lyman break), owing to the absorption of photons with λRF ≤ 1216Å by neutral hydrogen in the intergalactic medium, we can use the R- and g-band optical images to detect the expected flux break. Although all the galaxies should exhibit this break, not all of them will exhibit the LAE emission line, and thus they may not be included in the LAE catalog. Note that the presence of the flux break by itself cannot confirm the redshift of the source, since we would need an additional observation on the UV continuum at longer wavelengths to exclude possible interlopers, but the absence of the break can be used to confirm that the source is located at lower redshift.

For all the detected lines, we search for a ≥3σ detection in the R and g images within 2''. If a source is found in either of these bands, we perform a 2'' aperture photometry on that position in the two bands following the same criteria as in García-Vergara et al. (2019). If no source is found, we perform the aperture photometry at the position of the detected line and compute 3σ upper limits for the fluxes in the g and R bands. In all the cases, we compute the color–magnitude gR and report these values in Table 3. As in García-Vergara et al. (2019), the Lyman break is defined by the color criteria gR ≥ 0.7.

In addition to the LAE mentioned above (J1211+1224.1), we find that one other CO-emitting galaxy (J0202-0650.1) exhibits the Lyman break, suggesting that it is possibly an LBG at z ∼ 4. All the other galaxies are not detected in either g and R bands, so we do not have information to trace the Lyman break, and we can not confirm that any of these are low-z interlopers. However, the lack of an optical counterpart at the VLT survey (with 3σ limiting magnitude 25.61 and 26.37 for the R and g images respectively) makes these objects unlikely to be lower-z (z ∼ 0.2 for CO(1–0), z ∼ 1.4 for CO(2–1), and z ∼ 2.6 for CO(3–2); see Table 4) galaxies.

Specifically, we check the typical R magnitude for CO(2–1) emitters at z ∼ 1.4, which are our main potential contaminants (see Table 4). We used the photometric catalog available from Skelton et al. (2014) to check the median R magnitudes 11 for the 11 CO(2–1) sources detected with high fidelity in the ASPECS survey (see Table 6 in González-López et al. 2019). We find that these sources have a median magnitude R = 24.83, which is much lower than the 3σ limiting magnitude in the R band for our candidates (see Table 3), making them unlikely to be CO(2–1) emitters at z ∼ 1.4.

3.5. Quasar Detection

We detect an emission line in 10/17 quasars, which we assume to be the CO(4–3) transition, as they all have an optical counterpart and spectroscopic redshift from SDSS spectra. All 10 lines have fidelity ≥0.8 and are detected with S/Ns ranging from 5.9 up to 24.5. We relax our S/N criterion down to S/N > 4, and we look for possible line candidates for the other seven quasars, restricting the search to detections with line widths ≥124 km s−1. When more than one candidate was found at the expected sky projected position of their optical counterpart (within 3''), we selected the one with higher S/N. We do not impose a requirement in the velocity space position of the quasar counterpart, since the redshift of optical quasars could be associated with larger uncertainties than reported in previous studies. An uncertainty in the optical computed redshift of 2970 km s−1 (one half of the bandwidth) could be still possible. This search results in the detection of all the other seven quasar counterparts, detected with S/Ns ranging from 4.1 up to 5.3, but with low fidelity (<0.4). Therefore, some of these detections could be noise fluctuations.

We compute the offset between the CO(4–3) redshift versus the redshifted based on rest-frame UV-emission lines for all the quasars, and we show our results in Figure 3 and Table 5. For the secure quasar detections, we find a median velocity offset of ∣$\delta {\rm{v}}$∣ = 738 ± 651 km s−1. This is consistent with the typical reported uncertainties of the optical-based redshifts (see Table 1). The only exception is the quasar located in the field J1224+0746, which exhibits a large offset of ∣$\delta {\rm{v}}$∣ = 2386 km s−1 and therefore is located at 1004 km s−1 from the edge of the SPW0. For completeness, we also include the velocity offset computed for the quasar detected with low fidelity in our ALMA observations, but in this study, we only use the redshifts from the secure detections. We present a detailed analysis of the quasar properties in a forthcoming study (C. García-Vergara et al. in preparation).

Figure 3.

Figure 3. Velocity offset between the quasar redshifts determined from the CO(4–3) emission lines and the rest-frame UV-emission lines. We include the secure quasar detections (black) and the low-fidelity detections (gray). Based only on the secure detections, we find a median offset of ∣δv∣ = 738 ± 651 km s−1.

Standard image High-resolution image

Table 5. Quasar Redshift Determined from ALMA Observations, and Number of Emission Lines in Each Field

Field zopt zALMA δv [km s−1] N3000 (δCO) N1000 (δCO) δLAE
(1)(2)(3)(4)(5)(6)(7)
J0040+17063.8733.910 a 2266 a 0 (00.00)0 (00.00)2.66
J0042-10203.8653.8788051 (17.77)0 (00.00)0.00
J0047+04233.8643.8777800 (00.00)0 (00.00)0.00
J0119-03423.8733.874 a 46 a 2 (34.90)0 (00.00)3.41
J0149-05523.8663.8797811 (18.61)1 (55.87)0.00
J0202-06503.8763.864 a −741 a 1 (17.63)1 (52.91)0.00
J0240+03573.8723.8836610 (00.00)0 (00.00)0.92
J0850+06293.8753.854 a −1310 a 1 (19.40)1 (58.22)2.41
J1026+03293.8783.8843800 (00.00)0 (00.00)2.16
J1044+09503.8623.861 a −37 a 0 (00.00)0 (00.00)0.78
J1138+13033.8683.867−620 (00.00)0 (00.00)0.00
J1205+01433.8673.864 a −161 a 0 (00.00)0 (00.00)1.34
J1211+12243.8623.875 a 805 a 1 (25.97)1 (77.95)3.75
J1224+07463.8673.90523550 (00.00)0 (00.00)0.00
J1258-01303.8623.88212031 (27.34)0 (00.00)1.08
J2250-08463.8693.8764331 (19.61)1 (58.86)0.77
J2350+00253.8763.8919200 (00.00)0 (00.00)0.59
ALL   9 (10.56)5 (17.60)1.36

Notes. For each field, column (2) shows the quasar redshift determined from UV rest-frame emission lines, column (3) shows the redshift determined from the CO(4–3) emission line, column (4) shows the velocity offset between these (with typical uncertainties of 651 km s−1), and columns (5) and (6) show the number of CO emitters and the corresponding overdensity within ±3000km s−1 and ±1000 km s−1 from the quasar redshift, respectively (using the redshift determined from the CO(4–3) line, if detected with high fidelity (≥0.8), and the UV rest-frame emission lines otherwise). Note that, due to the quasar redshift offsets, we are not tracing a symmetric volume of ±3000 km s−1 in all the fields; however, we are symmetrically tracing 1000 km s−1 in all the cases, since all the quasars are located at least at 1000 km s−1 from the edge of the cube. The overdensity is computed as the number of detected galaxies over the number of expected galaxies in blank fields over the same traced volume (see Section 4.1). Column (7) show the overdensity of LAEs at R ≲ 7 h−1 cMpc within ±1600 km s−1 in each field (García-Vergara et al. 2019).

a Indicates the cases where the CO(4–3) line is detected with low (<0.8) fidelity.

Download table as:  ASCIITypeset image

In Figure 4, we show the sky distribution of the detected companion lines around the central quasar, and in Figure 5 we show the distribution of the velocity offsets between the lines and the central quasar for all our fields together. For the computation of the velocity offsets, we use the redshift based on the ALMA detection of the quasar, when it is detected with fidelity ≥0.8; otherwise, we use the redshift as determined from the optical quasar spectrum (reported in Table 1).

Figure 4.

Figure 4. Sky distribution of the emission lines around the central quasar for our 17 fields combined. The central black diamond indicates the quasar position. The lines are indicated as filled circles, with different colors indicating different fields in which they were detected. We include the position of the LAEs detected with S/N ≥ 5 in the field as open squares. The largest dashed line circle shows the region where we search for line candidates, which corresponds to the whole ALMA pointing (a radius of 59'' from the central quasar) . The smaller dashed line circle shows the radius at which the telescope sensitivity is ≥20% of the maximum. Sources located outside of this limit radius were included in our catalog only if they have a fidelity ≥0.9.

Standard image High-resolution image
Figure 5.

Figure 5. Velocity offset distribution of the CO(4–3) lines around the central quasar for our 17 fields together. The vertical dashed lines indicate the velocity window that was chosen to measure the clustering of sources around quasars (see Section 4.1).

Standard image High-resolution image

4. Clustering Analysis

In this section, we measure the clustering properties of the CO(4–3) emitters around quasars, following a procedure analogous to the one in our previous studies about the clustering of LAE and LBG around z ∼ 4 quasars (García-Vergara et al. 2017, 2019). We refer the reader to these works for detailed descriptions, but in Section 4.1 we provide a brief overview. We also use the measured cross-correlation function to infer the clustering of CO emitters in blank fields in Section 4.3. We note that, in this study, we do not analyze the clustering of continuum sources, since the clustering signal would be strongly diluted when projected over the large radial comoving distance traced by the almost flat selection function of these sources over the redshift range 1 ≲ z ≲ 7.

4.1. Quasar–CO Cross-correlation Function

We measure a volume-averaged projected cross-correlation function between quasars and CO(4–3) emitters defined by

Equation (2)

where ξ(R, Z) is the real-space correlation function, assumed to have a power-law shape $\xi {(r)=(r/{r}_{0,\mathrm{QG}})}^{-\gamma }$, where r0,QG and γ are the correlation length and the slope of the correlation function, respectively. The real-space separation between objects, r, has been written in Equation (2) as a function of their two components: the transverse comoving separation R, and the radial comoving separation Z. Veff is the effective volume of the survey, which is a cylindrical shell centered on each quasar, with inner and outer radius given by ${R}_{\min }$ and ${R}_{\max }$, respectively, and with a height defined by the radial comoving distance ${Z}_{\max }-{Z}_{\min }$.

We compute χ(R) in logarithmically spaced radial bins centered on the quasar by using the estimator

Equation (3)

where 〈QG(R)〉 is the number of quasar–galaxy pairs observed in our survey, within the cylindrical shell volume, and 〈QR(R)〉 is the number of quasar–galaxy pairs that we expect to observe within the same volume but in the absence of clustering (i.e., assuming the background number density of CO-emitting galaxies at z ∼ 4).

If we had ALMA observations covering large areas of the sky at random locations (i.e., regions not containing quasars), using the same setup as for our data, we could directly measure 〈QR(R)〉 from the data. However, this is not the case. We cannot use the three quasar–free SPWs (SPWs 1, 2, and 3) for this purpose, due to the following reasons. First, the central frequency (at least for SPW1) is relatively close (<6000 km s−1) to the quasar location, and thus the background number counts could not necessarily be reached. 12 This leaves us with only the other two SPWs, which are far enough away, reaching the background number density, although they would be tracing the number density of CO(4–3) at slightly lower redshift (z ∼ 3.2). Second, the volume covered by SPW2 and SPW3 is small, resulting in extremely low statistics for the number counts. 13 This would result in large uncertainty in the computed 〈QR(R)〉, and therefore in the χ(R) value.

Third, the SPW1, SPW2, and SPW3 were observed in TDM mode, whereas the SPW0 was observed in FDM mode (see Section 2.3). This results in different spectral noise properties, as well as different spectral resolutions (the spectral resolution in SPW1, SPW2, and SPW3 is two times lower than the resolution in SPW0), which complicates the comparison of the line number density in the different cubes (we would be sensitive to a different type of lines in SPW1, SPW2, and SPW3 compared to the lines detected in SPW0). Considering all the mentioned reasons, we thus decided to estimate 〈QR(R)〉 using the CO(4–3) luminosity function measured at z = 3.8 from the ASPECS survey (Decarli et al. 2019). We detail this in what follows.

The number of quasar–galaxy pairs that we expect to observe in the absence of clustering is given by $\langle {QR}(R)\rangle ={\int }_{{V}_{\mathrm{eff}}}{n}_{\mathrm{CO}}(L^{\prime} \geqslant {L}_{\mathrm{Lim}}^{{\prime} }){dV}$, and for the cylindrical shell volume defined above, we can write

Equation (4)

where ${n}_{\mathrm{CO}}(L^{\prime} \geqslant {L}_{\mathrm{Lim}}^{{\prime} })$ is the number density of CO(4–3) emitters with line luminosity above the limiting luminosity of our survey. To compute the limiting luminosity, we follow Solomon et al. (1997),

Equation (5)

where ν0 is the rest-frame frequency of the line (ν0 = 461.04 for CO(4–3)), DL is the luminosity distance at the redshift of the source, z, and ${F}_{\mathrm{Lim}}$ is the limiting integrated line flux.

The limiting integrated line flux for each quasar field is computed using the rms noise of the cubes from Table 2, as well as the S/N threshold defined by our fidelity threshold. Specifically, we assume a typical FWHM line width of 331 km s−1, which is the median width measured in the high-fidelity CO line sample of the ASPECS survey 14 (González-López et al. 2019), and we use the corresponding S/N threshold for such a width (S/N = 5.6 for a detection with fidelity 0.8 as shown by the magenta curve in Figure 1). The limiting luminosity of the survey at the center of the pointing is reported in Table 2.

If the rms sensitivity of the survey were roughly flat, the term ${n}_{\mathrm{CO}}(L^{\prime} \geqslant {L}_{\mathrm{Lim}}^{{\prime} })$ in Equation (4) could be taken out from the integral and it could be simply computed by integrating the CO(4–3) luminosity function down to the limiting luminosity of the survey computed from Equation (5). However, since the ALMA sensitivity decreases when the radius within the pointing increases, the limiting luminosity actually depends on R, and therefore we have to model this dependency and numerically compute the integral over R in Equation (4).

We model the sensitivity variation over the pointing as a 2D Gaussian with FWHM given by the primary beam size (∼66''), and compute the primary-beam-corrected limiting integrated line flux as a function of R, which gives us the limiting luminosity as a function of R from Equation (5).

Finally, the limiting luminosity as a function of R is used to compute ${n}_{\mathrm{CO}}(L^{\prime} \geqslant {L}_{\mathrm{Lim}}^{{\prime} })$ at every dR integration step in Equation (4). Here, we use the CO(4–3) luminosity function at z = 3.8 measured by Decarli et al. (2019), which is given by the Schechter parameters $\mathrm{log}{{\rm{\Phi }}}_{* }[{\mathrm{Mpc}}^{-3}{\mathrm{dex}}^{-1}]=-3.43$, ${\mathrm{logL}}_{* }^{{\prime} }[{\rm{K}}\,\mathrm{km}\,{{\rm{s}}}^{-1}\,{\mathrm{pc}}^{2}]=9.98,$ for a fixed α = −0.2.

In principle, in Equation (4) we should include a multiplicative term encapsulating the completeness of the sample. This completeness refers to the fraction of sources detectable by the search algorithm, and it is dependent on the line flux, line width, fidelity, and S/N of the sources. 15 Based on previous estimations of the performance of the line search algorithm findclumps using simulated mock sources (e.g., González-López et al. 2019; Decarli et al. 2020; Loiacono et al.2021), we expect that at the high fidelity and S/N threshold used in our study, the completeness is near 100%, so we have neglected this completeness correction. This could result in a slight underestimation of the clustering of sources around quasars, but our main conclusions will not significantly change.

The computation of 〈QG(R)〉 and 〈QR(R)〉 is performed individually for each of the 17 fields, and then they are stacked to obtain the final χ(R) value using Equation (3). We measure the clustering using ${Z}_{\max }=-{Z}_{\min }=8.19\,{h}^{-1}\,\mathrm{cMpc}$ (corresponding to δv = ±1000 km s−1 at z = 3.87). We note that the quasar with the largest redshift offset is located at 1004 km s−1 from the edge of SPW0 (see Section 2.1), and therefore the choice of δv = ±1000 km s−1 allows us to trace a symmetric and complete volume around the quasar for all the fields. This choice results in a total of five sources included in the clustering analysis (see Table 3). Given the small size of our sample, we assume that Poisson error dominates our measurement, and we compute the one-sided Poisson confidence interval for small number statistics from Gehrels (1986). We show the measured cross-correlation in Figure 6 and tabulate it in Table 6.

Figure 6.

Figure 6.  Top: The cumulative number counts of CO(4–3) lines observed in our 17 quasar fields (〈QG( ≤ R)〉) within δ v = 1000 km s−1 from the central quasar (red points) with Poisson error bars, compared to the expectation for CO(4–3) lines in blank fields in our 17 quasar fields (〈QR( ≤ R)〉) computed using Equation (4) (black line). The gray area shows the uncertainty in this expectation. Our observations yield five CO(4–3) lines from the ensemble of the 17 fields, while we expect only 0.28 CO(4–3) lines from the background alone, resulting in a total overdensity of ${17.6}_{-7.6}^{+11.9}$. Bottom: Quasar–CO cross-correlation function χ(R) with 1σ Poisson error bars for the 17 fields (black data points) computed using Equation (3), and the best maximum likelihood estimator for both r0,QG, assuming a fixed γ = 1.8 (black line). For comparison, we show the quasar–LBG and quasar–LAE cross-correlation function (blue dashed lines) observed at z ∼ 4, and their respective uncertainties showed as blue shaded areas (García-Vergara et al. 2017, 2019).

Standard image High-resolution image

Table 6. Quasar–CO Cross-correlation Function

${R}_{\min }$ ${R}_{\max }$ QG(R)〉QR(R)〉 χ(R)
h−1 cMpc h−1 cMpc   
0.0960.1650.0000.008−1.00${}_{-0.00}^{+226.13}$
0.1650.2820.0000.025−1.00${}_{-0.00}^{+74.91}$
0.2820.4832.0000.065 ${29.89}_{-19.96}^{+40.75}$
0.4830.8272.0000.119 ${15.79}_{-10.85}^{+22.15}$
0.8271.4171.0000.063 ${14.88}_{-13.14}^{+36.53}$

Download table as:  ASCIITypeset image

Note that choosing a larger radial comoving range (δv) would result in more sources for the measurement, increasing the statistics, but the clustering signal would be diluted because we are integrating the signal in a larger volume and up to large distances from the quasar, where the background is close to being reached. We also note that keeping the volume small also decreases the probability of having a contaminant in the sample (see Section 4). We explore the effect of the chosen δv in the Appendix.

For the seven quasars not (securely) detected in our observations, we use the optical-based redshifts, which could have an offset from the ALMA-based redshift of ∼700–800 km s−1 (see Figure 3). This would mean that, for these quasars, we may be including or excluding companions at larger and smaller redshift distances, respectively. The confirmation of the precise redshift for these quasars is the only way to correct for these uncertainties in our clustering measurement, but we explore how much the clustering signal changes if we use ALMA-based redshifts for all the quasars, even if they are detected at low fidelity. We find that the same five CO emitters are kept in this case, confirming the stability of the obtained correlation function.

To determine the real-space cross-correlation parameter r0,QG that best fits our data, we use a Poisson maximum likelihood estimator. Given the noise of the data and the small physical scales proved in our study, we follow common practice and fit our data assuming a fixed slope γ = 1.8 (e.g., Ouchi et al. 2004; García-Vergara et al. 2017, 2019; Fossati et al. 2021). We find that the maximum likelihood and 1σ confidence interval for the correlation length is ${r}_{0,\mathrm{QG}}\,={8.37}_{-2.04}^{+2.42}\,{h}^{-1}\,\mathrm{cMpc}$. We use the best-fitted parameter in Equation (2) to compute the corresponding χ(R) value, which is shown as a black line in Figure 6.

Finally, we use our 〈QG(R)〉 and 〈QR(R)〉 binned values to compute the observed and expected cumulative number counts of CO(4–3) lines in our 17 quasar fields (〈QG( ≤ R)〉), and show this in Figure 6. In the whole volume survey (1751.7 h−3 cMpc3 for the 17 fields over δv = 1000 km s−1), we find a total of five CO(4–3) lines, while we expect only 0.28 CO(4–3) lines from the background alone, resulting in a total CO(4–3) line overdensity of ${17.6}_{-7.6}^{+11.9}$ in quasar fields. We note that, due to the low-number statistics, the computed overdensity is significant at the 2.2σ level, and either deeper observations or observations of a larger sample of quasar fields would be required to confirm the overdensity with higher significance. We also compute the total overdensity per field as the ratio of 〈QG(R)〉 per field over 〈QR(R)〉 integrated over the radial bins and provide these values in Table 5. Although the individual overdensity is dominated by low-number statistics and affected by cosmic variance, we include this to study possible correlations between the overdensities and the quasar properties (C. García-Vergara et al. in preparation).

4.2. Comparison with the Clustering of Other Populations around Quasars

We compare our results with the clustering of LBG and LAE around z ∼ 4 quasars. Since the previous LBG and LAE clustering studies extend up to larger scales (R ≲ 9 h−1 cMpc) than those traced in our study (R ≲ 1.5 h−1 cMpc), for this comparison we assume that the small-scale quasar–CO cross-correlation function can be extrapolated toward larger scales following a single power-law shape.

We find that the QSO-CO cross-correlation length is slightly lower, but consistent within error bars with the QSO-LBG cross-correlation length, which is given by ${r}_{0,{\rm{Q}}-\mathrm{LBG}}={9.78}_{-1.86}^{+1.68}\,{h}^{-1}\,\mathrm{cMpc}$ for a fixed γ = 1.8 16 (García-Vergara et al. 2017). This suggests that CO emitters and LBGs would inhabit dark matter halos of similar masses at z ∼ 4 (we further discuss this point in Section 4.3).

We can also compare our measurements with the clustering of LAEs around this same quasar sample, thus providing a direct comparison of optical and dusty galaxy populations in these fields. We find that the cross-correlation length for CO-emitting galaxies is ${3.0}_{-1.4}^{+1.5}$ times higher than the cross-correlation length for LAEs (${r}_{0,{\rm{Q}}-\mathrm{LAE}}={2.78}_{-1.05}^{+1.16}\,{h}^{-1}\,\mathrm{cMpc}$ with γ = 1.8; García-Vergara et al. 2019) around quasars, although we caution that the uncertainties are still large, and therefore the cross-correlation lengths are consistent within 2σ error bars. We note that the redshift window traced by both studies is similar (δv = ±1000 km s−1 for CO emitters, and δv = ±1600 km s−1 for LAEs), and thus we do not expect that this discrepancy is the result of a dilution in the signal due to differences in the traced volume. We have explicitly checked this in the Appendix, where we find that the quasar–CO cross-correlation length is only 1.1 times smaller if we assume a δv = ±1600 km s−1 (see Figure 8).

Tracing a different quasar sample at z = 3–4.5, and using deeper optical observations, (Fossati et al. 2021) measure a slightly higher quasar–LAE cross-correlation length of ${r}_{0}={3.15}_{-0.40}^{+0.36}\,{h}^{-1}\mathrm{cMpc}$ at a fixed slope γ = 1.8. 17 This is still ${2.7}_{-0.8}^{+0.8}$ times lower than the cross-correlation length for CO-emitting galaxies. We note that the physical scale traced in this study (R ≲ 0.6 h−1cMpc) is slightly smaller than that traced in our analysis.

The difference in the clustering of CO emitters and LAEs around quasars is unlikely to be caused by differences in the halo mass hosting both populations (see Section 4.3), therefore we suggest that this discrepancy is related to physical processes affecting the visibility of the LAEs around quasars. We further discuss this interpretation in section Section 5.

4.3. Autocorrelation of CO Emitters at z ∼ 4

The autocorrelation of a galaxy population is a powerful tool, since it can be directly related to the dark matter halo mass in which that population reside (e.g., Cole & Kaiser 1989; Mo & White 1996; Sheth & Tormen 1999). The autocorrelation of CO-emitting galaxies at z ∼ 4 has never been measured before, mainly due to the lack of large and deep surveys of these galaxies at high-z. The largest samples of CO emitters at z ∼ 4 currently available are composed of only a few tens of sources (e.g., Decarli et al. 2016, 2019), which do not provide enough statistics for an autocorrelation measurement. Therefore, up to now it has been challenging to compute precise masses of CO-emitting galaxies and understand how they fit into different evolutionary scenarios for high-z galaxies.

However, under certain assumptions, the cross-correlation between CO emitters and quasars can be used to infer the clustering of CO-emitting galaxies in blank fields, providing initial constraints on the clustering of this population at z ∼ 4.

First, we assume that our small-scale cross-correlation can be extrapolated toward larger scales following a power-law shape given by $\xi {(r)=(r/{r}_{0,\mathrm{QG}})}^{-\gamma }$. Although the autocorrelation function of quasars and galaxies has been found to slightly deviate from a power law toward smaller (≲0.2 h−1cMpc) scales (e.g., Ouchi et al. 2005; Hennawi et al. 2006, but see also Shen et al. 2010), likely due to the transition between the one-halo to two-halo terms, the autocorrelation is typically reasonably well-approximated as a power-law, and thus we assume that the one-halo term does not strongly boost the signal in our measurement.

Second, we assume a deterministic bias model, in which the QSO–galaxy cross-correlation function can be written as ${\xi }_{\mathrm{QG}}=\sqrt{{\xi }_{\mathrm{QQ}}{\xi }_{\mathrm{GG}}}$, where ξQQ and ξGG are the autocorrelation of quasar and galaxies, respectively. We also assume that ξQQ and ξGG have a power-law shape with the same slope γ = 1.8. Under these assumptions, the correlation lengths can be related by

Equation (6)

Using the quasar autocorrelation length previously reported at z ∼ 4, given by r0,QQ = 22.3 ± 2.5 h−1cMpc (recomputed from Shen et al. 2007 with a fixed γ = 1.8), and our measured QSO–galaxy cross-correlation length, we obtain that the autocorrelation length of CO emitters at z ∼ 4 is given by r0,GG = 3.14 ± 1.71 h−1 cMpc.

This measurement is slightly lower than the LBG autocorrelation length at z ∼ 4 (${r}_{0,\mathrm{LBG}}={4.1}_{-0.2}^{+0.2}\,{h}^{-1}\,\mathrm{cMpc};$ Ouchi et al. 2004), and is slightly higher than the LAE autocorrelation length at z ∼ 4 (${r}_{0,\mathrm{LAE}}={2.74}_{-0.72}^{+0.58}\,{h}^{-1}\,\mathrm{cMpc}$, Ouchi et al. 2010). We caution that the uncertainties of our estimation are still large, which makes the CO autocorrelation length consistent within uncertainties with the autocorrelation length of both LAE and LBG.

Interestingly, the reported autocorrelation length of CO emitters is lower than the autocorrelation length of S870μ m > 4.0 mJy submillimeter galaxies (SMGs) detected with ALMA at 1.5 < z < 3 (${r}_{0,\mathrm{SMGs}}={7.7}_{-2.6}^{+2.8}\,{h}^{-1}\,\mathrm{cMpc};$ Stach et al. 2021, but see García-Vergara et al. 2020). If the SMG clustering does not strongly evolve with redshift as suggested by Stach et al. (2021), then this would mean that CO emitters are lower-mass galaxies compared to the population of SMGs typically detected in continuum surveys.

While our measurements are still noisy, they provide a first rough constraint of the clustering of CO-emitting galaxies, allowing us for the first time to locate them within the context of evolutionary galaxy models. Larger and deeper surveys of emitting lines around quasars are still needed in order to constrain the clustering of CO emitters more precisely. Alternatively, large and deep surveys of CO-emitting galaxies in blank fields would offer an independent and more direct constraint of the CO clustering at these redshifts. However, such large surveys are very expensive and extremely challenging because of the required sensitivity and the low number density of CO emitters in blank fields.

5. Discussion

Our study reveals a large overdensity of CO-emitting galaxies (${17.6}_{-7.6}^{+11.9}$) and a strong clustering of them around quasars (${r}_{0,\mathrm{QG}}={8.37}_{-2.04}^{+2.42}\,{h}^{-1}\,\mathrm{cMpc}$) at scales R ≲ 1.5 h−1cMpc. This result helps clarify the current confused picture of quasar environments at high z and provides strong observational evidence in favor of z ∼ 4 quasars as tracers of massive structures. By comparing with the previous measurement of clustering of LAEs around the same quasar sample at scales R ≲ 7 h−1cMpc, we find that CO-emitting galaxies are more clustered than LAEs around quasars (with a cross-correlation length ${3.0}_{-1.4}^{+1.5}$ times higher), resulting in large overdensities of these galaxies, whereas only a mild overdensity of LAEs (${1.4}_{-0.4}^{+0.4}$) was found in these fields (García-Vergara et al. 2019). A comparison with the other available LAE study in quasar environments (Fossati et al. 2021), which is performed at smaller physical scales (R ≲ 0.6 h−1cMpc), leads to similar conclusions, with the quasar–CO cross-correlation length being ${2.7}_{-0.8}^{+0.8}$ times higher than the quasar–LAE cross-correlation length (see Section 4.2). In the following, we discuss the possible reasons that could explain this discrepancy.

First, we explore the possibility that CO-emitting galaxies inhabit more massive dark matter halos compared to LAEs, which would result in a significant difference in the quasar–galaxy cross-correlation for both populations. Although the dark matter halos for CO emitters at z ∼ 4 have not been constrained yet, the LAE autocorrelation length at z ∼ 4 is well-constrained and is found to be ${r}_{0,\mathrm{LAE}}={2.74}_{-0.72}^{+0.58}\,{h}^{-1}\,\mathrm{cMpc}$, (Ouchi et al. 2010). If we focus on clustering hierarchy arguments only and assume a deterministic bias model, the difference of a factor of three in the measured quasar–CO and quasar–LAE cross-correlation length implies that the autocorrelation length of CO emitters would be nine times larger than the autocorrelation length of the LAEs (see Equation (6)), resulting in r0,CO ∼ 25 h−1 cMpc.

This is even higher than the quasar autocorrelation length at z ∼ 4 (r0,QQ = 22.3 ± 2.5 h−1 cMpc (Shen et al. 2007), which implies halo masses of Mhalo > 6 × 1012 M h−1), and it would imply that CO emitters inhabit halos more massive than quasars. This seems to be an unlikely scenario and inconsistent with the CO dark matter halo mass inferred from our results, assuming a deterministic bias model (${M}_{\mathrm{halo}}={8.31}_{-8.27}^{+49.29}\times {10}^{10}{M}_{\odot }$; see Section 4.3).

Another possibility to explain the discrepancy in the overdensity of CO emitters and LAEs around quasars is related to the uncertainties on the rest-frame UV quasar redshift of our targets. If these quasar redshifts were associated with large uncertainties, the LAE search would have been offset from the real quasar position, which would result in a lower number density of detected galaxies, and thus a lower quasar–LAE cross-correlation. However, with our ALMA observations, we could confirm the redshift of 10 quasars (and tentatively do so for the other seven quasars detected with lower fidelity), and for them we find relatively low redshift offset compared with the ones determined from rest-frame UV-emission lines (∣δv∣ = 738 km s−1; see Section 3.3). These uncertainties in the quasar redshifts are much smaller than the FWHM of the narrow band used in the LAEs study (3197 km s−1), implying that the LAE search was performed at the correct redshift.

If we assume that the remaining seven quasars have similar redshift uncertainties (as indeed suggested by their lower-fidelity detections; see Table 5), then this is not a convincing explanation for the observed discrepancy. Additionally, we note that five out of these seven quasars exhibit an overdensity of LAEs (see Table 5), which is an indication that the LAE search was performed at the correct redshift in these cases. We note that one of the quasars exhibits a larger offset (∣δv∣ = 2386 km s−1 for J1224+0746), in which case the LAE search was performed at larger distances from the quasar, where the background number density would be expected. However, if we exclude that field in the LAE study, we find that the total overdensity increases from 1.36 to 1.43 and the autocorrelation length increases by a factor of 1.09, which would not change our main conclusions.

Ruling out the two mentioned explanations as the main reasons to explain the large overdensity of CO emitters compared to the mild overdensity of LAEs in the same fields, we explore the possibility that particular physical properties in galaxies around quasars could be impacting their visibility and detection. While the CO(4–3) emission line traces the molecular gas in a galaxy, 18 the Lyα emission line is a tracer of instantaneous star formation, and thus, a relatively small star formation efficiency in galaxies around quasars could explain the lack of LAEs in these fields.

Although the molecular gas reservoirs in galaxies have been found to correlate well with their star formation rate, and stellar mass (e.g., Kennicutt 1998; Bigiel et al. 2008; Leroy et al. 2008), these scaling relations have been mostly measured in field galaxies, but the validation of such relations in high-z dense environments is still poorly constrained. The few available studies have been aimed to trace the properties of galaxies in z ∼ 1.5 galaxy clusters, and they show evidence of systematic deviations from the scaling relations of the field toward larger molecular gas masses (e.g., Noble et al. 2017; Hayashi et al. 2018, but see also Rudnick et al. 2017). Although such systems have a much more dense and evolved intercluster medium than that in protoclusters at z ∼ 4, the results of these studies suggest that complex physical processes may be involved in the galaxy evolution in dense environments.

We note, however, that the Lyα emission does not depend only on the instantaneous star formation, but it also depends on other factors such as the interstellar and circumgalactic medium conditions, which impact the complex radiative transfer process of the Lyα photons. Exploring the aforementioned scenario in the context of our galaxy samples would require the acquisition of more observations. Specifically, multiwavelength photometry would be useful to constrain the properties of the galaxies, including SFRs. Additionally, James Webb Space Telescope (JWST; Gardner et al. 2006) observations can add key information of kinematics, stellar masses, and SFRs, which would help to build a physical scenario for these systems. And at mm wavelengths, ALMA [C ii] 158 μ m emission observations are sensitive tracers of the interstellar medium conditions, enabling the study of scaling relations in high-z dense environments.

Finally, as previously suggested by García-Vergara et al. (2019), galaxies around quasars could be more dusty, affecting the visibility of the Lyα line. Since Lyα photons are easily absorbed by dust, the Lyα emission line could be partially suppressed, becoming hardly detectable. This would result in a decreased number of detected LAEs around quasars. The CO emission line is instead unaffected by dust absorption, making it detectable even in galaxies with large fractions of dust. This could explain why the deeper LAE study (Fossati et al. 2021) reveals a stronger clustering of LAEs around high-z quasars compared to that computed from shallower LAEs observations (García-Vergara et al. 2019).

We note that, in this scenario, the detection of LBGs would also be affected by the dust, implying a lower LBG overdensity around quasars than that expected. However, the escape of the Lyα photons is particularly sensitive to the presence of dust, resulting in a strongly attenuated Lyα emission line, whereas the UV continuum may be less impacted, making LBGs still detectable. ALMA continuum observations of our quasar fields would allow us to directly trace the dust in these galaxies, and together with additional multiwavelength data, would provide invaluable information to completely characterize the galaxy properties through SED modeling, testing their dependence on environment.

We caution that our clustering constraints are still dominated by the low-number statistics provided by our relatively small quasar sample. Deeper and/or larger surveys of quasar fields would constrain the galaxy clustering around quasars with higher S/N than that achieved in this work. Alternatively, targeting a brighter emission line, such as the emission line [C ii] at 158-μm (which is the strongest line from star-forming galaxies at radio wavelengths; Carilli & Walter 2013) would offer better statistics with the same exposure time despite the smaller area (R ≲ 0.3 h−1cMpc) that we would trace using ALMA/band 8, at which this line is detectable at z = 3.87. Specifically, we computed that, for the same exposure time used in our CO emitters study, we expect to detect 17 [C ii] companions for all the quasar fields over the area covered with band 8. 19 Finally, we stress the importance of performing quasar environment studies using multiwavelength information, since the visibility of the galaxies around quasars seems to be strongly dependent on the studied wavelengths.

6. Summary

We use ALMA band 3 observations to perform a blind search for CO(4–3) emitting galaxies in the environment of 17 z ∼ 4 quasars. The quasars were selected from the SDSS and BOSS quasar catalogs to lie within a redshift window of z ∼ 3.862−3.879 set by an optical narrow-band filter used in a previous work to detect LAEs around the quasars. The spectroscopic redshifts of the quasars are determined from the UV-emission lines (with typical uncertainties of <800 km s−1).

We explore a cylindrical volume around the quasars defined by a projected radius of R ≲ 1.5 h−1cMpc and a velocity coverage of δv ∼ ±3000 km s−1. We find nine CO-emitting galaxies with S/N ≥ 5.6, and line widths ranging from 47 km s−1 to 728 km s−1. The S/N threshold was chosen to only select sources with fidelity ≥0.8 in our survey.

We also detect the CO(4–3) emission line from 10 quasars with fidelity ≥0.8, and they typically show relatively low redshift offsets (the median offset is ∣δ v∣ = 738 km s−1) with respect to the redshifts determined from the UV-emission lines, except for one quasar that exhibit an offset of ∣δv∣ =2386 km s−1. The other seven quasars were tentatively detected with lower fidelity (<0.4), and S/Ns ranging from 4.1 to 5.3.

We quantify the clustering properties of the galaxies around the quasars. For this, we only focus on a small velocity range of δv ∼ ±1000 km s−1 around the quasar, to avoid dilution of the small-scale clustering signal. Five of the CO-emitting galaxies lie within this volume, and we use this sample to measure the volume-averaged cross-correlation function. For this measurement, we estimate the background number density from the CO luminosity function previously measured at z = 3.8 from blank fields (Decarli et al. 2019).

We find that the expected number density of CO(4–3) emitting galaxies in blank fields is 0.28 for the whole sample of quasars, while five were detected in our survey, which results in an overdensity of ${17.6}_{-7.6}^{+11.9}$. We also fit the quasar–CO cross-correlation in our fields and find a cross-correlation length of ${r}_{0,\mathrm{QG}}={8.37}_{-2.04}^{+2.42}\,{h}^{-1}\,\mathrm{cMpc}$, assuming a fixed slope γ = 1.8.

In a previous study, we performed a search for LAEs in the same 17 quasar fields, allowing us to simultaneously trace the clustering properties of optical and dusty galaxies around quasars for the first time. That study revealed only a mild overdensity (x1.4) of LAEs in these fields, and a quasar–galaxy cross-correlation length of ${r}_{0,{\rm{Q}}-\mathrm{LAE}}={2.78}_{-1.05}^{+1.16}\,{h}^{-1}\,\mathrm{cMpc}$ (García-Vergara et al. 2019), which is ${3.0}_{-1.4}^{+1.5}$ times lower than the cross-correlation length found for CO emitters.

We argue that differences in the halo mass hosting the two galaxy populations, as well as uncertainties associated with the optically based quasar redshifts, are unlikely reasons to explain the observed discrepancy. We suggest instead that the properties of galaxies in quasar environments could impact their visibility and thus detectability in Lyα emission. Specifically, galaxies in quasar environments could have low star formation efficiency, or they could have an excess of dust, thus becoming more difficult to detect in Lyα emission. Exploring these mentioned scenarios would only be possible with the acquisition of additional multiwavelength observations.

Finally, we use our quasar–CO cross-correlation to infer the clustering of CO-emitting galaxies at z ∼ 4. Assuming a deterministic bias model, and extrapolating the observed small-scale cross-correlation up to larger scales, we find an autocorrelation length for CO emitters of r0,CO = 3.14 ± 1.71 h−1 cMpc (for a fixed slope γ = 1.8), which agrees well, within the 1σ error bars, with the clustering of LBG and LAE at z ∼ 4, but is lower than the clustering of SMGs at 1.5 < z < 3. Assuming that there is not a strong evolution of the clustering of SMGs with redshift, this would imply that the population of CO emitters inhabit less massive halos compared to the general population of SMGs. Larger surveys of CO emitters are required in order to independently constrain their autocorrelation.

As the first quasar sample targeted for clustering studies of both optical and dusty galaxies, our study demonstrates the importance of tracing different galaxy populations, and it also opens new questions about environmental effects on galaxy evolution, highlighting the importance of characterizing galaxies in the vicinity of high-z quasars.

This paper makes use of the following ALMA data: ADS/JAO.ALMA #2019.1.00441.S. ALMA is a partnership of ESO (representing its member states), NSF (USA), and NINS (Japan), together with NRC (Canada), MOST and ASIAA (Taiwan), and KASI (Republic of Korea), in cooperation with the Republic of Chile. The Joint ALMA Observatory is operated by ESO, AUI/NRAO, and NAOJ. The authors acknowledge assistance from Allegro, the European ALMA Regional Center node in the Netherlands.

M.R. acknowledges the support of the NWO Veni project "Under the lens" (VI.Veni.202.225). M.R. and J.H. acknowledge support of the VIDI research program with project number 639.042.611, which is (partly) financed by the Netherlands Organisation for Scientific Research (NWO). M.A. acknowledges support from FONDECYT grant 1211951, "CONICYT + PCI + INSTITUTO MAX PLANCK DE ASTRONOMIA MPG190030" and "CONICYT+PCI+REDES 190194".

Appendix: Impact of the Fidelity and δv Choice on the Cross-Correlation Function

In this appendix, we explore how the fidelity and δv choices impact the cross-correlation measurement that we present in Section 4.

We first check the impact of the fidelity choice on the measurement. For this, we create source catalogs with different fidelity thresholds ranging from 0.7 to 0.9 in steps of 0.05 (to follow the same criteria as those described in Section 3.3, in all the cases we increase the fidelity threshold by 0.1 for sources selected at distances larger than 50.14'' from the central quasar, corresponding to the radius at which the telescope sensitivity is ≥20% of the maximum). We compute the cross-correlation function following the same procedure described in Section 4.1, and we fit the correlation with a fixed slope γ = 1.8. We show our results in Figure 7. We find that the cross-correlation decreases with increasing fidelity threshold, and it converges for fidelity ≥0.8, which motivates the choice of this as the threshold for our study.

Figure 7.

Figure 7. Impact of the fidelity threshold adopted to create the catalog of sources. The results converge at fidelity ≥0.8. The clustering signal becomes flat when including sources with lower fidelities, which means that we are including contaminants (i.e., noise fluctuations are being detected as real sources by the algorithm).

Standard image High-resolution image

We also note that, at lower fidelity (<0.8), we include more sources, but most of them are impacting the last bin of the measurement (at θ ∼ 50''), making the cross-correlation flatter, with the fixed slope at γ = 1.8 poorly constraining their shape. This suggests that we are including contamination in these samples (i.e., probably fake noise fluctuations that are detected as real sources by the line search algorithm), which dilute the power-law signal, causing the flattening.

We also explore the impact of the radial comoving range (δv) choice. For this, we only focus on the sources with fidelity ≥0.8 (i.e., on the nine sources of Table 3), and compute the correlation function using different δv values ranging from 850 km s−1 to 3000 km s−1 (i.e., the whole cube). We note that some quasars show large redshift offsets, and thus their locations within the ALMA cube are not centered. Specifically, for the two cases where the ALMA redshift and optical redshifts have a difference of ∼2000 km s−1, the quasar in the ALMA data cube is located at ∼1000 km s−1 from the edge of the cube (see Section 2.1). When exploring δv ≤ 1000 km s−1, we are tracing the complete volume around all the quasars, but when exploring δv = ±2000 km s−1, we are only exploring an asymmetric volume covering +2000 km s−1 and −1000 km s−1 around these two quasars, affecting the observed 〈QG(R)〉 value. Similarly, when exploring δv = ±3000 km s−1, most of the fields (13 out of 17) are affected by this issue. We have taken into account this effect when computing the 〈QR(R)〉 term in Equation (4), such that we only integrate over the effective volume traced in each individual field. In this way, the volume traced to compute 〈QG(R)〉 and 〈QR(R)〉 is the same, resulting in a correct clustering computation.

We show our results in Figure 8. As mentioned in Section 4.1, the choice of a larger radial comoving range increases the statistics, reducing the error bars in the measurement, but it also dilutes the strong small-scale signal, because we are integrating over larger volumes and at large distances from the quasar where the number counts of the background start to be reached. This effect is relatively small, but can be seen to happen at δv ≥1600 km s−1. Our choice of δv = ±1000 km s−1 (corresponding to 8.19 h−1 cMpc at z = 3.87) is a balance between having good statistics but not strongly diluting the signal.

Figure 8.

Figure 8. Impact of the radial comoving range around the quasar (δv) used to measure the clustering. The clustering slightly decreases when increasing the δv, because small-scale signal starts to be diluted when integrating over a large volume, covering distances from the quasar at which the background number density is close to being reached.

Standard image High-resolution image

Footnotes

  • 8  

    The r0 value published in Fossati et al. (2021) has recently been recomputed. Here, we quote the most updated r0 value, which was kindly provided by the authors via private communication.

  • 9  

    We use a rectangular region that covers most of the region within the primary beam FWHM, since the primary beam pattern should be very well-defined, and thus the image before the primary beam correction should be flat, without spurious features, and with (mostly) Gaussian noise.

  • 10  

    We note that one of these narrow lines is the object J1211+1224.1, which has a LAE counterpart at the same redshift.

  • 11  

    R magnitudes correspond to that measured in images taken using ESO/WFI with the filter Rc (see Skelton et al. 2014 for details).

  • 12  

    At this distance, we expect that the number density of CO(4–3) lines is 2.3 times the background number density. We have computed this quantity assuming a power-law shape for the cross-correlation function with cross-correlation length r0 = 8.37 h−1 cMpc (the cross-correlation length measured in this work) for a fixed γ = 1.8. We take into account the volume of the SPW1 cube, and its distance from the quasar.

  • 13  

    We computed the probability to detect a CO(4–3) line in each SPW, assuming the background number density from the CO(4–3) luminosity function measured in Decarli et al. (2019; more details of this computation are provided in the main text). This probability is 7% and 7% for SPW2 and SPW3, respectively, which would result in a total of ∼2.4 CO(4–3) lines for the 17 fields detected in the volume covered by the two SPWs.

  • 14  

    We note that this is larger than the median FWHM of the five sources used for our clustering analysis (median FWHM is 173 ±161 for the five sources with fidelity >80% and 301 ± 183 for the four sources with fidelity >90%), but still consistent within error bars. We nevertheless checked that the assumed line width has only a small impact on our final clustering results.

  • 15  

    Note that this is not the completeness due to the sensitivity variation over the pointing, which is indeed corrected in our computation as explained above.

  • 16  

    García-Vergara et al. (2017) fitted the quasar–LBG cross-correlation function using a fixed γ = 2.0, therefore we refitted their measurements using a fixed γ = 1.8, which results in ${r}_{0,{\rm{Q}}-\mathrm{LBG}}={9.78}_{-1.86}^{+1.68}\,{h}^{-1}\,\mathrm{cMpc}$.

  • 17  

    The r0 value published in Fossati et al. (2021) has recently been recomputed. Here, we quote the most updated r0 value, which was kindly provided by the authors via private communication.

  • 18  

    The luminosity of the CO(4–3) emission line ${L}_{\mathrm{CO}(4-3)}^{{\prime} }$ can be converted to a molecular hydrogen gas mass Mmol by assuming an excitation correction ${L}_{\mathrm{CO}(4-3)}^{{\prime} }/{L}_{\mathrm{CO}(1-0)}^{{\prime} }$ and a conversion factor αCO (e.g., Carilli & Walter 2013).

  • 19  

    For this computation, we have used the [C ii] luminosity function from Loiacono et al. (2021), and assumed that the quasar–C ii cross-correlation function is the same as the quasar–CO cross-correlation function at z ∼ 4.

Please wait… references are loading.
10.3847/1538-4357/ac469d