Target Selection and Sample Characterization for the DESI LOW-Z Secondary Target Program

Elise Darragh-Ford; John F. Wu; Yao-Yuan Mao; Risa H. Wechsler; Marla Geha; Jaime E. Forero-Romero; ChangHoon Hahn; Nitya Kallivayalil; John Moustakas; Ethan O. Nadler; Marta Nowotka; J. E. G. Peek; Erik J. Tollerud; Benjamin Weiner; J. Aguilar; S. Ahlen; D. Brooks; A. P. Cooper; A. de la Macorra; A. Dey; K. Fanning; A. Font-Ribera; S. Gontcho A Gontcho; K. Honscheid; T. Kisner; Anthony Kremin; M. Landriau; Michael E. Levi; P. Martini; Aaron M. Meisner; R. Miquel; Adam D. Myers; Jundan Nie; N. Palanque-Delabrouille; W. J. Percival; F. Prada; D. Schlegel; M. Schubnell; Gregory Tarlé; M. Vargas-Magaña; Zhimin Zhou; H. Zou

doi:10.3847/1538-4357/ace902

1. Introduction

Mapping the low-redshift Universe with a dense galaxy survey is a key goal of astronomy and cosmology, with diverse science applications, including understanding the properties of dwarf galaxies, identifying transient and gravitational-wave hosts, measuring peculiar velocities, and mapping the detailed connection between galaxies and the matter density.

A large sample of low-redshift dwarf galaxies can inform several important aspects of galaxy evolution, quasar physics, and dark matter physics. This includes studying the dwarf galaxy luminosity function, the best current estimates of which are from Sloan Digital Sky Survey (SDSS; Blanton et al. 2005), Galaxy And Mass Assembly Survey (GAMA; Loveday et al. 2015), and H i measurements (e.g., Jones et al. 2018). However, for the faintest objects, large samples remain lacking. Measurements of the faintest end of the luminosity function and of the clustering properties of dwarf galaxies can help address important uncertainties in the galaxy–halo connection, such as what kind of halos dwarf field galaxies live in and the efficiency of galaxy formation and baryonic feedback at these scales (Wechsler & Tinker 2018). Many of these faint galaxies also exist as satellites in larger halos, which allows us to estimate the scatter in the galaxy–halo connection through satellite kinematics (Cao et al. 2020). This scatter constrains the correlation between galaxy formation and halo formation, providing a critical test of galaxy formation models. Additionally, characterizing the field dwarf galaxy luminosity function serves as a stepping stone to place studies of ultra-faint dwarf galaxies in the Local Volume (e.g., Martin et al. 2016; Drlica-Wagner et al. 2020; Nadler et al. 2020; Carlsten et al. 2022; Nashimoto et al. 2022) in a cosmological context, reducing key uncertainties in these analyses, and connecting near-field studies to outstanding questions of halo and galaxy assembly bias.

Furthermore, obtaining larger samples of field dwarf galaxies can help reduce uncertainties on quenched fraction measurements at the faint end (e.g., Geha et al. 2012), improving our understanding of low-mass galaxy formation. In addition, a wide-field sample of dwarf galaxies can be used to study the effects of environment on quenching (Davies & Robotham 2019) and place constraints on galactic conformity (Treyer et al. 2018). A better understanding of quenching at low stellar masses can help determine the key feedback processes relevant for dwarf galaxies, for example, understanding where reionization versus environmentally driven quenching dominates.

A comprehensive catalog of low-redshift galaxies is also relevant to the task of efficiently identifying transient and gravitational-wave hosts. For example, the upcoming Laser Interferometer Gravitational-Wave Observatory run expects to be sensitive to binary neutron star mergers out to 160–190 Mpc for the two original detectors, with the Virgo and KAGRA instruments having a more limited range (Abbott et al. 2020). Despite the relatively small distances, optical follow-up is limited by relatively poor source localization—10²–10³ deg² for two detectors, 10–10² deg² for three detectors, and ≲10 deg² for four detectors—along with the sheer number density of all galaxies (∼3000 deg⁻² at r < 21), only ∼10 deg⁻² of which we expect to be truly low-redshift. Thus, a comprehensive catalog of low-redshift objects significantly reduces the number of potential host galaxies for a given event, increasing the likelihood of the successful observation of an optical counterpart. Such a catalog could also provide redshifts for standard siren measurements of the Hubble constant (Schutz 1986; Abbott et al. 2017; Chen et al. 2022; Palmese et al. 2023).

The Dark Energy Spectroscopic Instrument (DESI; Abareshi et al. 2022) is an excellent tool for providing a large area, low-redshift spectroscopic survey. DESI, on the 4 m Mayall telescope at Kitt Peak National Observatory, is a new massively multiplexed instrument capable of taking spectra of 5000 objects simultaneously, with a target density of ∼700 objects per square degree and a spectral resolution of 2000 < λ/Δλ < 5500 (Aghamousa 2016; Miller et al. 2023; Silber et al. 2023). The DESI Bright Galaxy Survey (BGS) is already set to enhance existing surveys by going significantly deeper than SDSS and wider than GAMA. Still, it is limited to r < 19.5 for its main magnitude-limited sample (Hahn et al. 2022).

Part of the difficulty in obtaining comprehensive samples of faint, low-redshift objects is due to the fact that although these objects are nearby on cosmic scales, separating them from the dominant background of high-redshift objects remains challenging. Significant effort has gone into accurate photometric redshifts (photo-z's) for galaxy evolution and cosmology (Baum 1962; Benítez 2000; Collister & Lahav 2004; Feldmann et al. 2006; Ilbert et al. 2006; Brammer et al. 2008; Lee & Chary 2020; Li et al. 2023), and also into designing photometric cuts to efficiently select high-redshift objects (e.g., Steidel et al. 1996; Daddi et al. 2004; Bouwens et al. 2015; Finkelstein et al. 2015; Ono et al. 2018; Bowler et al. 2020; Kauffmann et al. 2022). However, analogous algorithms for selecting low-redshift objects have historically received less attention. This has made low-redshift surveys costly and time-consuming, as they must either accept a high rate of contamination of higher-redshift objects or invest significant time into cleaning photometric catalogs by eye.

Recent efforts to design photometric cuts for efficient low-redshift galaxy selection and more accurate low-redshift photo-z's have produced impressive results. Machine-learning methods have been able to achieve high accuracy in photo-z's for the lowest redshift objects (Pasquet et al. 2019; Dey et al. 2022). Meanwhile, the Satellites Around Galactic Analogs (SAGA) Survey (Geha et al. 2017; Mao et al. 2021) has made significant progress toward converging on a set of photometric cuts optimized for low-redshift science targets that significantly reduce the target density while retaining high purity out to z ∼ 0.03. This was validated using a targeted redshift survey of about 67,000 objects around very nearby (z < 0.01) galaxies.

Here, we present the new DESI LOW-Z survey, designed to efficiently target faint, low-redshift (z < 0.03) objects. LOW-Z is a DESI spare fiber program, meaning it takes advantage of fibers not being used for primary DESI targets. The LOW-Z target selection strategy builds off of work done by the SAGA Survey in two key ways: (1) to define catalog-level cuts that efficiently select low-redshift galaxies, and (2) as a training set for a convolutional neural network (CNN) that can increase this efficiency further using imaging data. In this work, we detail the LOW-Z target selection strategy and characterize its efficiency (purity) and completeness at selecting z < 0.03 targets relative to a full magnitude-limited survey. We do not attempt to account for surface brightness or other forms of incompleteness in the underlying photometric catalogs. In addition, while we characterize our redshift failure rates and fiber allocation fraction, correcting our completeness calculation for these effects requires careful modeling and is beyond the scope of this introductory paper. Since the main DESI survey strategy is optimized for cosmology, and as such, it is optimized to probe large volumes and measure the expansion history and growth rate of structure (Levi et al. 2013); at present, our program is a low-priority program using spare fibers. However, we show here that our strategy can already compete with previous low-redshift surveys and can extensively inform future programs for efficiently and completely surveying the low-redshift Universe.

2. The LOW-Z Survey

The LOW-Z Survey is a DESI secondary target survey designed to target faint, low-redshift (z < 0.03) dwarf galaxies in dark time. DESI secondary target surveys make use of spare fibers (i.e., fibers that are not being used to target primary targets) to complement the main survey and its goals. LOW-Z targets are selected between 19 < r < 21 using a set of color and surface brightness cuts. LOW-Z targets are further sorted into three tiers of priority, with the highest priority tier selected using a CNN trained on images of low-redshift galaxies from the SAGA Survey. The remaining objects are split into two tiers based on their catalog-level photometric properties, with objects in the second tier corresponding to regions in parameter space where previous work indicates the majority of low-redshift dwarf galaxies are expected to lie (Geha et al. 2017; Mao et al. 2021). The full LOW-Z target sample consists of approximately 300 objects per square degree. However, the observed sample density is limited by the number of spare fibers available in a given pointing. In addition to getting redshifts for hundreds of thousands of low-redshift objects, the LOW-Z survey serves as a pilot program to refine methods for optimally selecting faint low-redshift targets for future campaigns in DESI II³⁵ and beyond.

A flowchart describing the LOW-Z targeting strategy for the first year (Y1) of DESI operations can be seen in Figure 1. This lays out the steps for target selection and tier identification for the LOW-Z program. We discuss each of the steps individually in Section 3. We present the early LOW-Z sample in Section 4, which consists of approximately 140,000 objects with spectra taken during the first year of DESI operations (approximately 17,000 of these objects were allocated fibers in dark time specifically as part of the LOW-Z program, while the remainder are objects that overlap with BGS and were allocated fibers in bright time). Using this sample as a benchmark, we validate the effectiveness of the LOW-Z Y1 targeting strategy for selecting a high completeness, magnitude-limited sample of low-redshift objects. Based on this analysis, we present slight modifications to the LOW-Z targeting strategy in Section 6 for the second year of DESI operations (Y2), which are currently ongoing. This includes using the Y1 data to provide updated target estimates from the CNN. Target density information can be found in Table 1.

**Figure 1.** (Upper panel) A flowchart showing how targets are selected and observed in the DESI LOW-Z Survey. The chart references sections in the text (§) where each process is described in more detail. (Lower left panel) The photometric cuts for each LOW-Z tier are illustrated graphically using a color–surface brightness schematic diagram. (Lower center panel) A table indicates overlap with BGS and targeting catalog surface density for each LOW-Z tier (see also Table 1). Tier 3 comprises two components, Tier 3A and Tier 3B, which have the same DESI fiber allocation priority (see Section 3.3 for a description of the LOW-Z tiers). (Lower right panel) For Tiers 1–3, we display Legacy Survey DR9 *grz*-band ${72}^{{\prime\prime} }\times {72}^{{\prime\prime} }$ image cutouts for three random non-BGS galaxy targets.
Download figure:
Standard image High-resolution image

2.1. LOW-Z as an Extension of BGS

The LOW-Z sample was designed specifically to complement the main DESI BGS sample. The DESI BGS consists of two samples: the BGS Bright sample, which targets all objects with r < 19.5, and the BGS Faint Sample, which targets objects between 19.5 < r < 20.175 with an additional set of color-dependent cuts (Table 2). The BGS program is significantly larger than the LOW-Z sample: 864 objects per square degree in the Bright sample and 533 objects per square degree in the Faint sample. It is expected to achieve >80% fiber allocation for BGS Bright targets and >95% redshift success rates for both samples.

However, DESI BGS is a bright-time program, meaning that targets are observed during bright conditions (determined based on observing conditions such as seeing, transparency, airmass, and sky brightness). This means that BGS is limited in its ability to obtain redshifts for the faintest and lowest surface brightness objects. In contrast, LOW-Z is a dark-time program. This allows the LOW-Z survey to complement the BGS in two key ways: (1) LOW-Z goes over half a magnitude fainter than the BGS Faint sample and a full magnitude and a half fainter than the BGS Bright sample, helping to fill in objects at the faint end of the galaxy luminosity function; and (2) LOW-Z objects are observed in dark rather than bright time, which allows us to target objects fainter than the BGS fiber magnitude cut at r_fib = 22.9 without drastically increasing our redshift failure rate (see discussion in Section 4.6), meaning that the LOW-Z sample will be more complete than BGS for very low surface brightness objects.

Due to the DESI fiber assignment strategy, most objects that overlap between the two samples will be allocated fibers in bright time as part of the main BGS survey.³⁶ However, for BGS objects in the LOW-Z sample that do not receive fiber allocation in bright time, LOW-Z provides a second opportunity for fiber assignment with a higher likelihood of redshift success for objects with 22 < r_fib < 22.9 (Section 4.6). Since BGS observations supersede LOW-Z observations in terms of priority, objects that overlap between the two samples and receive fiber allocation in bright time are removed from dark-time target lists.

3. LOW-Z Y1 Targeting Strategy

3.1. Imaging Data

We select objects using the catalog from the Data Release 9 (DR9) of the DESI Legacy Imaging Surveys (Zou et al. 2017; Dey et al. 2019; D. Schlegel et al. 2023, in preparation).³⁷ The DR9 catalog consists of data from three imaging projects: the Beijing-Arizona Sky Survey, the DECam Legacy Survey, and the Mayall z-band Legacy Survey.

We use the TYPE flag to identify galaxies as all objects whose TYPE ≠ PSF and remove duplicated Gaia entries using TYPE ≠ DUP. We use FLUX and MW_TRANSMISSION to calculate dereddened magnitudes. We use SHAPE_R as our effective photometric radius, R_r,eff. For bands in grz we additionally define SIGMA_GOOD for the purpose of implementing quality cuts. Unless explicitly defined below, these quantities come directly from the DR9 catalog³⁸ and the definitions can be found in the relevant citations above.

$\begin{eqnarray*}&&{\mathtt{SIGMA}}\_{\mathtt{GOOD}}=\left\{\begin{array}{l}{\mathtt{FLUX}}\times \sqrt{{\mathtt{FLUX}}\_{\mathtt{IVAR}}},\mathrm{if}\ {\mathtt{RCHISQ}}\lt 100;\\ 0,\mathrm{otherwise}.\end{array}\right.\end{eqnarray*}$

While the DR9 photometric catalog is generally very clean, it still contains some spurious objects, including shredded sources, false-positive detections, and sources with highly overestimated magnitudes. We apply a set of quality cuts to remove the majority of these spurious objects from our targets. Specifically, we only include objects that satisfy all of the following criteria:

$\begin{eqnarray*}\begin{array}{rcl} & & {\mathtt{SIGMA}}\_{\mathtt{GOOD}}\geqslant 5.0\ (\mathrm{any}\ \mathrm{two}\ \mathrm{bands});\\ & & {\mathtt{FRACFLUX}}\leqslant 0.35\ (\mathrm{any}\ \mathrm{two}\ \mathrm{bands});\\ & & {\mathtt{RCHISQ}}\leqslant 2.0\ (\mathrm{any}\ \mathrm{two}\ \mathrm{bands});\\ & & {\mathtt{SIGMA}}\_{\mathtt{GOOD}}\geqslant 30\ \mathrm{or}\ {\mathtt{RCHISQ}}\leqslant 0.85\ (\mathrm{any}\ \mathrm{two}\ \mathrm{bands});\\ & & {\mathtt{g}}-{\mathtt{r}}\gt -0.1.\end{array}\end{eqnarray*}$

These criteria were first developed for the SAGA Survey (Mao et al. 2021), and later adopted for cleaning the LOW-Z sample. The criteria on SIGMA_GOOD aim to remove false-positive detections, those on FRACFLUX aim to remove shredded sources from a brighter companion, those on RCHISQ aim to remove sources with very inaccurate model fits, and finally, those on g − r aim to remove sources with very different fits in g and r bands. We visually inspect the resulting targets to set the thresholds in these criteria so that they remove the majority of these spurious objects without impacting our target selection completeness.

We exclude objects that are within 1.5 times the radius of an object in the Siena Galaxy Atlas (SGA) catalog (Moustakas et al. 2023) or within 4 times the half-light radius of any non-SGA objects in DR9 catalogs brighter than r = 16. Galactic radii in SGA are defined as the radius at the 25 mag arcsec⁻² surface brightness isophote. This was done as a further cleaning step to avoid targeting misidentified remnants of bright galaxies and was designed to remove only those objects that significantly overlap with the light of a brighter galaxy. This should not strongly impact the sample satellite galaxies in the LOW-Z survey. However, for a detailed comparison of the differential impact of environment on isolated and satellite dwarf galaxies, the LOW-Z sample is well suited for comparison with satellites from the SAGA Survey, as the two were selected using nearly identical color and surface brightness criteria and span a similar range in magnitudes and distances.

3.2. LOW-Z Catalog-level Photometric Cuts

Accurately identifying low-redshift galaxies using only photometric data is difficult, even when spectroscopic training sets are available. Most current photometric redshift algorithms have been trained on data that has been explicitly color selected for high-redshift galaxies. In addition, low-redshift objects are vastly outnumbered by higher-redshift objects in almost every available training set. There are a few thousand objects per square degree between 19 < r < 21, all but tens of which we expect to be bright galaxies at a higher redshift (z > 0.03). Thus, efficiently selecting low-redshift objects in this regime requires careful study.

Here, we present a set of catalog-level photometric cuts designed specifically for the target selection of low-redshift (z < 0.03) objects to high completeness (hereafter referred to as the z < 0.03–complete photometric cuts). These cuts are developed based on the photometric cuts first introduced by the SAGA Survey (Mao et al. 2021). The SAGA Survey Stage II targeting cuts were tested extensively by the SAGA Survey team, including tests with a complete spectroscopic survey of objects around two SAGA systems. These cuts were found to be complete out to z < 0.01 at a target density of ∼200 objects per square degree; hence, we will refer to the SAGA Survey Stage II targeting cuts as the z < 0.01–complete photometric cuts hereafter. The z < 0.03–complete cuts presented here are identical to the z < 0.01–complete photometric cuts but with an increase in the surface brightness and color thresholds used:

$\begin{eqnarray}{\mu }_{{r}_{o},\mathrm{eff}}+{\sigma }_{\mu }-0.7\,({r}_{o}-14)\gt \left\{\begin{array}{ll}16.8 & (z\lt 0.03-\mathrm{complete})\\ 18.5 & (z\lt 0.01-\mathrm{complete})\end{array}\right.,\end{eqnarray} \tag{ 1 }$

$\begin{eqnarray}{(g-r)}_{o}-{\sigma }_{\mathrm{gr}}+0.06({r}_{o}-14)\lt \left\{\begin{array}{ll}0.99 & (z\lt 0.03-\mathrm{complete})\\ 0.9 & (z\lt 0.01-\mathrm{complete})\end{array}\right.,\end{eqnarray} \tag{ 2 }$

where r_o, g_o are the extinction-corrected r- and g-band apparent magnitudes respectively, ${\mu }_{{r}_{o},\mathrm{eff}}$ is effective surface brightness, σ_μ is the error on ${\mu }_{{r}_{o},\mathrm{eff}}$ , and ${\sigma }_{{gr}}\equiv \sqrt{{\sigma }_{g}^{2}+{\sigma }_{r}^{2}}$ is the error on the (g − r)_o color. We calculate ${\mu }_{{r}_{o},\mathrm{eff}}$ and σ_μ analogously to Mao et al. (2021). We present the validation of the completeness z < 0.03–complete photometric cuts in Section 4.5.

3.3. LOW-Z Tier Assignment

The full LOW-Z target sample consists of all objects at 19 < r < 21 passing the z < 0.03–complete photometric cuts (Equations (1) and (2)). However, in order to maximize our observed sample of low-redshift objects, we split the target sample into three tiers, which roughly correspond to our expectation of a given object being legitimately low-redshift. A CNN algorithm selects the first tier, while the second and third tiers correspond to different regions in color–surface brightness parameter space. The LOW-Z tiers are hierarchical, such that objects in Tier 1 are excluded from Tier 2, and objects from Tiers 1 and 2 are excluded from Tier 3.

3.3.1. Tier 1: CNN Selection from Imaging

We use a CNN to select our Tier 1 sample on the basis of their imaging (Wu et al. 2022). A CNN is a parametric model that can be optimized to make predictions purely from images as inputs. Understanding how CNNs work (as well as they do) remains an active field of research, but we attempt to provide some intuition here. A CNN can be thought of as a multi-scale matched-filtering algorithm with fully learnable filters (see, e.g., Mallat 2016). In other words, the input image is decomposed into multicolor morphological features at various scales. Crucially, each convolution with a learned filter is also followed by a nonlinear operation and a pooling layer, which decreases the resolution while increasing the receptive field. Additionally, residual layers in the CNN permit interactions between different scales (He et al. 2016). These ingredients enable the CNN developed here to efficiently identify low surface brightness features and other distinguishing elements of low-redshift galaxy images.

We trained a CNN to separate low-redshift (z < 0.03) galaxies from high redshift (z > 0.03) using grz-band 144 × 144-pixel image cutouts from the DESI Legacy Imaging Surveys DR9. The CNN prediction p_CNN can range between 0 and 1, where 1 represents the highest confidence that the input image is a low-redshift system. The CNN training details are presented in Appendix A. In the interest of incorporating all of the valuable data for training the CNN, we use the SAGA redshift catalog that is identical to the one used in Wu et al. (2022). This catalog contains 112,016 galaxy redshifts that the SAGA Survey team has measured or compiled around SAGA hosts. Among these galaxy redshifts, 2550 are at z < 0.03. The majority (89%) of the z < 0.03 galaxies in this catalog lie within the z < 0.01–complete photometric cuts, and almost all (98.5%) of the z < 0.03 galaxies lie within the z < 0.03–complete photometric cuts. Additional details about our spectroscopic data set can be found in Section 2.2 of Wu et al. (2022).

We use the CNN to select approximately 20 objects per square degree from the sample selected using z < 0.03–complete photometric cuts. In other words, we train the CNN on the full SAGA redshift sample, including objects outside the catalog-level cuts, but we use the CNN to select targets from within these cuts. In the north, we remove all objects with p_CNN < 0.2503, and in the south, those with p_CNN < 0.3308. We use slightly different cutoff thresholds in the two regions to ensure approximately constant density across the whole sky (similar to the photometric offsets found in Zarrouk et al. 2022). From training and cross-validation experiments, we find that our CNN selection achieves ∼45% purity and ∼85% completeness on the SAGA redshift catalog.

3.3.2. Tier 2 and Tier 3: Catalog-level Photometric Selection

Tier 2 and Tier 3 are selected using purely catalog-level photometric criteria. Tier 2 corresponds to objects within the z < 0.01–complete photometric cuts outside of the BGS sample, while Tier 3 consists of objects in the z < 0.01–complete photometric cuts that overlap with BGS (Tier 3A) as well as a random sampling of the remaining objects between the z < 0.01–complete photometric cuts and the z < 0.03–complete photometric cuts (Tier 3B). The objects in Tier 3B are, by definition, redder and more compact than the Tier 2 objects and thus have a lower probability of being legitimate low-redshift objects.

In practice, the deprioritization of objects in the z < 0.01–complete photometric cuts that overlap with BGS from Tier 2 to Tier 3A has a negligible impact on the number of these targets that are allocated fibers. This is due to the fact that BGS targets supersede LOW-Z targets in terms of fiber allocation priority, meaning that most of these objects will be assigned as bright-time targets as part of BGS and therefore will not be included as LOW-Z targets during fiber assignment. However, the split is useful for analysis as it ensures that all Tier 2 targets were specifically targeted as part of the LOW-Z program in dark time. After removing the overlap with BGS, Tier 2 consists of approximately 80 objects per square degree.

Finally, due to survey limitations for our total target density, we downsample objects in Tier 3B to approximately 80 objects per square degree. The downsampled objects in Tier 3B are selected in the following two stages: (i) all objects within the z < 0.01–complete surface brightness cuts and between the z < 0.01–complete and z < 0.03–complete g − r cuts (∼40 objects per square degree); (ii) random sample of the remaining objects between the z < 0.01–complete and z < 0.03–complete cuts, that are not in (i) (∼40 objects per square degree). We prioritize the redder objects in Tier 3B to ensure we have an accurate representation of the population of quenched low-mass dwarfs. The random sampling in (ii) downsamples the total number of objects by a factor of ∼2 in these photometric regions. After downsampling, the total combined target density of Tiers 3A and 3B is approximately 200 objects per square degree.

3.3.3. Summary of LOW-Z Tiers

Tier 1 (∼22 objects per square degree) consists of objects selected by the CNN from the z < 0.03–complete photometric cuts sample. Approximately six objects per square degree in this sample overlap with the DESI BGS sample.

Tier 2 (∼80 objects per square degree) consists of objects from the z < 0.01–complete photometric cuts sample that are outside of the main BGS color cuts (Table 2) (Hahn et al. 2022).

Tier 3 (∼200 objects per square degree) consists of Tier 3A—the remaining objects from z < 0.01–complete photometric cuts sample that overlaps with the main BGS sample (∼120 objects per square degree), as well as Tier 3B—the objects from the z < 0.03–complete photometric cuts sample that are outside of the z < 0.01–complete photometric cuts (∼80 objects per square degree).

The density for each of the three tiers can be found in Table 1. On average, the CNN tends to select objects that are larger and have lower surface brightness. In addition, it selects a higher fraction of blue objects than the z < 0.01–complete photometric cuts sample. Meanwhile, the z < 0.03–complete photometric cuts sample is on average redder and more compact due to the relaxation of the g − r and μ_r,eff cuts (Equations (1) and (2)).

Table 1. Target Density for the Three Tiers in the LOW-Z Survey

Tier	Y1 Targets	Y1 Targets	Y1 Observed	Y1 Observed	Y2 Targets	Y2 Targets
	All	BGS Overlap	All	BGS Overlap	All	BGS Overlap
Tier 1	22 deg⁻²	6 deg⁻²	11 deg⁻²	6 deg⁻²	97 deg⁻²	1.7 deg⁻²
Tier 2	80 deg⁻²	⋯	41 deg⁻²	⋯	325 deg⁻²	1.3 deg⁻²
Tier 3A	120 deg⁻²	120 deg⁻²	120 deg⁻²	120 deg⁻²	⋯	⋯
Tier 3B	80 deg⁻²	⋯	30 deg⁻²	⋯	⋯	⋯
z < 0.03	⋯	⋯	3.7 deg⁻²	1.6 deg⁻²	⋯	⋯

Notes. Columns 1 and 2: submitted target densities for the Y1 survey. Columns 3 and 4: observed target densities for Y1 survey. All objects in Columns 1–4 are between the Y1 LOW-Z magnitude cuts of 19 < r < 21. Columns 5 and 6: submitted target densities for the Y2 survey. All objects are between the Y2 LOW-Z magnitude cuts of 19 < r < 21.15. The full BGS target density is 1400 targets per square degree (864 deg⁻² in the Bright sample and 533 deg⁻² in the Faint sample).

Download table as: ASCII Typeset image

Table 2. Color Cuts for the BGS Bright and BGS Faint Samples (Hahn et al. 2022)

BGS Sample	r	r_fib	Color	Density
BGS Bright	r < 19.5	r_fib < 22.9	⋯	864 deg⁻²
BGS Faint	19.5 < r < 20.175	r_fib < 21.5 if color ≥0 or r_fib < 20.75	(z − W1) − 1.2(g − r) +1.2	533 deg⁻²

Download table as: ASCII Typeset image

Due to an error in target selection, Tier 1 and Tier 3 had slightly different selection criteria for the first few months of DESI Y1 (the sample considered here). Outside of the BGS color–magnitude cuts, Tier 1 and Tier 3B only contain objects in the z < 0.03–complete photometric cuts region that are outside of both the z < 0.01–complete surface brightness and g − r cuts. This means that only objects with both g − r or μ_r,eff outside of the z < 0.01–complete photometric cuts are included in the extended sample in dark time. In addition, in the northern sky, Tier 1 only contains objects within the z < 0.01–complete photometric cuts.

4. Characterizing the Early LOW-Z Sample

The initial data for the LOW-Z program was taken between 2021 April and June as part of the DESI One-Percent Survey (April–May) and early Main Survey (May–June). The early Main Survey data represents the first 2 months of data taken for Y1 of the DESI survey and should be representative of the full Y1 data set. The DESI One-Percent Survey took place before the beginning of data taking for Y1 of the DESI Main Survey. It was designed to operate similarly but with more passes per tile and longer exposure times. In total, the One-Percent Survey covered an area of 180 deg². The One-Percent data set will be released as part of the Early DESI Data Release, expected in mid-2023 (DESI Collaboration et al 2023). The LOW-Z targeting strategy was the same for both DESI One-Percent and Y1 (see Section 3). LOW-Z targets will be identifiable in all DESI Data Releases by selecting targets with

$\begin{eqnarray*}&&\begin{array}{l}{\mathtt{SCND}}\_{\mathtt{TARGET}}={2}^{15}\ (\mathrm{for}\ \mathrm{tier}\ 1);\\ {\mathtt{SCND}}\_{\mathtt{TARGET}}={2}^{16}\ (\mathrm{for}\ \mathrm{tier}\ 2);\\ {\mathtt{SCND}}\_{\mathtt{TARGET}}={2}^{17}\ (\mathrm{for}\ \mathrm{tier}\ 3).\end{array}\end{eqnarray*}$

4.1. Redshift Sample by Tier

Between the beginning of the DESI One-Percent Survey through the end of the scheduled summer shutdown in July, redshifts were obtained for 143,486 unique LOW-Z targets. Of the full sample, 6633 are from Tier 1 (4.6%), 9479 are from Tier 2 (6.6%), and the remaining 127,374 are from Tier 3 (88.8%) (Table 3). The Tier 3 objects dominate the sample due to the overlap with BGS. BGS targets have higher priority than LOW-Z and therefore have a higher fiber allocation fraction. Of the total number of objects that were allocated fibers in Tier 3, 120,002 (94.2%) received fibers as part of the BGS sample (along with 5478 (82.6%) objects in Tier 1 and 724 (7.6%) objects in Tier 2).³⁹ The redshift distribution of all three tiers can be seen in the left panel of Figure 2. This figure represents 96% of the LOW-Z sample, with only 4% of objects having redshifts z > 0.3 (the high-z tail is not plotted for visual clarity). The median redshift of all objects in Tier 1 is 0.05, for Tier 2, it is 0.12, and for Tier 3, it is 0.15. While this demonstrates the effectiveness of the whole LOW-Z program at selecting low-redshift galaxies, it especially exemplifies the efficacy of the CNN at selecting a sample of the lowest redshift objects (z < 0.03).

**Figure 2.** Left: redshift distribution for all objects in the LOW-Z sample. The color represents which LOW-Z tier the objects come from. The redshift limits (0.001, 0.3) include 95% of the full sample, with a small tail to higher redshifts. Center: redshift distribution for all objects between 0.001 < z < 0.03. This redshift range is dominated by the Tier 1 objects selected by the CNN. Right: redshift distribution for all of the dark-time objects in the LOW-Z sample. The color represents which LOW-Z tier the objects come from. The redshift limits (0.001, 0.3) include 90% of the full dark-time sample, with a small tail to higher redshifts.
Download figure:
Standard image High-resolution image

Table 3. Observed Number of Targets in LOW-Z Survey Split by Tier and Redshift

	z < 0.01	z < 0.03	All Redshifts
One-Percent Survey
Tier 1	26	382	2015
Tier 1 (excl. BGS)	12	167	992
Tier 2	5	100	7445
Tier 3	3	179	27,021
Tier 3 (excl. BGS)	2	37	5906
Main Survey
Tier 1	53	875	4618
Tier 1 (excl. BGS)	4	34	163
Tier 2	1	22	2034
Tier 3	3	461	100,353
Tier 3 (excl. BGS)	0	4	1413

Notes. The One-Percent survey and main survey samples are non-overlapping. The Main Survey results presented here include just the first 2 months of DESI Y1.

Download table as: ASCII Typeset image

Focusing on the lowest redshift objects (z < 0.03) in the LOW-Z sample, we are left with a sample of 2019 objects: 1257 are from Tier 1 (62.3%), 122 are from Tier 2 (6.0%), and 640 are from Tier 3 (31.7%) (Table 3). Approximately 20% of the CNN-selected sample consists of objects at z < 0.03, consistent with the expected purity based on CNN cross-validation results. Of the z < 0.03 sample, 356 are faint (r > 19.5) non-BGS targets: 201 from Tier 1, 114 from Tier 2, and 41 from Tier 3.

4.2. LOW-Z Dark-time Sample and BGS Overlap

We can also examine the sample of objects that received fibers specifically as part of the LOW-Z program (rather than BGS targets in the LOW-Z sample). These targets are interesting because they were observed in dark time, making it possible to get successful redshifts for fainter and lower surface brightness objects. Out of the 143,486 LOW-Z objects, 17,437 were observed during dark time. While 155 of these represent dark-time observations of BGS objects (due to overlap with the luminous red galaxies (LRG) or emission line galaxies (ELG) samples), the rest are objects outside the BGS main sample (Table 2). Out of the 17,437 objects, 1160 (6.6%) are from Tier 1, 8757 (50.2%) are from Tier 2, and 7520 (43.1%) are from Tier 3. On average, the dark-time sample has slightly higher redshifts than the full sample. However, the median redshifts in Tier 1 and Tier 2 are the same as for the full sample, indicating this is mainly driven by the objects in Tier 3, which have a median redshift of 0.20. This result is likely due to the targeting error referenced in Section 3.3.3, meaning that the majority of Tier 3 objects in this regime are being sampled from the z < 0.03–complete photometric cuts outside of both the z < 0.01–complete color and surface brightness cuts. Since these objects are the reddest and most compact objects we target, we expect this sample to contain the lowest density of low-redshift objects. The full redshift distribution can be seen in the right panel of Figure 2. As shown in Figure 10, we are significantly more likely to obtain successful redshifts for low r_fib objects if they were observed in dark time.

In Figure 3, we also plot the apparent magnitude–redshift distribution for both the LOW-Z, BGS, and overlapping samples at z < 0.03 for the One-Percent and Early Main Survey data. About 80% of the z < 0.03 objects are in the overlapping sample. A further 17% of objects are exclusively LOW-Z galaxies; these galaxies tend to be fainter than the overlapping sample, as expected given the apparent magnitude range of the two surveys. The final 3% of objects are exclusively BGS objects. These objects tend to be higher redshift and are discussed further in Section 4.5.

4.3. Galaxy Properties of the Early LOW-Z Sample

The absolute r-band magnitude and stellar mass of the full LOW-Z sample at z < 0.03 is shown in the left and center panels of Figure 4. K-corrected r-band absolute magnitudes are derived using the program FastSpecFit.⁴⁰ Stellar masses are derived using g − r color and absolute r-band magnitude following Mao et al. (2021). While the distributions of Tier 2 and Tier 3 distributions look similar, the CNN-selected objects tend to be fainter in M_r and at lower stellar masses. The LOW-Z sample contains a considerable number of galaxies with M_* < 10⁹ M_⊙, making it an interesting data set for studying dwarf galaxies. Out of the full LOW-Z sample, 22,679 objects have M_* < 10⁹ M_⊙, 2011 objects have M_* < 10⁸ M_⊙, and 98 objects have M_* < 10⁷ M_⊙. The right panel of Figure 4 shows the distribution of stellar mass as a function of redshift colored by tier. On average, both redshift and stellar mass increases as a function of tier, with Tier 1 objects making up the tail end of the stellar mass and redshift distribution (as can also be seen in the center panel of Figure 4). The median stellar mass for Tier 1 is 10^8.4 M_⊙ compared with 10^9.0 and 10^9.7 M_⊙ for Tiers 2 and 3, respectively. Example objects at z < 0.03 can be seen in Figure 5, sorted by decreasing surface brightness and increasing stellar mass. Since stellar mass depends on color and surface brightness, higher stellar mass objects can be seen to be redder and have larger absolute magnitudes.

**Figure 4.** Left panel: absolute r-band magnitude distribution for the LOW-Z sample at z < 0.03 labeled by tier. Center: Stellar mass distribution for the LOW-Z sample at z < 0.03 labeled by tier. Right panel: scatter plot of redshift vs. ${\mathrm{log}}_{10}{M}_{* }$ (M_⊙) colored by tier. The histograms on the top and left are normalized to show the shape of the distributions. By tier, the median redshift is [0.05, 0.12, 0.16], respectively, and the median stellar mass is [10^8.4, 10^9.0, 10^9.7], respectively.
Download figure:
Standard image High-resolution image

**Figure 5.** Example observed LOW-Z objects with successful redshifts at z < 0.03. Objects increase in stellar mass from top to bottom and decrease in surface brightness from left to right. The redshift and magnitude for each of the galaxies is labeled in white.
Download figure:
Standard image High-resolution image

The effective surface brightness versus physical radius for the LOW-Z galaxies at M_* < 10⁹ M_⊙ is shown in Figure 6. Of the LOW-Z dwarf galaxies, 469 are in the ultra-diffuse regime as defined by van Dokkum et al. (2015).

4.4. LOW-Z Fiber Allocation

As the LOW-Z program is a secondary target program, not all targets will be observed during the DESI survey. To understand the observed density of targets, we examine completed tiles taken as part of the One-Percent Survey. Out of the full sample, 36,481 objects received fibers during the One-Percent Survey, corresponding to an observed target density of ∼200 objects per square degree ([11, 41, 150] per square degree in Tier [1–3]; Table 1) or approximately a 67% fiber allocation fraction for the LOW-Z program. These numbers are consistent with close to 100% fiber allocation for objects that overlap with the BGS survey and approximately 30% fiber allocation for objects observed in dark time as part of the LOW-Z survey (Table 1). Since we are a spare fiber program, the observed target density will vary over the sky depending on the density of the primary targets. However, because we are a dark-time survey, our targets are primarily being displaced by ELG, LRG, and quasar targets. All of these surveys are focused on much higher-redshift targets (z > 0.4), so our observed target density should not depend on the local density of objects at low redshift but rather should vary approximately independently of the low-redshift environment across the sky.

Since the One-Percent Survey had a different survey strategy than the main survey, which may have led to more LOW-Z targets receiving fibers than in the main survey, we verify these numbers using the DESI fiber assignment code (A. Raichoor et al. 2023, in preparation) run on a small patch of the sky. After seven passes, we find that ∼30% of our dark-time targets are assigned fibers, while for BGS, after four passes in bright time, we find that ∼75% of targets are assigned fibers. This is lower than our estimate from the One-Percent Survey, which is most likely due to the extra passes per tile completed during the One-Percent Survey. Combining the fiber allocation between bright and dark times gives a total fiber allocation for the LOW-Z survey of ∼50%.

Using data from the One-Percent Survey, we can also estimate the number of low-redshift (z < 0.03) targets per square degree we can expect to be observed as part of the LOW-Z Survey. During the One-Percent Survey, the LOW-Z sample selection returned 661 objects with z < 0.03. Since the One-Percent Survey covered 180 deg², this corresponds to approximately 3.7 observed objects per square degree (Table 1). This number may be a slight overestimation for the full survey as the One-Percent Survey had longer exposures and more passes per tile than the main survey.

4.5. Sample Selection Validation

We further validate our sample selection methods by using all redshifts from DESI Y1 data, including redshifts that are not from the LOW-Z program. As these objects are not part of the LOW-Z photometric sample, it allows us to examine if any low-redshift objects are missed from our sample selection. Due to the overall design of the DESI survey, at low redshifts (z < 0.1), this sample is dominated by galaxies from the BGS galaxy sample. While the BGS sample is only complete out to r = 19.5, it is not subject to the same color and surface brightness cuts as the LOW-Z sample, allowing us to validate the completeness of the current set of catalog-level photometric cuts for a sample of objects outside of the LOW-Z selection. However, this calculation is limited by the dominance of the LOW-Z sample at low redshifts in the DESI data as well as underlying incompleteness in the Legacy Imaging DR9 photometric catalogs used to select all DESI targets. We discuss these limitations further at the end of this section.

Figure 7 shows the low-redshift galaxies in DESI as a function of r, g − r, μ_r,eff, and redshift. The gray line is constrained to have the same slope as the z < 0.01–complete and z < 0.03–complete photometric cuts, and the intercept is varied to capture 95% of the sample. We see a steady redshift-dependent evolution for both the g − r and μ_r,eff fits. We recover that the z < 0.01–complete photometric cuts are complete to the SAGA goal of z < 0.01. Furthermore, the difference between the fit and the z < 0.03–complete photometric cuts is negligible for both g − r and surface brightness. This indicates that the z < 0.03–complete photometric cuts are indeed quite complete out to the LOW-Z sample goal of z < 0.03 relative to the broader cuts used to select BGS galaxies.

Focusing on the z < 0.03 objects that are not in the LOW-Z sample, we can divide the objects into three categories: (1) objects that are outside the LOW-Z color–surface brightness cuts, (2) objects that are within the LOW-Z color–surface brightness cuts but that were excluded from the LOW-Z sample due to our catalog cleaning cuts (Section 3.1), and (3) junk objects. The third category mainly consists of misclassified pieces of brighter galaxies. Examples of objects from each of the three categories can be seen in Figure 8.

**Figure 8.** Example DESI objects at z < 0.03 that are not in the LOW-Z sample. (1) Top row: objects that are outside of the z < 0.03–complete color cuts. (2) Second row: objects that are outside of the z < 0.03–complete surface brightness cuts. (3) Third row: objects that are within the z < 0.03–complete photometric cuts but are removed by our photometric cleaning cuts. (4) Bottom row: junk objects (generally parts of larger galaxies).
Download figure:
Standard image High-resolution image

The objects in the first and second categories tend to be compact, high-surface brightness objects, while the objects in category three tend to be miscentered large, bright nearby galaxies. Out of the 994 objects with DESI spectra at z < 0.03 that are not part of the LOW-Z survey, only 97 are outside of the z < 0.03–complete photometric cuts. This aligns with what we see in Figure 7 that ∼95% of the sample at z < 0.03 is within the z < 0.03–complete photometric cuts. These 97 objects represent less than 5% of the sample at z < 0.03. A further 397 are within the z < 0.03–complete photometric cuts but are removed from the sample due to the photometric cleaning cuts imposed in Section 3.1. The majority of these objects are removed by the cut on SIGMA_GOOD ≥30 and RCHISQ ≤0.85. As a result, we modify our target selection in Y2 so that they do not include these requirements (see Section 6 for details). The remaining are junk objects.

In order to better understand the objects that were being missed outside of the z < 0.03–complete photometric cuts, we visually inspected the spectra of the 97 objects. Twenty of these objects were found to be either quasars or stars misidentified as galaxies. Another six were objects at z ∼ 0.1 misclassified as z < 0.03. Removing these objects left us with 71 objects that were actually galaxies at z < 0.03 outside of the z < 0.03–complete photometric cuts, all of which come from the BGS sample. Out of the 71 objects, 20 are outside of the z < 0.03–complete color cuts, 41 are outside of the z < 0.03–complete surface brightness cuts, and a further 10 are outside of both the color and surface brightness cuts. The color–surface brightness distribution of objects can be seen in Figure 9. The vast majority of these objects are at z > 0.02 (66 out of 71).

**Figure 9.** Magnitude–color (left) and magnitude–surface brightness (right panel) plots for the z < 0.03 objects outside of the z < 0.03–complete photometric cuts colored by redshift. All of these objects come from the BGS sample. The color–surface brightness distributions of the LOW-Z objects within the z < 0.03–complete photometric cuts at z < 0.03 are plotted in black.
Download figure:
Standard image High-resolution image

We further investigate the 30 objects that fall outside of our z < 0.03–complete color cuts. These objects are of particular interest as we want a complete sample of quenched objects in order to further understand dwarf galaxy formation as a function of environment with the LOW-Z sample. Of these objects, 28/30 are at z > 0.02, and all of them are at z > 0.01. Four of these objects are blended objects with incorrect photometry. Most of the remaining are in extremely high-density environments (13/30 are members of the Coma Cluster), where we expect to find the reddest and most compact low-mass objects.

Removing all objects in Coma and with obvious photometric errors, we are left with only 13 objects. While these objects represent an interesting sample for further follow-up, they do not indicate a significant population of isolated quenched objects outside of our z < 0.03–complete color cuts.

As stated above, this analysis is limited by the sample of redshifts available in DESI, which is dominated by objects selected by the LOW-Z program. Since LOW-Z is pushing the forefront for faint low-redshift surveys, accurate characterization of its completeness is difficult given the lack of available data with which to compare; externally validating our redshift completeness will remain an active area of research for the program going forward. We are also limited by catalog-level incompleteness in the Legacy Imaging Survey DR9 catalogs used to select DESI targets, especially for the lowest surface brightness objects. We anticipate that these biases will be better characterized by current (e.g., Aihara et al. 2018; Danieli et al. 2020; Carlsten et al. 2022) and future low surface brightness galaxy surveys (e.g., Spergel et al. 2015; Ivezić et al. 2019; Borlaff et al. 2022), and partially ameliorated by more advanced techniques for constructing and cleaning photometric catalogs (e.g., Walmsley et al. 2019; Greco et al. 2021; Tanoglidis et al. 2021; Di Teodoro et al. 2023).

In addition to incompleteness in our target selection, an additional source of incompleteness comes from observed sources for which we are unable to accurately determine a redshift. We discuss redshift failure rates further in the following section. However, since we do not see evidence for a population of galaxies we are missing with our current z < 0.03–complete photometric cuts, we do not propose an update to the photometric selection for DESI Y2 (Section 6.2).

4.6. Redshift Success Rate

Since our sample extends to fainter r-band apparent magnitudes than the BGS sample, we are interested in the redshift success rates for these objects. This has important implications both for understanding the power of the DESI instrument as well as understanding the completeness of the observed LOW-Z sample. We define a successful redshift as a redshift with ZWARN = 0 and Δχ² > 30, indicating no warning flags raised and a high level of redshift confidence (Δχ² is the difference in χ² for the two best-fitting models). Redshift failure as a function of r, g − r, μ_r,eff, and r_fib is shown in Figure 10. We separate out bright and dark-time observations to show the dependence of the failure rate on observing conditions. However, we do not separate between observations taken in Y1 and the One-Percent Survey. Despite the differences in survey strategy and longer exposure times for the One-Percent data, we do not find a significant difference in the redshift failure rates as a function of any of the variables we consider between DESI One-Percent and Y1, leading to our choice to show results from the combined sample.

**Figure 10.** Fractional redshift failure as a function of r (left), g − r (center left), r_fib (center right), and μ_r,eff (right). The sample is split between spectra obtained in bright time (red) and dark time (purple) with Poisson error bands. The increase in redshift failures in dark time at r < 19.5 is being driven by the low fiber magnitude of these large nearby objects, all of which are beyond the BGS fiber magnitude cut at r_fib = 22.9.
Download figure:
Standard image High-resolution image

Redshift failure rates show negligible evolution as a function of r and g − r for both bright and dark-time targets, indicating that DESI is able to capture redshifts out to our apparent magnitude limit of r = 21 in dark-time and out to the BGS limit of r = 20.2 in bright time across the full range of g − r colors included in the z < 0.03–complete color cuts. The increase in redshift failures in dark time at r < 19.5 is due to the fact that the only objects in this regime observed in dark time are objects outside of the BGS r_fib cut and thus correspond to a sample of objects with r_fib > 22.9. Therefore, the increasing failure rate can be attributed to their high r_fib rather than a magnitude dependence. We do see a significant increase in redshift failures for the lowest surface brightness and r_fib objects. The failure rate increases to almost 40% at μ_r = 27 mag arcsec⁻² for both dark- and bright-time targets. The trend with r_fib is even more dramatic, with failures increasing to around 70% at the bright-time limit of r_fib = 22.9 for objects observed in bright time and to a similar rate at r_fib = 24 for objects observed in dark time. This indicates that object surface brightness and, by extension, fiber magnitude rather than apparent magnitude is the biggest limitation for getting successful LOW-Z redshifts with DESI. In addition, it can be seen in Figure 11 that redshift failure is correlated with p_CNN at fixed r_fib, which is statistically driven by bright-time observations (the dark-time spectroscopic failure rates do not show a significant trend). This is concerning because it indicates that redshift failure rates may be correlated with the likelihood of an object being low redshift. Observational effects may be able to explain this trend: the spectra of lower redshift galaxies feature the [O ii] doublet emission line at bluer observed wavelengths, where the DESI spectrograph is less sensitive (see, e.g., Abareshi et al. 2022).⁴¹ Thus, it becomes more difficult to confirm the redshift for bona fide lower redshift galaxies via the distinguishing [O ii] spectral feature. If p_CNN truly selects lower redshift galaxies, then we may expect targets with higher values of p_CNN to result in a higher rate of redshift failures. We expect to be able to characterize this potential effect significantly better using the full Y1 and Y2 data sets.

**Figure 11.** Fractional redshift failure as a function of r_fib and p_CNN. Objects with p_CNN > 0.6 are assigned a value of p_CNN = 0.6 to increase statistics in the highest bin.
Download figure:
Standard image High-resolution image

5. Discussion: The LOW-Z Survey in Context

The sample of low-redshift galaxies from LOW-Z is already significant when compared to previous surveys. The SDSS main survey (covering 9380 deg²) only has ∼0.5 objects per square degree at z < 0.03 and only a few hundred are at r > 19 (Aihara et al. 2011). GAMA, meanwhile, has about 17 objects per square degree at z < 0.03 but only covers 250 deg² of the sky (Driver et al. 2022). With more than 2000 objects at z < 0.03, the LOW-Z sample is already competitive with the GAMA and SDSS samples, which each contain around 5000 objects at z < 0.03. Additionally, SDSS is only complete down to r = 17.77 and GAMA to r = 19.65. The LOW-Z sample also already contains roughly the same number of objects at r > 19 as the SAGA sample. SAGA has 1440 objects at z < 0.03 (420 of which are at r > 19). This corresponds to approximately 17 (5 at r > 19) objects per square degree (because the number density is enhanced by satellite galaxies around SAGA hosts).

We use the GAMA luminosity function (Loveday et al. 2015) to estimate the total expected density of z < 0.03 objects in the sky to a given magnitude. At r < 21, we expect approximately 16 objects per square degree (8 deg⁻² between 19 < r < 21). For the LOW-Z sample, we find an observed density within this magnitude and redshift range of 3.7 objects per square degree (Table 1). Correcting for the low fiber allocation fraction of the LOW-Z program (approximately 67% in the One-Percent Survey; Section 4.4), this gives us an estimated completeness of ∼70%. This is slightly lower than our estimate of >95% target completeness for the z < 0.03–complete photometric cuts (Section 4.5). This underestimate could be due to the over-representation of BGS objects in the One-Percent sample (see Section 4.4), which skews the sample to brighter magnitudes where we expect a lower number density of z < 0.03 objects, due to uncertainties about our completeness (see Section 4.5), or due to sample variance in the GAMA estimate. For comparison, at z < 0.03, BGS Bright has about 1.5 objects per square degree at r < 19.5, and BGS Faint has about 0.5 between 19.5 < r < 20.3. Assuming >95% fiber allocation for BGS during the One-Percent Survey, we estimate that the BGS target selection is close to 100% complete at r < 19.5 and 15% complete between 19.5 < r < 20.3 for objects at z < 0.03. Figure 3 already shows how the LOW-Z survey complements the BGS Faint sample by filling in the lowest redshift galaxies between 19.5 < r < 20.3.

A comparison between the number of galaxies at M < M_* between LOW-Z, GAMA (Driver et al. 2022), and SDSS-DR8 (Kauffmann et al. 2003; Aihara et al. 2011; Blanton et al. 2011) is shown in the left panel of Figure 12. At M_* < 10⁹ M_⊙, LOW-Z already has more galaxies than GAMA and is competitive with SDSS. The gray-shaded region gives the estimated number of LOW-Z galaxies that would be observed over the full 14,000 deg² DESI footprint. This estimate is done by rescaling the number of galaxies in the 180 deg² region covered by the One-Percent Survey to the full survey area and does not account for the updates in targeting described in Section 6.2. Even in the lower limit where no targeting improvements are included, we predict that by the end of the 5 yr DESI survey, if the LOW-Z survey continued as it did in Y1, it will have surpassed the number of dwarf galaxies (M_* < 10⁹ M_⊙) identified by the SDSS main survey and GAMA by an order of magnitude.

The right panel of Figure 12 shows the median stellar mass as a function of redshift for the LOW-Z, GAMA, and SDSS-DR8 samples. For comparison, the dashed lines show the median redshift for a complete magnitude-limited survey assuming the GAMA luminosity function and a luminosity–stellar mass relation fit to the GAMA data. The LOW-Z sample has a lower median stellar mass at all redshifts than either the GAMA or SDSS-DR8 samples and lies close to the theoretical line for a complete magnitude-limited survey to r < 21.

6. The Future of LOW-Z

The LOW-Z program will continue to survey a highly complete sample of z < 0.03 objects. Using our results from the survey validation and the first 2 months of DESI Y1 observations, we implement the following updates to the LOW-Z survey targeting strategy for DESI Y2. These updates (a) reduce overlap with BGS, and (b) improve the completeness for the more efficient CNN selection. Based on Figure 10, we extend the faint end of the LOW-Z survey to r = 21.15. However, in combination with this extension, we implement a fiber magnitude cut at r_fib < 23.5 to avoid targeting objects with a low likelihood of redshift success. We also remove the cleaning cut mentioned in Section 4.5. Additionally, we remove objects that overlap with the BGS survey given the high fiber allocation fraction and redshift success BGS has achieved so far (Hahn et al. 2022). Since we are removing the BGS targets, we expect the Y2 LOW-Z fiber allocation fraction to be lower than that found for Y1; we estimate that it will be ∼30%, with all of these objects receiving fibers in dark time. The exception will be objects with r_fib > 22, where we expect to achieve higher-redshift success in dark time (Figure 10). We expect that this change, which allows us to include all objects in the z < 0.03–complete photometric cuts without subsampling, to maximize the number of z < 0.03 objects targeted by DESI between the BGS and LOW-Z samples. Combined, these changes only require a slight increase in the LOW-Z target density (∼425 targets per square degree).⁴²

6.1. CNN Retraining

Our resulting LOW-Z sample provides a more comprehensive data set for retraining and validating the CNN. During the same time period, the SAGA Survey has obtained more redshifts for objects within the projected virial radius of z ∼ 0.01 host galaxies (Y.-Y. Mao, in preparation). Our updated training set consists of 29,537 SAGA redshifts and 139,245 DESI LOW-Z objects within the expanded color–surface brightness selection that includes 95% of z < 0.03 galaxies (Equations (3) and 4). We retrain our CNN using the same architecture and framework described in Appendix A using the updated redshift catalog.⁴³ Because our aim is to form a complete survey of z < 0.03 galaxies and BGS has already demonstrated a high level of completeness for low-redshift objects, we evaluate the retrained CNN performance on targets outside of the BGS cuts.

We perform k = 5 fold cross validation and save each CNN model trained on an 80% subset of the data; our results are based on the averaged predictions over the ensemble of five CNNs. From the cross-validation results, we are able to assess the completeness as a function of p_CNN, or equivalently, target density. A given p_CNN threshold corresponds to different number densities in the northern and southern skies due to differences in telescope instrumentation and BGS selection criteria. We propose a CNN-selected target density of 95 deg⁻² for objects outside of the BGS cuts and in the magnitude regime 19 ≤ r < 21.15, which corresponds to p_CNN > 0.0894 in the northern sky and p_CNN > 0.1198 in the southern sky. For 19 ≤ r < 21 objects outside BGS cuts, we expect that the retrained CNN can achieve 85%–90% completeness for z < 0.03 objects. This target list comprises ∼10⁵ objects at r < 21 that would otherwise not be targeted by BGS. We also note that the p_CNN thresholds correspond to >95% completeness for objects in our entire redshift catalog (including BGS objects). Our CNN forecasts for Y2 are shown in Table 4. We compare the performance of the Y1 and Y2 CNNs in Appendix B.

Table 4. CNN Forecasts for Y2 LOW-Z Tier 1

Sample	Target Density [deg⁻²]	Estimated Completeness (z < 0.03)	Number of z < 0.03 Objects
North (p_CNN > 0.0894)
19 < r < 21	78.2	0.911 ± 0.067	∼3.6 × 10⁴
21 < r < 21.15	16.8	⋯	⋯
r_fib > 22, r < 21 (in BGS)	1.7	⋯	⋯
South (p_CNN > 0.1198)
19 < r < 21	78.7	0.867 ± 0.056	∼6.2 × 10⁴
21 < r < 21.15	16.3	⋯	⋯
r_fib > 22, r < 21 (in BGS)	1.6	⋯	⋯

Notes. p_CNN refers to the threshold value for CNN-selected targets. Forecasts for completeness of the z < 0.03 sample and the total number of z < 0.03 galaxies are based on CNN cross validation on 19 ≤ r < 21 objects and an assumed density of eight low-z objects per square degree.

Download table as: ASCII Typeset image

6.2. LOW-Z Y2 Selection

Our Y2 sample, which began getting data in Fall 2022, consists of all objects between 19 ≤ r < 21.15 and r_fib < 23.5 within the z < 0.03–complete photometric cuts (Section 3.2) and excluding objects in the BGS Bright and Faint samples for objects with r_fib < 22. The full sample of LOW-Z targets for Y2 is then divided into two tiers of priority (Table 1):

Y2 Tier 1 (∼97 objects per square degree) consists of objects selected by the retrained CNN from the z < 0.03–complete photometric cuts sample. Our selection includes the top-ranked 95 objects per square degree outside BGS cuts in the 19 < r < 21.15 range, in addition to the top ∼1.7 CNN-selected objects per square degree with r_fib > 22. This latter set of objects represents BGS targets that may encounter high-redshift failure rates in bright observing time.

Y2 Tier 2 (∼325 objects per square degree) consists of all objects from the z < 0.03–complete photometric cuts sample that are outside of the main BGS color cuts (Ruiz-Macias et al. 2020) or at r_fib > 22.0 (and are not in Tier 1).

We plan to continue to characterize the Y2 sample as new data comes in, although at present we do not plan to make further significant updates to our targeting strategy during the DESI main survey.

6.3. Optimizing Catalog-level Photometric Selection as a Function of Redshift for Future LOW-Z Surveys

We parameterize the fit in Section 4.5 as a function of redshift and find that the redshift evolution of the intercept is well described by a linear fit. For the color cuts, the redshift evolution is described by the equation

$\begin{eqnarray}&&{(g-r)}_{o}-{\sigma }_{{gr}}+0.06({r}_{o}-14)\gt 2.62\times z+0.90,\end{eqnarray} \tag{ 3 }$

while the surface brightness cuts evolve as

$\begin{eqnarray}&&{\mu }_{{r}_{o},\mathrm{eff}}+{\sigma }_{\mu }-0.7\,({r}_{o}-14)\lt -14.35\times z+17.33.\end{eqnarray} \tag{ 4 }$

This parameterization gives us a way to estimate target density as a function of redshift and magnitude for a complete LOW-Z survey using only catalog-level photometric information from DR9.

An example of projected target density as a function of redshift for a range of cuts in apparent r-band magnitude is shown in Figure 13. As expected, there is a trade-off between maximum apparent magnitude and redshift. We estimate that for z < 0.03, we could be complete out to r < 21 at 350 targets per square degree and r < 22 at 800 targets per square degree. We can use the GAMA luminosity and stellar mass functions to translate our completeness to a function of stellar mass (Loveday et al. 2015; Wright et al. 2017). For a survey of z < 0.03 galaxies out to r < 22 we expect to be complete for galaxies with M_* > 10⁷ M_⊙. These results can help inform future planning for DESI II and beyond on how to design an optimally targeted low-redshift survey.

7. Conclusions

We have described the DESI LOW-Z survey, a DESI secondary target program that has already generated a large and scientifically interesting survey of low-redshift objects and dwarf galaxies in the early stages of the DESI survey. This survey (including overlap with DESI BGS selection) includes over 140,000 objects with redshifts, over 22,000 dwarf galaxies (M_* < 10⁹ M_⊙), and over 2000 low-redshift objects (z < 0.03), rivaling SDSS and GAMA for the total number of low-redshift dwarf galaxies. Using the first few months of data from the DESI Y1 survey, we have validated the completeness of our photometric cuts at capturing the population of low-redshift galaxies. While we use all available low-redshift objects to evaluate our completeness, we note that the LOW-Z sample dominates the data set. We have also studied the properties of a CNN-selected sample with lower target density, trained on low-redshift data from the SAGA Survey.

We find that:

1.
Our z < 0.03–complete photometric cuts are ∼95% complete at z < 0.03 between 19 < r < 21.
2.
Our CNN is approximately 20% efficient at selecting low-redshift galaxies, compared to efficiencies of ∼1% using traditional photometric methods.
3.
We achieve ∼75% fiber allocation for objects that overlap with BGS and ∼30% fiber allocation for objects outside of the BGS Bright and BGS Faint samples for a combined fiber allocation fraction of ∼50%.
4.
We find no evidence of increasing redshift failures with r-band magnitude, but see a strong increase in the redshift failure rate as a function of r_fib for objects at r_fib > 23 in dark time and r_fib > 22 in bright time. We also find that this increase in redshift failure is correlated with p_CNN at fixed r_fib, indicating somewhat lower redshift success for true low-redshift galaxies.
5.
The LOW-Z survey is currently observing 3.7 low-redshift galaxies (z < 0.03) per square degree. We expect this to be a lower limit for DESI Y2 observations, given improved targeting strategies.

Based on these data, we have retrained a new CNN to select a complete and efficient sample of low-redshift galaxies. Using this retrained CNN, we estimate that we can achieve 85%–90% completeness within our catalog-level photometric cuts to z < 0.03 with ∼80 targets per square degree for 19 < r < 21. Using this information, we update our Y2 targeting strategy to target objects outside of the BGS survey to a slightly fainter magnitude limit (r < 21.15) with a fiber magnitude cut at r_fib < 23.5.

Beyond Y2, the LOW-Z survey provides a blueprint for the design of a higher-priority low-redshift survey as part of Y3–Y5 or DESI II. In the future, we estimate that we could run a complete low-redshift survey (z < 0.03) at 350 targets per square degree at z < 21 or 800 targets per square degree at z < 22. Translating this to stellar mass would correspond to a complete survey for galaxies with M_* > 10^7.5 M_⊙ or M_* > 10^7.0 M_⊙ respectively. Such a dense map of the local universe would provide an incredibly rich data set for studying the local density and velocity field and the relation of galaxy properties to this field, for identifying the host galaxies of transients and gravitational waves, and for expanding our understanding of galaxy formation at the lowest masses.

Acknowledgments

We thank Mia de los Reyes and Kelly Douglass for their helpful comments on the draft. We would also like to thank the DESI Collaboration internal reviewers, Rita Tojeiro and Jeremy Tinker, for helpful feedback that improved the paper. We also thank Mike Blanton for suggesting an observational bias that could explain lower redshift success rates for lower redshift galaxies. We are grateful to the anonymous referee for comprehensive comments that significantly improved the presentation of the paper.

This work received support from the Kavli Institute for Particle Astrophysics and Cosmology at Stanford University and SLAC National Accelerator Laboratory and from the U.S. Department of Energy under contract No. DE-AC02-76SF00515 to SLAC National Accelerator Laboratory. Support for Y.Y.M. was partly provided by NASA through the NASA Hubble Fellowship grant No. HST-HF2-51441.001, awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555.

The SAGA Survey (sagasurvey.org) is a spectroscopic survey with data obtained from the Anglo-Australian Telescope, the MMT Observatory, and the Hale Telescope at Palomar Observatory. The SAGA Survey made use of public imaging data from SDSS, the DESI Legacy Imaging Surveys, and the Dark Energy Survey, and also public redshift catalogs from SDSS, GAMA, WiggleZ, 2dF, OzDES, 6dF, 2dFLenS, and LCRS. The SAGA Survey was supported by NSF collaborative grants AST-1517148 and AST-1517422 to R.H.W. and M.G. and by Heising-Simons Foundation grant 2019-1402.

This material is based upon work supported by the U.S. Department of Energy (DOE), Office of Science, Office of High-Energy Physics, under Contract No. DE-AC02-05CH11231, and by the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility under the same contract. Additional support for DESI was provided by the U.S. National Science Foundation, Division of Astronomical Sciences under Contract No. AST-0950945 to the NSF's National Optical-Infrared Astronomy Research Laboratory; the Science and Technologies Facilities Council of the United Kingdom; the Gordon and Betty Moore Foundation; the Heising-Simons Foundation; the French Alternative Energies and Atomic Energy Commission (CEA); the National Council of Science and Technology of Mexico (CONACYT); the Ministry of Science and Innovation of Spain (MICINN), and by the DESI Member Institutions. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the U.S. National Science Foundation, the U.S. Department of Energy, or any of the listed funding agencies.

The authors are honored to be permitted to conduct scientific research on Iolkam Du'ag (Kitt Peak), a mountain with particular significance to the Tohono O'odham Nation.

Data Availability

LOW-Z data will be released as part of the DESI data releases. A portion of the data analyzed here will be released as part of the Early DESI Data Release, expected in mid-2023.

All data points shown in the figures are available in a machine-readable form at doi:10.5281/zenodo.7422591.

Appendix A: CNN Optimization Details

We train CNNs to identify z < 0.03 galaxies from optical image cutouts. The optimized model acts as a mapping between images ( $x\in {{\mathbb{R}}}^{3\times 144\times 144}$ ) to a scalar prediction, p_CNN ∈ [0, 1]; if p_CNN exceeds some threshold value, then the input image can be classified as a low-redshift galaxy candidate. Because modern neural networks have ${ \mathcal O }({10}^{7})$ tunable parameters (He et al. 2016), the optimization process must be done carefully. We closely follow the methodology of Wu et al. (2022), which uses a trained CNN to identify z < 0.03 galaxies with balanced purity and completeness (i.e., similar levels of false positives and false negatives). In this work, we have selected galaxies at a higher level of completeness at the cost of lower purity (accomplished by using a threshold for p_CNN below the value of 0.5 used by Wu et al. 2022).

We use an extended version of the Mao et al. (2021) SAGA redshift catalog as the ground truth data set for training the model. We use an 80%/20% training/validation split, such that a random 80% subset is used for training, and the remaining 20% is used for evaluating the CNN. We focus on the accuracy, purity (precision), and completeness (recall) metrics for evaluating CNN performance on z < 0.03 predictions. These metrics allow us to compare different combinations of hyperparameters, such as the model architecture, optimization objective, and optimization schedule. By examining these validation metrics, we can gauge whether the model has overfit, leading to strong performance on the training data but poor generalization on unseen data, or if the model has converged.

We briefly summarize the hyperparameter choices used in our CNN model. Our model architecture is a 34 layer residual neural network with several modifications that allow for efficient processing of sparse astronomical images (Wu & Peek 2020). Because the training data are heavily imbalanced in favor of high-z examples, we adopt the Focal Loss function for optimization (Lin et al. 2017). We train the CNN using the Ranger optimizer⁴⁴ and a one-cycle schedule for the learning rate and momentum hyperparameters (Smith 2018) for 10 epochs.

Appendix B: Comparison of the Y1 and Y2 Trained CNNs

The Y1 CNN and the Y2 retrained CNN are validated on 19 < r < 21 objects with redshifts in the DESI survey. In Figure 14, we estimate the low-redshift purity and completeness as a function of target density. Objects that fall outside BGS cuts are shown in the left panel, while all objects in DESI are shown in the right panel of Figure 14.

The Y2 CNN purity and completeness are determined using k = 5 cross validation in order to ensure independent training/validation sets, while the Y1 CNN performance is characterized using a single CNN trained on all of the then-available data; this may result in a slight underprediction of the Y2 performance. Additionally, the Y1 CNN was used to select part of the sample that was used for cross validation, thereby inflating the Y1 CNN completeness in that regime. Nonetheless, we find that the Y2 retrained CNN has improved completeness and purity as a result of its larger training set (see Section 6.1).

Target Selection and Sample Characterization for the DESI LOW-Z Secondary Target Program

Article metrics

Author e-mails

Author affiliations

ORCID iDs

Dates

Abstract

1. Introduction