Overview of the DESI Milky Way Survey

Andrew P. Cooper; Sergey E. Koposov; Carlos Allende Prieto; Christopher J. Manser; Namitha Kizhuprakkat; Adam D. Myers; Arjun Dey; Boris T. Gänsicke; Ting S. Li; Constance Rockosi; Monica Valluri; Joan Najita; Alis Deason; Anand Raichoor; M.-Y. Wang; Y.-S. Ting; Bokyoung Kim; Andreia Carrillo; Wenting Wang; Leandro Beraldo e Silva; Jiwon Jesse Han; Jiani Ding; Miguel Sánchez-Conde; Jessica N. Aguilar; Steven Ahlen; Stephen Bailey; Vasily Belokurov; David Brooks; Katia Cunha; Kyle Dawson; Axel de la Macorra; Peter Doel; Daniel J. Eisenstein; Parker Fagrelius; Kevin Fanning; Andreu Font-Ribera; Jaime E. Forero-Romero; Enrique Gaztañaga; Satya Gontcho A Gontcho; Julien Guy; Klaus Honscheid; Robert Kehoe; Theodore Kisner; Anthony Kremin; Martin Landriau; Michael E. Levi; Paul Martini; Aaron M. Meisner; Ramon Miquel; John Moustakas; Jundan J. D. Nie; Nathalie Palanque-Delabrouille; Will J. Percival; Claire Poppett; Francisco Prada; Nabeel Rehemtulla; Edward Schlafly; David Schlegel; Michael Schubnell; Ray M. Sharples; Gregory Tarlé; Risa H. Wechsler; David H. Weinberg; Zhimin Zhou; Hu Zou

doi:10.3847/1538-4357/acb3c0

1. Introduction

The Dark Energy Spectroscopic Instrument (DESI) is currently the premier multiobject spectrograph for wide-field surveys (DESI Collaboration et al. 2016a, 2022). DESI deploys 5000 fibers over a 3 fdg 2 diameter field of view at the prime focus of the Mayall 4 m telescope at Kitt Peak National Observatory (DESI Collaboration et al. 2016b; T. Miller et al. 2022, in preparation; Silber et al. 2023). The fibers feed 10 identical three-arm spectrographs, each spanning 3600–9824 Å at an FWHM resolution of about 1.8 Å. Each fiber can be positioned individually by a robotic actuator within a radius of 1 farcm 48, with a small overlap between the patrol regions of adjacent fibers. The total time to slew the telescope, read out the spectrograph, and reconfigure the focal plane between successive survey exposures can be ≲2 minutes (DESI Collaboration et al. 2022). This allows large areas of the sky to be surveyed very rapidly at a density of ∼600 targets per square degree.

Although DESI is optimized for galaxy redshift surveys, it is also well suited to observing large numbers of Milky Way stars over wide areas at low spectral resolution. DESI will therefore carry out a Milky Way Survey (MWS) alongside its primary 5 yr cosmological program (DESI Collaboration et al. 2016a). MWS will operate during bright-sky conditions (when high-redshift galaxy observations are inefficient) and will share the DESI focal plane with a low-redshift Bright Galaxy Survey (BGS; Hahn et al. 2022). Although BGS galaxies will be prioritized for fiber assignment, there is often no BGS target in the patrol region of a DESI fiber. This provides an excellent opportunity to obtain large numbers of stellar spectra, which MWS is designed to exploit.

The primary MWS program will observe approximately seven million stars to a limiting magnitude of r = 19 across the full DESI bright-time program footprint. As shown in Figure 1, this footprint covers most of the northern Galactic cap region and a significant fraction of the southern cap. It includes many known substructures in the Milky Way stellar halo, such as the Sagittarius, Orphan, and GD 1 streams; the Hercules–Aquila and Virgo overdensities; and many dwarf satellite galaxies and globular clusters. MWS will provide radial velocities, stellar parameters, and metallicities for an order of magnitude more stars than found in existing samples with similar resolution from the SEGUE and LAMOST surveys, in the same range of magnitudes (r > 16) and Galactic latitudes (see Section 2).

**Figure 1.** The DESI MWS footprint. Gray lines indicate the approximate Galactic latitude limit of the survey, b ± 20°. The density of MWS targets is shown in gray scale. Colored symbols indicate known Milky Way satellites (stars) and globular clusters (triangles). Points and tracks (colors given in the legend) show the four most prominent streams, as reported in the `galstreams` compilation (Mateu 2017): Sgr, represented by the Law & Majewski (2010) model; Palomar 5 (Price-Whelan et al. 2019); GD 1 (Price-Whelan & Bonaca 2018); and Orphan (Koposov et al. 2019). We also show the approximate extent of the Hercules–Aquila cloud as reported in `galstreams` (based on Grillmair & Carlin 2016). Many other less prominent streams and stellar overdensities are known to be in the MWS footprint (see, e.g., Mateu 2023).
Download figure:
Standard image High-resolution image

The MWS selection function is designed to be inclusive, minimally biased, and amenable to forward modeling. The main MWS sample will focus on large-scale spatial and kinematic structures up to 150 kpc from the Sun, using three target categories, main-blue, main-red, and main-broad, which in combination cover the full color–magnitude space within 16 < r < 19 (see Section 4.2). main-blue randomly samples all point sources in this magnitude range with blue optical colors (g − r < 0.7). It will be dominated by metal-poor main-sequence turnoff stars in the thick disk and inner halo, and will also contain more distant horizontal branch stars. We expect these targets to have a spectroscopic completeness of ≈30% and to comprise 56% of the main MWS sample. main-red (≈30% complete, 12% of the main sample) applies Gaia proper-motion and parallax criteria to sources with redder colors (g − r > 0.7) to boost the probability of observing distant halo giants. Sources with g − r > 0.7 that do not meet those astrometric criteria, mostly redder main-sequence stars in the thin disk, are targeted at lower priority in the main-broad category (≈20% complete, 32% of the main sample). Based on survey forecasts (see Section 5), we expect to observe ≈6.6 million unique targets in the MWS main sample (90% of all MWS spectra). These data will constrain the star formation history and chemical and dynamical evolution of the Galactic thick disk and stellar halo, and allow us to identify the low-contrast remnants of ancient dwarf galaxy accretion events. Radial velocities from MWS combined with Gaia astrometry will constrain the three-dimensional distribution of dark matter in the Galaxy.

As part of the primary MWS, the generously selected main sample will be supplemented by several much smaller but highly complete samples of rare stellar types with very low density on the sky. MWS will target a near-complete sample of white dwarfs to the Gaia magnitude limit in order to obtain an independent measurement of the star formation history of the disk and halo populations. It will also collect a highly complete sample of stars in the Gaia catalog within 100 pc, to investigate the fundamental properties of low-mass stars and measure the stellar initial mass function. Blue horizontal branch stars (BHBs) and RR Lyrae variables will be prioritized for their use as tracers of the distant metal-poor halo. In addition to these primary samples, the DESI surveys will allocate a small number of fibers to specialized secondary science programs, some of which will focus on stars. The survey will also operate a backup program for poor observing conditions, which we expect to yield spectra for several million stars brighter than those in the primary MWS sample. The secondary and backup programs will be described in separate publications; here we provide only brief summaries of their relevance to the goals of MWS.

This paper presents the context (Section 2) and scientific motivation of MWS (Section 3), the target selection scheme (Section 4), and the survey strategy (Section 5), including forecasts for the final size of each primary sample. In Section 6 we describe the core components of our spectroscopic analysis pipeline. From 2019 November to 2021 May, DESI carried out a survey validation (SV) campaign consisting of three subprograms (SV1, SV2, and SV3). The SV1 and SV3 data sets each contain spectra for ∼200,000 unique stars, covering a superset of the MWS selection function. These observations will be the basis for the first DESI MWS data release. They include observations of calibration fields, such as open and globular clusters, and an ≈100 deg² high-completeness minisurvey representative of the northern Galactic cap. We give an overview of these data sets in Section 7 and use them to demonstrate that our survey design and analysis pipeline meet the requirements set by the ambitious science goals of MWS. We summarize in Section 8.

Machine-readable tables of the data shown in the figures of this paper are available at 10.5281/zenodo.7013864. The complete sets of targets selected for the main MWS and the DESI SV programs are available at https://data.desi.lbl.gov/public/ets/target/ as described by Myers et al.(2023).

2. DESI MWS in Context

DESI is the first on-sky instrument among a new generation of multiobject survey spectrographs with a 4 m aperture, high fiber density over a wide field of view, and rapid fiber positioning. These advances allow a new approach to large-scale stellar spectroscopy, which seeks to minimize selection biases and to provide uniformly high spectroscopic completeness over a large fraction of the sky. As we describe below, even subject to operational constraints imposed by the DESI cosmological programs, MWS expects to assemble an effectively flux-limited sample of ≃7 million stellar spectra to a magnitude limit r ≲ 19 over 5 yr. In comparison to existing surveys, DESI MWS will provide a much higher density of faint stars to search for substructures and to probe the kinematics and chemistry of the outer thin disk, thick disk, tidal streams, and diffuse stellar halo.

Table 1 lists the parameters of other recent spectroscopic surveys of >10⁵ stars with spectral resolution comparable to or greater than that of DESI. The closest existing counterparts to DESI MWS in scientific scope and spectral resolution (R ≡ λ/FWHM ≃ 2000) are the Sloan Digital Sky Survey (SDSS) optical surveys (York et al. 2000; Eisenstein et al. 2011; Blanton et al. 2017) and the LAMOST survey (Cui et al. 2012).

Table 1. Summary of Completed and Ongoing Stellar Spectroscopic Surveys with ≳10⁵ Targets, Discussed in Section 2

Survey	N_star/10⁶	Mag. Range	R	λ (Å)	Release	Reference
DESI MWS	7.2^a	16 < r < 20	2500–5000	3600–9900	⋯	⋯
SEGUE I	0.2	16 < r < 20	1850–2200	3800–9200	DR17	Yanny et al. (2009)
SEGUE II	0.1	16 < r < 20	1850–2200	3800–9200	DR17	Rockosi et al. (2022)
LAMOST LRS	1.8^b	16 < G < 18^c	≈1800	3700–9000	DR8	Luo et al. (2015)
	5.0^d	10 < G < 16	≈1800	3700–9000	DR8	Luo et al. (2015)

LAMOST MRS	1.1	10 < G < 15	≈7500	4900–6800	DR8	Luo et al. (2015)
RAVE	0.5	9 < I < 12	7500	8410–8795	DR6	Steinmetz et al. (2020b)
Gaia RVS	33^e	G < 14	11,500	8450–8720	DR3	Katz et al. (2022)
Gaia-ESO	0.1	17 < r < 18	20,000	3700–9500	DR5	Gilmore et al. (2012)
GALAH	0.6	9 < V < 14	20,000–50,000	4718–7890^{^f}	DR3	Buder et al. (2021)
APOGEE	0.7	10 < G < 17	22,500	15,140–16,960	DR17	Majewski et al. (2017)
H3	0.3	15 < r < 18	32,000	5150–5300	⋯	Conroy et al. (2019)

Notes. Columns give the number of targets (either in the latest data release or from forecasts) and the approximate magnitude range, spectral resolution R, and wavelength coverage. The rightmost columns give the most recent data release (except where noted, N_star is the number of unique stars in that release) and the reference for each survey. For LAMOST, we count the unique Gaia source IDs with spectral class STAR and separate the low-resolution survey (LRS) into two rows, for stars in the MWS magnitude range and for brighter stars. The table is divided into low-resolution and medium/high-resolution surveys.

^aForecast.^b1.2M at ∣b∣ > 20, of which 0.6M have stellar parameters.^c69,233 with G > 18.^d2.9M at ∣b∣ > 20, of which 2.3M have stellar parameters.^e5.6M have stellar parameters (Recio-Blanco et al. 2022).^fFour bands of width 200 Å.

Download table as: ASCII Typeset image

The primary SDSS low-resolution stellar surveys were SEGUE-1 (Yanny et al. 2009) and SEGUE-2 (Rockosi et al. 2022), which together observed ≃300,000 stars. Stars were also observed as calibration targets and serendipitously by other SDSS programs, including ∼380,000 stars by the BOSS/eBOSS cosmological surveys (Dawson et al. 2013). The SEGUE surveys each covered ≲1500 deg² in total, with individual fields distributed over the SDSS imaging footprint. The SEGUE-2 target selection focused on the distant stellar halo, with similar scientific goals to MWS. The most significant differences of MWS from SEGUE are its contiguous sky coverage, much larger number of spectra, and broader, simpler selection function.

LAMOST (Cui et al. 2012; Luo et al. 2015) is an ongoing survey covering 17,000 deg² in the northern hemisphere (0° < δ < 60°) including the Galactic anticenter. To date, LAMOST has obtained more than 11 million R ≈ 1800 stellar spectra in its LRS, primarily for stars brighter than r ∼ 18. The most recent public data release (DR8 v2) contains 10.3 million LRS stellar spectra. Stellar parameters have been derived for 6.7 million of these, corresponding to ≈4.8 million unique sources in the Gaia catalog. LAMOST also includes a medium-resolution survey (≈1.1M stars in LAMOST DR8). In comparison to LAMOST, MWS will provide many more spectra for fainter stars in the outer thick disk and stellar halo. Of the 4.8 million unique stars with stellar parameters in LAMOST LRS (DR8), only ≈580,000 are at Galactic latitudes ∣b∣ > 20° and fainter than Gaia G = 16.

High-resolution Milky Way surveys, such as Gaia-ESO (Gilmore et al. 2012), GALAH (De Silva et al. 2015), RAVE (Steinmetz et al. 2020b), APOGEE (Zasowski et al. 2013; Majewski et al. 2017), and the ongoing H3 survey (Conroy et al. 2019), have collected spectra for substantial samples of bright stars (≲1 million) with R ≃ 20,000–50,000. APOGEE has observed ≈400,000 stars at ∣b∣ > 20°, of which ∼95% are brighter than G = 17. Although APOGEE observations have concentrated on the Galactic bulge and disk, a number of fields have been observed in the halo. Some of these fields targeted have known halo substructures, including the Sagittarius stream (e.g., Hasselquist et al. 2019), and include target selections that overlap in distance with MWS. The brightest MWS targets observed with the APOGEE halo data set will be useful for calibrating MWS parameter and abundance measurements.

The H3 survey is using MMT Hectochelle (Szentgyorgyi et al. 2011) to observe 300,000 high-resolution optical spectra (R = 32,000) in the northern hemisphere sky at ∣b∣ > 20° and δ > −20°, sparsely sampling 15,000 deg². H3 targets a magnitude range similar to that of MWS, and has similar science goals focused on the stellar halo. Like the MWS selection function, the H3 selection function is close to magnitude-limited, with a weak parallax selection, and priority given to sparsely distributed halo giant candidates, BHBs, and RR Lyrae variables. In many respects H3 therefore provides a complementary (slightly shallower) high-resolution counterpart to MWS.

Radial velocity surveys with intermediate resolution, such as RAVE (Steinmetz 2003), have also focused on brighter targets than those in MWS. By far the largest of these is the Gaia Radial Velocity Spectrometer survey (RVS; Cropper et al. 2018). Both RAVE and Gaia RVS use a narrow spectral window around the Ca ii triplet (8400 Å ≲ λ ≲ 8800 Å). Despite this limited wavelength coverage, multiple individual abundances have been recovered for the majority of the 0.5 million R ≈ 7500 RAVE spectra, including measurements of [Fe/H], to an accuracy of ≈0.2 dex (Boeche et al. 2011; Steinmetz et al. 2020a). Recio-Blanco et al. (2022), using an extension of the same core pipeline, report abundances for ∼5 million Gaia RVS spectra with G < 14. MWS will provide both radial velocities and chemical abundance information, based on measurements over a wider range of wavelengths, for a significant fraction of much fainter stars, including many with Gaia astrometric measurements but no useful Gaia RVS data.

It has previously been shown that multiple elemental abundances can be extracted from low-resolution spectra (e.g., Fernández-Alvar et al. 2015; Ting et al. 2017; Xiang et al. 2019). These approaches can be further developed with DESI, which, compared to SDSS and LAMOST, has better sensitivity at λ ≲ 4000 Å and slightly higher resolution. Precise spectrophotometric distances can also be estimated based on spectra at the resolution of DESI, to better than 10% at signal-to-noise ratios (S/N) > 50 (Hogg et al. 2019; Xiang et al. 2021) and to better than 20% at S/N > 20. Furthermore, DESI's broad optical wavelength range will provide useful information about stellar ages for red giants inferred through mass estimates from C and N abundances (Masseron & Gilmore 2015; Martig et al. 2016; Ness et al. 2016; Sanders & Das 2018; Shetrone et al. 2019).

3. Science Goals of DESI MWS

DESI MWS will assemble an extremely large sample of radial velocities and chemical abundances, predominantly for distant stars at high Galactic latitude. In this section we review how these data, combined with spectrophotometric distance estimates and Gaia astrometry, will advance understanding of the dark and luminous structure of the Milky Way, the history of stellar mass growth through tidal stripping, in situ star formation, and stellar (and planetary) astrophysics.

3.1. Probing the Dark Matter and Accretion History of the Milky Way

Dark matter constitutes approximately 86% of the gravitating mass in the present-day universe. Galaxies form in dark matter potential wells. While the large-scale distribution of galaxies traces the dark matter distribution on cosmological scales, the internal kinematics of galaxies provides a uniquely powerful laboratory in which to study the small-scale structure of dark matter.

Galaxies build their stellar halos through mergers with satellite galaxies, which may be accompanied by their own globular cluster systems. As dwarf galaxies are tidally disrupted, their stars and globular clusters are pulled out into thin tidal streams. Mergers with lower mass ratios may also generate or enhance a galactic thick disk component. Because dynamical times in the outer halo are long assuming approximate global dynamical equilibrium, the kinematics of halo stars and globular clusters probe the mass distribution and dynamics of the dark matter halo (for recent reviews, see Bland-Hawthorn & Gerhard 2016; Wang et al. 2020, and references therein). The merger history of the Milky Way is preserved in the clustering of stellar debris from past accretion events, and correlations between halo stars in phase space and in abundance space encode the assembly history of the Galaxy (see, e.g., Helmi 2020).

3.1.1. The Shape and Mass of the Dark Matter Halo

Characterizing the dark matter halo of the Milky Way is crucial for understanding our Galaxy in its cosmological context. This requires accurate constraints on its total mass (M_vir) within the virial radius (r_vir), the form of the radial mass density profile, and its three-dimensional shape (spherical, axisymmetric, or triaxial).

ΛCDM cosmological simulations predict that the smooth components of stellar halos have power-law or broken-power-law stellar density profiles (e.g., Cooper et al. 2010; Deason et al. 2013; Amorisco 2017; Font et al. 2020) and predominantly radially anisotropic velocity dispersion profiles (Bullock & Johnston 2005; Abadi et al. 2006; Loebman et al. 2018). Current observations are in broad agreement with these predictions; namely, the Milky Way stellar halo has a "broken" density profile (Deason et al. 2011; Sesar et al. 2011), and the orbits of halo stars, at least in the inner ∼30–50 kpc, are highly eccentric (Deason et al. 2018; Bird et al. 2019; Cunningham et al. 2019; Iorio & Belokurov 2019, 2021; Lancaster et al. 2019; Hattori et al. 2021).

Dark matter halos are believed to depart significantly from spherical symmetry. Cosmological simulations assuming collisionless dark matter produce halos that are triaxial and have an almost constant shape at all radii (Jing & Suto 2000). Simulations that include baryons produce halos that are oblate-axisymmetric within the inner one-third of the virial radius, but become triaxial or prolate at larger radii (Kazantzidis et al. 2004; Debattista et al. 2008; Zemp et al. 2012; Prada et al. 2019). Cosmological simulations with warm dark matter (sterile neutrinos; Bose et al. 2016) and self-interacting dark matter (Peter et al. 2013; Vargya et al. 2022) also predict triaxial dark matter halos, albeit with small but quantifiable differences in the variation of shape with radius.

Estimates of the shape of the Milky Way's dark matter halo from halo field stars, based on samples within ≈30 kpc, imply a nearly spherical inner halo (Wegg et al. 2019; Hattori et al. 2021). Recent attempts to probe the mass and shape of the dark matter halo have been greatly advanced by all-sky Gaia proper-motion measurements for individual stars to G ∼ 20 and averaged proper motions for more distant satellites and globular clusters (e.g., Callingham et al. 2019; Li et al. 2020; Deason et al. 2021). Studies combining Gaia proper motions and radial velocities from the H3 survey suggest that the dark matter halo of the Milky Way may be triaxial or tilted relative to the Milky Way disk (Han et al. 2022), although self-consistent modeling is required to confirm this. More generally, the total virial mass, inner density slope, concentration parameter, and local dark matter density of the halo remain uncertain (Callingham et al. 2019) and depend quite strongly on the assumed properties of the baryonic components (de Salas et al. 2019).

Despite evidence that the Milky Way experienced a significant merger event (the "Gaia Sausage/Enceladus" (GSE); see Section 3.1.3) approximately 8–11 Gyr ago and is currently interacting with the Large Magellanic Cloud (LMC; see below), most estimates of the global mass distribution of dark matter in the Galaxy assume that it is in dynamical equilibrium. In many contexts, this assumption is justified; tests with mock data from cosmological simulations of Milky Way analogs including those with ongoing interactions (e.g., Kafle et al. 2018; Hattori et al. 2021; Rehemtulla et al. 2022) show that both the mass and average flattening of the halo can be estimated with 15%–25% accuracy using data sets that cover large areas of the sky and extend to large distances from the Galactic center.

DESI MWS will greatly improve our understanding of the mass and shape of the dark matter halo by significantly increasing the samples of halo tracers (∼1 million turnoff and subgiant branch stars between 10 and 30 kpc, and ∼10,000 giant stars in the halo beyond 30 kpc; see below) with precise radial velocities (velocity errors ${\sigma }_{{v}_{\mathrm{los}}}\sim 1\mbox{--}20\,\mathrm{km}\,{{\rm{s}}}^{-1}$ ) and spectrophotometric distances. MWS will also obtain spectra for half of all Gaia-detected RR Lyrae variables with brightness 14 mag < G < 19 mag in the survey footprint and increase the sample of known BHBs with radial velocities by a factor of ≈3 (see Sections 4.4.2 and 4.4.4). Radial velocities, in conjunction with better proper-motion measurements from future Gaia data releases and proper motions from the Roman Observatory's High Latitude Survey (WFIRST Astrometry Working Group et al. 2019), will enable us to more fully characterize the dark matter halo of the Milky Way, significantly reducing the current factor of ∼2 uncertainties on the cumulative mass of the Milky Way within ∼100 kpc.

Mapping the interaction with the LMC. It has become clear in the past 15 yr that the Milky Way can no longer be modeled to high accuracy as an isolated galaxy. There is considerable evidence that the ongoing gravitational interaction with the LMC may have distorted the halo and displaced the stellar component of the Galaxy away from the center of the dark matter potential. Hubble Space Telescope proper-motion measurements of the LMC have been used to show that the LMC is on its first infall and has only recently passed its pericenter (Besla et al. 2007). Furthermore, the LMC is much more massive than previously believed, ∼2 × 10¹¹ M_⊙ (Besla et al. 2010), massive enough to cause a significant reflex motion of ∼40–60 km s⁻¹ of the Milky Way disk (Gomez et al. 2015).

The dynamical response of the Galactic stellar and dark matter halo to the LMC has two primary components. First, a classical dynamical friction wake is produced behind the LMC in the Galactic southern hemisphere as it orbits the Milky Way, accompanied by a density enhancement in the Galactic northern hemisphere, referred to as the "collective response." Second, the fact that both the LMC and the Milky Way orbit their common barycenter results in a "reflex velocity" of the stellar halo relative to the Milky Way disk. This was predicted to appear as a dipole in the velocity distribution of halo stars (Garavito-Camargo et al. 2019) and has been detected (Petersen & Peñarrubia 2020). More recently, Conroy et al. (2021) have used star counts from Gaia to measure the stellar overdensity due to the dynamical friction wake and collective response. Both of these overdensities have been observed in nearly the same locations predicted by N-body simulations. Although the wake is in the Galactic southern hemisphere and outside the DESI footprint, the "collective response" in the Galactic north lies partially within the footprint of the main survey and will also be covered by the DESI Backup Program. A velocity accuracy for DESI MWS of 1–20 km s⁻¹ will be sufficient to yield constraints on the predicted dark matter wake and large-scale motions in the stellar halo associated with the ongoing gravitational interaction between the Milky Way and the LMC.

3.1.2. Small-scale Substructure in the Dark Matter Halo

The distribution of dark matter on small scales, including the subhalo mass function and the central density profiles of dwarf galaxies, is an extremely important probe of the nature of dark matter (e.g., Zavala & Frenk 2019). MWS can constrain these properties in the Milky Way and Local Group through observations of tidal streams and dwarf galaxies.

Tidal streams. Kinematically cold structures like stellar tidal streams are ideal for probing small-scale substructures in the Milky Way's dark matter halo. Interactions between cold streams and dark matter subhalos can produce stream gaps, overdensities, and characteristic off-stream structures ("S-shaped" kinks and "spurs"; Siegal-Gaskins & Valluri 2008; Carlberg 2013; Erkal & Belokurov 2015a), as well as perturbations to the line-of-sight velocities and proper motions of stars near the gaps (Erkal & Belokurov 2015b). The velocities of stars near gaps in cold, thin globular cluster streams (such as the kink, spur, and gaps observed in GD 1, or the ATLAS and Aliqa Uma streams; e.g., Price-Whelan & Bonaca 2018; Li et al. 2021) will constrain the clumpiness of the dark matter distribution encountered by the stream (e.g., Carlberg 2012; Erkal et al. 2016; Bovy et al. 2017; Bonaca et al. 2019). In addition, it has been recently shown from simulations that the secular tidal evolution of accreted globular cluster streams can be used to probe the density profiles of the parent dwarf galaxies from which the clusters were accreted by the Milky Way (Malhan et al. 2021). Such secular tidal evolution produces broader "cocoon" components around thin streams (Carlberg 2018) and increases the velocity dispersion over large swaths of the stream (Malhan et al. 2021). DESI spectra will provide the stellar chemical abundances that are crucial for determining stream membership and will enable discrimination between localized perturbations from subhalos and secular tidal evolution. Furthermore, the chemical abundance and radial velocity gradients along dwarf galaxy streams will constrain the rate of stellar mass loss, providing further clues to their dark matter distributions (Errani et al. 2015).

Many known tidal streams are covered by the MWS footprint and target selection criteria (e.g., Sagittarius, GD 1, Palomar 5, and the Orphan stream), including 43 of the 73 streams and other halo substructures in the galstreams catalog⁵⁵ (Mateu 2017; Mateu et al. 2018) and tens of more recently discovered Gaia streams (e.g., Ibata et al. 2019; Mateu 2023). For each of these, we expect MWS to identify on the order of ∼10–100 member stars per stream. For example, based on the selection criteria and fiber assignment forecasts described below, we expect to obtain spectra for ∼600 candidate members of the GD 1 stream (satisfying a broad cut in position and proper motion), of which ∼10%–30% may be true members. This number may be even greater for the Sagittarius stream: we expect to obtain spectra for ∼6600 of the ∼33,600 candidate stream members identified by Antoja et al. (2020) in our footprint with magnitudes 15 < G < 20, and a further ∼1800 Sagittarius RR Lyrae variables from the catalog of Ramos et al. (2020) in the same magnitude range. For comparison, Vasiliev et al. (2021) find that 4465 stars in their catalog of 55,192 probable Sagittarius members have existing radial velocities. We expect to observe ≈2800 of the 5600 Vasiliev et al. candidate members in the MWS footprint, of which 306 have existing radial velocities. Of course, MWS may also observe members of Sagittarius, GD 1, and other streams that are not identified in existing catalogs. MWS radial velocities, chemical abundances, and spectrophotometric distances for candidate stream members, in conjunction with Gaia proper motions, will enable us to better assess membership and chemical homogeneity (or the opposite) within streams, and to search for yet unknown phase-space substructures.

Since tidal streams approximately trace the orbits of their progenitors, they have been widely used for measuring the shape of the Milky Way's dark matter halo (Johnston et al. 1999, 2005; Helmi 2004; Fellhauer et al. 2006; Koposov et al. 2010; Law & Majewski 2010; Sanders & Binney 2013; Bovy 2014, 2016; Gibbons et al. 2014; Bowden et al. 2015; Küpper et al. 2015; Malhan & Ibata 2019; Vasiliev et al. 2021). A high-precision model of the Sagittarius stream (Vasiliev et al. 2021) has been derived from Gaia proper motions (Antoja et al. 2020), following the realization that the LMC gives rise to a significant reflex motion of the Milky Way's center of mass (Gomez et al. 2015). This nonequilibrium model implies a radially varying and time-dependent shape for the Milky Way halo. However, even this model relies on a limited sample of stars with radial velocities. MWS will provide six-dimensional kinematic information from multiple stellar streams, which can be jointly used to constrain the Milky Way's dark halo density profile to even greater precision (Bonaca & Hogg 2018). Furthermore, some of these streams can be used to probe the mass of the LMC at large radii. Erkal et al. (2018) first argued that the LMC could induce a substantial proper motion perpendicular to the track of the stream on the sky, and this offset could be used to measure the mass of the LMC. Such a misalignment of proper motions with the stream track has been found in the Orphan–Chenab stream (Koposov et al. 2019) and used to constrain the LMC and Milky Way potential simultaneously (Erkal et al. 2019). A similar analysis has been carried out for the Sagittarius stream (Vasiliev et al. 2021). Recently, Shipp et al. (2021) extended this approach to five stellar streams with proper motions measured by Gaia EDR3 and radial velocities measured by the Southern Stellar Stream Spectroscopic Survey (S⁵; Li et al. 2019, 2022). Using this six-dimensional kinematic information for an ensemble of streams, they found a mass of the LMC ranging within ∼1.4–1.9 × 10¹¹ M_⊙.

Satellite dwarf galaxies. The satellite galaxies of the Milky Way provide an additional testing ground for the nature of dark matter on small scales (Simon & Geha 2007; Nadler et al. 2021). The particle physics governing dark matter may have observable effects on the luminosity function of satellite galaxies, the density profiles of their dark matter halos (cusps versus cores), and the production of energetic Standard Model particles through annihilation or decay. Satellites also provide a window into the formation of the oldest and least massive galaxies, inaccessible to direct observation (Kirby et al. 2008; Bland-Hawthorn et al. 2015). Many open questions remain concerning the structure, stellar populations, and star formation histories of these galaxies, the answers to which would shed light on their past interactions with the Milky Way and complement surveys of the diffuse stellar halo.

Although significant resources have been invested to discover and characterize dwarf galaxies, very few spectroscopic observations have been taken in their outskirts, mainly due to the limited field of view of existing spectroscopic facilities. Recent work suggests at least some of these galaxies may be surrounded by a significant low surface brightness structure, which simulations suggest may occur even for galaxies that are not presently losing significant mass through tidal stripping (e.g., Wang et al. 2017). For example, Chiti et al. (2020) used deep SkyMapper UV photometry to find candidate members of the ultrafaint dwarf galaxy Tucana II out to 7 half-light radii. Spectroscopic follow-up is now required to confirm their association, and more generally to address the prevalence and properties of similar features in other satellite galaxies.

The DESI MWS footprint includes 31 known dwarf spheroidal and ultrafaint dwarf galaxies (according to McConnachie 2012; see Figure 1). Sixteen of these have a distance modulus smaller than 20 and hence, potentially, red giant branch (RGB) stars within the MWS main survey selection. The primary MWS target selection categories will therefore provide a sparse but extremely wide-field sampling of potential members in the outskirts of these galaxies, most of which have half-light radii well within the 3 fdg 1 field of view of DESI. In addition, a DESI secondary program will allocate fibers at higher priority to potential members (selected in a broad window of proper-motion and color–magnitude space) around dwarf galaxies in the DESI footprint, in both the bright- and dark-time DESI surveys. The confirmation of distant members will enable the study of large-scale metallicity gradients and tidal effects.

3.1.3. The Assembly History of the Milky Way Halo

The accretion and star formation histories of the Milky Way are encoded in its stellar populations. The Milky Way was formed through the accumulation of stars, gas, and dark matter through mergers, which continues to the present. The stellar halo and thick disk still retain kinematic signatures of this process, which MWS seeks to recover through measurements of the line-of-sight velocity, metallicity, and α-abundance, in combination with Gaia proper motions. DESI also has the potential to measure individual elemental abundances including C, Mg, Ca, and Fe abundances. These measurements will enable the identification of fossil remnants of the assembly process in a multidimensional phase space, including the relating of tidally stripped halo stars to their parent objects (i.e., dwarf galaxies and globular clusters).

Our view of the stellar halo has changed dramatically since the first data releases of the Gaia mission. It is now understood that the inner halo (within 20–30 kpc) is dominated by one massive dwarf progenitor, namely the GSE (Belokurov et al. 2018; Helmi et al. 2018). This major event in the Milky Way's assembly history was predicted from the observed halo star counts by Deason et al. (2013), who argued that the "break" in the stellar halo density profile (at 20–30 kpc) signifies the apocenter of a massive accretion event. Our view of the (inner) halo is thus intricately linked with the properties of the GSE event. The large increase in stellar halo tracers provided by DESI in the inner region of the halo dominated by the GSE will be essential to quantifying the details of this accretion event and the properties of the progenitor dwarf galaxy. For example, the two-dimensional phase space of Galactocentric distance and radial velocity can be used to identify shells (overdensities around common orbital apocenters) linked to the GSE. These in turn can be used to constrain the orbit and accretion time of the progenitor, while the metallicity distribution function (MDF) and chemical abundances can be used to infer its stellar mass and star formation history.

The DESI footprint covers many well-known Milky Way globular clusters. Finding and characterizing tidal features around globular clusters is crucial to understanding their orbits and mass-loss histories. Estimates of the fraction of mass contributed to the Milky Way stellar halo by disrupted globular clusters range from 50% (Martell & Grebel 2010; Martell et al. 2011) to negligible (e.g., Deason et al. 2015). Significant mass loss (≥90%) from globular clusters has been invoked to explain the presence of multiple stellar populations within them (see, e.g., Bastian & Lardo 2018, and references therein); however, such extreme mass loss requires globular clusters to be born with very much larger effective radii than currently observed.

Several imaging studies, especially those carried out in combination with Gaia proper motions, have led to the discovery of tidal tails associated with a few globular clusters (e.g., Shipp et al.2020; Sollima 2020). At the same time, the lack of convincing signs of tails around other Milky Way globular clusters (e.g., Leon et al. 2000; Kuzma et al. 2016) has led to speculation that they may be embedded in their own dark matter halos (e.g., Peñarrubia et al. 2017; Carlberg & Grillmair 2021), which prevents stars from escaping at the expected rate (Moore 1996). However, it is likely that the extremely low surface brightness of these features limits the discovery potential of imaging alone. DESI's spectroscopic observations can complement these results by identifying stripped stars through kinematic and chemical tagging. This will allow a better characterization of the tidal features themselves and the frequency with which they occur around globular clusters.

More generally, DESI MWS radial velocities and stellar abundances, in conjunction with Gaia proper motions, will enable us to determine what fraction of the halo is made up of merger or tidally stripped remnants, and whether there exists a smooth, isotropic component that is chemically and dynamically old. As demonstrated by Belokurov & Kravtsov (2022) using APOGEE abundances, such data can also constrain the properties, and hence the origin, of an "in situ" stellar halo component. Distinguishing between the wide range of processes that produce in situ stellar halos in cosmological simulations (including early chaotic gas accretion and disk collapse, dynamical heating, and star formation in cooling instabilities or accreted gas; see, e.g., Cooper et al. 2015) would be a significant advance in the understanding of early galaxy formation as a whole, and would address a major source of uncertainty in the galactic archeology of the Milky Way and other galaxies.

3.1.4. The Formation History of the Milky Way Thick Disk

The disk of the Milky Way, visible from any dark site, is the feature that defines its morphology. The disk is made up of at least two components of different thickness, and there is evidence that the outer disk may show warps and flares (Djorgovski & Sosin 1989; Drimmel & Spergel 2001; López-Corredoira et al. 2002; Ted Mackereth et al. 2017). Past studies of stars in the Milky Way disk have focused on the inner disk (R ≲ 13 kpc) and generally been restricted to the thin (and young) disk component (e.g., Martig et al. 2014; Aumer et al. 2016; Mackereth et al. 2019; Ting & Rix 2019). DESI MWS will focus primarily on the thick disk.

The thick disk of the Milky Way is believed to be an ancient structure that has been dynamically heated over time. Several scenarios have been proposed for its origin: secular evolution due to scattering with giant molecular clouds (e.g., Aumer et al. 2016; Ting & Rix 2019) or dark matter substructures (Church et al. 2019), in situ formation from cooling of a thick and turbulent interstellar medium (so-called "upside-down growth"; Bird et al. 2013; Grand et al. 2016), heating of a protodisk by a massive merger (e.g., Walker et al. 1996; Velazquez & White 1999), and early scattering by massive clumps (Bournaud et al. 2009; Clarke et al. 2019; Beraldo e Silva et al. 2020). A combination of dynamical and chemical data on thick disk stars is required to distinguish between these various scenarios. In particular, the long dynamical times, especially in the outer disk, result in a persistent memory of past perturbations, either secular or external, which have created lasting Galactic warps and flares (Momany et al. 2006; Minchev et al. 2015; Laporte et al. 2022).

DESI MWS will yield chemodynamical measurements of ∼4 million thick disk stars. The DESI spectral resolution is sufficient to determine the abundance of C, Mg, Ca, and Fe to an internal statistical precision of ≈0.1–0.2 dex for the majority of stars, and of additional elements (e.g., Al, Si, and Cr) for a subset with higher S/N (see Ting et al. 2017; Xiang et al. 2019, 2020; Sandford et al. 2020). Chemical and age estimates (derivable, for giant stars, using C and N features; Masseron & Gilmore 2015; Martig et al. 2016; Ness et al. 2016; Sanders & Das 2018; Ting & Rix 2019; Vincenzo et al. 2021) as a function of the scale height and orbital properties will help constrain the origin of the thick disk and the relative ages of the main disk components (e.g., Belokurov et al. 2020; Beraldo e Silva et al. 2021; Ciucă et al. 2021; Montalbán et al. 2021; Xiang & Rix 2022). MWS will also study open clusters (OCs) in the thick and thin disks. Although most OCs are found at ∣b∣ < 20, several fall within the MWS footprint, including M67 and M44. These well-studied clusters can be used to calibrate abundance measurements and assess systematic differences with other surveys. DESI spectra can, in principle, be used to explore chemical abundance inhomogeneities in these clusters, probing the diffusion of heavy elements in stellar atmospheres (e.g., Souto et al. 2019) and perhaps constraining cluster formation timescales. The broad sky coverage of MWS will enable searches for former cluster members as they disperse through the Galaxy, and will potentially contribute to the discovery and characterization of old OCs at high latitude (e.g., Schmeja et al. 2014; Cantat-Gaudin & Anders 2020).

3.1.5. Primordial Stars in the Milky Way

Primordial Milky Way stars are expected to form with extremely low metal abundance (perhaps zero). Stars that form after the pristine interstellar medium has been metal-enriched by the first supernova are predicted to reflect exotic abundance ratios and can inform models of yields from the first generation of supernovae. For example, dramatically high carbon-to-iron ratios have been found and are interpreted to be the result of fallback of the innermost layers of supernova progenitors (Ishigaki et al. 2014; Nordlander et al. 2019).

Studies of the MDF show that only 1 in 10,000 halo stars has a metallicity [Fe/H] < −4 (Beers & Christlieb 2005; Starkenburg et al. 2017). This, together with the decrease in metallicity in the outer halo, makes large and distant samples a necessity in order to search efficiently for the first-generation low-mass stars that have survived. Surprisingly, a significant fraction of the stars with the lowest iron abundances have their orbits in the Galactic plane (Sestito et al. 2020, 2021).

Most of the extremely metal-poor stars known have been identified from deep surveys of the Galactic halo, such as the Hamburg/ESO Survey (Schörck et al. 2009) and SDSS (Yanny et al. 2009; Rockosi et al. 2022). Although metallicity precision is likely to be poor at [Fe/H] < −4 for spectra at the resolution of DESI, such stars can, at least, be identified efficiently for follow-up observations (e.g., Allende Prieto et al. 2015; Frebel et al. 2015). Given the large sample size of MWS and its focus on faint stars at high latitude, we expect to uncover over 100 new stars with [Fe/H] < −4, which would triple the number of currently known stars in this regime. We also hope to identify new stars with iron abundances [Fe/H] < −7, comparable to the current lowest known measurements (Nordlander et al. 2019). The statistics of the parameters and chemical abundances of these stars will shed light on the first generation of stars, and their supernovae (Ishigaki et al. 2014; Nordlander et al. 2019).

3.2. The DESI White Dwarf Survey

White dwarfs are the final stage of evolution for stars with masses ≲8–10 M_⊙ (Iben et al. 1997; Dobbie et al. 2006), a destiny that the majority of A/F-type stars in the Milky Way have already reached. As such, white dwarfs play a central role across a variety of areas in astrophysics. These dense stellar remnants are chemically stratified with atmospheric compositions dominated by hydrogen and/or helium (e.g., Eisenstein et al. 2006; Giammichele et al. 2012), although ≃20% of white dwarfs display spectroscopic peculiarities (see Figure 2 for examples). Spectroscopy spanning the full optical range is therefore critically important for the study of white dwarfs.

**Figure 2.** Various types of white dwarf systems observed by DESI. Most white dwarfs have atmospheres dominated by hydrogen or helium, and their spectra contain only Balmer (DA) or He I (DB) lines. As the white dwarfs cool, they eventually cease to produce these lines, resulting in featureless spectra (DC). However, ≃20% of white dwarfs exhibit spectroscopic peculiarities. Accretion of planetary material results in photospheric contamination by metals (DAZ, DBZ, and DZ). The presence of carbon in the atmospheres of white dwarfs can indicate the dredge-up of material from the white dwarf core, or, in higher-mass white dwarfs, is an indication of a merger, possibly descending from R Corona Borealis stars (DQ). Up to 10% show magnetic fields (up to 10⁹ MG) via Zeeman splitting across all atmospheric compositions (e.g., DAH), and serve as laboratories for atomic physics under extreme conditions. Finally, white dwarf binaries span a wide range of evolutionary channels. One common type of white dwarf binary is a cataclysmic variable (CV), where a main-sequence companion accretes onto a white dwarf via an accretion disk that can be identified from double-peaked line profiles.
Download figure:
Standard image High-resolution image

Homogeneous samples of white dwarfs with accurate physical parameters are essential for constraining and calibrating stellar evolution theory (Williams et al. 2009; Cummings et al. 2018), internal rotation profiles and loss of angular momentum (Hermes et al. 2017), and fundamental nuclear reaction rates (Kunz et al. 2002), with important implications for stellar population synthesis and galaxy evolution theory (Maraston 1998; Kalirai et al. 2014). Because of their well-constrained cooling ages, white dwarfs provide an insight into the age of the Galactic disk (Winget et al. 1987; Oswalt et al. 1996), OCs (García-Berro et al. 2010), and globular clusters (Hansen et al. 2007), and can even trace variations in the Galactic star formation rate (Tremblay et al. 2014). White dwarfs also indirectly allow the investigation of other areas of astrophysics, such as main-sequence stars in binaries with white dwarfs (Zorotovic et al. 2010; Rebassa-Mansergas et al. 2016; Toonen et al. 2017), planetary systems (Zuckerman et al. 2007; Farihi et al. 2009; Gänsicke et al. 2012; Koester et al. 2014; Hollands et al. 2018; Vanderburg et al. 2020; Guidry et al. 2021), and extreme physics (Guan 2006; Kowalski 2006).

Gentile Fusillo et al. (2019) assembled the first unbiased all-sky magnitude-limited (G ≃ 20) sample of ≃260,000 white dwarf candidates using Gaia DR2. While the Gaia data are sufficient to identify white dwarfs with high confidence, follow-up spectroscopy is required (Figure 2) to determine their physical properties and derive fundamental properties that are necessary to address the science areas outlined above.

The DESI white dwarf survey will target ≃70,000 white dwarfs largely from the Gentile Fusillo et al. (2019) sample, and will roughly double the number of white dwarfs with high-quality spectroscopy in the northern hemisphere (Kleinman et al. 2013; Gänsicke Fusillo et al. 2015). This will be the first large and homogeneous spectroscopic sample that is not subject to complex selection effects, and is therefore ideally suited for detailed statistical analyses of white dwarfs in the context of galactic, stellar, and planetary structures and evolution. The large sample size will also result in the identification of rare white dwarf species, tracing the extremes of parameter space of short-lived phases in their evolution. The combination of accurate Gaia parallaxes and photometry with spectroscopic mass determinations will result in an extremely stringent test of the mass–radius relation of white dwarfs (Tremblay et al. 2017). The DESI spectroscopy will also provide radial velocity measurements, which will complement the Gaia proper motions and produce the first large sample of white dwarfs with full 3D kinematics. With this sample, the thin disk, thick disk, and halo populations can be distinguished (Pauli et al. 2003, 2006; Anguiano et al. 2017), with an expected ≃1% of halo white dwarfs. Furthermore, the kinematics will provide constraints on the age–velocity dispersion relation, and insight into the mass distribution of white dwarfs that formed via binary mergers (Wegg & Phinney 2012).

3.3. The DESI Nearby Star Survey

Prior to Gaia, our knowledge of the solar neighborhood was extremely limited: the RECONS survey (Henry et al. 1994, 2018) provided only a nearly complete view of the stellar population within 10 pc, comprising ≃300 stellar systems, of which about a third are binaries or higher multiples. Thanks to Gaia, the inventory of nearby stars has considerably increased, and is now ≃90% complete for stars down to the hydrogen burning limit, M_G ≃ 15, within 100 pc (Gaia Collaboration et al. 2021b).

MWS will include a highly complete spectroscopic census of stars within 100 pc of the Sun. As described below, these stars will be selected in the DESI footprint based on Gaia parallaxes, in the magnitude range 16 mag ≤ G ≤ 20 mag (i.e., below the faint limit of the Gaia RVS spectrograph). This sample will be heavily dominated by M-dwarfs, which represent the bulk of the Galactic stellar mass. Much of the current framework for the star formation history of the Milky Way disk and the low-mass end of the stellar initial mass function is based on smaller and brighter analogs of this sample (Freeman & Bland-Hawthorn 2002; Nordström et al. 2004; Henry et al. 2018). The MWS 100 pc sample will provide the most complete census of the kinematics, chemical evolution, and initial mass function in the extended solar neighborhood to date. DESI will cover two major indicators of stellar activity, Hα emission and the Ca ii H and K doublet, and the 100 pc MWS sample will provide detailed constraints on the fraction of active M-dwarfs; on empirical relations between magnetic activity, rotation rate (which can be determined from, e.g., Zwicky Transient Facility light curves), and age (Skumanich 1972; Pizzolato et al. 2003; Barnes 2007; Mamajek & Hillenbrand 2008); and on the role of multiplicity in magnetic activity (notably for older stars; Newton et al. 2016; Stauffer et al. 2018). Correlations between kinematics (i.e., tangential velocity, velocity dispersion, or vertical action) and stellar activity (Gizis et al. 2000; Schmidt et al. 2007; Kiman et al. 2019; Angus et al. 2020) can also be explored with this sample.

Our knowledge of the coldest (late T and Y type) brown dwarfs (350 K ≲ T_eff ≲ 500 K) is similarly restricted to this volume (Kirkpatrick et al. 2019; Kota et al. 2022). In addition to the Gaia 100 pc sample, a DESI secondary program will target fast-moving high-proper-motion stars identified using combinations of multi-epoch data from SDSS, the NOIRLab Source Catalog (Nidever et al. 2021), and the Wide-field Infrared Survey Explorer (WISE; e.g., Meisner et al. 2021). These data will sample the faint end of the brown dwarf luminosity function, and provide better constraints on the low-mass end of the initial mass function.

3.4. Survey Requirements

The scientific goals above impose common requirements on the sample of stars to be observed, and on the performance of DESI and its pipelines, particularly on the accuracy of key measurements (stellar parameters, radial velocities, and abundances) on the final data set. The rest of this paper presents the design of the survey and our first evaluation of our pipeline against those requirements.

In the next section, we describe our target selection procedure and forecast the size and content of the full sample. We determine the completeness of each target class by applying the DESI fiber assignment algorithm, accounting for the prioritization of targets from BGS. For the bulk of our targets the completeness is ≈30% averaged over the footprint. Using a simple mock catalog based on an empirical model of the Galactic structure, we demonstrate that our target selection recovers tracers with sufficient density and distance coverage to address the core dark matter and stellar halo structure science goals. In future work, we will apply our target selection to mock catalogs derived from ab initio cosmological simulations, to better understand the effects of halo-to-halo variation and substructures.

We then introduce our analysis pipelines and use data from early DESI observations to demonstrate that we can recover radial velocities and [Fe/H] to the necessary precision of 1 km s⁻¹ and ≈0.2 dex, respectively. We identify systematic offsets with respect to other surveys and literature [Fe/H] measurements for globular clusters and dwarf galaxies. We also demonstrate that metal pollution signatures of planetary systems can be identified in DESI spectra of white dwarfs.

We identify priorities for development of the current pipelines, which we expect to include [α/Fe] measurements, more robust fitting of very cool stars, and potential improvements to radial velocity accuracy below 1 km s⁻¹. These improvements will be addressed in future publications, along with more detailed studies of the white dwarf and nearby samples, spectrophotometric distance estimates, and the recovery of individual element abundances.

4. Selecting Milky Way Targets for DESI

This section describes the selection criteria for MWS targets. This description updates the summary given in Allende Prieto et al. (2020) and corresponds to the final design of MWS at the start of the main survey in 2021 May. We refer the reader to Myers et al. (2023) for further details of the DESI target selection process, the desitarget software, and associated data products.

4.1. Overview of MWS Target Categories

Primary targets. The MWS main sample is designed to address the dark matter halo, stellar halo, and thick disk science cases described in Sections 3.1.1, 3.1.3, and 3.1.4. It comprises the bulk of the stellar targets that will be observed by DESI MWS, split into three categories,⁵⁶ main-blue, main-red, and main-broad. The union of these categories is essentially a magnitude-limited selection and is therefore well suited to characterizing stellar halo substructures associated with disrupted Milky Way satellites and globular clusters (Section 3.1.2) and to the serendipitous discovery of rare types of stars (such as extremely metal-poor stars; Section 3.1.5).

The main sample selection function is deliberately generous. Any starlike source in the survey footprint with magnitude 16 < r < 19 has a finite probability of being observed in one of the three MWS main target categories (excluding only sources that are masked or have poor data quality in the input imaging). As we describe in detail below, main-blue (defined by the simple color criterion g − r < 0.7) will provide a highly complete sample of metal-poor main-sequence, turnoff, and bluer subgiant stars, with distances in the range 5–30 kpc and no kinematic selection bias. This will be a dense, high-fidelity data set with which to study the thick disk and inner halo over the likely extent of the GSE debris, as well as Sagittarius and many other known streams.

We assign redder stars (g − r > 0.7) in the same 16 < r < 19 magnitude range to the main-red category if Gaia astrometry indicates they are more likely to be distant halo giants, and to the main-broad category otherwise. We give main-red targets the same fiber assignment priority as main-blue targets. We give main-broad targets lower priority in order to select more main-red giant candidates to probe the kinematics and substructure of the outer stellar halo, particularly to provide dynamical constraints on the dark matter halo mass over a large fraction of the virial radius. Of course, not all stars in main-red will be distant giants. The astrometric selection criteria are "mild": as described in more detail below, they accept relatively high contamination from nearby red main-sequence stars as a trade-off to minimize kinematic bias. Based on the current DESI bright-time observing plan, focal plane state, and fiber assignment strategy, we expect main-red spectra to account for ≈12% of the main sample and to have a completeness of ≈32% (similar to that of main-blue). The fraction of true distant halo stars in main-red will be sensitive to the form of the stellar halo density profile at large distances; below, we present a fiducial model of the halo density, which predicts ∼5000–10,000 main-red giants beyond 50 kpc and ∼1000 beyond 100 kpc. Although main-broad targets will have lower spectroscopic completeness than main-red (∼19%), those spectra will still account for ≈32% of the main MWS sample. They will serve both as a constraint on forward models of the MWS data set and as an important scientific sample in their own right, providing radial velocities for thin disk stars without Gaia RVS spectra, as well as more detailed abundance measurements.

In addition to the main sample, MWS includes several high-priority samples of targets with high scientific value and low surface density (≲10 per square degree), comprising white dwarfs (mws-wd, Section 3.2), faint nearby stars (mws-nearby, Section 3.3), RR Lyrae variables (mws-rrlyr), and horizontal branch stars (mws-bhb). These target categories are given higher fiber assignment priority than the main sample to ensure high completeness, as discussed in Section 5. Two further categories, faint-blue and faint-red, target fainter stars in the same color ranges as the main-blue and main-red samples, at lower priority and hence at much lower completeness.

F-type stars in a similar magnitude range to the main MWS sample will be targeted as flux standards across all bright- and dark-time DESI programs.⁵⁷ The algorithm for selecting flux standards is more complex than that for the main-blue sample, of which they are essentially a subset (flux standards can be slightly brighter than the flux limit of the main sample). Their targeting and prioritization is independent of MWS; hence they may also be observed as part of the MWS main sample or as one of the high-priority MWS samples. Main MWS targets will be observed only once by design; they will only be reobserved if no unobserved MWS targets or targets from any other program are available to a fiber. However, stars may be observed as flux standards multiple times in both bright and dark conditions.

We refer to the MWS main, high-priority, and faint samples collectively as the primary MWS target categories. All these targets are selected within the bright-time program footprint, shown in Figure 1. The following subsections give the specific target selection criteria for each category. Most criteria are based on optical photometry from Data Release 9 of the DESI Legacy Imaging Survey (LS DR9; Zou et al. 2017; Dey et al. 2019; D. J. Schlegel et al. 2022, in preparation), which also includes infrared photometry derived from WISE data (Meisner et al. 2017), and is matched to photometry from Gaia DR2 and astrometric measurements from Gaia EDR3 (Gaia Collaboration et al. 2021a).

Table 2 lists the number of targets per category in the bright-time footprint (categories are not mutually exclusive). Approximately 30.5 million unique LS sources meet the primary sample selection criteria. Section 5 describes the relative prioritization among the categories and the resulting fiber assignment efficiencies over four passes of the footprint, which are also listed in Table 2. The complete survey will contain ≃7.2 million spectra of unique stars from the primary MWS categories, of which ≃6.5 million are from the three main sample categories.

Table 2. Summary of Main MWS Target Classes

Class	N_footprint	N_fiber	%_{4 pass}
main-red	2,589,211	805,794	31
main-blue	11,450,969	3,693,518	32
main-broad	10,783,830	2,077,222	19
mws-wd	66,811	66,365	99
mws-nearby	33,788	20,892	62
mws-rrlyr	17,303	8981	52
mws-bhb	32,353	17,706	55
faint-red	960,134	71,771	7
faint-blue	4,606,314	486,568	11

Total (unique)	30,470,033	7,197,722	24

Notes. From left to right, the columns give the name of the class in the mws_mask bitmask, the total number of DR9 targets in the class available to a DESI fiber, and the total number and fraction of fibers assigned to those targets in a fiducial four-pass survey (with the footprint and focal plane state defined in 2022 March; see text). Targets may be counted in more than one class; the final row gives the total number of unique targets.

Download table as: ASCII Typeset image

Secondary targets. As noted in Section 1, the DESI survey also defines a number of secondary science programs, each of which corresponds to one or more secondary target classes.⁵⁸ The secondary programs will use spare fibers or dedicated observations of specific areas (not necessarily within the main survey footprint) to complement the goals of the primary dark- and bright-time DESI surveys. Several secondary programs are complementary to MWS, including observations of fainter BHBs and white dwarf binary candidates using dark time, higher prioritization for proper-motion and color outliers, surveys of M31 and M33 (Dey et al. 2023) , and high-completeness observations of selected Milky Way dwarf satellites and globular clusters. These secondary programs will be described separately. The targets for these secondary programs may overlap with primary MWS targets. Myers et al. (2023) provide a more detailed explanation of the primary and secondary target categories, how they can be identified in DESI target catalogs, and the implications of targets being selected by multiple programs.

Backup targets. Finally, in poor weather conditions and bright skies (including twilight), the DESI backup program will target stars brighter than those in the MWS main sample. The backup program will cover a larger footprint, extending to lower Galactic latitude (∣b∣ ≳ 7°). Backup targets will be prioritized in three categories by magnitude and color, to favor stars for which high S/N can be achieved in poor conditions, with two additional categories to favor brighter halo giants.⁵⁹ This sample will be suited to detailed chemodynamical modeling of the Galactic disk and nearby stellar halo. The total number of spectra obtained by the backup program will depend on observing conditions. Extrapolating from the first months of the main survey, we expect the backup program will obtain at least five million additional stellar spectra. A full description of the backup program will be given in a separate publication.

4.2. Main Sample Selection

The MWS main sample is selected from the LS DR9 source catalog combined with astrometric measurements from Gaia EDR3. It is divided into three subcategories: main-blue, main-red, and main-broad. All three categories share the following definition of an MWS main sample stellar target based on fields in the LS DR9 catalog:⁶⁰

1.
16 < r < 19
2.
r_obs < 20
3.
type = PSF
4.
gaia_astrometric_excess_noise < 3
5.
gaia_duplicated_source = False
6.
brick_primary = True
7.
nobs_{g,r} > 0
8.
{g,r}_flux > 0
9.
fracmasked_{g,r} < 0.5

The observed r-band magnitude is obtained from the LS flux in nanomaggies as ${r}_{\mathrm{obs}}=22.5-2.5{\mathrm{log}}_{10}$ r_flux. The extinction-corrected magnitude is computed as $r={r}_{\mathrm{obs}}\,+2.5{\mathrm{log}}_{10}$ mws_transmission_r. We do not put any requirements on the availability or quality of data in the LS z band.

The main-blue subsample is further defined by the color criterion

g − r < 0.7

with g defined in the same way as r above. main-blue is therefore an approximately magnitude-limited selection, comprised mainly of main-sequence turnoff stars and bluer subgiants. Stars redward of the g − r = 0.7 cut include redder main-sequence stars, subgiants, and giants. To improve the sampling of the more distant galaxy, higher priority is given to main-red targets, which have astrometric cuts designed to reduce nearby main-sequence contaminants. However, Gaia astrometry becomes less precise near the faint limit of MWS, so the lower-priority main-broad target category loosens these astrometric cuts. More specifically, the main-red subsample is defined by the following additional criteria:

1.
g − r ≥ 0.7
2.
astrometric_params_solved ≥ 31
3.
Gaia parallax π < 3σ_π + 0.3 mas
4.
Gaia proper motion $| \mu | \lt 5\sqrt{{f}_{r}/{f}_{19}}\,\mathrm{mas}\,{\mathrm{yr}}^{-1}$ $\left({f}_{x}={10}^{(22.5-x)/2.5}\right)$

The main-broad sample comprises stars with color g − r ≥ 0.7 that satisfy the same magnitude and data quality requirements as the other two categories, but do not satisfy one or more of the astrometric criteria that define main-red (stars that do not have well-measured astrometric parameters in the Gaia catalog are therefore included in main-broad).

The parallax cut applied to main-red with respect to main-broad aims to remove nearby red contaminants in order to recover a larger sample of distant halo giants. However, because Gaia parallax accuracy is not good enough at the faint limit of the survey, r = 19, we also apply a proper-motion cut to reject stars with high tangential velocities that are more likely to be nearby contaminants. The proper-motion cut is made to be a function of the r-band magnitude, corresponding to 5 mas yr⁻¹ at r = 19 and ∼20 mas yr⁻¹ at r = 16. This corresponds to different limits on tangential velocity, depending on the absolute magnitude of the target. For example, for a 10 Gyr old stellar population with [Fe/H] < −0.5, the faintest stars on the giant branch with g − r > 0.7 will have M_r ≲ 1; thus the proper-motion cut will select all stars with tangential velocities below 950 km s⁻¹. For stars with fainter M_r, for example the subgiant branch stars of a 10 Gyr old stellar population with [Fe/H] = 0 that have M_r ∼ 3.5, the tangential velocity cut will correspond to ∼300 km s⁻¹.

The lower probability of observing stars with high tangential velocities in the main-red sample at distances ≲20 kpc does not compromise the science goals of MWS. As shown below, the MWS sample at these distances is overwhelmingly dominated by main-blue targets, which have no astrometric selection bias. It reduces the chance of discovering extremely high-velocity red stars around the base of the giant branch at these intermediate distances, but not to zero, because such objects are still targeted in the main-broad selection.

Finally, the faint-blue and faint-red samples extend the main-blue and main-red classes, respectively, to the fainter magnitude range 19 < r < 20. The selection criteria are otherwise identical to those for the corresponding main classes, except for a slightly fainter limit on the observed r-band magnitude, r_obs < 20.1. These faint categories are intended as targets of last resort and are given the lowest fiber assignment priority of any unobserved target.⁶¹ Although faint-blue and faint-red will be much lower completeness and lower S/N samples, they will provide opportunities for serendipitous discovery and may boost the detection significance of faint substructures.

4.3. Main Sample Targets in the Galaxia Model

Figure 3 shows the color and proper-motion distribution of main-blue, main-red, and main-broad targets⁶² at high Galactic latitude (b > 70°). We compare these distributions to a mock catalog derived from the Galaxia Milky Way model (Sharma et al. 2011), which describes the phase-space distribution of stellar populations in the Milky Way's thin disk, thick disk, and halo. The parameters of the default Galaxia model were calibrated to surveys of relatively bright and nearby stars. More recent studies have improved our knowledge of these parameters, particularly for the stellar halo. The original Galaxia model uses a power-law density profile with a logarithmic slope of −2.44, which is inconsistent with recent evidence that the stellar halo density breaks to a much steeper slope at a distance of ∼25 kpc (e.g., Bell et al. 2008; Deason et al. 2013, 2014). To make more accurate predictions for MWS, we modify the Galaxia model such that the logarithmic slope of the stellar halo density profile⁶³ changes from −2.5 to −4 at a Galactocentric distance of 25 kpc. To turn the output of Galaxia into a mock MWS survey, we convert magnitudes from Galaxia to the LS photometric system via the PS1 color terms given on the LS website, and assign proper-motion and parallax errors using PyGaia.⁶⁴ We then apply exactly the same footprint and target selection functions (except for the LS data quality flags) to obtain mock main-blue, main-red, and main-broad samples.

**Figure 3.** Top panels: Density of MWS main sample targets at high Galactic latitude (b > 70°) in the space of g − r color and proper motion, in our broken-power-law halo Galaxia mock catalog (left) and in the real MWS target catalog (right). The vertical line marks the color separation between the main-blue and main-red/main-broad samples. The horizontal lines show the separation in proper motion between main-red and main-broad at r = 16 (upper dotted line) and r = 19 (lower dashed line). The main-sequence turnoff and giant branch of the globular cluster M15 are visible as density peaks at ${\mathrm{log}}_{10}\,| \mu | /\mathrm{mas}\,{\mathrm{yr}}^{-1}\sim 0.4$ . Bottom panels: The contribution of thick disk (left) and halo stars (right) to the density distribution in the Galaxia mock catalog.
Download figure:
Standard image High-resolution image

The upper panels of Figure 3 show two broad peaks in both the MWS target catalog (top right) and our Galaxia mock catalog (top left). These correspond to the metal-poor halo (bluer and low proper motion, mostly in the main-blue sample) and the thin disk (redder and high proper motion, mostly in the main-broad sample). In the distribution of real main-blue targets, two smaller, concentrated peaks visible at ${\mathrm{log}}_{10}| \mu /\mathrm{mas}\,{\mathrm{yr}}^{-1}| \sim 0.4$ correspond to the main sequence and giant branch of the globular cluster M15. In the Galaxia mock, the lower-density sequence that bridges the disk and halo regions of the diagram corresponds to the thick disk, as shown in the lower left panel, where we isolate the thick disk stars in Galaxia. The lower right panel shows only the halo stars in Galaxia, demonstrating that the three distinct sequences visible in main-blue correspond to the halo main sequence, the metal-poor thick disk main sequence, and the halo giant branch (from blue to red).

Figure 4 illustrates the magnitude-dependent proper-motion cut that divides the g − r > 0.7 sample into main-red and main-broad (this figure includes all targets in the survey footprint, not just those at high latitude). Thick disk stars are included in main-red at r = 16 but almost completely relegated to main-broad at the faint limit of r = 19.

Figure 5 shows the distribution of heliocentric distances in our mock catalog. The mock main-blue targets are dominated by thick disk stars between 1 and 10 kpc. main-blue contains a further ∼500,000 stars between 10 and 20 kpc, which corresponds to the transition between the thick disk and halo in Galaxia. In this distance range the mock MWS sample is dominated by the metal-poor turnoff and subgiant branch. Beyond 20 kpc the mock sample is dominated by the stellar halo. For comparison, the dotted lines in Figure 5 show the number of distant giants predicted by the original Galaxia model (Sharma et al. 2011), which has a much shallower density profile in the outer stellar halo. Our fiducial model predicts MWS will target ∼1000 giants beyond 100 kpc, whereas the original Galaxia model predicts ∼10,000. The right-hand panel of Figure 5 shows that most of the difference is in the main-red and main-broad samples (which include all but the most metal-poor giants at large distances).

Finally, Figure 6 illustrates how halo stars of different types enter the MWS selection function at different distances. For example, the figure shows that, in this MWS mock survey, halo turnoff and subgiant branch stars dominate the sample up to ∼10 kpc. The mock survey targets ∼100,000 horizontal branch stars between 10 and 50 kpc at a volume density of 1–10 kpc^–3, and comparable numbers of RGB and red clump stars in the same region. These are target densities; they can be converted to densities of observed spectra with reference to the fiber assignment simulations described in Section 5. Those simulations show that, in a 5 yr survey sharing the DESI focal plane with BGS, we expect to observe ∼30% of the main-blue and main-red targets (e.g., ∼300 of the ∼1000 giants beyond 80 kpc).

**Figure 6.** The heliocentric distances of MWS main sample stellar halo targets in our Galaxia model variant with a broken-power-law stellar halo density profile. The top panel shows the number of MWS targets beyond a given heliocentric distance. The bottom panel shows the volume density of targets at a given distance. Colors correspond to the eight regions of the ${T}_{\mathrm{eff}}\mbox{--}\mathrm{log}g$ diagram (inset). Dotted lines correspond to main-sequence and turnoff stars, solid lines to subgiants and giants, and dashed lines to horizontal branch stars. The solid black lines correspond to the whole sample.
Download figure:
Standard image High-resolution image

4.4. High-priority Sample Selections

4.4.1. mws-wd

We select white dwarfs using a set of BP − RP color and G-band absolute magnitude criteria based on Gentile Fusillo et al. (2019) that identify the white dwarf cooling sequence in the Gaia photometric catalog alone. This selection is applied to the LS catalog using the properties of cross-matched Gaia EDR3 sources; the LS photometry is not used in this selection. Photometric measurements are taken from Gaia DR2 and astrometric measurements from EDR3.

1.
G_abs > 5
2.
BP − RP < 1.7
3.
G_abs > 5.93 + 5.047(BP − RP)
4.
G_abs > 6(BP − RP)³ − 21.77(BP − RP)² + 27.91(BP − RP) + 0.897

These criteria are applied to a sample defined by extinction-corrected Gaia G flux (gaia_phot_g_mean_mag in the LS catalog) at high latitude, with mild parallax and proper-motion cuts to select against nearby luminous blue stars (early-type main-sequence stars, horizontal branch stars, and subdwarfs) as well as QSOs:

1.
G < 20.0
2.
∣b∣ > 20°
3.
π/σ_π > 1 mas
4.
astrometric_params_solved ≥ 31
5.
∣μ∣ > 2 mas yr⁻¹

We impose the following photometric quality cut, because failing this criterion results in poor astrometry:

phot_bp_rp_excess_factor < 1.7 + 0.06(BP − RP)².

However, we retain objects that have reliable parallaxes and significant proper motions that meet either of the following criteria:

1.
astrometric_sigma5d_max < 1.5
2.
(astrometric_excess_noise < 1) and (π/σ_π > 4) and (∣μ∣ > 10 mas yr⁻¹)

4.4.2. mws-rrlyr

We target Gaia DR2 sources with magnitudes 14 < G < 19 that are classified as RR Lyrae variables by the general variability classifier and the Special Object Studies classifier (Holl et al. 2018; Clementini et al. 2019), comprising all objects from the table vari_rrylrae and any sources from the table vari_classifier_result for which best_class_name includes "RR." For technical reasons, mws-rrlyr targets are associated with the secondary target bitmask in the DESI data products, rather than the MWS bitmask (see Myers et al. 2023).

4.4.3. mws-nearby

To address the goals described in Section 3.3, we select dwarf stars within 100 pc of the Sun based only on the apparent Gaia magnitude and parallax, allowing for moderate parallax uncertainties:

1.
16 < G < 20
2.
π + σ_π > 10 mas
3.
astrometric_params_solved ≥ 31

No extinction correction is necessary to select stars at these distances. This sample corresponds to fewer than 10 targets per square degree. As described below, to ensure high completeness, mws-nearby targets are prioritized over all other MWS targets except mws-wd and mws-rrlyr.

4.4.4. mws-bhb

We select BHBs starting from the basic definition of main sample stars given at the start of this section (common to main-blue and main-red), with the following additional criteria:

1.
G > 10
2.
π ≤ 0.1 + 3σ_π mas
3.
−0.35 ≤ g − r ≤ −0.02
4.
−0.05 ≤ X_BHB ≤ 0.05
5.
r − 2.3(g − r) − W1_faint < −1.5

These criteria exclude nearby stars and select around the BHB locus in a combined LS and WISE color space, defined by

$\begin{eqnarray}\begin{array}{rcl}{X}_{\mathrm{BHB}} & = & {(g-z)-[1.07163(g-r)}^{5}-1.42272{(g-r)}^{4}\\ & & +\,0.69476{(g-r)}^{3}-0.12911{(g-r)}^{2}\\ & & +\,0.66993(g-r)-0.11368]\end{array}\end{eqnarray} \tag{ 1 }$

and

$\begin{eqnarray}&&W{1}_{\mathrm{faint}}=22.5-2.5{\mathrm{log}}_{10}(W1-3{\sigma }_{W1}),\end{eqnarray} \tag{ 2 }$

where W1 and σ_W1 are the WISE 3.4 μm flux and its error, respectively.

4.5. Flux Standards

Stellar flux standards for all DESI dark and bright programs are selected as point sources having blue colors consistent with F-type stars around the metal-poor main-sequence turnoff:

1.
16 < G < 18
2.
0 < g − r < 0.35
3.
r − z < 0.2

Stars in the magnitude range 18 < G < 19 that meet the same color criteria are also considered for use as standards, at lower priority. Myers et al. (2023) describe the full set of criteria for selecting flux standards for DESI, including additional requirements on photometric quality in LS. Since approximately 100 flux standards are assigned at higher priority than all other targets on every configuration of the DESI focal plane, this subset of the main-blue sample will be observed at a higher sampling rate and have a higher fraction of stars with repeat observations. These stars (6000 K ≲ T_eff ≲ 7000 K dwarfs) will have distances ≲20 kpc, with most in the range 5 kpc ≲ d ≲ 10 kpc (see Figure 6). Flux standards can be identified in the DESI data products using the DESI_TARGET bitmask (see Myers et al. 2023).

5. Observing Strategy

5.1. The DESI Bright-time Program

The DESI bright-time program comprises both MWS and BGS. The observing strategy and fiber assignment strategy for the bright-time program are primarily designed to meet the goals of BGS. BGS galaxies are prioritized over all MWS targets when assigning fibers, with the exception of mws-wd targets (∼1 per square degree). The higher priority of BGS galaxies is therefore the main limitation on the fraction of MWS targets that will be allocated a DESI fiber. We describe the relative prioritization of the different MWS target categories in the next subsection.

The bright-time program defines 5675 tiles to cover a footprint of more than 14,000 deg² (a minimum coverage of 9000 deg² is a design requirement of BGS; see Hahn et al. 2022). Each tile corresponds to a single configuration of DESI fibers. The tiles are organized into four passes of ∼1400 tiles each. Tiles within a pass do not overlap. Tiles on different passes are offset such that all points on the sky will be covered by three tiles on average. A detailed description of the tiling strategy is given in A. Raichoor et al. (2022, in preparation).

On successive passes covering a particular area of the sky, fibers are preferentially assigned to unobserved targets. In general, a fiber will only be assigned to reobserve an MWS target if no unobserved target is available.⁶⁵ All bright-time tiles will be observed to an effective exposure time of T_eff = 180 s, defined as the time required to achieve the same S/N that would be obtained for a typical BGS emission line galaxy in an exposure of 180 s under nominal dark conditions (Hahn et al. 2022). Actual exposure times are scaled on the fly, based on real-time measurements of sky brightness, seeing, and transparency; the airmass; and the Milky Way dust attenuation of the tile (D. Kirkby et al. 2022, in preparation; E. F. Schlafly et al. 2022, in preparation). The decision to observe dark, bright, or backup program tiles is also made according to a real-time measure of the survey speed, the rate at which S/N increases for a fiducial target under the prevailing observing conditions (E. F. Schlafly et al. 2022, in preparation). The total open-shutter time on each tile may be split over several exposures, possibly on different nights, until T_eff (computed from the spectra themselves) is reached (J. Guy et al. 2022, in preparation). In Section 7 we show examples of co-added MWS spectra with T_eff ≃ 180 s.

5.2. Target Density and Fiber Assignment

The density of MWS targets varies significantly across the bright-time footprint, which ranges from the edge of the Galactic plane (∣b∣ ≃ 20°) to the Galactic poles. This is illustrated in Figure 7, which shows the range of surface densities of main-red, main-blue, and main-broad targets across the survey footprint. The median density of main-red (main-blue, main-broad) targets is ≳150 (500, 600) per square degree.

**Figure 7.** Fraction of survey footprint having less than a given surface density of targets in the three main MWS target categories individually (solid lines) and in total (dashed line).
Download figure:
Standard image High-resolution image

For comparison, the average density of DESI fibers on a single tile (i.e., in one pass of the survey) is at most ≃667 per square degree (if all positioners are operational). However, not all fibers will be available for allocation to MWS targets. A fiber will be preferentially assigned to a BGS target in its patrol region. Moreover, on each tile, DESI allocates 50 fiber positioners to flux standards and 40–65 fibers to blank sky locations, per spectrograph petal, which leaves at most 4500 fibers available for science targets. This will be reduced further by nonoperational fiber positioners, the number of which will vary over the course of the survey and is ∼15% as of 2021 December (Silber et al. 2023).

Operational positioners are allocated to targets in a predetermined priority order, based on target category. Each category is assigned an integer base priority that establishes the ordering of classes in fiber assignment. Fiber collisions between targets of the same category (and between classes with the same base priority, such as main-blue and main-red) are resolved by a continuous subpriority value, randomly assigned to each target at the start of the survey (see Myers et al. 2023). Further details of the DESI fiber assignment algorithms are given in A. Raichoor et al. (2022, in preparation).

The highest-priority target category in the bright-time program overall is mws-wd, followed by all BGS target categories, and then by all other MWS categories. The order of all MWS categories from highest to lowest priority is therefore mws-wd, mws-rrlyr, mws-nearby, mws-bhb, main-blue and main-red (at equal priority), main-broad, and finally faint-blue and faint-red (at equal priority). This scheme prioritizes the most scientifically valuable but sparsely distributed targets above the bulk of the main MWS sample.

Table 2 gives estimates of the number of targets in each MWS category that will be allocated to a fiber over the course of the survey. These estimates are made by running the DESI fiber assignment algorithm on all bright-time program tiles, assuming the state of the focal plane current in 2022 March. The estimates further assume that all tiles will be observed to their planned effective exposure time, T_eff. Over the entire footprint, we expect to assign a fiber to approximately 30% of the main-blue and main-red targets, falling to 19% for the lower-priority main-broad sample. The higher-priority mws-wd, mws-nearby, mws-rrlyr, and mws-bhb samples have correspondingly higher completeness (99%, 62%, 52%, and 55%, respectively). Completeness increases as target density falls toward the Galactic poles, such that ∼40% of the footprint has ∼50% completeness for main-blue and main-red.

BGS targets have a mean density of ∼1400 per square degree with tile-to-tile fluctuations corresponding to the large-scale structure in the low-redshift galaxy distribution. Since DESI can deploy at most ∼600 fibers per square degree (after accounting for standards and sky fibers), more MWS targets will be observed in the later passes of the survey, when the density of remaining unobserved BGS targets is lower. Approximately 35% of the MWS sample will be observed on the fourth pass, alongside the final 15% of BGS targets. Approximately 11% of the MWS targets will be observed more than once; for the majority of these, no other target is available to a fiber. The tile-to-tile fluctuations in BGS density give rise to corresponding fluctuations in the MWS completeness of ≈±5% at high latitude. The clustering pattern of the extragalactic large-scale structure will therefore be imprinted on the apparent clustering of observed MWS targets.

6. The DESI MWS Data Pipeline

A massive software development effort has been carried out to ensure that the main data products from DESI, such as extracted flux-calibrated spectra and galaxy redshifts, are of the highest possible quality and processed on a daily basis from the beginning of the survey. The main DESI pipeline uses the Redrock spectral fitting code to determine redshifts (S. J. Bailey et al. 2022, in preparation). Redrock classifies spectra into broad categories (galaxy, star, white dwarf, quasar, etc.) but is not optimized for stellar radial velocity measurement at the accuracy required for MWS science. MWS has therefore developed additional software to extract information from stellar spectra, which we refer to collectively as the MWS pipeline.

There are three main components to the MWS pipeline, which we refer to as RVS, SP, and WD. The RVS component derives stellar radial velocities and atmospheric parameters. The SP component deals with the determination of stellar atmospheric parameters and complements the RVS branch by providing a way for inferring abundances for individual elements through the definition of spectral windows. The WD component is focused on determining parameters for white dwarf stars. Each of these components is based on improvements to existing tools for the analysis of stellar spectra.

6.1. RVS

The RVS pipeline determines the radial velocities of all stellar objects and finds the best stellar atmospheric parameters. The pipeline algorithms are based on ideas first presented in Koposov et al. (2011), where stellar spectra, without flux calibration, are fit by stellar templates multiplied by a polynomial continuum. The code has been expanded to perform interpolation across templates for the Gaia-ESO project (Gilmore et al. 2012). A python implementation of these algorithms, RVSpecfit, is publicly available (Koposov 2019),⁶⁶ including a version specific to DESI (which we describe here) and variants for other surveys, such as S⁵ (Li et al. 2019) and the WHT Extended Aperture Velocity Explorer survey (WEAVE; Dalton et al. 2012).

We briefly summarize the algorithm. A stellar spectrum S evaluated at pixels with wavelengths λ_k is modeled as

$\begin{eqnarray}&&M({\lambda }_{k}| {\boldsymbol{\alpha }},v,\phi )=\displaystyle \sum _{i}{\alpha }_{i}{P}_{i}({\lambda }_{k})T\left({\lambda }_{k}\sqrt{\displaystyle \frac{1-v/c}{1+v/c}}| \phi \right)\end{eqnarray} \tag{ 3 }$

where T(λ∣ϕ) is the interpolated stellar template from the library convolved to the resolution of the observed spectrum._Pi(λ) denotes basis functions, such as polynomials λⁱ, Chebyshev polynomials, or radial basis functions (RBFs), which take care of modifying the continuum shape. The vector α is a vector of linear coefficients, v is the radial velocity, and ϕ is the vector of stellar atmospheric parameters, abundances, and potentially stellar rotations $V\sin i$ . As shown in Koposov et al. (2011) the vector of linear coefficients, α, can be determined through simple linear algebra operations. This allows us to compute the log-likelihood for each spectrum given only the stellar atmospheric parameters and radial velocities, $\mathrm{log}P({\bf{S}}| \phi ,v)$ . In the case of DESI, the total log-likelihood function is the sum of the log-likelihoods for each of the three arms of the spectrograph.

The RVS pipeline optimizes the log-likelihood combined across the three arms simultaneously through Nelder–Mead optimization (Nelder & Mead 1965). Since the likelihood surface is extremely complex in the radial velocity dimension, we need to provide a reasonable initial guess. This is obtained from cross-correlation of the continuum-normalized spectrum with a 0.5% subset of continuum-normalized templates from the PHOENIX grid. We continuum-normalize both the observed spectra and the model templates by a spline-based continuum with a regular grid of knots in wavelength separated by 1000 km s⁻¹. The cross-correlation uses the uncertainties and is computed as ${({\bf{S}}/{{\bf{E}}}^{2} \circledast {\bf{T}})}^{2}/(1/{{\bf{E}}}^{2} \circledast {{\bf{T}}}^{2})$ , where T is the template vector, S is the spectrum, E is the uncertainty vector, and ⊛ is a convolution operator that is computed using a fast Fourier transform. The cross-correlation functions from the three arms of the instrument are added together. The best cross-correlation velocity and the stellar template parameters are then used as a starting point for the likelihood optimization.

Both the initial cross-correlation step and the nonlinear optimization steps are conducted in the radial velocity interval of ±1500 km s⁻¹.

The interpolated stellar templates are constructed from the PHOENIX grid of models in a two-step process. First, global RBF interpolation is used to interpolate the original grid from Husser et al. (2013) onto a regular grid without holes, using steps of 0.2 in [Fe/H], 0.25 in [α/Fe], and the original grid points in $\mathrm{log}g$ and T_eff. This grid is then convolved to the average resolution of DESI in the B, R, and Z arms. The resulting templates are then interpolated using multilinear interpolation.

We note that, because we are fitting the product of the template model and a polynomial or RBF continuum modifier, at this stage we are not using large-scale flux calibration information in the spectra. Currently we use 10 basis functions in each arm for the continuum modification. This approach may be refined in the future.

The RVS pipeline quantifies the uncertainty in the parameters of each fit through an estimate of the Hessian matrix at the minimum. However, the uncertainty in radial velocity is estimated somewhat differently, by evaluating the log-likelihood from the best-fit template P( S ∣v, ϕ₀) on a finely spaced grid of radial velocities and computing the standard deviation of the radial velocity (which is equivalent to assuming a uniform prior on the radial velocity and computing the standard deviation over the posterior).

The RVS outputs are the best-fit stellar atmospheric parameters [Fe/H], $\mathrm{log}g$ , and [α/Fe]; the best-fit radial velocity; the stellar rotation $V\sin i$ ; and their uncertainties. We also return the best-fit chi-square values per arm ${\chi }_{\min ,B}^{2}$ , ${\chi }_{\min ,R}^{2}$ , and ${\chi }_{\min ,Z}^{2}$ and the combined χ². To facilitate the identification of sources that are not well fit by stellar templates, such as galaxies, quasars, or unknown types of objects, we also compute the χ² values for a continuum-only model, i.e., a template T(λ) = 1 in Equation (3), providing ${\chi }_{\mathrm{cont},B}^{2}$ , ${\chi }_{\mathrm{cont},R}^{2}$ , and ${\chi }_{\mathrm{cont},Z}^{2}$ . The rationale for computing these chi-squares is that they give a reference value for the goodness of fit, with respect to which we can assess the stellar fits.

The outputs of the pipeline are also used to populate the 64 bit warning bitmask RVS_WARN, as follows. If the difference in χ² of a stellar model with respect to the continuum model is small, $({\chi }_{\min ,B}^{2}+{\chi }_{\min ,R}^{2}+{\chi }_{\min ,Z}^{2})$ − $({\chi }_{\mathrm{cont},B}^{2}+{\chi }_{\mathrm{cont},R}^{2}+{\chi }_{\mathrm{cont},Z}^{2})\lt 50$ , the first bit of RVS_WARN is set to 1. If the radial velocity is within 5 km s⁻¹ of the edges of the interval of radial velocities considered, [−1500, 1500] km s⁻¹, the second bit of RVS_WARN is set to 1. If the radial velocity error is larger than 100 km s⁻¹ the third bit of RVS_WARN is set to 1. A good spectrum without any of those issues has RVS_WARN = 0.

The RVS pipeline, by default, does not require either targeting information or Redrock results. However when processing the main survey data, if Redrock outputs are available we use the classification from Redrock to avoid fitting spectra of quasars and galaxies. Thus we only process through RVSpecfit the spectra that either are classified as type STAR by Redrock or have been targeted as either a Milky Way target, a stellar secondary target, or a spectrophotometric standard.

6.2. SP

The SP pipeline is based on FERRE⁶⁷ (Allende Prieto et al. 2006), a FORTRAN code that fits numerical models to data by optimization and interpolation of a precomputed model grid. FERRE has been used in the analysis of SDSS optical spectra (Yanny et al. 2009; Rockosi et al. 2022) and SDSS/APOGEE near-IR high-resolution spectra (Majewski et al. 2017) and in other surveys such as Gaia-ESO (Gilmore et al. 2012).

A new python software package, piferre,⁶⁸ has been written specifically to handle DESI MWS data. This package reads the reduced spectra, corrects for the radial velocity (measured by the MWS RVS pipeline or the primary DESI pipeline), resamples the spectra, and prepares submission scripts for batch processing, taking advantage of FERRE's OpenMP parallelism. The piferre package manages the independent processing of DESI tiles, exposures, and petals; collects the results; and creates the final data products.

Only the spectra that have been successfully fitted by the RVS pipeline are processed by the SP pipeline. The input data are the observed spectra and associated inverse covariance arrays.

We set up FERRE to work with grids of model spectra having between two and five parameters: mostly, effective temperature T_eff, surface gravity $\mathrm{log}g$ , and metallicity [Fe/H], and, in some instances, microturbulence ξ and [α/Fe] as well. We adopt several sources for the template models, including the same PHOENIX models (Husser et al. 2013) of the RVS pipeline (Section 6.1) for very cool stars (2300 K < T_eff < 5100 K), the Kurucz ATLAS9 models (Kurucz 1979, 1993) for warmer stars (3500 K < T_eff < 30,000 K; Mészáros et al. 2012; Allende Prieto et al. 2018), and models for DA and DB white dwarfs (Koester 2010). Table 3 describes the parameter range, steps, and origin of the models we use to analyze DESI commissioning and SV data. In some cases there are multiple grids for each family: one for the PHOENIX models, five for the Kurucz models, and four for the Koester models.

Table 3. Summary of the Families of Grids of Model Spectra Employed in the SP Branch of the DESI MWS Pipeline

Type of Star	Models	Bands Fit	Parameters	Reference
Cool and very cool (2300 K < T_eff < 5100 K)	Phoenix	R, Z	${T}_{\mathrm{eff}},\mathrm{log}g$ , [Fe/H], [α/Fe]	Husser et al. (2013)
Cool and warm (3500 K < T_eff < 30,000 K)	Kurucz	B, R, Z	${T}_{\mathrm{eff}},\mathrm{log}g$ , [Fe/H], [α/Fe], ξ	Allende Prieto et al. (2018)
White dwarf	Koester	B, R, Z	${T}_{\mathrm{eff}},\mathrm{log}g$	Koester (2010)

Download table as: ASCII Typeset image

FERRE derives all the atmospheric parameters simultaneously and provides error covariance matrices for every spectrum by inverting the Hessian matrix. As in the case of the RVS code, the optimization is based on the Nelder–Mead algorithm (Nelder & Mead 1965). As expected for a purely spectroscopic analysis, there are significant correlations between the uncertainties in T_eff and log g, and especially between T_eff and [Fe/H].

We analyze the continuum-normalized spectra, after dividing the data by a running mean with a width of 500 spectral bins (equivalent to 250 Å). This is similar to the normalization approach described by Aguado et al. (2017), which ignores the continuum information and focuses on the absorption lines. The entire spectral range is fitted for all models, except for the lowest-temperature stars in the Phoenix models, for which the blue (B) spectrograph arm is ignored because including it leads to significant systematic errors. All the spectra are evaluated against all the model libraries in Table 3. The solution with the smallest value of the reduced chi-square is kept and the others are discarded. The results comprise the parameter estimates, the error covariance matrix, and the best-fitting model, which can be directly compared with the continuum-normalized observation. We also store a version of the best model without continuum normalization, which may be used for flux calibration. A success flag is set for fits with χ² < 1.5 and median S/N > 5.

The SP pipeline performs a number of iterations holding the atmospheric parameters fixed, except for [Fe/H] or [α/Fe], to infer the abundances of several elements by evaluating the χ² in regions of the spectrum dominated by transitions associated with those elements. This procedure resembles the methodology followed in APOGEE (García Pérez et al. 2016; Jönsson et al. 2020), and requires a careful definition of the windows to be used for each element. These are created by assigning weights W_λ(X) proportional to the derivative of the flux (F) with respect to the abundance of the element of interest (X), subtracting possible contributions from the rest of the elements,

$\begin{eqnarray}&&{W}_{\lambda }(X)=\displaystyle \frac{\partial {F}_{\lambda }}{\partial X}-\displaystyle \sum _{Y\ne X}\displaystyle \frac{\partial {F}_{\lambda }}{\partial Y},\end{eqnarray} \tag{ 4 }$

where the derivatives are computed using a solar model atmosphere. The weights are set to zero when Equation (4) gives a value smaller than a predetermined threshold.

6.3. WD

In addition to the white dwarf fitting performed by the SP branch of the pipeline described above, we fit DA white dwarfs with a bespoke code using models provided by Koester (2010), which span 6000 K ≤ T_eff ≤ 100,000 K and $4.00\leqslant \mathrm{log}g\,\leqslant 9.75$ . The fitting procedure employed is similar to that outlined in detail in Liebert et al. (2005). We first continuum-normalize the flux of the observed and convolved model spectra around the Balmer lines, Hβ to H8, which are all contained within the blue arm of the DESI spectrograph (see Table 5.5 of DESI Collaboration et al. 2016b). This fitting procedure allows us to use white dwarfs as an independent test of the absolute flux calibration of DESI obtained from F- and A-type stars across the blue, red, and NIR spectrograph arms, because the best-fit white dwarf models are based solely on fits to the blue arm spectra.

After the continuum normalization, we determine the best-fitting models from our grid. As the equivalent widths of the Balmer lines go through a maximum at approximately T_eff ≃ 12,000 K (dependent on $\mathrm{log}g$ ; Daou et al. 1990; Bergeron et al. 1992), we determine two solutions, one hotter and one cooler than this T_eff. We use both initial best-fit grid solutions as our first estimates for a χ² minimization between the observed and linearly interpolated model grid spectra. The degeneracy between the two solutions can be broken either with the continuum flux of the flux-calibrated DESI spectrum, or through use of Gaia photometry and parallaxes. The current WD pipeline uses the DESI continuum flux, but as this can be affected by flux calibration (e.g., Tremblay et al. 2011), we plan to incorporate the Gaia photometry and parallaxes in a future code development. We also plan to extend this code to DB and DBA white dwarf model fitting, using data from both the blue and red arms of the DESI spectrograph.

7. SV

The DESI SV campaign (DESI Collaboration et al. 2023a, in preparation) included observations dedicated to MWS. These were split into two major subprograms, SV1 and SV3. Figure 8 shows all the DESI SV fields and highlights those associated with MWS. The MWS target selection for SV was previously described in Allende Prieto et al. (2020); other DESI SV programs were introduced in Raichoor et al. (2020), Zhou et al. (2020), Yèche et al. (2020), and Ruiz-Macias et al. (2020). The results of these other programs and the final target selections for the DESI cosmological surveys are given in companion papers by DESI Collaboration et al. (2023a, in preparation), Raichoor et al. (2023), Zhou et al. (2023), Chaussidon et al. (2023), and Hahn et al. (2022).

**Figure 8.** SV fields discussed in the text. Orange and green circles indicate fields with large numbers of stellar targets in the SV1 and SV3 programs, respectively, which were selected specifically for the validation of the MWS main survey and prioritized stellar targets. Black circles correspond to the SV2 program, which will be presented separately. Gray circles show SV fields not associated with the stellar SV program. Purple points and tracks show the extent of the Sagittarius, GD 1, and Orphan streams as reported by the `galstreams` compilation; references are given in Figure 1. Gray lines indicate the approximate Galactic latitude limit of the survey, ∣b∣ ± 20°.
Download figure:
Standard image High-resolution image

The following subsections summarize the MWS SV observations. We then use these data to test our analysis pipelines (introduced in Section 6) and to evaluate our survey selection function. These tests reflect the current state of our pipelines, which we will continue to develop ahead of the first MWS data release. The quantity and quality of the DESI SV spectra also allow us to address a number of interesting scientific questions, including the detection of extended features around star clusters and satellite galaxies, the structure of the GD 1 stream, the discovery of rare sources, and the characterization of radial velocity variability in our sample. We will present these scientific results from the MWS SV program in separate publications.

7.1. SV Target Selection

In all the SV3 fields and all but a few of the SV1 fields, targets were selected using a simpler version of the scheme described in Section 4 (see also Allende Prieto et al. 2020). This selection was magnitude-limited between r = 16 and r = 19 and did not include the astrometric criteria we use to separate main-red and main-broad targets in the main survey. The mws-wd and mws-nearby samples were selected as in Sections 4.4.1 and 4.4.3, except that the astrometric criteria were applied to the Gaia DR2 catalog rather than to EDR3.

For the SV, stars in the range 19 < r < 20 were also included at lower priority. A subset of MWS SV1 tiles included additional sets of high-priority targets specific to the region observed (for example, likely members of satellite galaxies and star clusters selected using Gaia astrometry, or stars observed by previous surveys).

7.2. SV1 MWS Data Set

In SV1, the 14 regions listed in Table 4 were observed to significantly greater depth than that of the main survey.⁶⁹ In most cases, these observations consisted of a single fiber configuration (tile) for which multiple exposures were accumulated under different observing conditions. The completeness of MWS targets in these regions is therefore somewhat higher than that for the main survey (because, in SV1, all fibers were allocated to stars) but lower than that for the SV3 fields (because the same targets were observed on all exposures regardless of their accumulated S/N). We use these high-S/N and multi-epoch spectra to validate our analysis pipelines.

Table 4. Summary of Dedicated MWS Observations in SV1

Region Name	R.A. (deg)	Decl. (deg)	Objective	T_eff (s)	N_exp	Target Selection
NGC 2419	114.54	39.38	Globular cluster	1424	14	Standard
GD 1 Low Latitude A	128.50	0.80	Tidal stream	1034	23	Standard
UMa II	132.87	63.73	Dwarf galaxy	1071	10	Standard
UMa II (extra)	132.87	63.73	Dwarf galaxy	85	2	Customized
Sextans	153.26	−1.11	Dwarf galaxy	250	3	Standard
Orphan	159.08	7.50	Tidal stream	1118	5	Standard
GD 1 High Latitude A	163.74	47.86	Tidal stream	1536	18	Standard
GD 1 High Latitude B	173.55	52.86	Tidal stream	2038	14	Standard
NGP	192.86	27.13	Blank sky	914	21	Standard
BOSS 7456	198.04	0.00	Blank sky/calibration	1647	12	Customized
M53+N5053	199.10	18.30	Globular cluster	1829	16	Standard
M53+N5053 (deep)	198.30	17.50	Globular cluster	6178	8	Customized
M5	229.64	2.58	Globular cluster	2698	17	Standard
M13	250.42	36.96	Globular cluster	2787	16	Standard
M92	259.28	43.64	Globular cluster	3446	14	Standard
Draco	260.07	58.42	Dwarf galaxy	2258	16	Standard

Note. From left to right, the columns give the name and central coordinates of the field, the objective of the observation (in most cases the main feature of interest), the total effective exposure time calculated according to the bright-time metric (see text), the number of exposures taken in SV1, and the target selections used for those configurations (standard refers to the MWS SV1 target selection criteria described in the text, and customized to cases in which one or more additional high-priority target categories were added to the standard selection).

Download table as: ASCII Typeset image

Most MWS SV1 regions (orange circles in Figure 8) correspond to known features in the Milky Way halo, including the GD 1 stream, several globular clusters, and the satellite dwarf galaxies Draco, Sextans, and UMa II. Other tiles were placed on blank regions. Since the angular sizes of most of the star clusters and dwarf galaxies are much smaller than the DESI field of view, much of the area on those tiles is also effectively blank. Additional tiles were placed on the UMa II satellite galaxy and the globular clusters M53 and NGC 5053 with customized target selections favoring likely members of those systems. SV1 included many other observations dedicated to testing the DESI cosmological surveys (gray circles in Figure 8), which yielded a small number of additional stars.

The RVS and SP pipelines are run on the co-added spectra of 45,738 unique SV1 targets, of which 38,051 were targeted by the MWS SV1 program, mostly in the MWS SV1 fields but also with spare fibers in the BGS SV1 program. Many of these MWS targets are observed more than once. Among the MWS targets, 1209 (∼3%) cannot be fit reliably by RVS.

The remaining 7687 SV1 spectra comprise brighter flux standards and serendipitous (potentially) stellar contaminants to the galaxy and QSO selections of other DESI SV programs.⁷⁰ We attempt to fit all targets classified as stars by the Redrock pipeline even if they would not be selected in the main MWS survey, although many of these are faint extragalactic sources and therefore have a higher failure rate in our fitting.

7.3. SV3 MWS Data Set

In SV3 (also referred to as the One-percent Survey in other DESI publications), 20 regions with the area of a single DESI field were observed using multiple tiles dithered in a rosette pattern around a common center, each with total coverage comparable to a single DESI tile. Details of these regions and the corresponding DESI observations are given in DESI Collaboration et al. (2023a, in preparation). The total area covered is 100 deg², mostly distributed over Galactic latitudes 25° < b < 65°, with two tiles covering the north Galactic pole.

Unlike the case of the SV1 observations, multiple exposures of each SV3 tile were accumulated up to an effective exposure time only 20% greater than that to be used for each main survey tile, with fibers on subsequent tiles in the dither pattern being reallocated to unobserved targets. Fibers were assigned to BGS and MWS targets on the same focal plane in the same priority order that will be used in the main survey, but with the broader target selection criteria used in SV1 (see below). This strategy mimics the full DESI survey, but with ∼6 bright-time passes on each region rather than 4 (the other tiles in each region were observed with the dark-time and backup program target selections). The result is a highly complete sampling of each region, typically yielding spectra for ≳90% of the MWS targets (see Section 7.7). Figure 8 shows that, in contrast to the SV1 fields, most of the SV3 fields do not overlap with significant Milky Way substructures (most are cosmological deep fields). However, four are close to the midline of the Sagittarius stream and a further four are in its outskirts. Three more regions lie close to the GD 1 stream.

The MWS SV3 data set contains 213,414 spectra taken under bright conditions, almost all of which (99.6%) are for MWS main survey targets. Of these, 982 (≈0.5%) are not fit successfully by the RVS pipeline. Visual inspection suggests the majority of these failures are for QSOs. Other classes of targets for which the current version of the RVS pipeline fails or produces unreliable results include white dwarfs (which we fit separately) and cool M-dwarfs, which we discuss further in Section 7.5.1.

7.4. Validating the Pipelines

7.4.1. Radial Velocities

We validate the radial velocities from the RVS pipeline by comparing them to radial velocities from other large surveys. Figure 9 shows the distribution of radial velocity residuals for the stellar sources observed during the SV program in bright observing conditions. The different curves show comparisons to different individual surveys, while the black curve shows a comparison to the Survey of Surveys (SoS) data set (Tsantaki et al. 2022), a homogenized set of radial velocities from various surveys. We remark that the comparison to high-resolution surveys such as APOGEE and Gaia shows offsets within 1–2 km s⁻¹, while the SoS data set (dominated by low-resolution LAMOST and SDSS spectra) shows a systematic offset of ∼2.5 km s⁻¹. We are investigating the cause of this offset, which is likely associated with the DESI wavelength calibration and adjustments using skylines.

7.4.2. Radial Velocity Accuracy

Figure 10 shows the median radial velocity error reported by the RVS pipeline for the SV spectra, as a function of stellar color and magnitude. The SV data provide a representative sampling of the color and magnitude range, and hence the velocity accuracy, expected for the main survey.

To validate the radial velocity errors, we analyze the pairwise velocity differences from several tiles that were observed multiple times in the SV. Figure 11 shows the distribution of pairwise velocity differences as a function of the combined velocity uncertainty. We also show the robust estimate of the standard deviation of pairwise differences obtained using the 16th and 84th percentiles in bins of velocity uncertainty. The red curve is the expected behavior if the radial velocity errors have an additional systematic component of ∼0.9 km s⁻¹. This extra systematic error component is likely due to the wavelength calibration. Adding this systematic error in quadrature to the radial velocity errors reported by our pipeline, we see that the distribution of pairwise radial velocity errors normalized by the total uncertainty is very close to a normal distribution (see Figure 12).

**Figure 11.** The accuracy of radial velocity errors from individual DESI exposures. Black points show the robust standard deviation estimate (from the 16th and 84th percentiles) for pairwise radial velocity measurements as a function of their combined formal uncertainties $\sqrt{{\sigma }_{1}^{2}+{\sigma }_{2}^{2}}$ . If the radial velocity errors reported by the pipeline are accurate, the black points should lie on a one-to-one line. The red line shows the model curve after a 0.9 km s⁻¹ systematic error has been added in quadrature to the reported errors.
Download figure:
Standard image High-resolution image

**Figure 11.** The accuracy of radial velocity errors from individual DESI exposures. Black points show the robust standard deviation estimate (from the 16th and 84th percentiles) for pairwise radial velocity measurements as a function of their combined formal uncertainties $\sqrt{{\sigma }_{1}^{2}+{\sigma }_{2}^{2}}$ . If the radial velocity errors reported by the pipeline are accurate, the black points should lie on a one-to-one line. The red line shows the model curve after a 0.9 km s⁻¹ systematic error has been added in quadrature to the reported errors.
Download figure:
Standard image High-resolution image

**Figure 12.** The distribution of pairwise radial velocity differences normalized by the radial velocity errors with a 0.9 km s⁻¹ systematic error added. The red curve shows a Gaussian distribution with zero mean and unit variance.
Download figure:
Standard image High-resolution image

7.4.3. [Fe/H] Abundances

To assess the accuracy of iron abundances, we compare the measurements from the SV data with measurements of the same targets in APOGEE DR17, LAMOST DR7 LRS, and SDSS DR14 SSPP. A histogram of the differences is shown in Figure 13. The overlap of DESI with APOGEE, LAMOST, and SDSS comprises ∼400, 16,000, and 10,000 stars, respectively. The top panel of the figure shows the comparison of [Fe/H] measurements from the RVS pipeline, while the bottom panel shows the comparison to the FERRE abundances. This comparison shows that the [Fe/H] calibration of both the RVS and SP pipelines is within 0.1–0.2 dex of that of other surveys. Some scatter in the metallicity residuals can be attributed to the SDSS, LAMOST, APOGEE, and DESI measurement errors.

**Figure 13.** Comparison of abundances determined by the RVS (top panel) and SP (bottom panel) pipelines against data from other surveys. Dark blue curves show the residuals of DESI [Fe/H] measurements relative to the SDSS DR14 measurements, light blue curves show the comparison to LAMOST DR7 LRS measurements, and green curves show the comparison to APOGEE DR17 [Fe/H] measurements. In the legend we provide the 16th/50th/84th percentiles of the [Fe/H] residuals for each survey.
Download figure:
Standard image High-resolution image

To separate the effects of random errors in the DESI [Fe/H] measurements from systematic errors and errors in external catalogs, we also characterize [Fe/H] precision in bins of color and magnitude using repeated observations of the SV fields. Figure 14 shows the distribution of pairwise [Fe/H] residuals for ∼3 million pairs of measurements, using observations with exposure times between 150 and 1000 s. The [Fe/H] accuracy is estimated from the 16th to 84th percentile range of the distribution. The figure shows that, as expected, the [Fe/H] precision is a strong function of magnitude, with values around 0.05 at r = 16, worsening to 0.2–0.4 at r = 19. For blue stars, 0.2 < r < 0.4, the precision is noticeably worse than that for redder stars at the same magnitude. This is likely due to the impact of bright observing conditions, which degrade the signal most strongly in the blue arm of the instrument, and to the decreasing number of detectable absorption lines for hotter stars.

**Figure 14.** The estimates of [Fe/H] random errors from the RVS pipeline as a function of color and magnitude. In each color–magnitude bin we show the typical random error estimated from the 16th–84th percentile range of the pairwise [Fe/H] differences between repeated measurements of the same stars.
Download figure:
Standard image High-resolution image

7.4.4. Stellar Atmospheric Parameters

We do not provide here a detailed comparison between our measurements of $\mathrm{log}g$ and T_eff and those from other surveys, but they show good agreement. The typical deviation (corresponding to the 16th/84th percentile range) of T_eff with respect to APOGEE and LAMOST is −100 K ≲ δ T_eff ≲ 250 K, with a median offset of approximately 100 K. The RVS pipeline gives a broader range of T_eff offsets than the SP pipeline. The $\mathrm{log}g$ differences between DESI and other surveys are $-0.1\lesssim \delta \mathrm{log}g\lesssim 0.5$ with a median offset of 0.2 dex and are similar between the RVS and SP pipelines.

7.5. Abundances of Star Clusters and Dwarf Galaxies

To validate the abundance measurements from the survey, specifically their accuracy, we also look at DESI observations of several globular clusters and OCs as well as dwarf galaxies for which there is a good external measurement of [Fe/H]. Figure 15 shows the distribution of measured abundances for several objects from Table 4 that were observed during SV. For this figure we do not attempt to identify carefully likely members of each object, only selecting stars that have proper motion within 1 mas yr⁻¹ and radial velocity within 10–15 km s⁻¹ of the parent object. The plot provides a broad overview of the [Fe/H] accuracy of DESI. The expectation is that cluster members should show narrow distributions centered on their true [Fe/H] values, while dwarf galaxy members are expected to show some [Fe/H] spread. The observed [Fe/H] spreads on the figure are also expected to be affected by random [Fe/H] errors that are magnitude- and color-dependent, as discussed in Section 7.4.3. We focus here on the [Fe/H] offsets. We clearly see that for several systems, like M13 and M5, DESI measurements (from both the RVS and SP pipelines) are systematically shifted from the literature values. If we compare the median DESI metallicity for each system to other measurements from the literature, we see that the average offset [Fe/H]_DESI − [Fe/H]_external is −0.13 dex for RVS and −0.14 dex for SP. The range spanned by the [Fe/H] offsets is −0.27 dex (M5) to 0.06 dex (M53) for the RVS pipeline, and −0.32 dex (M13) to 0.1 dex (NGC 2419) for the SP pipeline.

**Figure 15.** Comparison of DESI [Fe/H] measurements by the MWS RVS and SP pipelines of stars in star clusters and a dwarf galaxy to literature measurements from Carretta et al. (2009), Kirby et al. (2011), and D'Orazi et al. (2020). Each panel shows a sample of stars observed as part of the DESI SV1 program (Table 4). We only use stars with proper motions and radial velocities matching the expected proper motion (within 1 mas yr⁻¹) and radial velocity (within 10–15 km s⁻¹) of the corresponding object. We only include measurements from spectra with S/N > 6 in at least one of the arms. The curves show kernel density estimates with an Epanechnikov kernel, assuming a bandwidth of 0.3 dex, a conservative estimate of the precision of the DESI metallicity measurements. Orange lines show literature values of the metallicities. For Draco we also show the 1σ metallicity scatter in the system. The number of stars used for the analysis is provided in the legend of each panel.
Download figure:
Standard image High-resolution image

7.5.1. Quality of the Fits

Figure 16 shows five typical spectra for the MWS main sample targets and the corresponding fits from the RVS pipeline. The S/N is representative of what is expected for the survey at the corresponding magnitudes. The agreement between the data and the model is very good even at high S/N. We also note that, since the RVS model includes a polynomial multiplicative correction to the spectrum (see Equation (3)), the agreement between the model and the data directly probes the accuracy of the flux calibration. Figure 17 summarizes the quality of the spectral fits by showing the typical χ² per degree of freedom (DOF) for the RVS pipeline (left) and the SP pipeline (right), as a function of the color and magnitude of the star. In both cases we see that χ² has a strong dependence on magnitude and color. For the RVS fits, the chi-squares are close to unity for faint stars, as expected if the errors are treated correctly. For the SP fits, the chi-squares seem to be slightly below unity for faint stars. However, both pipelines show strong increases in χ²/DOF to ∼3–10 at bright magnitudes and for stars with g − r ∼ 2. This is primarily due to the limitations of the stellar atmospheric grids used, leading to template mismatches that are apparent at high S/N and for cool stars with many absorption lines and molecular bands.

**Figure 17.** The median logarithm of χ² per DOF for spectral fits with the RVS and SP pipelines, as a function of target color and magnitude.
Download figure:
Standard image High-resolution image

7.6. White Dwarf SV Results: Remnant Planetary Systems

DESI has already identified several hundred planetary systems around white dwarfs via the metal pollution of their atmospheres. A dedicated study of these planetary systems is beyond the scope of this paper. Here we showcase the potential of this statistical sample using one of the most polluted white dwarfs so far observed by DESI, GD 362 (Gianninas et al. 2004; Zuckerman et al. 2007; Xu et al. 2013). GD 362 is extreme in both its pollution from planetary material (a total of 17 elements have been previously identified in the white dwarf's atmosphere; Zuckerman et al. 2007; Xu et al. 2013) and its fairly sizable hydrogen content in an otherwise helium-dominated atmosphere (which can be indicative of water accretion; Jura & Xu 2010; Klein et al. 2010; Gentile Fusillo et al. 2017).

The DESI spectrum of GD 362 obtained on 2021 June 21 is shown in Figure 18, revealing strong absorption lines from many metals, including Ca, Mg, and Fe. To analyze the DESI spectrum of GD 362, we compute a grid of white dwarf model spectra using the code of Koester (2010) and by sampling the parameter space in T_eff, $\mathrm{log}g$ , and number abundances $\mathrm{log}[{\rm{H}}/\mathrm{He}]$ and $\mathrm{log}[{\rm{Z}}/\mathrm{He}]$ around the values determined by Xu et al. (2013). We fix the metal-to-metal ratios $\mathrm{log}[{\rm{Z}}/\mathrm{Ca}]$ to those of Xu et al. (2013). Using that grid, we find T_eff = 10,500 K, $\mathrm{log}g=8.00$ , log[H/He] = −1.75, and $\mathrm{log}[\mathrm{Ca}/\mathrm{He}]=-6.84$ as the best-fit parameters. The parameters differ somewhat from those of Zuckerman et al. (2007) and Xu et al. (2013; T_eff = 10,540 K, $\mathrm{log}g=8.24$ , log[H/He] = −1.14, and $\mathrm{log}[\mathrm{Ca}/\mathrm{He}]=-6.24$ ), which is unsurprising as these authors based their analysis on deep Keck High Resolution Echelle Spectrometer data of this star. However, incorporating the Gaia photometry and astrometry in our analysis (Section 6.3) will offset the lower S/N of the DESI spectrum. DESI will identify ≃1000 debris-polluted white dwarfs, allowing the first large-scale statistical analysis of the characteristics of these planetary systems as a function of host star mass and age. The MWS-WD sample will also include many heavily polluted white dwarfs and probe the extreme parameter spaces of remnant planetary systems.

7.7. MWS Targets in SV3 Fields

The 20 SV3 regions (green circles in Figure 8) each comprise 26–28 slightly dithered DESI pointings with an area of overlap equivalent to a single DESI tile. Compared to the four-pass MWS main survey, this dense coverage pattern provides much higher completeness on all target classes. Several of these regions overlap the Sagittarius stream, but the majority do not fall on known Milky Way halo structures.

For a typical SV3 region, nine observations were taken under standard bright-sky conditions comparable to those expected for MWS and BGS, 13 under dark conditions (in which the DESI cosmological surveys will operate), and five under extremely bright conditions (in which the main DESI survey will switch to the backup stellar program). Fibers on the bright SV3 tiles were allocated to targets from the BGS and MWS SV selections, with BGS having higher priority (this is roughly the same way in which fibers will be allocated in the main survey, although the BGS target density was higher in SV3). Fibers were also assigned to flux standard stars and secondary program stellar targets on the dark SV3 tiles.

Figure 19 shows the completeness of MWS main survey targets (main-blue, main-red, and main-broad) from bright-time SV3 observations in a typical SV3 region (this example is near the north ecliptic pole, b ∼ 30°). The completeness is ≳90% in the annulus $24^{\prime} \lt r\lt 87^{\prime}$ . The dither pattern results in slightly lower (≲80%) completeness near the center (slightly extended to the northeast) and at the edge of each region.

$15^{\prime} \times 15^{\prime} $ — **Figure 19.** Fraction of all MWS main sample targets (main-blue, main-red, and main-broad) observed in the SV3 region close to the north ecliptic pole, (l, b) ≈ (92°, 30°), in $15^{\prime} \times 15^{\prime}$ pixels. Circles at $24^{\prime}$ and $87^{\prime}$ indicate the region of ≳90% completeness discussed in the text (the outer circle is slightly smaller than the nominal size of a DESI tile).
Download figure:
Standard image High-resolution image

This high completeness allows us to characterize the parent sample from which the MWS targets are drawn in these regions, using the measurements made by the RVS and SP pipelines (Section 6). In the following comparisons, we use all stars from SV3 that would be selected in one of the three main survey categories (main-blue, main-red, or main-broad). We consider only stars with a successful RVS pipeline velocity measurement (RVS_WARN = 0) and S/N ≳ 2 in all three spectrograph arms. These data quality cuts remove ∼1% of all MWS main survey targets in the SV3 data set. We refer to this subset of SV3 targets as the MWS SV3 sample.

Figure 20 shows the MDF of the MWS SV3 sample in each of the three main survey target categories. As expected, the distribution peaks at [Fe/H] ≈ −0.5, with the main-blue selection (which samples metal-poor turnoff stars in the thick disk and halo) dominating at lower metallicity and the main-broad selection (mostly nearby metal-rich disk stars) dominating at higher metallicity. The main-red selection has a similar distribution to main-broad below [Fe/H] ≈ −0.5 and notably fewer stars around solar metallicity. An offset of ≲0.2 dex is apparent between the RVS and SP pipelines for main-red and main-broad targets, but not for main-blue targets (see Section 7.4.3).

Figure 21 compares the combined MDF of all MWS main survey targets in the SV3 sample (as shown in Figure 20) to data from APOGEE (Majewski et al. 2017), SEGUE (Yanny et al. 2009; Rockosi et al. 2022), and LAMOST (Cui et al. 2012), restricted to the same range of high Galactic latitudes and imposing basic quality cuts for each survey.⁷¹ This comparison illustrates the much broader scope of the MWS target selection compared to these previous surveys (similar considerations motivated the design of the medium-resolution H3 survey; Conroy et al. 2019). The main-blue targets in our SV3 data have a similar distribution to the sample from SEGUE, which selected for the metal-poor halo. The main-red and main-broad samples have distributions similar to those of the data from LAMOST and APOGEE, both of which comprise intrinsically brighter stars and include subsets with selection functions tuned to recovering halo giants.

Figure 22 shows the $\mathrm{log}g$ distributions of main survey targets in the SV3 data. As expected, the main-broad sample is dominated by dwarf stars. The main-red sample was selected to reduce the contamination of foreground dwarfs and recover a higher proportion of giants. Figure 22 shows that, although main-red contains a large number of dwarf contaminants (a necessary consequence of the mild astrometric selection required to avoid strong kinematic bias) there is also a clear second peak at low $\mathrm{log}g$ values, which is absent in main-broad. The main-blue sample contains a large number of giants and subgiants; as shown in Figure 6, in the MWS magnitude range main-blue probes the metal-poor main-sequence turnoff, subgiant branch, and horizontal branch at distances 20 < r < 60 kpc. The $\mathrm{log}g$ distributions from the RVS and SP pipelines broadly agree, the most notable exception being a slightly higher number of main-blue and main-broad stars with $\mathrm{log}g\lt 4$ in the SP results.

Figure 23 shows separate metallicity distributions for dwarfs and giants. main-blue giants (median [Fe/H] ≈ −1.5) are clearly sampling the metal-poor halo. main-red giants are notably more metal-rich ([Fe/H] ≈ −0.9), with a distribution similar to that of main-blue dwarfs. The main-red selection is weighted toward the lower RGB out to ∼30 kpc (at larger distances the density of giants falls rapidly), and is therefore likely to be dominated by the relatively metal-rich Gaia Enceladus feature (Belokurov et al. 2018; Helmi et al. 2018; Naidu et al. 2020). The main-red and main-broad dwarf samples have median [Fe/H] ≈ −0.4 and −0.3 respectively, consistent with the expectation that they are dominated by the outer thin and thick disks. The main-blue dwarf sample is biased toward the metal-poor tail of those populations.

**Figure 23.** [Fe/H] distributions for main-blue, main-red, and main-broad targets in the SV3 data set, divided into "giant" ( $\mathrm{log}g\leqslant 3;$ solid) and "dwarf" ( $\mathrm{log}g\gt 3;$ dashed) subsets (as shown in Figure 22, there are very few main-broad giants in the data set).
Download figure:
Standard image High-resolution image

Figure 24 provides another perspective on the same data, showing color–magnitude diagrams for giants in different ranges of metallicity. The isochrones in this figure (from the MIST library; Paxton et al. 2011, 2013, 2015; Choi et al. 2016; Dotter 2016) illustrate the types of stars entering the selection function at different distances. A metal-rich structure is clearly visible in the main-red sample at 30–50 kpc.

**Figure 24.** Color–magnitude diagrams for giants ( $\mathrm{log}g\lt 3.5$ ) in the SV3 fields, separated into bins of [Fe/H] (panels; range shown in each panel). We overplot MIST isochrones (corresponding to the central [Fe/H] of each bin) at 15, 30, 60, 90, and 180 kpc (line styles, as shown in the leftmost panel). Larger points indicate the small number of giants in main-broad. The vertical line shows the separation in color between the main-blue and main-red selections.
Download figure:
Standard image High-resolution image

The figures above demonstrate that the MWS main target selection function will yield a representative survey of distant Galactic structures at high latitude. Comparison to predicted distributions for the same fields by the Galaxia model (Sharma et al. 2011) supports this conclusion. The metallicity and velocity distributions for the SV3 data have shapes and amplitudes in broad agreement with the default Galaxia model. Our fiducial broken-power-law halo variant (see Section 4.2) improves this agreement by reducing the counts of distant red giants in the main-red sample. Some discrepancy in detail between Galaxia and the SV3 data is expected: the MWS selection is sensitive to stars from the Gaia Enceladus progenitor and the Sagittarius stream (which are not included in the Galaxia model) and to the slope of the smooth stellar halo density profile.

Finally, Figure 25 compares the $\mathrm{log}g$ distributions of the SV3 data and our Galaxia mock catalog with a broken-power-law halo (convolved with a fiducial Gaussian error ${\sigma }_{\mathrm{log}g}=0.2$ ). Galaxia predicts substantially more high-proper-motion giants in the main-broad selection than we see in the data. These are associated with the thin and thick disks. The counts of giants in the main-red sample are in reasonable agreement at $\mathrm{log}g\approx 2$ but are overpredicted by Galaxia at both higher $\mathrm{log}\,g$ (an excess of thick disk giants) and lower $\mathrm{log}\,g$ (an excess of halo giants). The latter difference is sensitive to the outer slope of the density profile.

8. Summary

We have presented the scientific motivation and final target selection criteria for DESI MWS, which will obtain radial velocities, stellar parameters, and chemical abundances for approximately seven million stars over contiguous footprints in the north and south Galactic caps totaling ∼14,000 deg². The overarching goal of MWS is to provide new constraints on the assembly history of the Milky Way and the distribution of dark matter through measurements of chemical composition and radial velocity. The main MWS sample focuses on stars in the outer thick disk and stellar halo. It is essentially magnitude-limited for colors g − r < 0.7 (the halo and thick disk turnoff; main-blue) and has only a weak astrometric selection applied to redder stars to favor the targeting of distant giants (main-red). These two samples will have ≃30% spectroscopic completeness. Red stars failing the astrometric selection (main-broad) will still be observed, with spectroscopic completeness ≃19%. This straightforward and inclusive selection function will greatly simplify the forward modeling of the survey. It will also favor serendipitous discoveries; essentially every 16 < r < 19 star in the DESI footprint has a finite probability of being observed by MWS. Combined with Gaia, the main MWS sample will be an excellent resource for mapping structures and substructures in the thick disk and accreted halo, probing distances from 1 to ≳100 kpc with multiple overlapping tracer populations. MWS will also provide highly complete samples of white dwarfs, faint M-dwarfs, BHBs, and RR Lyrae variables. These samples will address the star formation history of the Milky Way disk, provide additional probes of the Galactic structure, and serve as a resource for a wide range of stellar and planetary physics. MWS will be supplemented by a backup observing program targeting brighter stars and several more specific secondary science programs, which will be described in a separate publication.

The first DESI data release (DESI Collaboration et al. 2023b, in preparation) will include approximately 500,000 stellar spectra from the MWS SV program, introduced in Section 7. We have used these data to show that, with our current analysis pipelines, DESI can measure stellar radial velocities to an accuracy ≃1 km s⁻¹ and [Fe/H] to ∼0.2 dex, for representative main-sequence turnoff and giant branch halo tracers in our main sample, given observing conditions comparable to those used for MWS targets in the full bright-time MWS. This velocity precision meets the requirements of the scientific program described in Section 3. It is sufficient to distinguish kinematic substructures (for example, tidal streams and shells) from the bulk stellar halo, to explore large-scale velocity perturbations associated with the LMC, and to characterize the internal velocity dispersion of features as dynamically cold as the tidal streams of globular clusters.

We have quantified systematic offsets in our measurements with respect to previous spectroscopic surveys and between our two main analysis pipelines. The discrepancies are most significant in the parameters and abundances measured for very cool stars. The results in Section 7.4.3 give us confidence that [Fe/H] uncertainties from DESI are, at least, comparable to those from other surveys at similar resolution, such as SDSS SEGUE. The random [Fe/H] uncertainties span a range of 0.05 dex for our brightest targets to 0.4 dex for the faintest. When comparing to external high-resolution measurements of clusters and dwarf galaxies, we see a typical systematic offset ∼0.15 dex in the DESI measurements. However, this varies from system to system, likely indicating systematic errors that are a function of temperature, [Fe/H], or [α/Fe]. We plan to investigate and where possible correct these deficiencies in preparation for the first DESI data release.

The radial velocity, metallicity, and surface gravity distributions we obtain from the highly complete and higher-S/N SV3 minisurvey over ≈100 deg² are broadly consistent with predictions from the Galaxia Milky Way model (Sharma et al. 2011). The agreement in giant counts is greatly improved if we assume a steeper decline in the density of halo stars at Galactocentric distances ≳25 kpc, relative to Galaxia's original "unbroken"-power-law halo density profile. A mock realization of the full MWS based on this modified Galaxia model predicts ≃1 million MWS targets beyond 15 kpc, ∼100,000 beyond 30 kpc, and ∼1000 beyond 100 kpc, of which we will be able to observe ≈30%. We therefore expect spectrophotometric distance estimates based on MWS data to provide strong constraints on the outer density profile of the stellar halo.

MWS is one of several active or near-future large-scale Galactic spectroscopy projects, including H3 (a smaller medium-resolution stellar survey with similar goals to MWS; Conroy et al. 2019), 4MOST (in the southern hemisphere; de Jong et al. 2019), and WEAVE (in the northern hemisphere; Dalton et al. 2012). The 4MOST and WEAVE spectrographs have a high-resolution mode (R ∼ 20,000) in addition to an R ∼ 5000 mode capable of surveys comparable in scope to MWS. These surveys complement one another. MWS is particularly well suited to a large-scale mapping of the outer thick disk, the Gaia Enceladus debris, the Sagittarius stream, and other northern-sky overdensities. The MWS data should greatly improve our understanding of how these features contribute to the overall stellar halo density, metallicity, and velocity dispersion profiles, and provide a robust foundation for detailed comparisons with theoretical models.

We thank the reviewer, Matthias Steinmetz, for his thorough reading of our manuscript and constructive suggestions. A.P.C. and N.K. are supported by a Taiwan Ministry of Education (MoE) Yushan Fellowship awarded to A.P.C., and by Taiwan National Science and Technology Council (NSTC) grant 109-2112-M-007-011-MY3. This work used high-performance computing facilities operated by the Center for Informatics and Computation in Astronomy (CICA) at National Tsing Hua University. This equipment was funded by MoE, NSTC, and National Tsing Hua University. C.A.P. acknowledges financial support from the Spanish Ministry of Science and Innovation (MICINN) projects AYA2017-86389-P and PID2020-117493GB-I00. C.J.M. acknowledges financial support from Imperial College London through an Imperial College Research Fellowship grant. M.V. and L.B.e.S. acknowledge support from NASA-ATP award 80NSSC20K0509. The work of A.D. and J.N. is supported by NOIRLab, which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation. T.S.L. acknowledges financial support from the Natural Sciences and Engineering Research Council of Canada (NSERC) through grant RGPIN-2022-04794. B.T.G. was supported by grant ST/T000406/1 from the Science and Technology Facilities Council (STFC). This project has received funding from the European Research Council under the European Union's Horizon 2020 research and innovation program (grant agreement No. 101020057). This research made use of computing time available on the high-performance computing system at the Instituto de Astrofisica de Canarias. The authors thankfully acknowledge the technical expertize and assistrovided by the Spanish Supercomputing Network (Red Espanola de Supercomputacion), as well as the computing resources provided by the LaPalma Supercomputer and the Diva cluster, both located at the Instituto de Astrofisica de Canarias.

This research is supported by the Director, Office of Science, Office of High Energy Physics of the U.S. Department of Energy (DOE), under contract No. DE-AC02-05CH11231, and by the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility under the same contract. Additional support for DESI is provided by the U.S. National Science Foundation, Division of Astronomical Sciences, under contract No. AST-0950945 to the NSF's National Optical-infrared Astronomy Research Laboratory; the Science and Technology Facilities Council of the United Kingdom; the Gordon and Betty Moore Foundation; the Heising-Simons Foundation; the French Alternative Energies and Atomic Energy Commission (CEA); the National Council of Science and Technology of Mexico (CONACYT); MICINN; and the DESI member institutions: https://www.desi.lbl.gov/collaborating-institutions.

The DESI Legacy Imaging Surveys consist of three individual and complementary projects: the Dark Energy Camera Legacy Survey (DECaLS), the Beijing–Arizona Sky Survey (BASS), and the Mayall z-band Legacy Survey (MzLS). DECaLS, BASS, and MzLS include data obtained, respectively, at the Blanco telescope, Cerro Tololo Inter-American Observatory, NSF's NOIRLab; at the Bok telescope, Steward Observatory, University of Arizona; and at the Mayall telescope, Kitt Peak National Observatory, NOIRLab. NOIRLab is operated by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation. Pipeline processing and analyses of the data were supported by NOIRLab and the Lawrence Berkeley National Laboratory (LBNL). The Legacy Surveys also use data products from the Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE), a project of the Jet Propulsion Laboratory/California Institute of Technology, funded by the National Aeronautics and Space Administration. The Legacy Surveys were supported by the Director, Office of Science, Office of High Energy Physics of the U.S. DOE; the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility; the U.S. National Science Foundation, Division of Astronomical Sciences; the National Astronomical Observatories of China, Chinese Academy of Sciences; and the Chinese National Natural Science Foundation. LBNL is managed by the regents of the University of California under contract to the U.S. DOE. The complete acknowledgments can be found at https://www.legacysurvey.org/.

This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.

Funding for SDSS-IV has been provided by the Alfred P. Sloan Foundation, the U.S. DOE Office of Science, and the participating institutions. SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss.org.

SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, the Center for Astrophysics, Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, the Johns Hopkins University, the Kavli Institute for the Physics and Mathematics of the Universe (IPMU)/University of Tokyo, the Korean Participation Group, LBNL, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), the National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional / MCTI, the Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, the United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University.

The Guoshoujing Telescope (the Large Sky Area Multi-object Fiber Spectroscopic Telescope / LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

For the purpose of open access, the authors have applied a Creative Commons attribution (CC BY) license to any author-accepted manuscript version arising from this submission.

The authors are honored to be permitted to conduct scientific research on Iolkam Du'ag (Kitt Peak), a mountain with particular significance to the Tohono O'odham Nation.

For more information, visit desi.lbl.gov.

Facilities: Mayall (Mosaic3) - , Bok (90Prime) - , Blanco (DECam) - , WISE - , Gaia. -

Software: NumPy (Harris et al. 2020), SciPy (Virtanen et al. 2020), Astropy (Astropy Collaboration et al. 2013, 2018), Matplotlib (Hunter 2007), PyGaia (A. Brown; Gaia Project Scientist Support Team and the Gaia DPAC; https://github.com/agabrown/PyGaia), galstreams (Mateu 2017).

Overview of the DESI Milky Way Survey

Article metrics

Share this article

Author e-mails

Author affiliations

ORCID iDs

Dates

Abstract

1. Introduction

2. DESI MWS in Context

3. Science Goals of DESI MWS

3.1. Probing the Dark Matter and Accretion History of the Milky Way

3.1.1. The Shape and Mass of the Dark Matter Halo

3.1.2. Small-scale Substructure in the Dark Matter Halo

3.1.3. The Assembly History of the Milky Way Halo

3.1.4. The Formation History of the Milky Way Thick Disk

3.1.5. Primordial Stars in the Milky Way

3.2. The DESI White Dwarf Survey

3.3. The DESI Nearby Star Survey

3.4. Survey Requirements

4. Selecting Milky Way Targets for DESI

4.1. Overview of MWS Target Categories

4.2. Main Sample Selection

4.3. Main Sample Targets in the Galaxia Model

4.4. High-priority Sample Selections

4.4.1. mws-wd

4.4.2. mws-rrlyr

4.4.3. mws-nearby

4.4.4. mws-bhb

4.5. Flux Standards

5. Observing Strategy

5.1. The DESI Bright-time Program

5.2. Target Density and Fiber Assignment

6. The DESI MWS Data Pipeline

6.1. RVS

6.2. SP

6.3. WD

7. SV

7.1. SV Target Selection

7.2. SV1 MWS Data Set

7.3. SV3 MWS Data Set

7.4. Validating the Pipelines

7.4.1. Radial Velocities

7.4.2. Radial Velocity Accuracy

7.4.3. [Fe/H] Abundances

7.4.4. Stellar Atmospheric Parameters

7.5. Abundances of Star Clusters and Dwarf Galaxies

7.5.1. Quality of the Fits

7.6. White Dwarf SV Results: Remnant Planetary Systems

7.7. MWS Targets in SV3 Fields

8. Summary

Footnotes