LAMOST DR1: Stellar Parameters and Chemical Abundances with SP_Ace

, , , , , , and

Published 2018 April 3 © 2018. The American Astronomical Society. All rights reserved.
, , Citation C. Boeche et al 2018 AJ 155 181 DOI 10.3847/1538-3881/aab5af

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1538-3881/155/4/181

Abstract

We present a new analysis of the LAMOST DR1 survey spectral database performed with the code SP_Ace, which provides the derived stellar parameters ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, [Fe/H], and [α/H] for 1,097,231 stellar objects. We tested the reliability of our results by comparing them to reference results from high spectral resolution surveys. The expected errors can be summarized as ∼120 K in ${T}_{{\rm{eff}}}$, ∼0.2 in $\mathrm{log}g$, ∼0.15 dex in [Fe/H], and ∼0.1 dex in [α/Fe] for spectra with S/N > 40, with some differences between dwarf and giant stars. SP_Ace provides error estimations consistent with the discrepancies observed between derived and reference parameters. Some systematic errors are identified and discussed. The resulting catalog is publicly available at the LAMOST and CDS websites.

Export citation and abstract BibTeX RIS

1. Introduction

During the last decades, the formation and evolution of the Milky Way (MW) has become a question of major importance in modern astrophysics. The proximity of our Galaxy permits individual star-by-star investigations that would not be possible for external galaxies. This opportunity has been taken by many research groups who have planned and run spectroscopic surveys of the MW at high- and low-spectral resolution (e.g., the Apache Point Observatory Galactic Evolution Experiment, APOGEE, Allende Prieto et al. (2008); the Galactic Archaeology with HERMES (GALAH) Survey, Zucker et al. (2012); the Gaia-ESO Public Spectroscopic Survey, Gilmore et al. (2012); the Sloan Extension for Galactic Understanding and Exploration, SEGUE, Yanny et al. (2009); the RAdial Velocity Experiment, RAVE, Steinmetz et al. (2006); the 4 m multi-Object Spectroscopic Telescope, 4MOST, de Jong et al. (2012); the WHT Enhanced Area Velocity Explorer, WEAVE, Dalton et al. 2012).

While high-resolution spectroscopy can provide higher accuracy in stellar parameters and chemical abundances for relatively small samples of stars that are relatively bright (e.g., 106 stars brighter than V = 14 for GALAH), low-resolution spectroscopy is better suited to collecting spectra of stars of fainter magnitude, thus securing a much larger sample (25 × 106 stars down to V = 20 for 4MOST).

The Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST, Cui et al. 2012) is a telescope with effective aperture of 4 m, which is used to conduct Galactic and extra-galactic spectroscopic surveys at a spectral resolution of R ∼ 2000. The LAMOST Experiment for Galactic Understanding and Exploration (LEGUE, Deng et al. 2012) is an on-going Galactic survey that, with a present sample of more than 5 million stellar spectra of the MW, is the largest spectroscopic survey available to the astronomical community. The first data release (DR1, Luo et al. 2015) includes 2,204,696 spectra, of which 1,944,329 are spectra of stars in the MW (corresponding to around 1.6 × 106 unique stars). Public data releases have followed on an annual basis.8

In our current paper, we present the stellar parameters and chemical abundances obtained by applying the code SP_Ace (Boeche & Grebel 2016, see also Section 3) to the LAMOST DR1 spectra. Stellar parameters, such as ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, and [M/H] have already been derived, most notably by the official LAMOST pipeline, LASP (Luo et al. 2015). This pipeline initially compares the observed spectra to a library of synthetic spectra derived from a Kurucz/ATLAS9 grid in order to obtain a first, coarse, estimate of the parameters. After this, the ULySS method (Koleva et al. 2009; Wu et al. 2011), using the ELODIE spectral library (Prugniel et al. 2007) for the template spectra, is applied. In addition to LASP, other groups have carried out their own analyses. Ho et al. (2017a, 2017b) have applied a method based on the "The Cannon" (Ness et al. 2015), which estimates parameters by training a model on existing data sets (in this instance, from the APOGEE survey). From this analysis, they are able to derive parameters for giant stars, including [α/Fe], individual [C/M] and [N/M] abundances, and, from these latter two abundances, masses and ages. Xiang et al. (2015) present another analysis, like LASP also matching to template spectra (in this case observed libraries from MILES and ELODIE). A recent update to this pipeline now delivers alpha-element abundances and, by applying a machine learning algorithm to the giant stars, [C/N] and [N/H] abundances (Xiang et al. 2017). By using SP_Ace, we derive the stellar parameters ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, [Fe/H], and the alpha abundances [α/H] for both dwarf and giant stars.

2. Data

We employ the LAMOST spectra from the first data release (DR1, Luo et al. 2015) using the latest internal DR3 reduction. Out of the 2,204,969 spectra belonging to the DR1 catalog, 1,944,329 were classified as stars, the rest as galaxies, quasars, or other non-stellar objects. These spectra are radial velocity (RV) corrected for the Earth motion (only), flux calibrated, and are the result of joining the two spectra obtained by the blue and red arms of the spectrographs (Luo et al. 2015). The spectra are re-binned to a constant velocity dispersion so that the pixel interval is constant in $\mathrm{log}\lambda $. The calibration is in the vacuum wavelength. To process these spectra with SP_Ace and derive stellar parameters and chemical abundances, we must re-shape the spectra by (i) converting the dispersion from $\mathrm{log}\lambda $ to λ (angstrom), (ii) converting the wavelength from vacuum to air, and (iii) normalizing the flux. We used the IRAF9 task disptrans to convert the wavelength calibration from vacuum to air and from $\mathrm{log}\lambda $ to λ. This conversion renders a spectral dispersion that is not constant. To flux normalize the spectra, we used the IRAF task continuum with the settings

Out of the 2,204,969 DR1 spectra, we processed 2,052,662 spectra; 152,300 were not present in the internal DR3 catalog, mostly objects with low (<20) signal-to-noise ratio (S/N), while 7 spectra failed the IRAF tasks described above. We ran all of the spectra through SP_Ace, including the ones classified as non-stellar objects, to cover possible mis-classifications. Because SP_Ace was designed for stellar spectra, we expect no convergence (and therefore null values to be output) for spectra of objects that are not stars (this is discussed in Section 3.3).

3. The SP_Ace Code

The SP_Ace software (Boeche & Grebel 2016) derives stellar parameters and chemical abundances of FGK stars from the analysis of their spectra in the wavelength windows 5212–6960 Å and 8400–8920 Å. The software uses a novel method to derive these parameters. Many pipelines rely on equivalent width (EW) measurements (among others, Fast Automatic Moog Analysis, FAMA, Magrini et al. 2013; GALA, Mucciarelli et al. 2013) or on libraries of synthetic spectra (among others, the MATrix Inversion for Spectral SynthEsis, MATISSE, Recio-Blanco et al. 2006; FERRE, Allende Prieto et al. 2006), while others rely on training sets of standard stars (The Cannon, Ness et al. 2015; ULySS, Koleva et al. 2009; LASP, Luo et al. 2015). Unlike these, SP_Ace relies on a library of general curves-of-growth (GCOG). The GCOG is a function that describes the EW of an absorption line as a function of the stellar parameters ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, and [El/H], where "El" is the element the line belongs to. The GCOG is, therefore, the generalization of the classical curve-of-growth extended to the parameters ${T}_{{\rm{eff}}}$ and $\mathrm{log}g$. Given the stellar parameters ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, and the abundances [El/H], SP_Ace takes the GCOG of the lines stored in the GCOG library for the wavelength interval under consideration and computes the expected EWs. Then, assuming a Voigt line profile, SP_Ace constructs a model spectrum by subtracting these line profiles (with the just-computed EWs) from a continuum normalized to 1. This is described in more detail in Section 7.1 of Boeche & Grebel (2016) (see also Figure 1 of the current paper). In this way, SP_Ace constructs many model spectra of different stellar parameters and searches for the one that minimizes the χ2 between the observed and the model spectrum. The χ2 is minimized following the Levenberg–Marquadt method, details of which are given in Section 7.3 of Boeche & Grebel (2016). This method of comparing model spectra and observed spectra puts SP_Ace in the "global fitting methods" category. We remind the reader that, as input, SP_Ace takes spectra that have been wavelength calibrated, RV corrected to the rest frame, and continuum normalized. However, SP_Ace can apply changes in RV to the model spectrum (see Section 7.3 of the Boeche & Grebel 2016 paper). This feature was implemented in the code because experience showed that SP_Ace could detect small wavelength shifts in observed spectra, even when these were previously RV corrected to the rest frame. Since this can badly affect the χ2 analysis, we allowed SP_Ace to perform small shifts (no larger than ∼1 FWHM) to the spectrum model. This was supposed to be a mere internal setting performed in order to better match the observed spectrum and increase the accuracy of the derived stellar parameters. In fact, in the case of high-resolution spectra, a shift of 1 FWHM in wavelength corresponds to a small shift in RV (at R ∼ 20,000 this corresponds to ∼15 km s−1). The case of low-resolution spectra (like LAMOST) is different, because a 1 FWHM shift corresponds to ∼200 km s−1 in RV. Because the LAMOST spectra are corrected for the Earth's motion only, these internal RV corrections correspond to the heliocentric RVs of the LAMOST stars. From now on, we refer to these shifts as the SP_Ace RV corrections. However, SP_Ace was not designed to measure RVs and has never been tested for this purpose, therefore will not use them as such. Official LAMOST RVs have been determined by the LAMOST 1D pipeline (see Luo et al. 2015).

Figure 1.

Figure 1. LAMOST spectrum spec-55932-GAC_061N46_V1_sp03-061 (in red) and the best matching model found by SP_Ace (in blue) with derived parameters of ${T}_{{\rm{eff}}}$ = 4642 K, $\mathrm{log}g$ = 2.60, [M/H] = 0.24 dex. The observed spectrum has S/N = 44 and it has been continuum re-normalized with the internal SP_Ace procedure. Shaded areas are the rejected wavelength intervals.

Standard image High-resolution image

Similarly, SP_Ace applies a continuum re-normalization to the (already normalized) observed spectra as an internal trim of the continuum to improve the χ2 analysis. This internal setting was implemented because the normalization of spectra (done with IRAF or similar tools) are often not optimal in the case of wide absorption lines and/or high metallicity. This is due to the difficulty in distinguishing the continuum level from a pseudo-continuum generated by the wide blends of lines or wings of wide strong lines. However, in order to deal with certain idiosyncrasies of the LAMOST spectra, we have changed some parts of the code, as explained in the following section.

3.1. The LAMOST version of SP_Ace

To fully exploit the information carried by the LAMOST spectra, we need to consider the largest wavelength range possible. This means that both the blue and red parts of the spectra must be used for the analysis. However, the blue and red parts differ from each other as follows:

  • 1.  
    Spectral dispersions: The LAMOST spectra have been re-binned to a wavelength width that is constant in $\mathrm{log}\lambda $. This means that the binning in wavelength (employed by SP_Ace) varies along the whole spectrum.
  • 2.  
    S/N: The intensity of the blue and the red parts of a spectrum change not only as a function of the temperature of the stars, but also because the efficiency of the two spectrograph arms differs. Therefore the S/N of these two parts is different. Furthermore the S/N changes inside each of the two parts because the central part of the spectral orders receives more light than the borders (blaze function).
  • 3.  
    Spectral resolutions: The instrumental full-width-half-maximum (FWHM) of the blue and red arm of the spectroscope differ, with the former being smaller than the latter.
  • 4.  
    Wavelength calibration: There are differences in the wavelength calibration between the blue and red parts of the spectra, since they are done independently. This can lead to small differences in RV.

The most recent public version of SP_Ace is able to handle spectra with a dispersion and S/N varying along the spectrum. However, for the analysis of a given spectrum, the public version of SP_Ace assumes one single FWHM and RV correction. This makes the public version of SP_Ace unsuitable for a full exploitation of the LAMOST spectra. To overcome this limitation, we have made a custom version of SP_Ace capable of analyzing the blue and red parts of the spectra separately. This version estimates an individual RV correction and instrumental FWHM for each of the two parts, while simultaneously searching for a single set of stellar parameters and chemical abundances with a unique χ2 analysis. Unlike the public version of SP_Ace, this version also makes use of the Hα absorption line because it improves the parameter estimation of the LAMOST spectra for dwarf stars. For giant stars, deviations from local thermodynamic equilibrium (non-LTE) can affect Hα significantly, but for dwarf stars (i.e., stars with high gravities, $\mathrm{log}g$ > 4), the LTE assumptions are good enough to predict a reliable strength for this line. Therefore, the LAMOST version of SP_Ace uses this line by assigning different weights to the pixels associated with the line10 as follows,

Equation (1)

This provides a significant improvement in $\mathrm{log}g$ for dwarf stars, while leaving the analysis of giant stars unaffected. For this work only, the Hα GCOG employed has been obtained with the same procedure followed for the other lines, but we used the spectrum synthesis code SPECTRUM (Gray & Corbally 1994) instead of MOOG (Sneden 1973). This was done because, with the same atomic parameters and stellar atmospheres employed as for the official SP_Ace GCOG library, the strength of the Hα line synthesized with SPECTRUM is bigger than the one in MOOG and it provides a better match to that of the Sun and Procyon. For this line only, we adopted the approximated Voigt profile by Bruce et al. (2000),

Equation (2)

where L and G are the Lorentzian and Gaussian functions and EW is the EW expressed in Å. While the FWHM of the Gaussian function is equal to the spectral resolution FWHM (one of the variables that SP_Ace actively looks for), the σ of the Lorentzian profile depends on the EW according to the following relation:

Equation (3)

where σL is expressed in Å, and the r parameter in Equation (2) is defined as

Equation (4)

These are empirical relations that have been chosen to provide an optimal match to the observed Hα line profile.

Although the LAMOST catalog provides an estimate of the error on the flux of a spectrum (inverse variance), it has been noted that this overestimates the true uncertainty (e.g., Xiang et al. 2015; Ho et al. 2017a). If we use these values in our fitting, this will result in unreliable parameter uncertainties, so we have chosen to estimate the error ourselves. The uncertainty on the ith pixel is taken as the standard deviation of the residuals computed over an interval with 25 pixels of half-width and centered on the ith pixel (after the pixels deviating more than 3σ were rejected). This approach ensures that the best-fit reduced χ2 is approximately one for most spectra and hence the parameter uncertainties should be reliable.

With this version of SP_Ace, we processed the 2,052,662 LAMOST DR1 stellar spectra, along with more than 50,000 spectra from later data releases. This latter sample consists of spectra of stars with existing stellar parameters in the literature, chosen so that we can validate our results (see Section 4). We limited the analysis to the wavelength intervals 5212–5700 Å, 5900–6270 Å, and 6320–6860 Å. The first neglected interval (5700–5900 Å) avoids the overlapping region of the blue and red arm, while the second one (6270–6320 Å) avoids the region affected by telluric lines. The processing was carried out with the SP_Ace options "ABD_loop" and "alpha" switched on. The first option forces SP_Ace to estimate the stellar parameters and chemical abundances using a loop, whereby the stellar parameters are derived again using the last abundance estimation, and vice versa, until they reach convergence. The second option imposes that the absorption lines of the elements Mg, Si, Ca, and Ti must be derived as if they were one single element called the α-element11 by forcing the relative abundances of these elements to be equal to each other. Similarly, all of the other non-α elements are derived as though they were the same element that we call "Fe." Although this may appear imprecise, this parameter traces Fe more closely than the other elements because most of the absorption lines that drive it are in fact iron lines. The choice of grouping alpha elements and heavy elements together is due to the features of our spectra. A good fraction of the LAMOST spectra have low signal-to-noise and their low spectral resolution does not allow individual fit for most of the absorption lines. By grouping together absorption lines for similarly behaving elements, the total absorbed flux is bigger than for the individual elements and so it makes detectability easier in low signal-to-noise spectra. The "lack of purity" for the α- and Fe-abundance is the price we pay to obtain abundance measurements for a higher number of spectra and with greater precision.

3.2. Complementary Parameters

The first few columns of the SP_Ace output report some parameters that, although usually not used for science, can be helpful in determining features and quality of the spectra. These parameters are as follows:

  • 1.  
    FWHM: The standard SP_Ace pipeline estimates only one FWHM for the whole spectrum. Because LAMOST spectra are actually composed of two separate spectra (the blue and the red part), which have different spectral resolutions, the LAMOST version of SP_Ace estimates FWHMs for the blue and the red spectrum parts independently. These are output as FWHMb and FWHMr, respectively. The FWHMb and FWHMr differ and their values are (on average) ∼2.9 and ∼4.3 Å, respectively (see top panel of Figure 2). Only ∼2% of the spectra have FWHMb > 4 Å and ∼3% have FWHMr > 6 Å. These values may be used as quality selection criteria, although we do not adopt any such cuts in this study as the number of stars with very high FWHM is negligible. Spectra with unusually large FWHM may be out-of-focus spectra, fast rotator stars, or have bad seeing, for which we do not expect reliable parameter estimations. As expected, the two FWHMs are weakly correlated, meaning that if the FWHM is large in the blue arm, there is a tendency for it to be large for the red arm as well.
  • 2.  
    Radial Velocity: Like the FWHM, the RV correction is also independently estimated for the blue and the red parts of the spectrum (RVb and RVr, respectively). Note that SP_Ace was designed to take RV corrected spectra and, in the case of small errors in the wavelength calibration or RV correction, it can correct for this by shifting the central wavelengths of the lines in the model spectrum. The limit of this shift is 1.27 FWHM, which is equivalent to a 3σ shift for a Gaussian profile; this limit corresponds to an RV offset of ∼±200 km s−1 for an average FWHMb = 2.9 Å. Because the FWHM varies for different spectra, the RV correction limit varies as well.12 For stars having an RV that causes a wavelength shift larger than 1.27 FWHM, SP_Ace quits the analysis and reports no results. Since the LAMOST spectra are corrected for the Earth's motion only (i.e., they are in the heliocentric frame), the RV correction performed by SP_Ace corresponds to the heliocentric RV of the star. This means that the RVb and RVr should agree to within their uncertainties. If the difference between RVb and RVr is large, it may indicate a mis-calibration of the wavelength or an incorrect RV convergence of SP_Ace in one of the two parts of the spectrum. In the bottom panel of Figure 2 the gray solid histogram shows the distribution of RVb minus RVr for the first full year of the survey.13 This distribution is not symmetric and peaks at ∼−7 km s−1, which means that typical spectra have a value of RVr that is 7 km s−1 larger than RVb. This offset remains even if we select only high S/N spectra. The cause of this shift may be related to the 5.7 km s−1 difference between LAMOST and APOGEE RVs reported by Tian et al. (2014) (see Section 2.3). If we compare our RVs to those from the LAMOST pipeline, we find that less than 0.5% of stars with S/N > 40 have offsets greater than ±30 km s−1. From this, we conclude that our stellar parameters are unlikely to be affected by problems with the RV correction.
  • 3.  
    S/N: This is a quality indicator of the spectra. As discussed above, each spectrum will have an S/N that varies with wavelength. SP_Ace computes a pixel-by-pixel S/N based on the discrepancy between the observed spectrum and the best matching model, using a 51 pixel interval (25 pixel half-width) centered on the pixel under consideration. However, in its output, SP_Ace delivers a single value for the S/N. This is computed as S/N${}_{\mathrm{SPAce}}$ = 1/σtot, where σtot is the standard deviation of the residuals between the observed spectrum and the best matching model over the whole spectrum, i.e., this can be considered as an average S/N.14 In Figure 3, we compare S/N${}_{\mathrm{SPAce}}$ to the I-band signal-to-noise (SNRI) given in the LAMOST catalog, where we only include spectra that have stellar parameters in both catalogs. The two S/Ns show a fair 1:1 trend, although for some spectra the difference can be significant (note that the SNRI is evaluated in a wavelength window centered on 6220 Å). The peak of the S/N distribution is around 30, with the SP_Ace distribution being slightly higher. This shift is probably due to the aforementioned issue regarding LAMOST's flux error estimate (see Section 3.1). For lower S/N spectra, one or both methods often fail to converge, mainly due to a lack of information in the spectra. Other issues that affect convergence are cosmetic defects (such as cosmic rays, fringing, or dead pixels), emission lines, or any other peculiar/unexpected features, like non-stellar objects. As mentioned in the previous bullet point, SP_Ace may also fail to converge if the RV correction is too large. We return to the issue of performance later in Section 5.2 when we compare the results of SP_Ace to those from the official LAMOST pipeline LASP.

Figure 2.

Figure 2. Top panel: the distribution of the FWHMs for the blue part (blue solid line) and the red part (red dashed line) of the spectra. Bottom panel: the distribution of radial velocities for the blue (RVb) and the red (RVr) part of the spectra (blue solid and red dashed histograms, respectively). The solid gray histogram represents the distribution of the difference RVb minus RVr. For this panel, we have only plotted data from the first full year of the survey (see Section 3.2 for details).

Standard image High-resolution image
Figure 3.

Figure 3. Top panel: 1:1 comparison between S/N${}_{\mathrm{SPAce}}$ and the LAMOST SNRI (S/N in the I band). Bottom panel: distributions of S/N${}_{\mathrm{SPAce}}$ (blue solid line) and LAMOST SNRI (red dashed line) for the spectra that have results in both catalogs. The contours enclose 34%, 68%, 95%, and 99% of the sample.

Standard image High-resolution image

3.3. Objects Class Discrimination

Although the majority of the LAMOST spectra processed with SP_Ace are stellar objects, a fraction of the sample are non-stellar such as galaxies, quasars, and planetary nebulae. Because SP_Ace has been designed to derive stellar parameters of FGK stars only, it is expected not to converge (and to exit without results) for other objects. According to the LAMOST 1D pipeline's classification (CLASS), 95% of our sample are stars. SP_Ace converges for 56% of these, which is a similar success rate as for the LAMOST pipeline (58%), and most have good S/N (96% have S/N${}_{\mathrm{SPAce}}$ > 20). On the other hand, from the 5% of spectra classified as non-stellar objects, SP_Ace converges for about 1% of them (1331 objects). A by-eye inspection shows that these appear to be mostly bona fide stars, indicating that the LAMOST CLASS classification is not perfect. Note that we are unable to determine the false-positive rate for CLASS, i.e., the fraction of non-stellar spectra that were classified as stars.

4. Validation

To verify the reliability of the LAMOST stellar parameters and chemical abundances obtained with SP_Ace, we compared them against stars with stellar parameters in the literature. For these tests, we restricted ourselves to LAMOST spectra with sufficient S/N to allow for a fair estimation of the chemical abundance, so we only selected stars with S/N${}_{\mathrm{SPAce}}$ > 40. We return to the issue of performance as a function of S/N${}_{\mathrm{SPAce}}$ later in Section 4.4.

4.1. Comparison to APOGEE

APOGEE (Holtzman et al. 2015) is a large spectroscopic survey that has collected over ∼150,000 spectra in the near-infrared with a spectral resolution of ∼22,500. The APOGEE stellar parameters and chemical abundances are derived with the APOGEE Stellar Parameters and Chemical Abundances Pipeline (ASPCAP; García Pérez et al. 2016), which compares the observed spectra to libraries of theoretical spectra and then calibrates the resulting parameters and abundances using an observed calibration sample. Although method and data are different from LAMOST's, the comparison is useful. The complete DR12 APOGEE sample has 34,783 stars in common with LAMOST (45,193 LAMOST spectra, including repeat observations). Out of these spectra, we ignored the 27,021 spectra that are flagged by APOGEE as having possible problems in the spectrum (for instance, the stars may have a bright neighbor that can pollute the spectrum, particularly broad lines or low S/N that can badly affect the parameter derivation).15 This left us with 18,172 stellar spectra that have no reported problems, of which 13,351 are giant stars ($\mathrm{log}g$ < 3.5) with calibrated stellar parameters. We neglect the dwarf stars because APOGEE does not provide reliable parameters for these stars (Holtzman et al. 2015). SP_Ace converged for 10,879 spectra of those giant stars, of which 9189 had S/N${}_{\mathrm{SPAce}}$ above our aforementioned threshold of 40.

The performance of the two catalogs is shown in Figure 4. The top panels show the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane, together with a set of fiducial isochrones by Bressan et al. (2012) corresponding to typical thin-disc, thick-disc, and halo populations. SP_Ace provides a good match to the expected distribution, as demonstrated by the agreement with the isochrones. In the bottom panels of Figure 4, we directly compare the SP_Ace and APOGEE stellar parameters, while a more detailed plot showing the correlations between the parameters is given in Appendix B (Figure 18). Since APOGEE gravities are calibrated using asteroseismic gravities from Kepler, we defer the discussion of the gravity performance to the following section where we directly compare to data from Kepler (Section 4.2). The distributions show some systematic differences (shifts and/or correlations) between APOGEE and SP_Ace parameters. When making this comparison, we should first point out that prior tests of the SP_Ace pipeline have shown a tendency to slightly underestimate [Fe/H], at a level of around −0.05 dex (see Boeche & Grebel 2016, their Figure 17 and accompanying discussion). We find that SP_Ace [Fe/H] and [α/Fe] abundances appear to be underestimated with respect to APOGEE's. However, for both parameters the offset appears to be independent of [Fe/H] or [α/Fe], which means that it is straight-forward to shift the SP_Ace abundances onto the APOGEE scale. A closer look (Figure 18) shows that the underestimation in [α/Fe] has a very weak correlation with the other parameters. [Fe/H], on the other hand, does show stronger correlations (most notably with gravity and temperature), but the offset appears to be independent of [Fe/H] itself.

Figure 4.

Figure 4. An analysis of APOGEE stars with spectra also observed by LAMOST. Top: distributions of the stars with ASPCAP calibrated stellar parameters from the APOGEE spectra (top left panel) and SP_Ace stellar parameters (top right panel) obtained from the LAMOST spectra with S/N${}_{\mathrm{SPAce}}$ > 40. The red, light blue, and blue lines represent isochrones for [M/H] = 0.0 dex (of 5 Gyr age), −1.0 dex and −2.0 dex (10 Gyr age), respectively. Bottom: comparison of the same SP_Ace stellar parameters from the LAMOST spectra to the reference ASPCAP parameters from the APOGEE spectra. The reference iron abundance is [Fe/H] (not [M/H]) from ASPCAP. The contours enclose 34%, 68%, 95%, and 99% of the sample. A complete version of this plot is given in Figure 18 of the appendix.

Standard image High-resolution image

Note that, in the above discussion, we have used the ASPCAP iron abundance [Fe/H], not the total metallicity [M/H] (PARAM_M_H). The latter quantity has been calibrated onto an [Fe/H] scale using literature abundances for star clusters, but the former has not (see Section 5.4.3 of Holtzman et al. 2015). The lack of an external calibration may influence our findings and so we check the comparison between SP_Ace and ASPCAP [M/H] in Figure 5. For completeness, we also include [Fe/H] and [α/Fe] from the BACCHUS pipeline (Hawkins et al. 2016), which provides a complementary abundance analysis using APOGEE stars with asteroseismic gravities from Kepler. For the BACCHUS [α/Fe] we take the average of the Mg, Si, Ca, and Ti abundances. We can see that, when using the ASPCAP [M/H], there exists a noticeable correlation, which may be due to uncertainties in the external calibration of [M/H] at the metal-poor end. The correlation disappears (but an offset remains) when comparing SP_Ace to both ASPCAP [Fe/H] and BACCHUS [Fe/H]. The only differences between BACCHUS and ASPCAP [Fe/H] is a marginal reduction in both the scatter and offset in the former. For [α/Fe], the results are also similar between BACCHUS and ASPCAP, although SP_Ace appears to exhibit a very slight correlation in [α/Fe] compared to BACCHUS.

Figure 5.

Figure 5. Residuals between SP_Ace and a selection of different reference abundances available for the APOGEE data (SP_Ace minus reference value) for stars with S/N${}_{\mathrm{SPAce}}$ > 40. The top panel shows the ASPCAP [M/H] and the middle panel the ASPCAP [Fe/H] and [α/Fe], all of which are described in Holtzman et al. (2015). The lower panel shows [Fe/H] and [α/Fe] from the BACCHUS pipeline (Hawkins et al. 2016). The horizontal axes denote the reference abundance for that particular panel. The contours enclose 34%, 68%, 95%, and 99% of the sample.

Standard image High-resolution image

We include three further plots in Appendix B. Figure 19 shows the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) distributions for stars measured by ASPCAP, BACCHUS, and SP_Ace, divided into bins of [Fe/H]. For each bin, we have over plotted isochrones for the corresponding metallicity. The placement of the SP_Ace red clump (RC) is slightly more consistent with the corresponding isochrones, compared to both APOGEE/ASPCAP and BACCHUS. Furthermore, at lower metallicities ([Fe/H] ≲ −0.4), the SP_Ace red giant branch (RGB) appears to match the isochrones better than ASPCAP.

The [α/Fe] performance is characterized in Figures 20 and 21 of Appendix B. The former shows the distribution of stars in the ([Fe/H], [α/Fe]) plane for SP_Ace and ASPCAP, while the latter shows a one-to-one comparison for bins of [Fe/H]. The low- and high-alpha sequences are correctly estimated by SP_Ace, in that SP_Ace detects a clear shift in alpha between stars from the low- and high-alpha sequences (Figure 21). Although the LAMOST alphas are not as precise as those from APOGEE (e.g., there is no visible gap between the two sequences), the performance of SP_Ace is impressive considering the fact that the LAMOST data have a 10 times lower resolution.

4.2. Comparison with Kepler Stars

Huber et al. (2014) report the properties of ∼20,000 stars observed by the the NASA Kepler mission. While the stellar parameters of most of these stars come from a collection of different catalogs with different observational techniques (photometry, spectroscopy), ∼15,500 of these stars have known oscillations that permit a precise measurement of their surface gravity through asteroseismology. These stars can therefore be used as reference in $\mathrm{log}g$ for comparison purposes. Our LAMOST sample has 6684 stars in common with Huber et al. (2014). Most of these are giant stars, with a small number of sub-giant and turn-off stars. In Figure 6, we show the distribution of these stars in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane along with a direct comparison of the parameters, while in Figure 22, we show the correlations between stellar parameters and the residuals with respect to the reference. In these figures, we see an apparent offset in ${T}_{{\rm{eff}}}$ and a poor match in [Fe/H] between SP_Ace and the reference values. The temperature offset appears to be in agreement with the one found by Huber et al. (2014; top panel of their Figure 7), namely a systematic overestimation in their ${T}_{{\rm{eff}}}$ value due to the limitations of the Kepler Input Catalog (KIC) from which they were obtained. Similarly the metallicity offsets are not important, since the Huber et al. (2014) metallicities, which are mainly derived from the KIC, are known to have been underestimated (e.g., Dong et al. 2014). Furthermore the KIC metallicities show an unnatural discrete distribution, which demonstrates their lack of precision. We do know that Huber's $\mathrm{log}g$ values are extremely accurate (typically 0.03 dex), and the central bottom panel of Figure 6 (or the middle row of panels in Figure 22) shows a very good match with the SP_Ace gravity, albeit with an overestimation of ∼0.2 when $\mathrm{log}g$ ≲ 2 dex.

Figure 6.

Figure 6. Top: distributions in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane of the Kepler stars as given by Huber et al. (left panel) and SP_Ace(right panel), where we have restricted ourselves to stars with S/N${}_{\mathrm{SPAce}}$ > 40. The red, light blue, and blue lines represent the same isochrones as in Figure 4. Bottom: comparison of the SP_Ace stellar parameters to the reference Huber et al. parameters. The contours enclose 34%, 68%, 95%, and 99% of the sample. A complete version of this plot is given in Figure 22.

Standard image High-resolution image
Figure 7.

Figure 7. Top panels: comparison of SP_Ace to seismic gravities from the updated (Stello et al. 2013) sample (see Section 4.2) for 1366 red clump stars for stars with S/N${}_{\mathrm{SPAce}}$ > 40. The left panel shows the distribution of SP_Ace parameters in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane, while the right panel shows the $\mathrm{log}g$ residuals between SP_Ace and Stello. Bottom panels: as above, but for 928 red giant stars.

Standard image High-resolution image

Stello et al. (2013) carried out a further analysis of this Kepler sample to distinguish RC and RGB stars following a method similar to Bedding et al. (2011). The analysis by Stello et al. (2013) included only two years of Kepler data. To refine those results, we repeated their analysis on 3.5 years of Kepler data. The comparison with these updated seismic results is presented in Figure 7. In the left panels we notice that SP_Ace correctly positions both the RC and RGB locus on the high metallicity isochrone. The right panels show that SP_Ace is able to recover accurate gravities for both the RC and RGB samples, although the latter have a slight systematic offset (∼−0.08 dex). It should be stressed that these results are obtained directly from SP_Ace without any need for calibrations.

4.3. Main-sequence Stars: Comparison with the Geneva-Copenhagen and Gaia-ESO Surveys

As the previous sections focused mainly on giant stars, we now utilize two surveys that contain large numbers of main-sequence stars. Our first data set is from the Geneva-Copenhagen Survey (GCS, Nordström et al. 2004), which consists of 16,682 FGK dwarf stars in the immediate solar neighborhood. As reference parameters, we adopt those of Casagrande et al. (2011). In this work, they derived ${T}_{{\rm{eff}}}$ and $\mathrm{log}g$ from photometry using an infrared flux method (IRFM), where the latter quantity also incorporated the Hipparcos parallax, and derived metallicity from Strömgren photometry. Although we cannot expect high precision from this approach, their parameters are still of great help in determining the robustness of our parameters for dwarf stars. We have 413 stars in common with GCS, out of which 233 have S/N${}_{\mathrm{SPAce}}$ > 40 and SP_Ace stellar parameters. Their distributions in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane and a direct comparison are shown in Figure 8, while the full correlations between the parameters are in Figure 23.

Figure 8.

Figure 8. Top: distributions in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane of the GCS stars as given by Casagrande et al. stars (left panel) and SP_Ace (right panel) for stars with S/N${}_{\mathrm{SPAce}}$ > 40. The red, light blue, and blue lines represent the same isochrones as in Figure 4. Bottom: comparison of the SP_Ace stellar parameters to the reference Casagrande et al. parameters. A complete version of this plot is given in Figure 23.

Standard image High-resolution image

Our second data set is from the Gaia-ESO survey (GES, Gilmore et al. 2012), which provides precise stellar parameters from high -resolution spectra obtained with the ESO UVES and Giraffe spectrographs (with a spectral resolution of ∼60,000 and ∼20,000, respectively). Using GES DR3 we have found 390 stars in common with LAMOST that have non-null GES parameters ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, and [Fe/H]. Out of these, 146 have S/N${}_{\mathrm{SPAce}}$ > 40 and SP_Ace stellar parameters. Comparisons between SP_Ace and GES are presented in Figure 10 and in Figure 24.

Figure 9.

Figure 9. Top: density distributions for the discrepancies (SP_Ace minus reference) for data sets dominated by giant stars. The left and middle panels show the comparison of SP_Ace and ASPCAP/APOGEE, while the right panel shows SP_Ace compared with the Kepler stars of Huber et al. The dots with error bars represent the average discrepancy and 1σ of the distribution for each bin in S/N${}_{\mathrm{SPAce}}$, while the red dots represent the average uncertainties (±1σ) computed by adding the SP_Ace and reference errors in quadrature. The gray-scale indicates the density of points per pixel (see bar on right), while the contours enclose 34%, 68%, 95%, and 99% of the sample and all stars have S/N${}_{\mathrm{SPAce}}$ > 40. Bottom: density distributions for the discrepancies between SP_Ace and GES, i.e., for a sample dominated by main-sequence stars. Symbols, gray-scale bar, and contours are as in the top panel.

Standard image High-resolution image
Figure 10.

Figure 10. Top: distributions in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane of the stars as given by the GES stars (left panel) and SP_Ace (right panel) for stars with S/N${}_{\mathrm{SPAce}}$ > 40. The red, light blue, and blue lines represent the same isochrones as in Figure 4. Bottom: comparison of the SP_Ace stellar parameters to the reference GES parameters. A complete version of this plot is given in Figure 24.

Standard image High-resolution image

For both the GCS and GES comparisons, we can see that ${T}_{{\rm{eff}}}$ and $\mathrm{log}g$ match the reference parameters fairly well, with small offsets. However, for the GCS, the iron abundance exhibits a negative offset (i.e., SP_Ace is underestimating [Fe/H] compared to GCS), especially for the hotter, lower-gravity dwarfs (see upper-left and upper-middle panels of Figure 23). A handful of these stars also have significantly underestimated gravities compared to the GCS value. There is weak evidence for a similar metallicity trend in the GES comparison (Figure 24), although the paucity of stars in this region of parameter space (i.e., ${T}_{{\rm{eff}}}$ ≳ 6000 K and 3 ≲ $\mathrm{log}g$ ≲ 4 dex) makes it hard to draw any firm conclusions. From Figure 24, we can see that there is an underestimation in [α/Fe] of ∼0.1 dex when comparing SP_Ace to GES, which is similar to the one seen when comparing SP_Ace to APOGEE (Section 4.1).

4.4. Uncertainties

SP_Ace estimates 1σ uncertainties along with the stellar parameters for each star. It does this by considering the shape of the χ2 hyper-surface and identifying the upper and lower parameter limits for which ${\chi }^{2}={\chi }_{\min }^{2}+{\rm{\Delta }}{\chi }^{2}$, where Δχ2 depends on the number of degrees of freedom (for details, see Section 7.6 in Boeche & Grebel 2016). For a correct validation of these uncertainties, we would need to compare the SP_Ace results to reference parameters having no (or very small) errors, which would allow us to infer the precision from the standard deviation of the discrepancies and the accuracy from the systematic offsets. Unfortunately this is not possible because the reference parameters are also affected by stochastic and systematic errors. However, we can overcome this by adding in quadrature the errors from both SP_Ace and the reference data set, although of course this relies on the assumption that the reference errors are accurately reported.

The comparison between the distribution of the discrepancies (SP_Ace minus reference parameter) and the SP_Ace errors for giant stars is reported in the three top left panels of Figure 9. As expected, the discrepancies become larger as the S/N${}_{\mathrm{SPAce}}$ decreases. The red dots show the expected magnitude of the standard deviation, as estimated by adding in quadrature both the SP_Ace and ASPCAP uncertainties. For most of these parameters, the SP_Ace errors look like they are providing a fair approximation to the observed scatter, with the red dots agreeing with the 1σ limit of the distribution of discrepancies (indicated with black error bars). For ${T}_{{\rm{eff}}}$ and [α/Fe], SP_Ace appears to be overestimating the uncertainties slightly, especially for [α/Fe] in the low S/N regime. On the other hand, it appears to underestimate the errors for [Fe/H]. The magnitude of this underestimation is hard to quantify because the reported errors on the ASPCAP [Fe/H] are exceedingly small (0.03 dex) and so it is unclear which method is underestimating the size of their errors. As an example, if we consider moderately low S/N${}_{\mathrm{SPAce}}$ stars (around 40 ≲ S/N${}_{\mathrm{SPAce}}$ ≲ 70), we find that, in order to bring our estimated errors into agreement with the standard deviation of the discrepancy, we would need to inflate both SP_Ace and ASPCAP errors by around 30%. Repeating this exercise for [α/Fe] indicates that for the same stars we would need to reduce the SP_Ace errors by around 40% in order to match the observed standard deviation.

In the top right panel of Figure 9, we compare the SP_Ace $\mathrm{log}g$ to the Huber et al. values. Because of the high precision of the seismic gravities (with typical uncertainties of 0.03 dex), we expect that the scatter reflects the magnitude of the SP_Ace errors. Apart from a slight overestimation at the low S/N${}_{\mathrm{SPAce}}$ end, the $\mathrm{log}g$ errors provide a good match to the observed scatter. However, if we look at the mean of the discrepancy (given by the black dots) there appears to be a slight correlation with S/N${}_{\mathrm{SPAce}}$; as well as the previously identified offset (see Section 4.2), the SP_Ace $\mathrm{log}g$ appears to suffer from a larger underestimation for lower S/N${}_{\mathrm{SPAce}}$ spectra. At higher S/N${}_{\mathrm{SPAce}}$ (∼100), the offset is barely noticeable, amounting to less than 0.05 dex, but this increases to as much as 0.15 dex at lower S/N${}_{\mathrm{SPAce}}$ (∼30). The same analysis is presented on the bottom panels of Figure 9 for the Gaia-ESO survey stars. This sample is more representative of dwarf stars, since it contains only a small fraction of giants. Although there are also far fewer stars to compare to, and hence the distributions (especially ${T}_{{\rm{eff}}}$) are a little harder to interpret, the trends we saw for giants are replicated. Namely our errors appear to be reliable for ${T}_{{\rm{eff}}}$ and $\mathrm{log}g$, but are slightly over- and underestimated for [Fe/H] and [α/Fe], respectively. The correlation between SP_Ace $\mathrm{log}g$ and S/N${}_{\mathrm{SPAce}}$ that was found for the Huber sample appears to persist for the GES, implying that this is common for dwarf stars as well.

5. The Catalog

Out of the 2,052,662 processed spectra, SP_Ace derived parameters for 1,097,231 spectra. For half of the spectra, SP_Ace did not output any results since it failed to converge for one or more parameters. The main reason for the failure of convergence is the low S/N: for spectra with SNRI < 20 (736,929), ∼80% failed to converge, while for spectra with SNRI > 20 (1,315,733), ∼27% failed to converge. As we will show later (Section 5.2), this problem is not unique to SP_Ace; in fact, the fraction of low S/N spectra that can be processed by SP_Ace is similar to that of the official LAMOST pipeline (LASP). Other reasons for failure can be stars with stellar parameters out of the limits of the GCOG library (e.g., too hot or too cool), too high an RV, or spectra of non-stellar or peculiar objects.

The final results are sorted and described as in Table 1. In this table, we include the parameter [M/H], which has not previously been mentioned. This parameter represents the metallicity of the atmosphere model that was used to compute the GCOGs, which were in turn used to derive the spectrum's parameters. These atmosphere models were taken from ATLAS12 by Castelli & Kurucz 2003) and have [α/Fe] = 0. This means that their nominal metallicity is equal to the iron abundance. Since iron is the main driver of the metallicity, one finds that in our catalog the metallicity [M/H] is usually very close (or equal) to [Fe/H]. To obtain a more general metallicity index that also includes the contribution of the α elements, we suggest using the following formula

Equation (5)

given by Salaris et al. (1993).

Table 1.  Description of the Catalog

Field Name Format Unit Description
1 OBSID integer unique identification number of the spectrum
2 decl. string object designation
3 R.A. float deg object R.A.
4 decl. float deg object decl.
5 obsdate date of observation
6 lmjd integer local modified Julian date
7 planid string plan ID in use
8 spid integer spectrograph ID
9 fiberid integer fiber ID of object
10 RVb float km s−1 radial velocity correction of the blue part of the spectrum
11 RVr float km s−1 radial velocity correction of the red part of the spectrum
12 FWHMb float Å full-width-half-maximum of the blue part of the spectrum
13 FWHMr float Å full-width-half-maximum of the red part of the spectrum
14 S/N float signal-to-noise as computed by SP_Ace
15 ${T}_{{\rm{eff}}}$ float K effective temperature
16 inf float K effective temperature lower limit
17 sup float K effective temperature upper limit
18 $\mathrm{log}g$ float dex gravity
19 inf float dex gravity lower limit
20 sup float dex gravity upper limit
21 [M/H] float dex nominal metallicity of the atmosphere model
22 inf float dex nominal metallicity lower limit
23 sup float dex nominal metallicity upper limit
24 [Fe/H] float dex iron abundance
25 inf float dex iron abundance lower limit
26 sup float dex iron abundance upper limit
27 Nlin integer number of absorption lines used to derive the iron abundance
28 [α/H] float dex α-element abundance
29 inf float dex [α/H] lower limit
30 sup float dex [α/H] upper limit
31 Nlin integer number of absorption lines used to derive [α/H]
32 flag integer spectra with FLAG = 0 must be rejected or treated with extreme caution (Section 5.1)
33 class string LAMOST classification of the object (Section 3.3)
34 z float redshift of object (from the LAMOST pipeline)
35 z_err float redshift error (from the LAMOST pipeline)

Note. The corresponding catalog is publicly available at the LAMOST (http://dr1.lamost.org/doc/vac) and CDS (http://cds.u-strasbg.fr) websites and in its entirety in machine-readable FITS format in the online Journal. A description is shown here for guidance regarding its form and content.

Only a portion of this table is shown here to demonstrate its form and content. A machine-readable version of the full table is available.

Download table as:  DataTypeset image

We remember that our [Fe/H] parameter represents the abundance of all non-α-elements as if they were a single element, to which iron is the major contributor. The total number of absorption lines measured to derive the abundance of the [Fe/H] parameter is reported by the Nlin parameter. Consequently, the abundances will be more robust for spectra with higher values of Nlin. The same applies to the [α/H] parameter, which represents the abundance of the α-elements Mg, Si, Ca, and Ti derived as if they were one single element.

In Figures 11 and 12, we show the distributions of $\mathrm{log}g$ versus ${T}_{{\rm{eff}}}$ and [α/Fe] versus [Fe/H] for our SP_Ace catalog, divided into high and low S/N${}_{\mathrm{SPAce}}$. We here employ S/N${}_{\mathrm{SPAce}}$ = 40 as a discriminator between the stellar parameters with "fair" and "good" accuracy and, as expected, the dispersion of the data observed in these figures reflects this. In Figure 11, the red dotted line delineates a sample of cool dwarf stars that, as they are placed far from the expected isochrones, are most likely subject to a systematic error in gravity. This is discussed in the following section.

Figure 11.

Figure 11. Distributions of SP_Ace parameters (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) for the LAMOST DR1 stars with S/N${}_{\mathrm{SPAce}}$ smaller and larger than 40. The red, light blue, and blue solid lines represent the same isochrones as in Figure 4. The cool dwarfs delineated by the red dotted line appear to be subject to a systematic error in gravity (see Section 5.1). The contours enclose 34%, 68%, 95%, and 99% of the sample.

Standard image High-resolution image
Figure 12.

Figure 12. Distributions of SP_Ace abundances ([Fe/H], [α/Fe]) for the LAMOST DR1 stars with S/N${}_{\mathrm{SPAce}}$ smaller and larger than 40. The contours enclose 34%, 68%, 95%, and 99% of the sample.

Standard image High-resolution image

5.1. Known Systematic Errors

In Section 4, we demonstrated the successful performance of SP_Ace and found that there is good agreement with reference data sets. We also pointed out the presence of some systematic errors. The comparison with the Kepler stars showed that SP_Ace underestimates the gravity (∼−0.2) for giants with $\mathrm{log}g$ ≲ 2, while the comparison with the GCS and GES stars revealed an underestimation of the iron abundance for stars with ${T}_{{\rm{eff}}}$ ≳ 6000 K and 3 ≲ $\mathrm{log}g$ ≲ 4 (see also later in this section). Still, we did not test the performance for cool dwarf stars because of a lack of reference stars in this ${T}_{{\rm{eff}}}$$\mathrm{log}g$ region. As was shown in Figure 11, it appears that SP_Ace underestimates gravity for the cool dwarfs. These have been delineated by the red dotted line in this figure. The cause is unclear, but may be due to one or more of the following reasons: the physics of the adopted 1D atmosphere models is deficient, i.e., does not match the real conditions; the assumption of LTE is invalid; and/or the presence of molecular lines (which are neglected by SP_Ace) are affecting the atmospheres of cool dwarfs. Although this manifests itself in an underestimation in gravity, the problem could propagate into the other parameter estimates as well. Therefore, we recommend that the parameters for these cool dwarfs are not to be trusted. Since we have no way to determine precisely where this systematic problem becomes significant, the red dotted boundary in Figure 11 was determined by eye. These are the stars for which ${T}_{{\rm{eff}}}$ < 4800 K and $\mathrm{log}g$ > 0.0015 · ${T}_{{\rm{eff}}}$−3.25 and they are flagged with FLAG = 0 in the catalog. They represent 3% of the total sample.

Additional potential systematic errors are shown in Figure 13. The left and middle panels appear to show a dependence on the mean [Fe/H] with S/N${}_{\mathrm{SPAce}}$, which has an amplitude of ∼0.1 dex, for both dwarfs (left) and giants (middle). Although this may be due to some feature of the survey selection function (e.g., the distribution of latitude or distance), we have been unable to pinpoint a cause and therefore conclude that this is likely to be a systematic problem inherent to SP_Ace. The right panel of Figure 13 shows our final potential systematic problem, which has already been discussed above in Section 4.3. In that section, we pointed out that hot (${T}_{{\rm{eff}}}$ ≳ 6000 K) dwarfs appear to have underestimated [Fe/H]. The same problem appears to manifest itself in the right panel of Figure 13, where there is a clear deficiency of hot, iron-abundant stars, i.e., for these stars, the iron abundance is being underestimated. Note that the ${T}_{{\rm{eff}}}$ upper limit for SP_Ace is 7400 K, which means that for any spectrum of a higher temperature star, SP_Ace is expected to exit with no result. Moreover, we must remember that the SP_Ace line list was built using five standard stars, the hottest of which is Procyon with ${T}_{{\rm{eff}}}$ = 6554 K. As a consequence, for ${T}_{{\rm{eff}}}$ higher than that of Procyon, the stellar spectra may contain absorption lines that are not included in the line list, which could potentially lead to systematic errors.

Figure 13.

Figure 13. Density distributions of [Fe/H] as a function of S/N${}_{\mathrm{SPAce}}$ for dwarf (left panel) and giant stars (middle panel). The red dots trace the average of the [Fe/H] distribution for each S/N bin. Right panel: density distribution of [Fe/H] as a function of ${T}_{{\rm{eff}}}$ for the whole sample. The contours enclose 34%, 68%, 95%, and 99% of the sample.

Standard image High-resolution image

For the sake of completeness, we also mention a S/N dependent bias against low metallicity stars. The lower the metallicity of a star, the weaker are its absorption lines. This means that, for a star of lower metallicity, the S/N must be higher in order to make the lines identifiable through the noise. If too many lines are not identifiable, the analysis can fail, leading to an underestimation of the number of low metallicity stars. To explain the bias better, in the following, we propose a different point of view of this problem.

We usually refer to the S/N as the ratio between the continuum level and the noise level. However, in measuring the strength16 of an absorption line, we should consider the ratio between the line strength and the noise level, a ratio that we may call "line strength-to-noise" ratio (L/N). For a fixed level of noise, the L/N diminishes with the line strengths. Therefore, for small L/N, the measured strength of the line is more uncertain. If L/N < 3 we can no longer firmly detect the line. SP_Ace selects the lines (to be used to build the spectrum model) as a function of their detectability: if a line (or a collection of blended lines) has an expected maximum strength much smaller than the noise, then it is neglected. It follows that the noise level sets a lower limit to the strength of the detectable absorption lines and those that lie under this limit are not considered anymore. Because the strengths of the lines diminish with the metallicity of the stars, the number of the absorption lines considered diminishes with the metallicity of the star. Eventually, the number of measurable lines becomes too small to derive the stellar parameters and the analysis fails. This causes a bias against low metallicity stars, and this bias is greater for lower S/N spectra. The same bias has a greater effect on hot dwarfs, as opposed to cold giants, because the absorption lines of the former have lower EWs (for a given metallicity). Computing a reliable correction for this bias would require complex modeling and hence we do not tackle the problem here.

5.2. Comparison between SP_Ace and LASP

We now compare the performance of SP_Ace and the official LAMOST pipeline LASP17 (Luo et al. 2015), using two different methods applied to the same spectra. The first issue to consider is the total number of converged spectra. In this aspect, the performance of the two pipelines is similar. The total number of converged spectra is 1,125,722 for LASP and 1,097,010 for SP_Ace. The ability of each pipeline to process low S/N spectra is also similar; if we consider the 506,658 stars with LAMOST SNRI of between 20 and 40, we find that LASP converges for 297,597 while SP_Ace converges for 301,939. Note that not all of these spectra will have unusable parameters; of these 301,939 stars, 70% have S/N${}_{\mathrm{SPAce}}$ > 30 and 38% have S/N${}_{\mathrm{SPAce}}$ > 40. On the top of Figure 14, we compare the LASP and SP_Ace distributions in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane for all DR1 spectra that have S/N${}_{\mathrm{SPAce}}$ > 40 and parameters estimated by both codes. There are some differences, such as (i) for cool dwarf stars SP_Ace derives $\mathrm{log}g$ values that are systematically too low (this is clearly detectable by eye at ${T}_{{\rm{eff}}}$ < 4800 K, but it extends, at a lesser degree, to higher ${T}_{{\rm{eff}}}$), (ii) for ${T}_{{\rm{eff}}}$ ≳ 5600 K, the LASP gravity distribution unnaturally peaks at log g ∼ 4.2, and (iii) most of the LASP RGB stars do not have $\mathrm{log}g$ values lower than ∼2, while the SP_Ace gravities extend to lower values. Besides, the parameters from LASP closely follow the isochrones, even for very low S/N where we would expect the errors to be larger (see bottom panels of Figure 14). This is due to one of the LASP features, i.e., stars are placed on areas of the parameter space where the standard stars (which are employed as templates) lie. If there are no templates for a particular location in parameter space (for instance on the horizontal branch or far from the isochrones), then the LASP pipeline is unable to place a star there. As a consequence the LASP pipeline renders a more good-looking distribution, but it can also create hidden systematic errors for those stars that are not represented in the template sample. For low S/N spectra, the LASP parameters lack the natural dispersion expected for such spectra. Figure 15 shows a detailed comparison of the two approaches. They agree fairly well for ${T}_{{\rm{eff}}}$ and $\mathrm{log}g$ although for cool dwarfs we find the systematic errors already discussed in Section 5.1. The good general agreement between the two [Fe/H] values is shown in the same figure, together with the known systematic offsets for both extremes of the temperature range for dwarf stars (over- and underestimated for ${T}_{{\rm{eff}}}$ < 5500 and ${T}_{{\rm{eff}}}$ > 6500 K, respectively).

Figure 14.

Figure 14. Density distribution of the LAMOST spectra in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane for LASP (left panel) and SP_Ace (right panel) for stars with S/N${}_{\mathrm{SPAce}}$ > 40 (top) and S/N${}_{\mathrm{SPAce}}$ < 20 (bottom). The red, light blue, and blue lines represent the same isochrones as in Figure 4. The red dotted line highlights the region where the stars have systematically underestimated gravities. This is discussed in Section 5.1.

Standard image High-resolution image
Figure 15.

Figure 15. Residuals (SP_Ace minus LASP) of the stellar parameters with respect to the LASP values for stars with S/N${}_{\mathrm{SPAce}}$ > 40.

Standard image High-resolution image

5.3. Comparison between SP_Ace and The Cannon

Recently Ho et al. (2017b) used The Cannon (Ness et al. 2015) to derive stellar parameters and abundances of the alpha elements and the individual elements C and N of giant stars from the LAMOST DR2 internal data release. The Cannon has been trained on a set of spectra of objects that LAMOST has in common with APOGEE. In their work, Ho et al. demonstrated the consistency between their stellar parameters and those obtained by APOGEE from high-resolution spectra. As previously done with the LASP pipeline, we now compare the Ho et al. results with the ones obtained with SP_Ace. Out of the 454,450 spectra considered by Ho et al., we have in common 219,360 spectra belonging to LAMOST DR1.

The distribution in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane of the spectra with S/N${}_{\mathrm{SPAce}}$ > 40 is shown on the top of Figure 16. The Cannon places RC stars in a similar position to APOGEE (Figure 4), although the former are located at slightly higher gravity, in a position more suitable for RGB stars. We investigate this further using the asteroseismic $\mathrm{log}g$ values from Huber et al. (2014) for the two samples of RC and RGB stars classified by Stello et al. (which were introduced in Section 4.2, see Figure 7). From its high-resolution spectroscopy, APOGEE finds offsets of 0.14 and −0.03 dex for the RC and RGB, respectively (from samples of 1045 and 661 stars, respectively). These results are to be expected as APOGEE gravities are calibrated onto the asteroseismic RGB sample of Pinsonneault et al. (2014), hence the good agreement for the RGB and systematic overestimation for the RC (see Figure 4 of Holtzman et al. 2015). If we now consider the low-resolution LAMOST spectra as analyzed by The Cannon, we find that its performance is subject to larger systematic errors with offsets of 0.31 and 0.15 dex, respectively (from samples of 265 and 112 stars, respectively). This can be compared to the performance of SP_Ace (Figure 7), which has offsets of only −0.01 and −0.08 dex, respectively (from 1366 and 928 stars, respectively). The scatter for SP_Ace is marginally larger than for The Cannon (0.13 versus 0.10 dex), but the systematic errors are significantly less. The fact that the systematic errors are smaller than even the high-resolution data from APOGEE demonstrates the efficacy of SP_Ace for recovering accurate gravities from LAMOST spectra. Note that other machine learning methods have had more success at fitting gravities (e.g., Liu et al. 2015), but a comprehensive comparison of all pipelines is beyond the scope of this study.

Figure 16.

Figure 16. Density distribution of the LAMOST spectra in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane for The Cannon (left panel) and SP_Ace (right panel) for stars with S/N${}_{\mathrm{SPAce}}$ > 40 (top) and S/N${}_{\mathrm{SPAce}}$ < 20 (bottom). The red, light blue, and blue lines represent the same isochrones as in Figure 4.

Standard image High-resolution image

The correlations between the residuals SP_Ace minus The Cannon and the The Cannon parameters, as shown in Figure 17, are similar to those obtained for the APOGEE data (Figure 18). This is not surprising since The Cannon has been trained with a set of spectra labeled with the APOGEE data. The same systematics are observed in gravity, [Fe/H], and $\mathrm{log}g$. However, there is a group of stars for which SP_Ace assigns a higher ${T}_{{\rm{eff}}}$ than The Cannon (stars at The Cannon log g ∼ 3.5) and the same stars also have higher SP_Ace $\mathrm{log}g$. The Cannon locates these stars on the RGB while SP_Ace classifies them as sub-giants. Unfortunately, in this region of parameter space there are too few stars with asteroseismic gravities to clarify the origin of this offset.

Figure 17.

Figure 17. Residuals (SP_Ace minus The Cannon) of the stellar parameters with respect to the The Cannon values for stars with S/N${}_{\mathrm{SPAce}}$ > 40.

Standard image High-resolution image

On the bottom of Figure 16, we report the distributions of low S/N${}_{\mathrm{SPAce}}$ stars in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$) plane for The Cannon and SP_Ace. While, at low S/N, the SP_Ace parameters show a scatter around the isochrones (as expected), the lack of dispersion shown by The Cannon is similar to the LASP pipeline. We make the same comment as before, namely that codes relying on training sets of spectra that do not uniformly cover the parameter space tend to assign stellar parameters to regions covered by the training sets. This means that they lack the natural scatter due to the uncertainties coming from working at low S/N. This aspect should be further investigated to establish the presence (or lack) of possible systematic errors as a function of the S/N, particularly in regions of parameter space that are not covered by stars in the training set.

6. Conclusions

We have used the code SP_Ace to derive stellar parameters for 1,097,231 stellar objects from the LAMOST DR1 catalog. In addition to the parameters ${T}_{{\rm{eff}}}$, $\mathrm{log}g$, and [Fe/H] (which are also given by the LAMOST LASP pipeline), we have also derived the alpha abundance [α/H]. By comparing our results to high precision parameters for stars from surveys such as APOGEE, the Gaia-ESO survey, the GCS, and the Kepler mission, we have confirmed the ability of SP_Ace to derive stellar parameters and chemical abundances for FGK stars. This has also allowed us to demonstrate the robustness of our results. We have highlighted the presence of some systematic errors in our results, such as the overestimation of $\mathrm{log}g$ for cold dwarfs, an underestimation of gravity for $\mathrm{log}g$ ≲ 2, and a bias against hot metal-rich dwarfs. We have also shown that the SP_Ace error estimates look reliable when compared to the residuals between our results and these reference parameters.

We compared our results to other pipelines that have been used on LAMOST spectra, namely the LASP and The Cannon pipelines. The comparison between the three pipelines can be summarized as follows:

  • 1.  
    SP_Ace shows a systematic error in $\mathrm{log}g$ for dwarf stars cooler than ${T}_{{\rm{eff}}}$ < 4800 K while LASP and The Cannon perform well in this parameter region.
  • 2.  
    LASP shows an excess of stars with $\mathrm{log}g$ ∼ 4 that is not expected and not seen in the SP_Ace results.
  • 3.  
    The LASP results are biased against giant stars with $\mathrm{log}g$ < 2, which is unexpected as these stars are detected by SP_Ace and The Cannon.
  • 4.  
    The position of the RC stars is in good agreement with the isochrones for LASP and SP_Ace, while it is overestimated in $\mathrm{log}g$ by The Cannon (very likely due to the APOGEE training set used by The Cannon, which shows the same systematic problems).
  • 5.  
    At low S/N, SP_Ace's stellar parameters show the natural dispersion expected from spectra that hold little or no information. The other two pipelines follow the isochrones much more closely, which is a consequence of how they estimate parameters (i.e., both techniques use of training sets whose distribution closely match the isochrones). This artificial lack of dispersion is worrying and the results from these pipelines require further investigation, in order to check whether this behavior introduces any systematic errors.

The catalog presented here is publicly available at the LAMOST18 and CDS19 websites, and a machine-readable table can also be found in the electronic journal article. In the future, we would like to extend the catalog to more recent (and larger) LAMOST data releases, using a new version of SP_Ace designed to overcome the limitations identified in this current study.

This work was supported by the National Key Basic Research Program of China "973 Program" 2014 CB845700 & CB845702, and by the National Natural Science Foundation of China (NSFC) under grants 11673083 (PI: M.C. Smith), 11333003 (PI: Shude Mao) & 11373054 (PI: L. Chen).

C.B. and E.K.G. were supported by Sonderforschungsbereich SFB 881 "The Milky Way System" (sub-project A5) of the German Research Foundation (DFG). D.S. is the recipient of an Australian Research Council Future Fellowship (project number FT1400147).

Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Appendix A: Identified Problems in SP_Ace

During the progress of this work, we found one bug in the LAMOST version of SP_Ace that affects the mechanism with which SP_Ace aborts the analysis of spectra with large RV. This involves the offset limit in RV correction beyond which SP_Ace stops, and affects the LAMOST version only (the official public version of SP_Ace does not have this bug). Since the LAMOST version of SP_Ace handles both the blue and the red part of the spectrum, SP_Ace stops when one or both parts show a wavelength offset (due to the RV) larger than 1.27 FWHM. For this comparison, the source code translates the RV into Å at the wavelength of 5500 Å for both blue and red parts. This is a mistake because, in this way, the blue and red RV limits at which SP_Ace stops are different, with the blue part giving the more stringent limit (which is ∼±200 km s−1 for an average FWHMb ∼ 2.9 Å). However it must be stressed that, even if this bug were not present, the blue and red RV limits can never be constant because the limit depends on the FWHMb and FWHMr, which are not equal within one spectrum, nor among different spectra. This does not affect the robustness of any of the parameters derived by SP_Ace but it can create a bias against stars with high RV (with respect to the LSR). This will be corrected in our next version of LAMOST SP_Ace.

Appendix B: Validation Plots

For the sake of completeness, we include additional plots illustrating the comparison between SP_Ace and the reference data sets described in Section 4. These plots show the detailed correlations between parameters and are helpful in identifying potential problems with SP_Ace parameters.

Figure 18.

Figure 18. Residuals (SP_Ace minus ASPCAP/APOGEE) of the stellar parameters with respect to the reference parameters for stars with S/N${}_{\mathrm{SPAce}}$ > 40. The solid black lines show the median of the residuals as a function of the reference stellar parameter, while the upper and lower dashed lines denote the interval holding 68% of the sample. These have been constructed using bins of width 200 K in ${T}_{{\rm{eff}}}$, 0.5 in $\mathrm{log}g$, 0.1 dex in [M/H] and [α/Fe].

Standard image High-resolution image
Figure 19.

Figure 19. Distribution of stars in the (${T}_{{\rm{eff}}}$, $\mathrm{log}g$)-plane for the APOGEE/ASPCAP, BACCHUS, and SP_Ace samples, divided into bins of [Fe/H] with half-width of 0.1 dex. The average [Fe/H] of the bins increases from top to bottom. The dotted black, blue dashed, and solid red lines are Bressan et al. isochrones of 1, 5, and 10 Gyr, respectively, with metallicity corresponding to the [Fe/H] bin. The stars included here have S/N${}_{\mathrm{SPAce}}$ > 40. The gray-scale indicates the relative density of stars and the contours enclose 34%, 68%, 95%, and 99% of the sample.

Standard image High-resolution image
Figure 20.

Figure 20. [α/Fe] distributions for APOGEE stars with spectra also observed by LAMOST (and for which S/N${}_{\mathrm{SPAce}}$ > 40). The top panel shows the ASPCAP parameters from the APOGEE spectra, while the bottom panel shows the SP_Ace parameters from the LAMOST spectra.

Standard image High-resolution image
Figure 21.

Figure 21. Comparison between [α/Fe] derived by ASPCAP and SP_Ace, divided in ASPCAP [Fe/H] bins, for stars with S/N${}_{\mathrm{SPAce}}$ > 40.

Standard image High-resolution image
Figure 22.

Figure 22. Residuals (SP_Ace minus the Kepler stars of Huber et al.) of the stellar parameters with respect to the reference parameters for stars with S/N${}_{\mathrm{SPAce}}$ > 40. Black and dashed lines as in Figure 18.

Standard image High-resolution image
Figure 23.

Figure 23. Residuals (SP_Ace minus Casagrande et al.) of the stellar parameters with respect to the reference parameters for stars with S/N${}_{\mathrm{SPAce}}$ > 40. Black and dashed lines as in Figure 18.

Standard image High-resolution image
Figure 24.

Figure 24. Residuals (SP_Ace minus GES) of the stellar parameters with respect to the reference parameters for stars with S/N${}_{\mathrm{SPAce}}$ > 40. Black and dashed lines as in Figure 18.

Standard image High-resolution image

Footnotes

  • Image Reduction and Analysis Facility, http://iraf.noao.edu.

  • 10 

    We classify pixels as belonging to the Hα line if they are within 3σ of the center of the line, where we assume that the line profile is approximately Gaussian (i.e., 1 FWHM = 2.35σ).

  • 11 

    Other α elements, such as O and S, are not considered here because these are among the elements whose abundances derived by SP_Ace are (to date) considered not reliable.

  • 12 

    While this work was in progress, we discovered a bug in the code that can cause the limit of the RV correction to change for different spectra. This is discussed in Appendix A. As a consequence of this bug, some spectra will fail to converge, but the parameters reported in the catalog for the converged stars are unaffected.

  • 13 

    We found that there is a problem with LAMOST's RV calibration for the red part of the spectra prior to MJD = 55,945 and so here we only plot data from the first full year of the survey, i.e., from 2012 September 29. Note that the stellar parameters should be unaffected for these early spectra.

  • 14 

    Throughout this paper, we use the term S/N to refer to the signal-to-noise as general meaning and the term S/N${}_{\mathrm{SPAce}}$ to refer to the signal-to-noise derived by SP_Ace. When we refer to the LAMOST signal-to-noise we use the quantity SNRI, which is calculated in the I band.

  • 15 

    In our APOGEE sample, all the spectra having [Fe/H] < −0.6 dex were flagged as problematic by Holtzman et al. during their abundance estimation. Since we wish to retain the comparison with these stars, we neglected such flags.

  • 16 

    Here the term "strength" refers to the intensity of the absorption line. This can be also taken to mean as the ratio between the depth of the line over the continuum level and a proxy of the EW.

  • 17 

    Here we adopt the parameters derived using the DR3 version of LASP.

  • 18 
  • 19 
Please wait… references are loading.
10.3847/1538-3881/aab5af