Double- and Triple-line Spectroscopic Candidates in the LAMOST Medium-Resolution Spectroscopic Survey

, , , , , and

Published 2021 September 24 © 2021. The American Astronomical Society. All rights reserved.
, , Citation Chun-qian Li et al 2021 ApJS 256 31 DOI 10.3847/1538-4365/ac22a8

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0067-0049/256/2/31

Abstract

The LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS) provides an unprecedented opportunity for detecting multiline spectroscopic systems. Based on the cross correlation function and successive derivatives, we search for spectroscopic binaries and triples and derive their radial velocities (RVs) from the LAMOST-MRS spectra. A Monte Carlo simulation is adopted to estimate the RV uncertainties. After examining over 1.3 million LAMOST DR7 MRS blue-arm spectra, we obtain 3133 spectroscopic binary (SB) and 132 spectroscopic triple (ST) candidates, which account for 1.2% of the LAMOST-MRS stars. Over 95% of the candidates are newly discovered. It is found that all of the ST candidates are on the main sequence, while around 10% of the SB candidates may have one or two components on the red giant branch.

Export citation and abstract BibTeX RIS

1. Introduction

Since approximately half of the stars in our Galaxy are in double, triple, or high-order systems (Raghavan et al. 2010), multiple-star systems play an essential role in astrophysics, especially binary systems. The characterization of multiple-star systems, e.g., orbital parameters can be investigated by combining spectroscopic and photometric information.

Spectroscopic multiple-star systems can be classified according to the number of stellar components in the spectra because their spectral lines split due to the different radial velocities (RVs). A target can be considered as a spectroscopic binary (SB) or spectroscopic triple (ST) candidate if it has a double- or triple-line spectrum. The single-line SBs can also be identified as their RVs are variable.

Catalogs of spectroscopic multiple-star systems that have been published include ${{\rm{S}}}_{{{\rm{B}}}^{9}}$, the recent version of the ninth catalog of SB orbits that includes more than 4004 SBs, around one third of them double-line systems (Pourbaix et al. 2004), and the Geneva–Copenhagen Survey Catalogue that presents 3223 SBs from 16,682 nearby F and G dwarf stars (Nordström et al. 2004; Holmberg et al. 2009). Recent spectroscopic surveys providing many spectra that have substantially expanded the number of spectroscopic multiple-star systems include the Radial Velocity Experiment (RAVE) survey (Matijevič et al. 2010) that has found 123 SBs, the Gaia-ESO survey (Merle et al. 2017) that has detected 342 SBs, 11 STs, and even 1 quadruple-line candidate, the Apache Point Observatory Galaxy Evolution Experiment (APOGEE) survey that has discovered more than 3000 binaries (Fernandez et al. 2017; El-Badry et al. 2018), and the Galactic Archaeology with HERMES (GALAH) survey that has derived 12,760 FGK SBs with stellar properties (Traven et al. 2020).

The Large Sky Area Multi-Object fiber Spectroscopic Telescope (LAMOST) is a 4 m Schmidt telescope with a 5° field of view, and is equipped with 4000 fibers (Zhao et al. 2006, 2012; Cui et al. 2012; Luo et al. 2015). The medium-resolution spectroscopic (MRS) survey includes both blue and red arms, and the wavelength coverage of them are 4950–5350 Å and 6300–6800 Å, respectively. The resolution power of MRS is 7500. There are more absorption lines in the blue arm than in the red one, which enables us to measure the RVs with a precision of 1 km s−1 for most of the stars (Liu et al. 2019). The efficiency of observations and the achieved precision makes LAMOST-MRS an exceptional database for measuring RVs and detecting multiple-star systems.

The purpose of this work is to detect the multiple-line candidates (SBs and STs) from the LAMOST-MRS database following the method of Merle et al. (2017). This paper is organized as follows. In Section 2, we describe the LAMOST-MRS spectral data set. The methods used to normalize spectra, calculate cross correlation functions (CCFs), and detect peak positions in CCFs are introduced in Section 3, and a catalog that contains SB and ST candidates detected from LAMOST-MRS is presented in Section 4. In Section 5, we discuss the detection efficiency, the stellar parameters of the SB and ST candidates, RV differences, and caveats, and the conclusions are given in Section 6.

2. Data Selection

The LAMOST-MRS test observation started in 2017 September, and the LAMOST-MRS survey began in 2018 October. The first- and second-year MRS data has been released as LAMOST Data Release 7 5 (DR7). The wavelength ranges of the LAMOST-MRS blue and red arms are 4950–5350 Å and 6300–6800 Å. Up to 2019 April 11, the LAMOST-MRS survey obtained 5,369,891 spectra of 759,886 objects. Only the spectra of blue arms have been chosen to detect multiple-line candidates and measure their RVs, as there are more absorption lines in the blue arms. The distributions of the signal-to-noise ratio (S/N) of the blue-arm spectra versus the Sloan Digital Sky Survey (SDSS) g magnitude of the first-year survey and the Gaia DR2 G magnitude of the second-year survey are displayed in Figure 1. It can be seen that most of the objects have g or G magnitudes between 10 and 15. It is found that a spectrum with low S/N will lead to challenges for detecting reliable SBs and STs (Fernandez et al. 2017). Therefore, the blue-arm spectra with an S/N higher than 10 have been selected. In addition to S/N, the header keywords objtype and fibermas have also been taken into consideration. Only these spectra with an objtype of star and fibermas of 0 have been chosen, because a header keyword fibermas that is not 0 indicates that this fiber may have problems. Finally, 1,383,831 spectra of 281,437 stars were selected.

Figure 1.

Figure 1. The distributions of S/N vs. SDSS g magnitude of the first-year survey (upper panel) and the Gaia DR2 G magnitude of the second-year survey (lower panel).

Standard image High-resolution image

Figure 2 shows the number of exposures for these stars; most of them have been observed three times because the LAMOST-MRS no-time-domain observation takes three consecutive exposures (Liu et al. 2020). The number and cumulative distribution function versus S/N of all the selected spectra are presented in Figure 3.

Figure 2.

Figure 2. Number of selected stars vs. number of exposures. The number of stars is in a logarithmic scale.

Standard image High-resolution image
Figure 3.

Figure 3. Number and cumulative distribution function diagram of S/N of all the selected spectra.

Standard image High-resolution image

3. Method

Based on the Gaia-ESO survey, Merle et al. (2017) have developed a method to search for multiline spectroscopic candidates. Their method utilizes the Gaussian kernel to smooth the successive derivatives of the CCF. Following their method, we detect SB and ST candidates and derive their RVs from the LAMOST-MRS spectra. A Monte Carlo (MC) simulation is adopted to estimate the RV uncertainties for each spectrum.

3.1. Calculating the CCF

The CCF is usually used to measure RVs and to find multiple-line spectra. We use Equation (1) to calculate the values of the normalized CCF according to the normalized correlation coefficient (Gubner 2006). The range of the normalized CCFs is between −1 and +1, with +1 indicating a perfect correlation and −1 a perfect anticorrelation.

Equation (1)

Here, Oi and Ti,v are the normalized flux of the observed and template spectra at a same wavelength-sampling point i, and the RV value of the template is located at v. $\overline{O}$ and σO are the mean and scatter of the flux values of the observed spectrum, while $\overline{T}$ and σT are the mean and scatter of the flux values for the template. In this work, we generate a set of template spectra with a series of RVs based on the normalized solar spectrum (Kurucz et al. 1984). The RV variations are from −500 to +500 km s−1 with a step of 1 km s−1. To decrease the sampling points, we reduce the resolution of the templates to 100,000.

It is noted that the peaks in some CCFs are too low to be detected, therefore, these spectra with a maximum CCF value less than 0.2 and/or a difference between the maximum and minimum values of CCF less than 0.1 have been not considered. About 80,000 spectra have been excluded by these criteria.

The spectra need to be normalized, and we apply a general normalization procedure to all the selected spectra. The normalization procedure is as follows:

  • 1.  
    splitting each spectrum evenly into 10 bins by wavelength;
  • 2.  
    quadratic spline interpolation with the median flux values of the 10 bins;
  • 3.  
    masking the cosmic rays, absorption, and emission lines.

As the cosmic rays and emission lines will lead to negative values of the CCF, they need to be masked during normalization. Thus, we calculate the standard deviation (σ) of the residual between the observed spectrum and the continuum, and mask these points where the residuals are higher than +5σ.

The whole process is iterated three times to improve the continuum. The left panel of Figure 4 shows an example: the normalized spectrum of HD 47775.

Figure 4.

Figure 4. The normalized LAMOST-MRS spectra (left) and CCF diagrams (right) of HD 47775.

Standard image High-resolution image

3.2. Detecting the Number of RV Components and Determining their RVs

To obtain the number of RV components and their values, Merle et al. (2017) have designed a detection of extrema code to identify the peaks in CCFs. In their Figure 1, Merle et al. (2017) presented simulated CCFs and their derivatives, and there is a peak in the CCF diagram, where 72 km s−1 is the value of RV. By detecting where the first derivative passes zero in the declining phase or the third derivative passes zero in the ascending phase, it is possible to identify the RV components of the spectrum and calculate their RV values.

We note that small bulges (CCF values are around zero) can also be identified as peaks for the first or third derivatives, for instance, in the right panel of Figure 5. The reason is that there is more noise in the higher-order derivatives for a discrete CCF. To avoid the impact of these spurious peaks, it is necessary to select the RV range and smooth the derivatives. Therefore, the Gaussian filter function Gaussian_filter1d of the scipy.ndimage package (Virtanen et al. 2020) in Python is adopted to smooth the derivatives.

Figure 5.

Figure 5. CCFs and derivatives of two SB and one ST candidates. The gray dashed lines are the original CCFs and derivatives; all of the selected range of CCFs and smoothed derivatives are drawn with black solid lines. The red horizontal lines are the thresholds for each candidate, and the black vertical lines are their RVs.

Standard image High-resolution image

Two thresholds are used to select the RV range: the first one is the percentile of the CCF, and the other one is the percentile of the smoothed second derivative. Only the ranges where the CCF values are higher than the 75th percentile and the smoothed second derivative values are lower than the sixth percentile have been selected, as shown in Figure 5. If the selected ranges are separate, some triple-line spectra would be identified as double-line spectra (see the right panel of Figure 5). Thus, the middle part between the separate ranges is also selected.

As shown in the middle panel of Figure 5, the two peaks are strongly blended in the CCF diagram, and only one peak can be detected by the first derivative. Meanwhile, there are still two peaks that can be detected from the third derivative, therefore, we derive the RVs from the third derivatives. A linear fit using the two near zero-points of the ascending third derivative is adopted to calculate the RV value.

An appropriate standard deviation (σ) of the Gaussian kernel is important for smoothing, and we choose an initial σ = 13 km s−1 for the LAMOST-MRS spectra. However, this value is too small for some cases, and it may lead to an extra spurious RV component as shown in the left panel of Figure 6. In this case, we increase the σ by 1 km s−1 until the number of RV components detected with the third derivative is equal to the number of valleys in the second derivative, and the σ has been increased from 13 to 20 km s−1 (see the right panel of Figure 6).

Figure 6.

Figure 6. The example of finding an appropriate σ of Gaussian smoothing.

Standard image High-resolution image

3.3. Uncertainties

An MC simulation is applied to estimate the uncertainties of our RV measurements. The main idea is to generate a set of simulated spectra based on each observed spectrum, and derive their RV values from the CCFs. For each RV component, we take the standard deviation of these RVs as its RV uncertainty.

For each observed spectrum, we generate 100 simulated spectra. For a simulated spectrum, the flux at each wavelength point is a random value generated from a Gaussian distribution, and the mean and variance of the Gaussian distribution are the flux and flux error of the corresponding observed spectrum. The CCFs of an observed spectrum and the 100 simulated spectra are presented in Figure 7. We notice that the amplitudes of the simulated CCFs are lower than that of the observed one because the process of generating simulated spectra introduces additional error, therefore, the S/Ns of the simulated spectra are lower than that of the observed spectrum.

Figure 7.

Figure 7. Original CCF of a double-line spectrum and MC simulation ones.

Standard image High-resolution image

It should be noted that if the number of RV components detected from the simulated spectrum is different from the observed one, this spectrum is discarded.

Figure 8 shows the distribution of S/N and RV uncertainties of all the detected SB candidates. The scatter of the RV errors (∼1 km s−1) is in line with the value of Liu et al. (2019) for LAMOST-MRS spectra.

Figure 8.

Figure 8. The S/N of spectra vs. the RV uncertainties of SB candidates. Red and blue dots are the RV of the left and right peaks of the CCF.

Standard image High-resolution image

4. Results

After examining the LAMOST-MRS spectra, we detect 3133 SB and 132 ST candidates, which account for 1.2% of the LAMOST-MRS stars.

4.1. SB and ST Candidates and RVs

There are 14,647 double- and 1065 triple-line spectra that have been detected with this method, and 11,648 double- and 388 triple-line spectra are confirmed after checking the CCF diagrams by eye. The proportions are 80% and 36% for the double- and triple-line spectra, respectively.

We present all the SB and ST candidates and their RV values and uncertainties in Table 1, including the celestial coordinates, source id from Gaia Early Data Release 3 (Gaia Collaboration et al. 2021), Gaia G magnitude, and the number of exposures. Four items of spectral information from the LAMOST data release (the LAMOST-MRS plan name (planID), spectrograph ID (spID), fiber ID, and local modified Julian minute (LMJM)) are also presented. It needs to be pointed out that the RV values rv1, rv2, and rv3 are arranged in order of values as it is difficult to identify which RV component corresponds to each star when they have a similar spectral type.

Table 1. The Information of SB, ST Candidates and their RVs

R.A.(2000)Decl.(2000)Gaia Source id G (mag)SB/ST ${N}_{\mathrm{Exp}}$ planIDspIDfiberIDLMJMS/Nrv1 (km s−1)rv2 (km s−1)rv3 (km s−1)
0.0194857.4892342258994545586649612.1SB1HIP117842019628360488117−36.7 ± 0.720.7 ± 0.7
0.1349063.8728543163050802410739212.7SB1NGC77880112278365076215−57.0 ± 3.339.5 ± 15.1
0.2383040.43928288198404844807526412.4SB1HIP117769016178360914314−58.5 ± 0.8−6.9 ± 0.5
0.2912558.6955342276864615097369612.2SB3NGC778901122408364785915−9.3 ± 5.562.6 ± 1.6
0.2912558.6955342276864615097369612.2SB3NGC778901122408364787314−11.0 ± 5.663.7 ± 7.8
0.2912558.6955342276864615097369612.2SB3NGC778901122408364788616−19.2 ± 7.559.8 ± 8.5
0.3476741.11409288222586799145305612.2SB9HIP117769016398360480521−95.4 ± 0.432.4 ± 0.7
0.3476741.11409288222586799145305612.2SB9HIP117769016398360481820−96.0 ± 0.331.0 ± 0.8
0.3476741.11409288222586799145305612.2SB9HIP117769016398360483222−94.6 ± 0.330.9 ± 0.7
0.3476741.11409288222586799145305612.2SB9HIP117769016398360770427−85.3 ± 0.69.2 ± 0.4
5.3491858.2883742213285355379481612.9ST3NT002740N583314C0231878419645249−72.7 ± 1.5−0.4 ± 1.561.1 ± 2.2
5.3491858.2883742213285355379481612.9ST3NT002740N583314C0231878419647647−69.6 ± 1.9−2.2 ± 1.058.6 ± 2.0
5.3491858.2883742213285355379481612.9ST3NT002740N583314C0231878419649949−74.7 ± 1.4−1.4 ± 1.163.1 ± 2.4
17.518452.00869253863079070844531210.7ST4TD010605N031628K0171138411899158−20.9 ± 0.563.4 ± 0.4
17.518452.00869253863079070844531210.7ST4TD010605N031628K0171138411901450−20.3 ± 0.664.6 ± 0.5
17.518452.00869253863079070844531210.7ST4TD010605N031628K0171138411903839−18.5 ± 0.665.7 ± 0.7
17.518452.00869253863079070844531210.7ST4TD010605N031628K0171138416638346−65.7 ± 0.5−15.6 ± 0.479.5 ± 0.5
18.1640745.7822240086775321347174411.9ST4TD012220N453143T0114688415635315−97.8 ± 2.10.2 ± 3.1
18.1640745.7822240086775321347174411.9ST4TD012220N453143T0114688415637714−97.3 ± 2.3−0.5 ± 3.6
18.1640745.7822240086775321347174411.9ST4TD012220N453143T0114688415640012−95.3 ± 1.63.2 ± 1.5
18.1640745.7822240086775321347174411.9ST4TD012220N453143T0114688415642311−96.3 ± 2.2−46.4 ± 3.50.3 ± 2.2

Note. The RV values rv1, rv2, and rv3 are arranged in order of values. Forty-one ST candidates show both double- and triple-line spectra.

Download table as:  ASCIITypeset image

We consider a star to be an ST candidate as long as one of its spectra has been detected with three RV components. Accordingly, we classified 41 stars with both double- and triple-line spectra as ST candidates. The RV variations indicate that they are probably physical binaries or triples. Further checking of the RV variations of these candidates will help us to identify more physical multiple-star systems.

Figure 9 shows the distribution of the number of exposures of the detected candidates. There are 525 SB and 31 ST candidates with more than six observations, these need to be investigated as their orbital parameters may be derived by RV curves.

Figure 9.

Figure 9. Number of detected SB and ST candidates vs. number of exposures. The number of stars is in a logarithmic scale.

Standard image High-resolution image

The detected RV difference limit is around 50 km s−1 for all of the SB and ST candidates (Figure 10), which meets the resolution of the LAMOST-MRS spectra. The largest RV differences are around 250 km s−1 and 300 km s−1 for SB and ST candidates, respectively.

Figure 10.

Figure 10. The number distribution of the RV difference of SB and ST spectra.

Standard image High-resolution image

In order to improve the detection efficiency, an artificial method is needed. Among machine-learning and deep-learning (DL) techniques, a recurrent neural network (RNN) is proven to perform well for this classification (Jamal & Bloom 2020). To test its feasibility, we designed an RNN to derive the peak numbers from the CCF diagrams. Our RNN is trained by the CCF diagrams of the confirmed SB and ST candidates, and the details of the RNN can be found in Appendix A. The total classification accuracy of our RNN is 0.96, which is a significant improvement compared to the traditional method. Therefore, we will use the RNN to detect SB and ST candidates in a future study.

4.2. New Binary Candidates

We crossmatch our LAMOST-MRS SB and ST candidates with these of the other binary catalogs, including ${{\rm{S}}}_{{{\rm{B}}}^{9}}$ (Pourbaix et al. 2004), the Geneva–Copenhagen Survey of the Solar Neighborhood III (Holmberg et al. 2009), the RAVE SB2 (Matijevič et al. 2010), the Gaia-ESO multiline SB (Merle et al. 2017), the Washington Visual Double Star (WDS; Mason et al. 2001), the binaries of APOGEE (Fernandez et al. 2017; El-Badry et al. 2018), the FGK binary stars of the GALAH survey (Traven et al. 2020), and the third revision of the Kepler Eclipsing Binary Catalog (KEBIII; Kirk et al. 2016), and there are 107 stars in common. The information about these stars is listed in Table B1.

Table B1. List of the Common SB and ST Candidates with other Binary Catalogs

R.A.(2000)Decl.(2000)Gaia Source id G SB/ST ${N}_{\mathrm{Exp}}$ Catalog
4.311480.55870254557633694305907212.7SB3GALAH
6.3550359.1160442826730810506252811.1SB4WDS
9.7848159.8176442555680578660940810.9SB14WDS
10.1710857.7688742489867935532531211.3SB2WDS
13.4984110.08698258233974087143936012.2SB1GALAH
15.0408712.21674258428959159970406411.5SB1GALAH
15.2353411.58956258404027304247820812.8SB2GALAH
19.443860.62884253504898984659276813.1SB2GALAH
29.7696958.3978150549185088283558411.0ST3WDS
31.9647213.791387720244946251302412.1SB3APOGEE
46.8471440.8011123985458324638540810.3SB1WDS
46.9136666.1767349233932098535564810.6SB3WDS
50.5994551.6655644291279868344908812.1ST9WDS
51.4766518.135605592686940226176012.5SB4GALAH
51.8573819.011995754404484900544013.0SB2GALAH
52.3757919.414925764032083579980812.3SB4GALAH
55.1672245.7776024483396217285772811.2ST3WDS
62.0542019.944185183242124268812812.4SB3WDS
63.0135248.2690524625109521822016011.0SB1WDS
63.0154021.94700526841991566840328.9SB3SB9, WDS
63.2950461.5595047525813296541222413.1SB3WDS
66.4478018.31499331446636013830259213.0SB2GALAH
68.2036316.02178331265258256589363213.0SB9GALAH
71.0448426.7676915453333992381068813.7SB11GALAH
71.4397023.5337014657352857311424012.9SB4GALAH
71.9921622.95037341314158449638579212.0SB11GALAH
72.1620415.36070340502488658096473612.9SB3GALAH
72.2199414.71636330872210337377049613.4SB3GALAH
72.4003456.2888727460189946497459210.6SB3WDS
73.5848020.85980341171654870706137611.7ST8GALAH
73.7212524.23559341965935628298700812.5SB9GALAH
74.9081221.79690341225170163257331211.7ST5GALAH
75.0599824.13008341941076357613235210.9ST12WDS
76.0561824.30754341877400890535270410.3SB3WDS
77.5784124.48785341901249984958246411.5SB3GALAH
83.8843235.8560718316664564003558411.9SB3WDS
92.2326522.14776342354732818268864010.9SB1WDS
98.137018.59674332625855792541593611.2SB3WDS
98.722939.43785332682834546615795210.8SB5WDS
100.448969.86729332673766581810700813.1SB2Gaia-ESO
101.894528.47141313419016307071846410.4ST7WDS
101.9337623.12819337955772338162483211.2SB3WDS
102.2195717.33513335839666003093977612.0SB6WDS
103.7902222.80749338005096172171968010.5SB7WDS
106.8392645.7592597737464634767462412.5SB3WDS
115.9966124.7445186789483475770240013.5SB1WDS
123.6585514.7396665529364436835712011.7SB3GALAH
123.7282116.7123965582914089069401613.0SB5GALAH
123.8536016.2293165572643963289164812.7SB3GALAH
124.0003817.4552065700410650523033612.6SB4GALAH
124.1455719.4129666351455537078822412.4SB7GALAH
124.2419017.6538965701448314618035211.8SB4GALAH
124.4724814.3815965221095736178572811.7SB1GALAH
124.7256118.9090966324737404427891211.8SB5GALAH
124.9666917.3999765628523205796480013.7SB11GALAH
126.1570416.8488965601884100656179211.7SB3GALAH
126.9527713.0325365098221875679398412.8SB5GALAH
126.9564011.5837860115510056516787212.9SB3GALAH
127.1933619.0073866287890021049740811.5SB11GALAH
127.2319418.7637266230973114434918410.9SB5GALAH
130.1775817.2303765848892823676160013.8SB2GALAH
130.8601812.4190260224437299056640011.1SB3WDS
131.0305020.0768866138081561831731210.0SB3SB9
132.5270023.7507968950985781201139212.4SB11WDS
132.7063412.2876760501483305465088011.5SB3APOGEE
132.8232811.6600160489780378603276813.4SB11GALAH
132.8553912.0490160497208954012083213.4SB2APOGEE, GALAH
133.0190719.6029066073007653197401611.2SB14GALAH
133.2746510.3429559781248213677260812.8SB24GALAH
133.3419819.3457466066101345793920011.6SB2GALAH
133.4164123.2157268923594627386201613.1SB21GALAH
133.5789112.6580560513608857078310411.2SB13APOGEE, WDS
133.6860511.5014460471600640963545610.3SB11APOGEE, GALAH
133.8958816.7534761157637055598387212.3SB10GALAH
133.9951522.9246568616438752593164813.0SB29GALAH
143.9677737.4388379909038638895872013.3SB6APOGEE
144.0657537.5289179909251669272934410.3SB5SB9
168.542784.98812381681941454967654411.1SB3GALAH
170.486103.71580381268040610619084812.7SB3WDS
173.172251.59537379992936975887603213.4SB6GALAH
173.697282.06516380006140993759155212.7SB1GALAH
174.261872.27368379934683475468454413.3SB3GALAH
179.395103.22525389309363568095449610.7SB3GALAH
184.9087325.29859400848521205604620811.3SB8APOGEE
185.3248650.97667154767995889954022410.4SB3WDS
207.00384−0.78145366202927423800512011.5SB3WDS
207.24577−8.31973361901005716775180810.9SB10GALAH
207.60620−6.40342362062146017755059211.2SB5GALAH
226.2951226.56544126840788809264934410.5SB1APOGEE
263.8850623.17511455740251518859622410.9SB1SB9, WDS
266.198606.03134447413034492415219212.1SB2GALAH
276.7203225.83382458484749365101606412.1SB2WDS
284.7598641.04340210402552393143270410.5SB2KEBIII
289.8610441.40818210146795680922291210.5SB1KEBIII
289.8839845.30403212709014684846515211.6SB3KEBIII
290.4832043.62231212608330773583193612.3SB1KEBIII
290.5174640.69681210119208247087091211.3SB3KEBIII
290.8066842.34393210194232735600921612.0SB3APOGEE, WDS, KEBIII
290.9513340.54793210151080340276134414.1SB22KEBIII
291.3199143.59553212604424071325030410.7SB26KEBIII
292.0587143.92521212597122626966758412.7SB6KEBIII
292.2967444.28051212635698305172672013.2SB25KEBIII
292.7179141.92241207766796247565286410.1ST1KEBIII
293.2053242.19786207768332127904601612.1SB2APOGEE
294.4782846.86376212853260302516428812.5SB3KEBIII
305.5427740.21833206751867987101516811.7SB4WDS
332.4200258.31327219951034351044876810.3SB3WDS

Download table as:  ASCIITypeset image

Except for these common stars, there are 3034 SB and 124 ST candidates (over 95% of the total candidates) that are newly discovered. These RVs can be used to derive the orbital parameters of these systems (Pan et al. 2021; Wang et al. 2021).

5. Discussion

5.1. Detection Efficiency

To test the detection efficiency of the method, we use the radiative transfer code SPECTRUM (Gray & Corbally 1994) and the Kurucz (Kurucz 1979) stellar atmosphere models to generate six synthetic spectra of the given stellar parameters. They include cool and hot (Teff = 5000, 8000 K), giant and dwarf ($\mathrm{log}g=2.0$, 4.0 dex), and metal-rich and metal-poor ([Fe/H] = 0.0, −2.0 dex) models. All of the synthetic spectra have been reduced to the same wavelength range and resolution (R ∼ 7500) as the LAMOST-MRS blue-arm ones. We generate a set of double-line synthetic spectra by shifting a synthetic spectrum with different RVs and combining it with the original ones, and the synthetic double-line SBs have equal masses.

A similar MC simulation (see Section 3.3) is adopted to test the detection efficiency. We add the random Gaussian distribution noises to each point of the synthetic spectrum and generate 100 spectra for a specific S/N to detect RV components. We count the number of spectra detected as SBs, and the success rate represents the detection efficiency.

Our simulation results are presented in Figure 11, and it can be seen that the detection limit of RV differences is between 40 and 50 km s−1, and this limit is consistent with the LAMOST-MRS resolution power. The detection efficiency of the method is similar except for hot metal-poor dwarfs.

Figure 11.

Figure 11. The detection efficiency of simulated SBs with different atmospheric parameters, RV differences, and S/N levels.

Standard image High-resolution image

5.2. Stellar Parameters

It is useful to investigate the distribution of stellar atmospheric parameters of SB and ST candidates, although the general method for determining the stellar atmospheric parameters may not be appropriate for multiline spectra. We adopt the stellar atmospheric parameters of 1470 SB and 65 ST candidates from the LAMOST DR7 low-resolution spectroscopic survey (LAMOST-LRS; Luo et al. 2015), and present their distribution in Teff versus $\mathrm{log}g$, Teff versus [Fe/H], and $\mathrm{log}g$ versus [Fe/H] plots in Figure 12.

Figure 12.

Figure 12. The distribution of stellar atmospheric parameters of common stars between the LAMOST DR7 LRS and the selected LAMOST-MRS data. The black crosses are the SB candidates, and white pentagons are ST candidates. The left panel is Teff vs. $\mathrm{log}g$, the middle panel is Teff vs. [Fe/H], and the right panel is $\mathrm{log}g$ vs. [Fe/H]. All atmospheric parameters are provided by the LAMOST DR7 LRS.

Standard image High-resolution image

We note that about 90% of the SB and all the ST candidates are on the main sequence, only 155 SBs may have one or two components on the red giant branch ($\mathrm{log}g\lt 3.5$ dex). The reason is that when the massive component climbs the red giant branch, its radius increases, and mass transfer occurs after it reaches the Roche lobe. The interaction between the components can modify the evolution and lead to fewer giant binary systems. The lack of metal-poor candidates is due to fewer low-metallicity stars in our sample.

5.3. Caveats

There are some caveats that we need to point out before utilizing the SB or ST catalog for further studies. Since the completeness of the multiline spectroscopic candidate catalog is not the primary goal of this work, the selection effects of the LAMOST-MRS data and the detection process of the method must be considered in further statistical studies.

Another caveat is that the SB and ST candidates are not necessarily physical binaries or triples. The CCFs of double- and triple-line spectra may be mimicked by emission lines or stellar pulsations, etc. (Merle et al. 2017). Considering the LAMOST fibers have a diameter of 3farcs3 (Cui et al. 2012), one spectrum may contain light from multiple stars when they are very close to each other in the sky. Investigating the RV variations of the SB and ST candidates is essential to identifying the physical binaries and triples.

6. Conclusions

Based on the CCF and Gaussian smoothing for the derivatives of CCFs, we detected 14,647 double-line and 1065 triple-line spectra from the blue-arm spectra with S/N ≥ 10 in the LAMOST DR7 MRS. After checking the CCF diagrams by eye, 11,648 double-line and 388 triple-line spectra are confirmed, 80% and 36% of the total, respectively. They belong to 3133 SB and 132 ST candidates, and these confirmed candidates account for 1.2% of the LAMOST-MRS stars. Among the ST candidates, 41 of them show double- and triple-line cases, which indicates that they are physical binaries because of the variations of RVs. Comparing with the other binary-star catalogs, we find that about 95% of them are newly discovered. For the 1470 SB and 65 ST candidates that have stellar atmospheric parameters from the LAMOST-LRS survey, 90% of the SB and all the ST candidates are on the main sequence.

An MC simulation is used to estimate the RV uncertainties, and the RV error is of the order of 1 km s−1. Our results indicate that the detection limit of RV differences is between 40 and 50 km s−1.

The LAMOST-MRS survey is ongoing, and will continue to provide a great opportunity to study binary- and multiple-star systems.

We thank the referee for the useful comments, which have helped us to improve the manuscript. Our research is supported by the National Key R&D Program of China No. 2019YFA0405502 and the National Natural Science Foundation of China under grant Nos. 12090040, 12090042, 12090044, 11833002, 11833006, 12022304, 11973052, 11973042, and U1931102. This work is supported by the Astronomical Big Data Joint Research Center, cofounded by the National Astronomical Observatories, Chinese Academy of Sciences and Alibaba Cloud. H.-L.Y. acknowledges the support from the Youth Innovation Promotion Association, Chinese Academy of Sciences (id. 2019060) and NAOC Nebula Talents Program. Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Appendix A: Detecting Peaks

We design a simple RNN to inspect the peak numbers of CCF diagrams automatically. Because of the incremental data of the LAMOST-MRS survey, an artificial classification method is urgently needed to analyze these spectra. Among the most-common machine-learning and DL techniques, an RNN proves to be highly performant for the classification of one-dimensional data (Jamal & Bloom 2020) compared to a convolutional neural network.

RNNs refer to a class of artificial neural networks where the network architecture is composed of interconnected nodes through a directed graph along a temporal sequence. A long short-term memory network (LSTM; Hochreiter & Schmidhuber 1997) is one of the RNN variants with a gated state or memory. LSTMs have been found to be extremely successful in many applications, such as speech recognition (Graves et al. 2013), handwriting generation (Graves 2013), and machine translation (Sutskever et al. 2014). At the same time, RNNs have been applied on astronomical data (Jamal & Bloom 2020) including variable stars (Naul et al. 2018), periodic variable stars (Tsang & Schultz 2019) and supernovae classifications, (Charnock & Moss 2017) and online transient event detection (Muthukrishna et al. 2019; Möller & de Boissière 2020).

We set up our RNN with a three-layer LSTM, with a hidden size of 256 units, and three output features. This RNN is trained by 12,136 CCF diagrams with labels as the number of peaks (0, 2, 3) together with 4046 groups of test data. We split the training data into batches to speed up the training process. After 100 training epochs, we stop the training process to avoid overfitting (with a training loss of 0.003 and test loss of 0.100). The final classification accuracy and precision are shown in Table A1 and the confusion matrix obtained on the test-set predictions for the RNN classifier is shown in Figure A1.

Figure A1.

Figure A1. Confusion matrix obtained on the test-set predictions for the RNN classifier. The values in each box refer to the number of predictions vs. the true labels.

Standard image High-resolution image

Table A1. The Precision, Recall, and F1 Score of the Prediction of Classification

Peak NumberPrecisionRecall F1 Score
00.910.920.92
20.970.970.97
30.840.720.78

Download table as:  ASCIITypeset images: 1 2

The F1 score is the harmonic mean of precision and recall:

Equation (A1)

Two peaks are recognized well (precision ∼0.97) since there are the most training data for two-peak CCFs. On the contrary, the three-peak group is the most difficult one to recognize due to there being less data. In total, the final classification accuracy is 0.96, which means that for a given CCF diagram, we can detect the correct peak number successfully with a probability of 0.96. This method can improve the productiveness and robustness. We use the DL method along with manual checks to confirm our results.

Appendix B: Common SB and ST Candidates with Other Binary Catalogs

The information about the common stars of between our LAMOST-MRS candidates and the other binary catalogs are listed in Table B1.

Footnotes

Please wait… references are loading.
10.3847/1538-4365/ac22a8