This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.

The California-Kepler Survey. IV. Metal-rich Stars Host a Greater Diversity of Planets

, , , , , , , , , and

Published 2018 January 29 © 2018. The American Astronomical Society. All rights reserved.
, , Citation Erik A. Petigura et al 2018 AJ 155 89 DOI 10.3847/1538-3881/aaa54c

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1538-3881/155/2/89

Abstract

Probing the connection between a star's metallicity and the presence and properties of any associated planets offers an observational link between conditions during the epoch of planet formation and mature planetary systems. We explore this connection by analyzing the metallicities of Kepler target stars and the subset of stars found to host transiting planets. After correcting for survey incompleteness, we measure planet occurrence: the number of planets per 100 stars with a given metallicity M. Planet occurrence correlates with metallicity for some, but not all, planet sizes and orbital periods. For warm super-Earths having P = 10–100 days and ${R}_{P}$ = 1.0–1.7 ${R}_{\oplus }$, planet occurrence is nearly constant over metallicities spanning −0.4 to +0.4 dex. We find 20 warm super-Earths per 100 stars, regardless of metallicity. In contrast, the occurrence of warm sub-Neptunes (${R}_{P}$ = 1.7–4.0 ${R}_{\oplus }$) doubles over that same metallicity interval, from 20 to 40 planets per 100 stars. We model the distribution of planets as ${df}\propto {10}^{\beta M}{dM}$, where β characterizes the strength of any metallicity correlation. This correlation steepens with decreasing orbital period and increasing planet size. For warm super-Earths β =  $-{0.3}_{-0.2}^{+0.2}$, while for hot Jupiters β =  $+{3.4}_{-0.8}^{+0.9}$. High metallicities in protoplanetary disks may increase the mass of the largest rocky cores or the speed at which they are assembled, enhancing the production of planets larger than 1.7 ${R}_{\oplus }$. The association between high metallicity and short-period planets may reflect disk density profiles that facilitate the inward migration of solids or higher rates of planet–planet scattering.

Export citation and abstract BibTeX RIS

1. Introduction

Exploring the connection between planets and their host stars has been a long-standing focus of exoplanet astronomy. Host star metallicity is thought to reflect the metallicity of the protostellar nebula and the protoplanetary disk from which planets form. Metal-rich protoplanetary disks are thought to have enhanced surface densities of solids. Viewed in the context of core-accretion theory (Lissauer 1995; Pollack et al. 1996), one might expect metal-rich disks to form terrestrial planets and the cores of gas giant planets with greater efficiency than metal-poor disks. If true, metal-rich stars should host greater numbers of gas giant and terrestrial planets. This prediction can be tested by studying the correlation (or lack thereof) between $[\mathrm{Fe}/{\rm{H}}]$ and planet occurrence.

The extent to which stellar metallicity correlates with the presence or absence of planets has been the subject of many previous studies. Gonzalez (1997) observed that the first four extrasolar planets discovered orbited metal-rich stars and concluded that metal-rich stars have metal-rich protoplanetary disks which form planets more efficiently. As the sample of Doppler-detected planets grew into the hundreds, various studies noted a preference for Jovian-mass planets to orbit stars with super-solar metallicities (e.g., Santos et al. 2004; Fischer & Valenti 2005). However, as Doppler surveys pushed into lower regimes of planet mass, various authors noted that the correlation between planet occurrence and host star metallicity appeared to weaken (e.g., Sousa et al. 2008; Ghezzi et al. 2010).

The prime Kepler mission (2009–2013) revealed more than 4000 planet candidates with sizes as small as Mercury (Barclay et al. 2013). In contrast to previous studies that explored the connection between host star metallicity and planet mass, Kepler studies have focused mainly on the connection between metallicity and planet size. While a handful of Kepler planets have well-measured masses through radial velocities (RVs; e.g., Marcy et al. 2014) or transit-timing variations (TTVs; e.g., Hadden & Lithwick 2017). The vast majority of Kepler planets have unknown masses. Current RV instruments require bright targets ($V\lt 13\,\mathrm{mag}$), and TTV measurements require tightly packed, multi-planet systems. Both TTV and RV mass measurements are only possible on a small fraction of the total Kepler planet sample.

While the Kepler sample provided a large sample of planets for planet-metallicity studies, extensive follow-up spectroscopy was first required to measure host star metallicities. Buchhave et al. (2012) measured the metallicities of 152 stars harboring 226 planets and observed that while planets larger than 4 ${R}_{\oplus }$ orbit metal-rich stars, smaller planets orbit stars with wide-ranging metallicities. Later, using an augmented sample of 405 stars hosting 600 planets, Buchhave et al. (2014) argued for three distinct stellar metallicity distributions, with breakpoints at 1.7 ${R}_{\oplus }$ and 3.9 ${R}_{\oplus }$. In contrast, Schlaufman (2015) found no evidence for different metallicity distributions above and below 1.7 ${R}_{\oplus }$ and raised several concerns regarding the statistical validity of the Buchhave et al. (2014) analysis.

RV mass measurements of transiting planets have revealed that planets smaller than 1.7 ${R}_{\oplus }$ have bulk densities consistent with rocky compositions (e.g., Weiss & Marcy 2014; Rogers 2015), at least for the short-period planets ($P\lesssim 20\,\mathrm{days}$) that are amenable to these RV mass measurements. Consequently, the degree to which metallicity correlates with the occurrence of planets smaller than 1.7 ${R}_{\oplus }$ is of particular interest. On this point, there is disagreement in the literature. Wang & Fischer (2015), using low precision metallicities from the Kepler Input Catalog (KIC; Brown et al. 2011), reported that the rate of planets smaller than 1.7 ${R}_{\oplus }$ is ${1.72}_{-0.17}^{+0.18}$ times higher for stars with $[\mathrm{Fe}/{\rm{H}}]$ > 0 dex compared to stars $[\mathrm{Fe}/{\rm{H}}]$ < 0 dex. In contrast, Buchhave & Latham (2015) found no evidence of a metallicity enhancement among stars hosting planets smaller than 1.7 ${R}_{\oplus }$.

A long-standing limitation of Kepler metallicity studies was that reliable metallicities did not exist for a representative number of Kepler field stars. In constructing the KIC, Brown et al. (2011) placed a prior on the Kepler field metallicities based on the metallicities of nearby stars, as measured by Nordström et al. (2004), which have a mean of −0.14 dex and dispersion of 0.19 dex. However, it was not clear whether Kepler field stars, which are typically ∼1 kpc from Earth, would follow the metallicity distribution of nearby stars.

Such metallicity offsets have been invoked to explain differences in planet occurrence rates between the Kepler field and the solar neighborhood. Howard et al. (2012) measured a hot Jupiter occurrence rate of 0.4 ± 0.1%, roughly 40% of that in the solar neighborhood (1.2 ± 0.4%, Wright et al. 2012), and speculated that "a paucity of metal-rich stars in the Kepler sample is one possible explanation."

Today we know that Kepler field stars have a higher mean metallicity than solar neighborhood stars. The Large Sky Area Multi-object Fibre Spectroscopic Telescope (LAMOST; Cui et al. 2012; Luo et al. 2012; Zhao et al. 2012) is instrumented with a multiplexed, low-resolution spectrometer (4000 fibers, R = 1800) and can therefore efficiently gather spectra for large samples of Kepler target stars with and without transiting planets. Dong et al. (2014), through analyzing the LAMOST Data Releases 1 and 2, found that the mean metallicity of 12,000 Kepler field stars was −0.04 dex, much closer to solar than to the value of −0.14 dex assumed in the construction of the KIC. Guo et al. (2017) also found a near-solar mean metallicity of −0.04 dex in an analysis of 610 Kepler field stars observed by the Hectochelle R = 34,000 spectrometer at the MMT. Thus, a metallicity offset cannot explain differences in the hot Jupiter rates.

The LAMOST data sets permitted several breakthroughs in Kepler planet-metallicity studies, by measuring the true metallicity distribution of bright (${Kp}$ < 14) Kepler field stars. Mulders et al. (2016) analyzed the LAMOST metallicities and a sample of 665 planet candidates, and found that the occurrence rate of hot small planets (P < 10 days, ${R}_{P}$ < 4 ${R}_{\oplus }$) is three times higher among super-solar metallicity hosts compared to sub-solar hosts. Dong et al. (2018) also analyzed LAMOST metallicities and a sample of 295 planets and reported a similar trend, also noting that hot Neptune-size planets are typically single.

Here, we examine the connection between planets and stellar metallicity using spectroscopy from the California-Kepler Survey (CKS). Given that the CKS produced a homogeneous set of high-precision metallicities of 1305 stars, we can explore this connection in unprecedented detail. A key advantage of the CKS sample is the highly precise planet radii (10% precision) and stellar metallicities (0.04 dex precision) compared to previous studies. The CKS sample has high purity (i.e., low false positive rate) due to extensive vetting of false positives. This data set has already revealed new features in the planet radius distribution, most notably a gap in the size distribution of small (${R}_{P}$ = 1–4 ${R}_{\oplus }$ planets) reported by Fulton et al. (2017).

We find that planets larger than Neptune are preferentially found around metal-rich stars, while planets smaller than Neptune are found around stars of wide-ranging metallicity. For super-Earth-size planets (${R}_{P}$ = 1.0–1.7 ${R}_{\oplus }$), we observe a positive metallicity correlation for P = 1–10 days, but no correlation for P = 10–100 days. In contrast, rates of sub-Neptune-size planets (${R}_{P}$ = 1.7–4.0 ${R}_{\oplus }$) correlate with metallicity over P = 1–100 days. Planets larger than Neptune are an order of magnitude less common than planets smaller than Neptune, and probing the possible metallicity correlations is more challenging. However, we observe strong metallicity correlations for both Jovian-size (${R}_{P}$ = 8.0–24.0 ${R}_{\oplus }$) and sub-Saturn-size (${R}_{P}$ = 4.0–8.0 ${R}_{\oplus }$) planets.

We describe our planet and stellar samples in Section 2. Section 3 explores the distribution of planets in the P${R}_{P}$ plane as a function of metallicity, and highlights areas where metallicity plays a strong effect. In Sections 46, we compare host star metallicities to the Kepler field star distribution and compute planet occurrence as a function of metallicity. We summarize our findings in Section 7, and offer some interpretations of the observed trends.

2. Sample

Studies of planet occurrence require both a sample of planets ${ \mathcal P }$ and a parent stellar sample ${ \mathcal S }$ from which the planets are drawn. We construct ${ \mathcal P }$ from the CKS sample (Section 2.1) and apply a series of filters aimed at creating a well-defined sample of high purity with well-measured radii (Section 2.2). We then construct ${ \mathcal S }$ from the Kepler field stars after applying the same set of filters used to construct ${ \mathcal P }$ (Section 2.3). Because the metallicities are not known for every star in ${ \mathcal S }$, we employ the LAMOST parameters as a proxy (Section 2.4).

2.1. Initial Planet Sample

CKS is a large-scale spectroscopic survey of 1305 Kepler Objects of Interest (KOIs). The sample selection, spectroscopic observations, and spectroscopic analysis are described in detail in Petigura et al. (2017b, hereafter Paper I). In brief, the sample was initially constructed by selecting all KOIs brighter than Kp = 14.2 mag.9 A KOI is a Kepler target star which showed periodic photometric dimmings indicative of planet transits. However, not all KOIs have received the necessary follow-up attention needed to confirm the planets.

Over the course of the project, we included additional targets to cover different planet populations, including multi-candidate hosts, Ultra Short Period candidates, and Habitable Zone candidates. We obtained spectra at the 10 m Keck Telescope using the High Resolution Echelle Spectrometer at a resolution of R = 60,000 (Vogt et al. 1994). A key feature of the CKS data set is that stars were observed to a consistent signal-to-noise ratio (S/N) of 45 pixel−1 at the peak of the blaze function near 5500 Å.

In Paper I, we extracted the following properties for each star: effective temperature ${T}_{\mathrm{eff}}$, surface gravity $\mathrm{log}g$, metallicity $[\mathrm{Fe}/{\rm{H}}]$, and projected rotational velocity $v\sin i$ using the SpecMatch (Petigura 2015) and SME@XSEDE codes (Paper I).

In Johnson et al. (2017, hereafter Paper II), we converted the spectroscopic properties of ${T}_{\mathrm{eff}}$, $\mathrm{log}g$, and $[\mathrm{Fe}/{\rm{H}}]$ into stellar mass ${M}_{\star }$, radius ${R}_{\star }$, and age. This conversion is facilitated by the publicly available isochrones Python package (Morton 2015). Stellar mass, radius, and age are measured to 4%, 11%, and 30%, respectively. We then recompute planetary radii and equilibrium temperatures using our updated CKS parameters and the results from transit fitting performed by Mullally et al. (2015). Paper II provides updated stellar and planetary properties for 1305 KOIs and 2025 planet candidates, which are the starting point for our planet sample ${ \mathcal P }$.

2.2. Filtered Planet Sample

We applied a series of filters, described below in bullets, to the CKS sample to arrive at our final planet sample ${ \mathcal P }$. The filters restrict the range of stellar properties included in our analysis. Since we aim to explore the connection between metallicity and planet size, we also require the planets have well-determined radii.

Where possible, we filter based on the DR25 stellar properties table of Mathur et al. (2017). These cuts may be applied homogeneously to the field star sample ${ \mathcal S }$. Some filters involve follow-up observations not available for every star in ${ \mathcal S }$. In Section 2.3, we quantify the number of field stars that would have been excluded had comparable follow-up been performed. The applied filters are as follows:

  • 1.  
    Stellar brightness. We restrict our sample to the magnitude-limited sub-sample of the CKS sample (i.e., ${Kp}\lt 14.2\,\mathrm{mag}$).
  • 2.  
    Stellar effective temperature. The spectroscopic tools used in Paper I produce reliable results for ${T}_{\mathrm{eff}}$ = 4700–6500 K. We restrict our analysis to stars having photometric ${T}_{\mathrm{eff}}$ = 4700–6500 K (DR25 stellar properties table).
  • 3.  
    Stellar surface gravity. We restrict our analysis to stars having photometric $\mathrm{log}g$ = 3.9–5.0 dex (DR25 stellar properties table).
  • 4.  
    Stellar dilution. Dilution from nearby stars can also alter the apparent planetary radii. Furlan et al. (2017) compiled high resolution imaging observations performed by several groups.10 When a nearby star is detected, Furlan et al. (2017) computed a radius correction factor (RCF), which accounts for dilution assuming the planet transits the brightest star. We elect to not apply this correction factor, but conservatively exclude KOIs where the RCF is larger than 5%.
  • 5.  
    Planet orbital period. We remove planet candidates with orbital periods longer than 350 days. This excludes planets that only transit once or twice during Kepler observations, which have a higher false positive rate (Mullally et al. 2015).
  • 6.  
    Planet false positive designation. We exclude candidates that are identified as false positives according to Paper I.
  • 7.  
    Planets with grazing transits. Finally, we exclude stars having grazing transits (b > 0.9), which have suspect radii due to covariances with the planet size and stellar limb-darkening during the light curve fitting.

Our successive filters, along with a running tally of the number planets which pass them, are listed in Table 1. The filters on dilution and grazing transits each eliminate a small percentage (8% and 5%) of planets with imprecise radii, even after CKS spectroscopy. Similar filters clarified the bimodal planet radius distribution presented by Fulton et al. (2017), Paper III, in the CKS series.

Table 1.  Filters Applied to Planet Sample

Filter ${n}_{\mathrm{pl},\mathrm{pass}}$ ${n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}$ ${f}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}$ ${n}_{\star ,\mathrm{pass},\mathrm{run}}$
Full sample 2025 2025 1.000 1279
Kp < 14.2 mag 1359 1359 0.671 954
${T}_{\mathrm{eff}}$ = 4700–6500 K 1977 1328 0.977 929
$\mathrm{log}g=3.9\mbox{--}5.0\,\mathrm{dex}$ 1899 1212 0.913 829
P < 350 days 1965 1191 0.983 817
Not a false positive 1861 1105 0.928 758
Radius correction factor <5% 1891 1017 0.920 695
Not a grazing transit (b < 0.9) 1870 970 0.954 662

Note. Summary of the filters applied to the CKS catalog to create planet sample ${ \mathcal P }$. The column labeled ${n}_{\mathrm{pl},\mathrm{pass}}$ is the total number of KOIs that pass a specific filter and ${n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}$ is a running tally of KOIs that pass all filters. For filter i, ${f}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}(i)\,={n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}(i)/{n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}(i-1)$. For example, ${f}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}(3)\,={n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}(3)/{n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}(2)\,=\,1328/1359\,=\,0.977$. The column labeled ${n}_{\star ,\mathrm{pass},\mathrm{run}}$ gives the numbers of unique stars hosting the ${n}_{\mathrm{pl},\mathrm{pass},\mathrm{run}}$ KOIs.

Download table as:  ASCIITypeset image

In total, ${ \mathcal P }$ contains  970  planets orbiting  662  stars that pass all filters. Figure 1 shows planets on the $[\mathrm{Fe}/{\rm{H}}]$${R}_{P}$ plane as successive cuts are applied. Even though planets across this plane are filtered out, planets larger than Neptune have a higher rate of removal. Properties of the stars hosting these planets are shown in Figure 2.

Figure 1.

Figure 1. Host star metallicity vs. planet size after the application of successive filters. Panel (a): all KOIs in the CKS sample. Panels (b)–(j): blue points show the KOIs that pass successive filters in stellar brightness, effective temperature, surface gravity, planetary orbital period, false positive disposition, radius precision, dilution due to nearby stars, and impact parameter. The gray points show KOIs that do not pass one or more of the filters.

Standard image High-resolution image
Figure 2.

Figure 2. The distribution of stellar properties for the three samples of stars considered in this work. Panel (a): blue points show ${T}_{\mathrm{eff}}$ and $\mathrm{log}g$ of CKS planet hosts that passed all filters ${ \mathcal P }$; gray points represent all CKS planet hosts. Panels (b) and (c) show the same quantities as panel (a), but for the Kepler target stars and the LAMOST sample, respectively. Panels (d)–(f): distributions of host star ${Kp}$. Panels (g)–(i): distributions of host star metallicity from different catalogs. The sub-solar mean metallicity of the Kepler target stars (h) is a reflection of the low metallicity prior used in the photometric modeling.

Standard image High-resolution image

2.3. Field Star Sample

In order to compare the properties of stars with and without transiting planets, we need to consider the parent population of Kepler field stars ${ \mathcal S }$, from which ${ \mathcal P }$ is drawn. We begin with the Mathur et al. (2017) catalog that lists all 199,991  stars observed at some point during the Kepler mission. Where possible, we apply the same set of filters to construct ${ \mathcal S }$ that were used to construct ${ \mathcal P }$. Table 2 summarizes the number of stars that pass cuts on ${Kp}$, photometric ${T}_{\mathrm{eff}}$, and photometric $\mathrm{log}g$. A total of  36,959  stars pass all cuts. The distribution of field star properties is shown in Figure 2.

Table 2.  Filters Applied to Stellar Sample

Cut ${n}_{\star ,{\rm{pass}}}$ ${n}_{\star ,\mathrm{pass},\mathrm{run}}$ ${f}_{\star ,\mathrm{pass},\mathrm{run}}$
Full sample 199991 199991 1.000
Kp < 14.2 mag 81758 81758 0.409
${T}_{\mathrm{eff}}$ = 4700–6500 K 168885 62751 0.768
$\mathrm{log}g=3.9\mbox{--}5.0\,\mathrm{dex}$ 162854 36959 0.589

Note. Summary of the filters applied to ${ \mathcal S }$. See Table 1 for column descriptions.

Download table as:  ASCIITypeset image

The filters used to construct ${ \mathcal P }$ were not simply cuts on the stellar properties, but also on quality of the CKS stellar radii and the properties of the planet candidates. We cannot apply these same filters to the field stars on a star-by-star basis because these stars do not have CKS stellar radii or detected planets. However, we must assess whether the remaining filters from Section 2.2 would have excluded a significant fraction of field stars, assuming all stars received similar follow-up attention. Here, we quantify the number of field stars that would have been excluded from ${ \mathcal S }$ had comparable follow-up been performed.

  • 1.  
    Reliable spectroscopic parameters. Our initial sample of planets orbit stars for which the spectroscopic analysis described in Section 2 produced reliable results. For a star to be included, $v\sin i$ must be less than 20 km s−1. This selection effect excludes some stars that are typically near 6500 K, where $v\sin i$ values begin to exceed 20 km s−1. After filtering on ${Kp}$, photometric ${T}_{\mathrm{eff}}$, and photometric $\mathrm{log}g$, 3.1% of stars are excluded because they do not have reliable spectroscopic parameters.
  • 2.  
    Stellar dilution. 7.8% of planet candidates were excluded because the Kepler apertures contained enough flux from neighboring stars, such that the planet radii required a correction factor larger than 5%. The fraction of stars excluded depends on the crowding of Kepler field stars. We make the assumption that this distribution is independent of KOI status. Had all field stars received a similar level of high-contrast imaging follow-up, 7.8% would have companions sufficiently bright and close to warrant a RCF of $\gt 5 \% $.
  • 3.  
    Planet orbital period. For a KOI to be included in our sample, we required that the orbital period be less than 350 days. Such a cut is strictly a cut on the planet properties and does not preclude any stars from being included in ${ \mathcal S }$.
  • 4.  
    Planet false positive disposition. A star must first be identified as a KOI in order to then be designated as a false positive. Therefore, the false positive filter does not exclude significant numbers of field stars from ${ \mathcal S }$.
  • 5.  
    Planets with grazing transits. We require that the planets have non-grazing impact parameters. As with the radius filter, it does not affect inclusion in ${ \mathcal S }$. However, this cut on impact parameter must be factored into the geometric transit probability, which affects the occurrence calculation described in Section 4.

When we account for stars that would have been excluded from our sample due to unreliable spectroscopic parameters or the presence of nearby stars, we find that the planet sample ${ \mathcal P }$ was drawn from a parent stellar sample ${ \mathcal S }$ containing 36,959 × 0.922 × 0.969 = 33,020 stars.

2.4. Metallicity Distribution of Kepler Field Stars

Here, we characterize the metallicity distribution of the parent sample ${ \mathcal S }$. While the KIC (Brown et al. 2011) and its updates (e.g., Mathur et al. 2017) tabulate metallicities for every Kepler target star, they are inadequate for characterizing the metallicity distribution of the Kepler field given their low precision. Instead, we use the LAMOST/LASP stellar parameters (Luo et al. 2015) from Data Release 3 (DR3).11 Following De Cat et al. (2015), we crossmatch the LAMOST-DR3 with the KIC by identifying LAMOST spectra taken within 1.2 arcsec of a KIC target star. This crossmatching results in  29,997  stars in common having ${Kp}$ < 14.2 mag. We note a systematic offset between the LAMOST and CKS metallicity scales of ≈0.04 dex when comparing  476  stars in common. We apply a correction, described in Appendix A, but estimate that residual systematic offsets of ≈0.01 dex remain.

We apply the same set of filters to the LAMOST stars that we applied to ${ \mathcal P }$. These include cuts in ${Kp}$, ${T}_{\mathrm{eff}}$, and $\mathrm{log}g$. After applying these cuts, we are left with  14,382  stars with LAMOST parameters. The filters are summarized in Table 3 and the distribution of LAMOST stellar properties is shown in Figure 2.

Table 3.  Summary of Cuts to LAMOST Sample

Cut ${n}_{\star ,\mathrm{pass}}$ ${n}_{\star ,\mathrm{pass},\mathrm{run}}$ ${f}_{\star ,\mathrm{pass},\mathrm{run}}$
Full sample 29997 29997 1.000
Kp < 14.2 mag 29997 29997 1.000
${T}_{\mathrm{eff}}$ = 4700–6500 K 23551 23551 0.785
$\mathrm{log}g=3.9\mbox{--}5.0\,\mathrm{dex}$ 18625 14382 0.611

Note. Summary of the cuts applied to the LAMOST DR-2 sample. See Table 1 for column descriptions.

Download table as:  ASCIITypeset image

Table 4 summarizes the metallicity distribution of ${ \mathcal P }$ and ${ \mathcal S }$ measured through different methods. The mean metallicity of ${ \mathcal S }$, as measured by LAMOST, is −0.01 dex, similar to the mean metallicity of the filtered planet sample ${ \mathcal P }$, +0.03 dex, as measured by CKS. We note that the mean metallicity of ${ \mathcal S }$, according to the DR25 stellar properties table (Mathur et al. 2017), is −0.19 dex. This low value is due to the low metallicity prior used in the photometric modeling.

Table 4.  Comparison of Planet Host and Field Star Metallicities

  ${ \mathcal P }$ (spec) ${ \mathcal S }$ (phot) ${ \mathcal S }$ (spec)
  (dex) (dex) (dex)
Mean 0.031 −0.187 −0.005
rms 0.185 0.259 0.207
SEM 0.006 0.001 0.002
25% −0.069 −0.320 −0.116
50% 0.052 −0.160 0.020
75% 0.148 −0.020 0.131

Note. Summary of metallicity distributions for different stellar samples. ${ \mathcal P }$ (spec) refers to the CKS metallicities of our filtered list of planet hosts (Section 2.2). ${ \mathcal S }$ (phot) refers to the photometric metallicities of our parent stellar sample (Section 2.3). ${ \mathcal S }$ (spec) refers to the metallicities of ${ \mathcal S }$ measured using LAMOST data (Section 2.4). ${ \mathcal S }$ (phot) and ${ \mathcal S }$ (spec) have significantly different mean metallicities, due the metallicity prior imposed by Mathur et al. (2017).

Download table as:  ASCIITypeset image

3. Metallicities of Planet Hosts

In this section, we examine the metallicities of planet host stars with respect to planet size and orbital period. We first briefly note the trends seen within the planet sample ${ \mathcal P }$, without reference to field star metallicities (Section 3.1). However, features in P${R}_{P}$$[\mathrm{Fe}/{\rm{H}}]$ distribution are much more apparent when compared to the metallicities of the parent stellar sample ${ \mathcal S }$. We perform this analysis in Section 3.2. By searching for evidence of elevated metallicity in the host stars of certain types of planets, we can identify planet classes that show some positive correlation with metallicity. The advantage of this approach is that it does not require any modeling of the Kepler survey completeness. A more difficult but ultimately more useful approach is to compute planet occurrence as a function of metallicity, which requires modeling survey completeness. This approach is taken in Section 6.

3.1. Properties of Planet Hosts

In Figure 3, we show the planet sizes and host star metallicities for the 2025 planet candidates in the CKS sample. Planets smaller than Neptune are found around stars of wide-ranging metallicities, while there is a deficit of planets larger than Neptune around stars with sub-solar metallicity.

Figure 3.

Figure 3. Panel (a) shows the sizes and host star metallicities for the  970  planets in the filtered CKS planet catalog ${ \mathcal P }$. We observe a clear deficit of planets larger than Neptune with sub-Solar metallicities. The Fulton et al. (2017) radius gap may be seen from ${R}_{P}$ = 1.5–2.0 ${R}_{\oplus }$. We show the mean host star metallicity for various size ranges of planets with the red lines. The vertical bars show the standard error of the mean. The purple lines show the 25% and 75% quantiles. Mean metallicity is roughly constant from 0.7 ${R}_{\oplus }$ to 2.0 ${R}_{\oplus }$, rises from 2.0 ${R}_{\oplus }$ to 4.0 ${R}_{\oplus }$, and is roughly constant from 4 ${R}_{\oplus }$ to 16 ${R}_{\oplus }$. Panel (b): same as (a) except showing orbital period on the x-axis. We observe a small 0.05 dex increase in mean metallicity with decreasing orbital period over P = 1–10 days.

Standard image High-resolution image

To investigate variation in average host star metallicities with planet sizes, we divided the metallicity measurements according to planet size. Bins span a factor of $\sqrt{2}$ in ${R}_{P}$ for planets smaller than 4 ${R}_{\oplus }$. Larger planets are placed into bins spanning a factor of 2, owing to lower numbers. Figure 3 shows the mean metallicities for these planet radius bins along with the 25% and 75% quantiles. We observe a gradual upward trend in mean host star metallicity from smaller to larger planets. Buchhave et al. (2012, 2014) observed a similar trend in smaller samples of planet hosts.

Figure 3 also shows host star metallicity as a function of orbital period. Unlike the ${R}_{P}$$[\mathrm{Fe}/{\rm{H}}]$, there are no large regions of the P$[\mathrm{Fe}/{\rm{H}}]$ plane clearly devoid of planets. After computing the mean metallicity in bins of P spanning 0.25 dex, we observe a slight increase in mean metallicity of about 0.05 dex with decreasing orbital period over P = 1–10 days. Mulders et al. (2016) observed a similar trend in a smaller sample of planet hosts.

3.2. Comparison to Field Stars

Here, we examine the effect of host star metallicity on the 2D distribution of planet size and orbital period. We divided ${ \mathcal P }$ into four bins of host star metallicity with boundaries at  −0.116,  +0.020, and  +0.131 dex. The boundaries of the bins equally divide the stars in ${ \mathcal S }$ (see Table 4). Figure 4 shows the distribution of planets in the P${R}_{P}$ plane for the different metallicity bins. If the occurrence of planets were independent of host star metallicity, then the distribution of planets in each plot would be indistinguishable. The panels of Figure 4 show clear differences, indicating that metallicity is associated with certain types of planets, and that the strength of that enhancement depends on both P and ${R}_{P}$.

Figure 4.

Figure 4. Orbital periods and radii of planets orbiting host stars belonging to different metallicity bins. Each metallicity bin captures an equal fraction (25%) of the parent stellar sample ${ \mathcal S }$ (see Table 4). In panel (a), blue points represent planets orbiting host stars with $[\mathrm{Fe}/{\rm{H}}]$ <  −0.116 dex, the lowest metallicity bin. The gray points show the full planet sample ${ \mathcal P }$ for context. At top right, we show fp, the fraction of planets belonging to this metallicity bin. Panels (b)–(d): same as (a) except for different metallicity bins. If planets formed with equal efficiency regardless of host star metallicity, each bin would have fp = 25% and the distribution of blue points would be indistinguishable from bin to bin. Planets around the lowest metallicity stars (a) are clearly confined to a restricted range of P${R}_{P}$ space compared to planets around the highest metallicity hosts (d), notably in the lower right envelope of longer periods and smaller sizes.

Standard image High-resolution image

To facilitate our investigation into the effect of metallicity across the P${R}_{P}$ plane, we define several regions. As a matter of convenience, we define a nomenclature for referring to these different regions. While the boundaries are matters of taste, we choose physically motivated boundaries, when possible. We consider four domains of planet size, defined below:

  • 1.  
    Jupiters. ${R}_{P}$ = 8–24 ${R}_{\oplus }$. The lower limit is motivated by the fact that planets larger than 8 ${R}_{\oplus }$ tend to have masses ranging from 100 to 10,000 ${M}_{\oplus }$, while planets smaller than 8 ${R}_{\oplus }$ tend to have lower masses ranging from 6 to 60 ${M}_{\oplus }$ (see, e.g., Petigura et al. 2017b, Figure 9). At ${M}_{P}$ ≈ 100 ${M}_{\oplus }$ electron degeneracy pressure begins contribute significantly to a planet's pressure support (Zapolsky & Salpeter 1969). Thus 8 ${R}_{\oplus }$ approximates the dividing line between planets with different pressure support and interior structures. The upper radius limit of 24 ${R}_{\oplus }$ is based on the known sizes of hot Jupiters. The most highly irradiated Jovians discovered to date can exceed 2 ${R}_{\mathrm{jup}}$ (e.g., WASP-79b; Smalley et al. 2012).12 ,13
  • 2.  
    Sub-Saturns. ${R}_{P}$ = 4–8 ${R}_{\oplus }$. Sub-Saturns are typically typically less massive the larger Jupiters. They are roughly 10× more rare than the smaller sub-Neptunes, suggesting a different formation pathway. A radius of 4 ${R}_{\oplus }$ approximates a breakpoint in the planet size distribution where planet occurrence rises rapidly with decreasing size (see Fulton et al. 2017).
  • 3.  
    Sub-Neptunes. ${R}_{P}$ = 1.7– 4.0 ${R}_{\oplus }$. The lower radius limit corresponds to a likely transition radius between rocky planets and planets that have envelopes that are an appreciable fraction of the total planet size. Among the observations supporting such a transition are measurements of planet bulk density from transits and radial velocities (Marcy et al. 2014; Weiss & Marcy 2014; Rogers 2015) and the observation of a gap in the radius distribution of planets by Fulton et al. (2017).
  • 4.  
    Super-Earths. ${R}_{P}$ = 1.0–1.7 ${R}_{\oplus }$. Planets that are smaller than sub-Neptunes. Few planets in this size range have well-measured masses, but those that do are often consistent with rocky compositions.

We also consider three domains of orbital period:

  • 1.  
    Hot Planets. P = 1–10 days. The upper limit of 10 days corresponds to a breakpoint in distribution of planet occurrence with orbital period (Howard et al. 2012; Fressin et al. 2013). Below $P\approx 10\,\mathrm{days}$, planet occurrence per $\mathrm{log}P$ is observed to decline, while planet occurrence is roughly constant for longer periods.
  • 2.  
    Warm Planets. P = 10–100 days. An intermediate range of orbital periods. While warm planets are intrinsically more common than hot planets, they represent about half of the total sample due to falling completeness and transit probability with increasing orbital period.
  • 3.  
    Cool Planets. $P=100\mbox{--}350\,\mathrm{days}$. The longest period planets included in our survey. There are very few planets in our sample with such long periods due to falling completeness and decreasing transit probability.

Here, we note some qualitative features in Figure 4. Stars of all metallicity bins host warm super-Earths and warm sub-Neptunes. Cool Jupiters are intrinsically rare at all metallicities, but they are present in all four metallicity bins. We observe a steady increase of hot Jupiters with increasing metallicity. Sub-Saturns of all orbital periods are almost completely absent in the lower two metallicity bins, which represent half of the parent sample ${ \mathcal S }$; they are almost exclusively found around high metallicity stars. While there are some examples of hot super-Earths in each metallicity bin, their numbers increase with increasing metallicity. Finally, there is almost a complete absence of hot sub-Neptunes in the lowest metallicity bins. Hot sub-Neptunes are more common with increasing metallicity.

In order to quantitatively assess the extent to which metallicity enhances the production of different types of planets, we compare the distribution of planet host metallicities to that of the field star population. In Table 5, we list the mean metallicities of the various planet subclasses as well as the standard error of the mean (SEM), which we compare to the mean field star metallicity, as measured from LAMOST spectra.

Table 5.  Comparison of Planet Host and Field Star Metallicities

Name P ${R}_{P}$ ${n}_{\star }$ $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ t-stat p-value Siga
  day ${R}_{\oplus }$   dex      
Hot Jupiters 1–10 8.0–24.0 14 +0.19 ± 0.04 4.5 <8 × 10−4 Y
Warm Jupiters 10–100 8.0–24.0 4 +0.06 ± 0.02 3.4 <6 × 10−2 N
Cool Jupiters 100–350 8.0–24.0 13 +0.06 ± 0.05 1.4 <3 × 10−1 N
Hot Sub-Saturns 1–10 4.0–8.0 7 +0.26 ± 0.04 6.5 <7 × 10−4 Y
Warm Sub-Saturns 10–100 4.0–8.0 19 +0.14 ± 0.02 5.6 <6 × 10−5 Y
Cool Sub-Saturns 100–350 4.0–8.0 7 +0.11 ± 0.06 1.9 <1 × 10−1 N
Hot Sub-Neptunes 1–10 1.7–4.0 108 +0.11 ± 0.02 7.0 <4 × 10−9 Y
Warm Sub-Neptunes 10–100 1.7–4.0 282 +0.04 ± 0.01 4.0 <2 × 10−3 Y
Cool Sub-Neptunes 100–350 1.7–4.0 29 −0.05 ± 0.03 −1.3 <3 × 10−1 N
Hot Super-Earths 1–10 1.0–1.7 181 +0.05 ± 0.01 4.7 <2 × 10−4 Y
Warm Super-Earths 10–100 1.0–1.7 132 −0.04 ± 0.02 −2.2 <1 × 10−1 N
Cool Super-Earths 100–350 1.0–1.7 4 −0.36 ± 0.03 −11.4 <2 × 10−3 Y
All Jupiters 1–350 8.0–24.0 31 +0.12 ± 0.03 4.2 <6 × 10−4 Y
All Sub-Saturns 1–350 4.0–8.0 33 +0.16 ± 0.02 7.2 <1 × 10−7 Y
All Sub-Neptunes 1–350 1.7–4.0 419 +0.05 ± 0.01 6.2 <7 × 10−7 Y
All Super-Earths 1–350 1.0–1.7 317 +0.01 ± 0.01 1.3 <8 × 10−1 N

Note. We inspected the metallicity distribution of various groups of planets defined by their sizes and orbital periods. For each group, we computed the mean metallicity $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ and the standard error of the mean. We compare these metallicities to the metallicities of Kepler field stars, as measured by LAMOST, using the student t-test. This test returns a t-statistic, which quantifies the statistical difference in mean metallicities, and a p-value, the probability that the two samples were drawn from distributions with the same mean.

aIs the difference between planet host and field star metallicities significant (i.e., is $p \mbox{-} \mathrm{value}$ < 10−2)?

Download table as:  ASCIITypeset image

We assess the significance of the difference between field star and planet host star metallicities using the student t-test, which evaluates the difference between the means of the two samples in units of SEM, the t-statistic. The t-test also returns a $p \mbox{-} \mathrm{value}$, which is the probability that field stars and planet host stars are drawn from distributions with the same mean value.

We may observe statistically significant differences in mean metallicity for two reasons: (1) intrinsic differences between planet host and field star metallicities or (2) residual offsets in the CKS and LAMOST metallicity scales. While we calibrated LAMOST metallicities to the CKS scale, we estimate that offsets of 0.01 dex may remain (see Section 2.4 and Appendix A). To account for the possibility of such residual offsets, we perform three t-tests for each sample where we shift the LAMOST metallicities by −0.01, 0.00, and +0.01 dex. Each of these different tests returns a different $p \mbox{-} \mathrm{value}$. We use the largest (most conservative) $p \mbox{-} \mathrm{value}$ to assess differences in mean metallicities. If the largest p-value is less than 0.01, we deem the metallicity difference to be significant.

Stars hosting Jupiter-size planets have enhanced metallicities, $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ =  +0.12  ±  0.04 dex. The hot Jupiters hosts, with mean metallicity of $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ =  +0.19  ±  0.04 dex, are significantly enhanced relative to field stars ($p \mbox{-} \mathrm{value}$ <  $8\times {10}^{-4}$). While the mean metallicities for the warm and cool Jupiter hosts are also enhanced, the small numbers of such planets prevent a high significance detection of a metallicity enhancement.

Of all the planet size classes studied, the sub-Saturn hosts have the highest mean metallicity, $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ = +0.16 ± 0.02 dex. Among the sub-Saturns, the hot sub-Saturns have the highest mean host star metallicity, $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ =  +0.26  ± 0.04 dex. We find that the hot and warm sub-Saturns hosts were significantly enhanced compared to field stars, while small numbers of detected cool sub-Saturns prevented a detailed comparison.

As a whole, the sub-Neptunes have a mean metallicity of $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ =  +0.05  ±  0.01 dex, close to the field star value. However, when split according to orbital period, we find that the hot sub-Neptunes have an enhanced mean metallicity, $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ =  +0.11  ±  0.02 dex. The large mean metallicity, combined with the large number of such planets (N =  108), results in a very significant detection of a metallicity enhancement ($p \mbox{-} \mathrm{value}$ < $4\times {10}^{-9}$). Due to their high detectability and high intrinsic occurrence, the warm sub-Neptunes have the largest total number of any P${R}_{P}$ subclass, N =  282. The large number of planets means that even small offsets in mean metallicities can be significant. While the mean metallicity of warm sub-Neptunes, $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ =  +0.04  ±  0.01 dex), was not as high as that of the hot sub-Neptunes, it was significantly elevated relative to field stars ($p \mbox{-} \mathrm{value}$ < $2\times {10}^{-3}$).

Finally, the super-Earth-size planets have the lowest mean metallicity of  +0.01  ±  0.01 dex, which is consistent with field star metallicity. However, as with the sub-Neptunes, the hot super-Earth hosts exhibit enhanced metallicity of  +0.05 ± 0.01 dex. The difference in mean metallicity (in dex) is not as large for the sub-Neptunes, so while the offset is still significant ($p \mbox{-} \mathrm{value}$ <  $2\times {10}^{-4}$ ), it is not as significant as for the hot sub-Neptunes. Despite the large number of warm super-Earths, we cannot detect a significant offset in mean metallicity.

One may wonder whether the mean metallicities of stars hosting warm super-Earths/sub-Neptunes could possibly be different from the mean metallicity of the Kepler field (−0.01 dex), given the high intrinsic occurrence of these planets. While numerous previous works have shown that warm super-Earths and sub-Neptunes are intrinsically common (see, e.g., Petigura 2013), neither class of planets is found around 100% of stars. For example, in Section 5 we show that there are $\approx 17$ warm super-Earths per 100 stars. Therefore, the mean metallicity of warm super-Earth hosts could be significantly different than that of field stars. We consider the following limiting case: Suppose that the process that produces warm super-Earths has a step function dependence on metallicity. Among Kepler targets, 17% of stars have $[\mathrm{Fe}/{\rm{H}}]$ > 0.16 dex. A universe where every star with $[\mathrm{Fe}/{\rm{H}}]$ > 0.16 dex produced one warm super-Earth, and every star with $[\mathrm{Fe}/{\rm{H}}]$ < 0.16 dex produced zero, would be consistent with the measured occurrence. By consulting the distribution of Kepler field star metallicities (see Section 2.4), we find that the average metallicity of all stars with $[\mathrm{Fe}/{\rm{H}}]$ > 0.16 dex is 0.25 dex. In this limiting case, the average metallicity of warm super-Earth hosts would be $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ = 0.25 dex, much higher than the observed value of  −0.04  ±  0.02 dex. That warm super-Earth and sub-Neptune hosts have mean metallicities nearly the same as that of field stars demonstrates qualitatively that their occurrence is not a steep function of metallicity. We show this quantitatively in Section 6.

In summary, close-in ($P\lt 10\,\mathrm{days}$) planets of all sizes have enhanced metallicity host stars. Warm super-Earth-size planets at intermediate distances (P = 10–100 days) orbit stars with a metallicity distribution similar to that of field stars. Larger planets in this period range have enhanced metallicities, and the sub-Neptunes and sub-Saturns have a significant metallicity enhancement. Finally, the number of detected cool planets (P = 100–350 days) of all sizes is too small to permit the detection of different host star metallicities.

4. Planet Occurrence: Methodology

In Section 3, we examined the number distribution of planets belonging to host stars of differing metallicities. We now address the underlying prevalence of planets. In this section, we describe our methodology for computing planet occurrence and fitting parameterized descriptions of planet occurrence to the observed data. Section 5 presents planet occurrence as a function of orbital period and planet size. Section 6 presents planet occurrence as a function of orbital period, planet size, and stellar metallicity.

4.1. Definitions

Adopting the notation of Youdin (2011), the probability that a star with properties ${\boldsymbol{z}}$ has a planet with properties ${\boldsymbol{x}}$ that lie in a volume of $d{\boldsymbol{x}}$ of phase space is

Equation (1)

In this paper, ${\boldsymbol{z}}$ is metallicity M = $[\mathrm{Fe}/{\rm{H}}]$ ,14 and ${\boldsymbol{x}}$ is some combination of $\mathrm{log}P$ and $\mathrm{log}{R}_{P}$.15 As a matter of convenience we express differential distribution as

Equation (2)

where C is a normalization constant and g is a shape function that may depend on stellar and/or planetary properties.

The number of planets per star (NPPS) having stellar properties ${\boldsymbol{z}}$ over a specified volume of planet properties X is

Equation (3)

The NPPS having stellar properties within a range of stellar properties Z is

Equation (4)

where j labels stars where ${z}_{j}\in Z$.

Note that our definition of planet occurrence is NPPS. As a matter of convenience, we often express this in units of planets per 100 stars. Some papers (e.g., Fischer & Valenti 2005) define planet occurrence as the fraction of stars with planets (FSWP). For a transit survey like Kepler, computing FSWP is challenging because one must compute the number of stars without planets. However, a transit survey cannot distinguish between stars without planets and stars without transiting planets. Converting between NPPS and FSWP requires detailed modeling of planet multiplicity and mutual inclinations and is beyond the scope of this work. For more discussion on these points, see Youdin (2011) and references therein.

4.2. Occurrence within a Cell

In this paper, we compute and display planet occurrence over rectilinear "cells" constructed from various combinations of $\mathrm{log}P$, $\mathrm{log}{R}_{P}$, and M. We express the size of these cells as ${\rm{\Delta }}\mathrm{log}P$, ${\rm{\Delta }}\mathrm{log}{R}_{P}$, ${\rm{\Delta }}M$, measured in dex. The occurrence within a cell is ${f}_{\mathrm{cell}}$, which depends on the number of independent trials ${n}_{\mathrm{trial}}$ that could have yielded a detected planet. In the limit where every star in ${ \mathcal S }$ has one transiting planet that is also detectable by the Kepler pipeline, ${n}_{\mathrm{trial}}$ = ${n}_{\star }$. In practice, ${n}_{\mathrm{trial}}$ must account for (1) non-transiting orbital inclinations and (2) lack of detectability due to insufficient S/N. For a given planet size and orbital period,

Equation (5)

where i labels each star in ${ \mathcal S }$, ${p}_{\mathrm{tr},i}$ is the transit probability, and ${p}_{\det ,i}$ is the probability that a transiting planet would be detectable by the Kepler pipeline. We represent this more compactly as

Equation (6)

where $\langle \cdot \rangle $ denotes the arithmetic mean.

Following Bowler et al. (2015), if we assume that planet occurrence is log-uniform within a cell, ${n}_{\mathrm{trial}}$ for a cell of size ${\rm{\Delta }}{\boldsymbol{x}}={\rm{\Delta }}\mathrm{log}P\times {\rm{\Delta }}\mathrm{log}{R}_{P}$ is

Equation (7)

For a given cell with ${n}_{\mathrm{trial}}$ trials, ${n}_{\mathrm{pl}}$ detected planets, and ${n}_{\mathrm{nd}}$ = ${n}_{\mathrm{trial}}$${n}_{\mathrm{pl}}$ non-detections, the likelihood of ${f}_{\mathrm{cell}}$ is given by the binomial distribution

Equation (8)

Equation (9)

where C is the following normalization constant:16

Equation (10)

If no transits are detected in a given cell, we may place an upper bound on the occurrence rate by calculating the maximum value of ${f}_{\mathrm{cell}}$ that yields zero planet detections (${n}_{\mathrm{pl}}=0$) with 90% probability. This is done by numerically finding an ${f}_{\mathrm{cell}}$ that satisfies

Equation (11)

When analyzing planet occurrence as a function of metallicity, we must consider the fraction of stars in ${ \mathcal S }$ that falls within a specified range of metallicity. Therefore, to compute ${f}_{\mathrm{cell}}$ for a cell bounded by $[{R}_{P,1},{R}_{P,2}]$, $[{P}_{1},{P}_{2}]$, and $[{M}_{1},{M}_{2}]$, we simply multiply ${n}_{\mathrm{trial}}$ from Equation (7) by the fraction of LAMOST stars with metallicities between ${M}_{1}$ and ${M}_{2}$. This treatment assumes that planet detectability does not depend on metallicity. We justify this assumption in Appendix B.

4.3. Completeness

To compute occurrence, we must estimate $\langle {p}_{\mathrm{tr}}{p}_{\det }\rangle $. The probability that a randomly inclined planet at a distance of a will transit with b < 0.9  is ${p}_{\mathrm{tr}}$ = 0.9  ${R}_{\star }$/a, assuming circular orbits. We compute ptr for each star using the tabulated values of ${R}_{\star }$ and ${M}_{\star }$ from the Q1–Q17 (DR25) stellar properties table (Mathur et al. 2017), along with Kepler's Third Law.

The probability of detecting a transiting planet with size ${R}_{P}$ and period P depends on the number of factors, the most significant of which is the S/N. We follow the methodology of Fulton et al. (2017) and compute an expected S/N using the following formula:

Equation (12)

Here, Tobs is the time the star was observed by Kepler, and $\sigma ({T}_{14})$ represents the photometric variability on transit timescales. The DR25 stellar properties table lists Tobs, ${R}_{\star }$, and photometric noise17 on 3, 6, and 12 hr timescales. We compute $\sigma ({T}_{14})$ at intermediate values of T14 through piecewise linear interpolation (or extrapolation) in log σ–logT14 space.

Characterizing ${p}_{\det }$ as a function of S/N has been addressed in several previous works (e.g., Fressin et al. 2013; Petigura et al. 2013; Christiansen et al. 2015). This is a challenging problem, which depends on how a complex, multi-stage transit pipeline performs in the face of correlated, non-stationary, and non-Gaussian photometric noise. We adopted the ${p}_{\det }$ (S/N) of Fulton et al. (2017), who used the transit injections of Christiansen et al. (2015) to derive the following relationship:

Equation (13)

where s is the S/N, k = 17.56, l = 1.00, and $\theta =0.49$. We illustrate $\langle {p}_{\det }\rangle $ and $\langle {p}_{\mathrm{tr}}{p}_{\det }\rangle $ as a function of P and ${R}_{P}$ in Figure 5.

Figure 5.

Figure 5. Average detectability of planets as a function of planet size and orbital period. Panel (a): the probability that a transiting planet would be detected averaged over all stars in our sample, $\langle {p}_{\det }\rangle $. Panel (b): the product of the transit and detection probability averaged over all stars in the sample $\langle {p}_{\det }{p}_{\mathrm{tr}}\rangle $. The effective number of stars from which the planet detections are drawn is simply ${n}_{\mathrm{trial}}$ = ${n}_{\star }\langle {p}_{\det }{p}_{\mathrm{tr}}\rangle $.

Standard image High-resolution image

4.4. Fitting the Occurrence Distribution

We often wish to characterize the occurrence distribution with parametric models. These models serve to quantify shapes and trends in the population of planets. We extend the maximum-likelihood method of Howard et al. (2012) to fit ${Cg}({\boldsymbol{x}},{\boldsymbol{z}};{\boldsymbol{\theta }})$ to the observed planet population, where ${\boldsymbol{\theta }}$ represents the vector of shape parameters in our model.

The occurrence within each cell ${f}_{\mathrm{cell}}$ gives an estimate of the differential occurrence rate via

Equation (14)

The log-likelihood of a given model with parameters $\{C,{\boldsymbol{\theta }}\}$ given the observed rate in a single cell labeled by i is

Equation (15)

Each cell is an independent constraint on ${Cg}({\boldsymbol{x}},{\boldsymbol{z}};{\boldsymbol{\theta }})$, so a fit to multiple cells requires maximizing the combined log-likelihood over all cells:

Equation (16)

When fitting parameterized models of occurrence in Sections 5 and 6, we use very small bins having ${\rm{\Delta }}\mathrm{log}P\,=0.05\,\mathrm{dex}$ and ${\rm{\Delta }}\mathrm{log}M=0.05\,\mathrm{dex}$. As a result, many bins have one or zero planet detections. The maximum-likelihood method is stable for small bin sizes because Equation (15) incorporates non-detections. We experimented with even finer bins, but did not observe significant changes in the results. After finding the max-likelihood model, we explore the credible range of models using Markov Chain Monte Carlo (MCMC) sampling.18

5. Planet Occurrence: Period and Radius

We first present planet occurrence on the P${R}_{P}$ plane, averaging over stellar metallicity. Following Howard et al. (2012), we divided the domain of orbital period and planet size into numerous cells spanning ${\rm{\Delta }}\mathrm{log}P\times {\rm{\Delta }}\mathrm{log}{R}_{P}=0.25\,\mathrm{dex}\,\times 0.15\,\mathrm{dex}$. For each cell, we computed ${f}_{\mathrm{cell}}$ according to the prescription from Section 4.2. We display ${f}_{\mathrm{cell}}$ as a checkerboard in Figure 6, where each cell in is shaded and annotated according to the occurrence of planets within the cell. We list the cell-by-cell rates, uncertainties, and upper limits in Table 8 in the Appendix.

Figure 6.

Figure 6. The domain of orbital period and planet size, which has been sub-divided into numerous sub-domains, "cells." Each cell is annotated with the average number of planets per 100 stars having properties within each cell. This quantity is also reflected through the shading of each cell. The number of planets has been corrected for the probability of transiting and for detection completeness.

Standard image High-resolution image

We also show occurrence as a function of orbital period for various size classes in Figure 7, computed over bins spanning ${\rm{\Delta }}\mathrm{log}P=0.25\,\mathrm{dex}$. Errorbars and upper limits are computed according to the methodology presented in Section 4.2. For super-Earths and sub-Neptunes, inspection of the binned occurrence rates reveals that ${df}/d\mathrm{log}P$ increases with P until a transition period P0, above which occurrence is nearly uniform in $\mathrm{log}P$. This "knee" points toward an important orbital distance in the formation or subsequent migration of planets. We characterized the period distribution with the following parameterization from Howard et al. (2012):

Equation (17)

Far from P0, this function reverts to a power law with the following indices:

Equation (18)

We fit this distribution using the methodology from Section 4.2 and list the associated parameters in Table 6. Our best-fit model and 1σ range of credible models are shown in Figure 7. As explained in Section 4.4, the max-likelihood fitting uses much finer bins than those displayed in Figure 7. The binned rates shown in Figure 7 serve to guide the eye and are not used in the fitting.

Figure 7.

Figure 7. Planet occurrence as a function of orbital period for various size classes. For example, green points show the number of super-Earths per 100 stars per 0.25 dex interval in the period. Downward arrows represent upper limits (90%). For the super-Earths and sub-Neptunes, we fit the occurrence using the power law and exponential cutoff model described in Section 5 (Equation (17)). The solid lines and bands show the best-fitting model and 1σ range of credible models, respectively. For super-Earths and sub-Neptunes at $P\lt 10\,\mathrm{days}$, planet occurrence increases like ${df}/d\mathrm{log}P\propto {P}^{\alpha }$, where α ≈  ${2.4}_{-0.3}^{+0.4}$ and α ≈  ${2.3}_{-0.2}^{+0.2}$, respectively. At longer orbital periods, occurrence is nearly uniform in $\mathrm{log}P$. A transition period P0 characterizes where the distributions changes slope, which occurs at P0 =  ${6.5}_{-1.2}^{+1.6}$  days and P0 =  ${11.9}_{-1.5}^{+1.7}$  days, respectively. The distributions of sub-Saturns and Jupiters are not well-described by this model. Their occurrence gradually increases over P = 1–300 days.

Standard image High-resolution image

Table 6.  Best-fit Parameters for Planet Period Distributions

Size Class ${R}_{P}$ $[\mathrm{Fe}/{\rm{H}}]$ ${\mathrm{log}}_{10}C$ β P0 γ
Super-Earth 1.0–1.7 all $-{0.32}_{-0.25}^{+0.32}$ $-{0.3}_{-0.2}^{+0.2}$ ${6.5}_{-1.2}^{+1.6}$ ${2.7}_{-0.2}^{+0.3}$
    <0 $-{0.11}_{-0.39}^{+0.51}$ $-{0.4}_{-0.4}^{+0.3}$ ${8.8}_{-1.8}^{+2.3}$ ${3.0}_{-0.3}^{+0.4}$
    >0 $-{0.35}_{-0.23}^{+0.32}$ $-{0.4}_{-0.3}^{+0.2}$ ${5.1}_{-0.8}^{+1.4}$ ${2.9}_{-0.3}^{+0.4}$
Sub-Neptune 1.7–4.0 all $-{0.28}_{-0.18}^{+0.18}$ $-{0.1}_{-0.1}^{+0.1}$ ${11.9}_{-1.5}^{+1.7}$ ${2.4}_{-0.2}^{+0.2}$
    <0 $-{0.75}_{-0.22}^{+0.27}$ $+{0.1}_{-0.2}^{+0.1}$ ${10.6}_{-1.8}^{+2.5}$ ${2.7}_{-0.4}^{+0.4}$
    >0 $+{0.09}_{-0.24}^{+0.29}$ $-{0.3}_{-0.2}^{+0.1}$ ${13.2}_{-2.1}^{+2.8}$ ${2.5}_{-0.2}^{+0.2}$

Note. Best-fit parameters associated with the occurrence model defined in Equation (17).

Download table as:  ASCIITypeset image

For super-Earths the transition period P0 is ${6.5}_{-1.2}^{+1.6}$  days. At shorter orbital periods, occurrence rises rapidly with increasing P, with ${df}\propto {P}^{\alpha }d\mathrm{log}P$, where α =  ${2.4}_{-0.3}^{+0.4}$. At longer orbital periods, ${df}\propto {P}^{\beta }d\mathrm{log}P$, where β =  $-{0.3}_{-0.2}^{+0.2}$. The transition period for sub-Neptunes is farther out at P0 =  ${11.9}_{-1.5}^{+1.7}$  days, but the power law indices are similar, with α =  ${2.3}_{-0.2}^{+0.2}$ and β =  $-{0.1}_{-0.1}^{+0.1}$. The distributions of sub-Saturns and Jupiters are not well-described by the power law cutoff model. Their occurrence gradually increases over P = 1–300 days.

Figures 6 and 7 are convenient tools for calculating planet occurrence over various domains of period and radius. However, several features in the planet population are hard to see in these plots due to the coarse bin sizes and the arbitrary location of the bin boundaries. Figure 8 shows a finer view of planet occurrence. We computed occurrence in bins of period and radius spanning 0.25 dex and 0.10 dex, respectively. We then shifted the bins in small steps of ${\rm{\Delta }}\mathrm{log}P$ and ${\rm{\Delta }}\mathrm{log}{R}_{P}$ to smoothly trace out planet occurrence in the P${R}_{P}$ plane. The shading in Figure 8 gives a bird's eye view of the prevalence of various types of planets.

Figure 8.

Figure 8. Panel (a) shows the planet sample ${ \mathcal P }$ and planet occurrence in the P${R}_{P}$ plane. The shading at each (P, ${R}_{P}$) point represents the number of planets per 100 stars within an interval centered at (P, ${R}_{P}$) that spans 0.25 dex in $\mathrm{log}P$ and 0.10 dex in $\mathrm{log}{R}_{P}$. The size of the interval is indicated at top right. For example, the darkest contour indicates that there are $\approx 4$ planets per 100 Sun-like stars having periods within 0.125 dex of 40 days and radii within 0.05 dex of 2.5 ${R}_{\oplus }$. Panel (b): same as (a), but without detected planets for clarity. Warm sub-Neptunes are the most abundant class of planets, and warm super-Earths are another common class of planets. The super-Earths and sub-Neptunes are separated by a diagonal gap of low planet occurrence, described by Fulton et al. (2017). Panels (c)–(d): same as (a)–(b), but with logarithmic shading to highlight domains of low occurrence. The hot Jupiter population is an "island," distinct from the rest of the planet population. There is a "desert" of hot planets having intermediate sizes.

Standard image High-resolution image

Panels (a) and (b) of Figure 8 show occurrence on a linear scale to highlight the most abundant types of planets (super-Earths and sub-Neptunes). We note the rapid increase in occurrence for ${R}_{P}\lesssim 4\,{R}_{\oplus }$, which has been noted by Howard et al. (2012) and numerous subsequent works. Recently, Fulton et al. (2017) re-examined the radius distribution of small planets using the CKS catalog. Fulton et al. (2017) also noted a gap in the radius distribution at ${R}_{P}$ = 1.7 ${R}_{\oplus }$. This gap was predicted by various groups who modeled the erosion erosion of envelopes (e.g., Lopez & Fortney 2013; Owen & Wu 2013; Jin et al. 2014; Chen & Rogers 2016). We also resolve the gap that separates the super-Earth and sup-Neptune populations.

Panels (c) and (d) of Figure 8 show occurrence on a logarithmic scale to highlight domains of low planet occurrence. Although hot Jupiters are rare, they constitute a distinct island in the P${R}_{P}$ plane, surrounded by a sea of still lower occurrence. Hot planets of intermediate sizes are very rare. This triangular "Hot Planet Desert" has been noted by Mazeh et al. (2016) and Dong et al. (2018), and is thought to be due to photo-evaporative envelope stripping.

We also observe an factor of ≈5–10 increase in occurrence of Jupiters from P = 100–300 days. Cumming et al. (2008) analyzed the planets detected by radial velocities from the Keck Planet Search and observed that giant planet occurrence increases by a factor $\approx 5$ from P = 100–300 days. This rise in the occurrence of Jovian planets between 100–300 days, observed in both the Kepler stellar population and among nearby stars, may be associated with an ice-line at ∼1 au.

The joint distribution of planets in the P${R}_{P}$ plane shown in Figure 8 is an important quantitative description of the population of planets. For example, this distribution is useful for studies predicting planet yields in future surveys (e.g., Sullivan et al. 2015). To facilitate such studies, we sample this distribution and provide the periods and radii for a representative population of planets in a sample of  100,000  Sun-like stars in Table 9. This synthetic population also offers an convenient way to compute integrated planet occurrence (but not uncertainties) over arbitrary bins of P and ${R}_{P}$.

6. Planet Occurrence: Period, Radius, and Stellar Metallicity

Here, we analyze planet occurrence as a function of period, radius, and stellar metallicity. As a first step, we divided the super-Earth and sub-Neptune populations into two groups, depending on whether their host stars had sub- or super-solar metallicities. We then repeated the analysis from Section 5, modeling the occurrence distribution of both groups according to Equation (17).

Figure 9 shows the occurrence of super-Earths and sub-Neptunes as a function of orbital period for sub- and super-solar metallicity hosts. The occurrence rates for $P\gt 10\,\mathrm{days}$ are generally consistent for the two metallicity bins. For $P\,\lt 10\,\mathrm{days}$, the occurrence rates of super-Earths/sub-Neptunes is ≈2–3 times higher for super-solar metallicity hosts compared to sub-solar metallicity hosts.

Figure 9.

Figure 9. Panel (a) is analogous to Figure 7, except showing the period distribution of super-Earths at different host star metallicities. For example, red points show the number of super-Earths per 100 stars with super-solar metallicity per 0.25 dex period interval. Hot super-Earths are more common around metal-rich stars. Panel (b) same as (a) except for sub-Neptunes. Hot sub-Neptunes are also more common around metal-rich stars.

Standard image High-resolution image

To quantify the metallicity correlation for our different planet classes, we modeled occurrence using the following parametric function:

Equation (19)

This model extends the following exponential model,

Equation (20)

which was used in Fischer & Valenti (2005) and in several subsequent works. Given that $M=\mathrm{log}({n}_{\mathrm{Fe}}/{n}_{{\rm{H}}})-\mathrm{log}{({n}_{\mathrm{Fe}}/{n}_{{\rm{H}}})}_{\odot }$, Equation (20) is equivalent to a power law relationship between planet occurrence and the number density of iron atoms in a star's photosphere relative to hydrogen,

Equation (21)

We fit the distribution defined in Equation (19) using the methodology from Section 4.4 and list the associated model parameters in Table 7.

Table 7.  Best-fit Parameters for Planet-metallicity Distributions

  ${\mathrm{log}}_{10}C$ α β
Hot Super-Earth $-{2.22}_{-0.09}^{+0.09}$ $+{1.7}_{-0.1}^{+0.1}$ $+{0.6}_{-0.2}^{+0.2}$
Warm Super-Earth $+{0.05}_{-0.23}^{+0.22}$ $-{0.6}_{-0.2}^{+0.2}$ $-{0.3}_{-0.2}^{+0.2}$
Hot Sub-Neptune $-{2.95}_{-0.16}^{+0.15}$ $+{2.2}_{-0.2}^{+0.2}$ $+{1.6}_{-0.3}^{+0.3}$
Warm Sub-Neptune $-{0.85}_{-0.13}^{+0.14}$ $+{0.2}_{-0.1}^{+0.1}$ $+{0.5}_{-0.2}^{+0.2}$
Hot Sub-Saturn $-{4.93}_{-0.76}^{+0.65}$ $+{2.2}_{-0.7}^{+0.8}$ $+{5.5}_{-1.5}^{+1.6}$
Warm Sub-Saturn $-{2.59}_{-0.53}^{+0.53}$ $+{0.5}_{-0.4}^{+0.3}$ $+{2.1}_{-0.7}^{+0.7}$
Hot Jupiter $-{3.32}_{-0.31}^{+0.29}$ $+{0.9}_{-0.4}^{+0.4}$ $+{3.4}_{-0.8}^{+0.9}$
Warm Jupiter ... ... ...

Note. Best-fit parameters associated the joint period-metallicity occurrence model defined in Equation (19).

Download table as:  ASCIITypeset image

For the hot super-Earths we found β =  $+{0.6}_{-0.2}^{+0.2}$, indicating a significant positive metallicity correlation. That $\beta \sim 1$ for hot super-Earths means that their occurrence is nearly proportional to the number of iron atoms in a star's photosphere, relative to hydrogen. Through our MCMC modeling, we found that the metallicity index β was only weakly correlated with the period index α. In contrast, α and the normalization constant C were highly covariant.

The metallicity correlation steepens for larger planets. For hot sub-Neptunes, β =  $+{1.6}_{-0.3}^{+0.3}$. For hot sub-Saturns and hot Jupiters, our planet sample ${ \mathcal P }$ contains only  7  and  14 planets, respectively. Small sample size leads to larger uncertainties on β, and for hot sub-Saturns and Jupiters, β = $+{5.5}_{-1.5}^{+1.6}$ and β =  $+{3.4}_{-0.8}^{+0.9}$, respectively. Despite these larger uncertainties, it is clear that hot sub-Saturns and Jupiters have significantly steeper metallicity dependencies than hot super-Earths and sub-Neptunes.

Our occurrence rate model depends on both P and M, and thus cannot be displayed on a 2D plot. For display purposes, we performed an integration from P = 1–10 days. Figure 10 shows the number of planets belonging to different size classes per 100 stars having P = 1–10 days for various metallicity intervals spanning ΔM = 0.2 dex. As in Figure 7, the binned rates serve to guide the eye and are not used in the fitting.

Figure 10.

Figure 10. Panel (a): Points show the number of planets per 100 stars for bins of host star metallicity spanning ΔM = 0.2 dex having P = 1–10 days. Triangles represent upper limits (90%). The colors correspond to different planet size classes. We modeled the observed occurrence rates with an exponential model, ${df}\propto {P}^{\alpha }{10}^{\beta M}{dM}$, where β characterizes the strength of a metallicity correlation (see Section 6, Equation (19)). The solid lines and bands show the best-fitting model and the $1\sigma $ credible range of models, respectively. Metallicity correlates with the occurrence of super-Earths (β =  $+{0.6}_{-0.2}^{+0.2}$), sub-Neptunes (β =  $+{1.6}_{-0.3}^{+0.3}$), sub-Saturns (β =  $+{5.5}_{-1.5}^{+1.6}$), and Jupiters (β =  $+{3.4}_{-0.8}^{+0.9}$). The strength of the correlation increases with planet size. Panel (b): same as (a) except for planets with P = 10–100 days. Warm super-Earths are not correlated with metallicity (β =  $-{0.3}_{-0.2}^{+0.2}$). We observe positive correlations for larger planets. For warm sub-Neptunes (β =  $+{0.5}_{-0.2}^{+0.2}$); for warm sub-Saturns (β =  $+{2.1}_{-0.7}^{+0.7}$). Comparing the two panels, the metallicity correlation is stronger for P < 10 days.

Standard image High-resolution image

We observe different metallicity dependencies for more distant planets. As shown in Figure 10, warm super-Earths are consistent with no metallicity dependence β = $-{0.3}_{-0.2}^{+0.2}$. Warm sub-Neptunes show a positive metallicity correlation, but their power law index β = $+{0.5}_{-0.2}^{+0.2}$ is significantly shallower than that of the hot sub-Neptunes. The warm sub-Saturns have a positive metallicity correlation of β = $+{2.1}_{-0.7}^{+0.7}$, again more shallow than that of the hot sub-Saturns. Finally, our sample contains too few warm Jupiters (four) to search for a metallicity correlation.

In summary, some but not all planet subclasses are associated with stellar metallicity. Warm super-Earths exhibit no metallicity dependence. The planet-metallicity correlation steepens with increasing size and decreasing orbital period, with the hot Jupiters and hot sub-Saturns showing the steepest metallicity dependencies. We discuss some physical interpretation for these trends in the following section.

7. Summary and Discussion

The Kepler survey has revealed many valuable insights regarding the prevalence of planets with P ≲ 1 year. While numerous previous works have addressed Kepler planet occurrence in the P${R}_{P}$ plane,19 we have leveraged the high-precision, high purity CKS catalog to produce a clearer picture of planet occurrence as a function of orbital period and size (Section 5; Figures 68). This complements the work of Fulton et al. (2017) that focused on super-Earth and sub-Neptune occurrence rates with P = 1–100 days. Our occurrence rates may be easily incorporated into yield calculations for future surveys sensitive to planets with $P\lesssim 1\,\mathrm{year}$. We found a hot Jupiter occurrence rate of  ${0.57}_{-0.12}^{+0.14}$ %, which we compare to results from RV surveys in Section 7.3. While hot Jupiters are rare, we find that they occupy a distinct island in P${R}_{P}$ space, surrounded by a sea of still lower occurrence.

Using the CKS catalog combined with LAMOST metallicities for a large sample of Kepler field stars, we have computed planet occurrence as a function of host star metallicity (Section 6, Figures 9 and 10). We observe a clear association between stellar metallicity and the prevalence of certain types of planets. Figure 11 summarizes our findings: stellar metallicity is associated with the occurrence of hot planets ($P\lt 10\,\mathrm{days}$) and with the occurrence of large planets (${R}_{P}$ > 1.7 ${R}_{\oplus }$, $P\lt 100\,\mathrm{days}$).

Figure 11.

Figure 11. Summary of the planet-metallicity correlation across the P${R}_{P}$ plane. For each planet subclass studied, we display the average host star metallicity $\langle [\mathrm{Fe}/{\rm{H}}]\rangle $ and the β index, which quantifies the strength of the planet-metallicity correlation.

Standard image High-resolution image

While the occurrence of some types of planets correlate strongly with metallicity, others exhibit only a moderate or negligible correlation. The interpretation of these metallicity trends is not yet clear. There are many ways in which differences in metallicity might alter the processes of planet formation and evolution: through differences in the surface density of solids in the protoplanetary disk, the growth rate and abundance of solid cores, the availability of gas, the efficiency of gas accretion onto protoplanets, or the rate of disk migration. There may also be others. We offer some interpretations of these trends in Sections 7.17.3. Finally, we consider future studies that could extend our understanding of the connection between planets and host star metallicity in Section 7.4.

7.1. Metallicity and the Prevalence of Large Planets

We found that metallicity is associated with the prevalence of large planets. This trend is consistent with the planet-metallicity correlation for Jovian-mass planets found by RV surveys (Santos et al. 2004; Fischer & Valenti 2005) and with previous analyses of the metallicities of Kepler planet hosts (e.g., Buchhave et al. 2012). We observed that for smaller planets, the metallicity correlation weakens, which is consistent with trends observed in previous RV studies (e.g., Sousa et al. 2008; Ghezzi et al. 2010) and metallicity studies of Kepler planet hosts (e.g., Buchhave et al. 2012, 2014). Our analysis builds upon these previous studies by providing planet occurrence as a function of metallicity based on the high-precision, high purity CKS catalog of planet radii and metallicities.

There are roughly 15 warm super-Earths per 100 stars over the metallicities ranging from −0.4 to +0.4 dex. If we assume that stellar metallicity traces disk solid surface densities from 0.10 to 0.40 au (which correspond P ≈ 10–100 day orbits), then warm super-Earths can form with moderate efficiency, even in disks with low solid surface densities. Moreover, a factor of 6 increase in solids does not enhance the likelihood of a star to host a warm super-Earth.

RV and TTV mass measurements of super-Earths typically find masses of 1–10 ${M}_{\oplus }$ (Marcy et al. 2014, see Figure 7 of Sinukoff et al. 2017 for an updated census). The bulk densities of these planets are usually consistent with combinations of rock and iron, but are inconsistent with even small envelope fractions of ${M}_{\mathrm{env}}/{M}_{P}\sim 1 \% $, which would significantly reduce the observed bulk densities.

Even though sub-Neptunes are significantly larger than super-Earths, they share several important characteristics. Sub-Neptunes are typically 5–15 ${M}_{\oplus }$, which overlaps with the super-Earth mass range. Sub-Neptune bulk densities require H/He envelopes that are 1%–10% of the planet's mass, which significantly enlarge the planets without significantly altering their masses. Therefore, the typical sub-Neptune contains a comparable amount of solid material as a typical super-Earth. Also, their overall occurrence rates are roughly comparable at 5%–10% per 0.25 dex period interval (Figure 9).

We see an important difference between super-Earths and sub-Neptunes when we compare their dependencies on stellar metallicity. There are comparable numbers of warm super-Earths and warm sub-Neptunes around stars having $[\mathrm{Fe}/{\rm{H}}]$ = [−0.4, −0.2] dex, about 20 per 100 stars. Unlike the warm super-Earths, the occurrence of warm sub-Neptunes increases with metallicity. There are about 40 warm sub-Neptunes per 100 stars having $[\mathrm{Fe}/{\rm{H}}]$ = [+0.2, +0.4] dex.

Perhaps the enriched disks of metal-rich stars form super-Earths more efficiently, but they easily acquire the few percent envelopes and are converted into sub-Neptunes. This could occur if super-Earths form more quickly in metal-rich disks, allowing for more time to accrete 1%–10% gaseous envelopes.

Such a timing argument was proposed by Dawson et al. (2015), to explain an apparent absence of warm super-Earths around the highest metallicity stars ($[\mathrm{Fe}/{\rm{H}}]$ > 0.18 dex) in the Buchhave et al. (2014) metallicity catalog.20 While only 12/132 (9%) of our warm super-Earths have $[\mathrm{Fe}/{\rm{H}}]$ > 0.18 dex, this is consistent with the fact that only 12% of field stars have $[\mathrm{Fe}/{\rm{H}}]$ > 0.18 dex. As shown in Figures 4 and 10, we do not observe such an absence of warm super-Earths, but instead find nearly equal occurrence of warm super-Earths around stars of wide-ranging metallicities. This underscores the importance of having accurate knowledge of the distribution of field star metallicities in planet-metallicity studies. Nevertheless, accelerated formation of cores in metal-rich disks may explain the rise in warm sub-Neptune occurrence with metallicity.

An alternative to the timing argument, discussed above, is that high disk metallicities may somehow increase the rate of envelope accretion. However, the gas accretion models of Pollack et al. (1996) and (more recently) Lee & Chiang (2015) predict the opposite effect: dusty envelopes should accrete more slowly, due to higher opacities and longer cooling timescales.

As we consider larger warm planets, the metallicity correlation becomes stronger. Sub-Saturns have a β index of $+{2.1}_{-0.7}^{+0.7}$. If disk metallicity assists with the accretion of gas through accelerated core formation, it may also explain the sub-Saturns. However, abrupt strengthening of the metallicity correlation above 4 ${R}_{\oplus }$ and the roughly order of magnitude decrease in the frequency of sub-Saturns compared to sub-Neptunes suggests the formation pathways of sub-Saturns and sub-Neptunes may be quite different.

Around 20 sub-Saturns have well-measured masses from either RVs or TTVs (see Petigura et al. 2017b for a recent compilation). For sub-Saturns, we observe an order of magnitude scatter in the observed masses at a given size, indicating a diversity in core and envelope masses. While some sub-Saturns have ≈5 ${M}_{\oplus }$ cores, similar to the super-Earths and sub-Neptunes, many have cores of ≈50 ${M}_{\oplus }$. In addition, the most massive sub-Saturns tend to be found around the most metal-rich hosts (Petigura et al. 2017b).

The existence of ≈50 ${M}_{\oplus }$ cores in planets with 20% envelope fractions challenges the classic core-accretion models of Pollack et al. (1996) that predict that cores larger than 10 ${M}_{\oplus }$ should undergo runaway accretion. Perhaps these massive sub-Saturn cores are the result of the late-stage mergers (or series of mergers) of 10 ${M}_{\oplus }$ cores. This formation scenario requires one or more closely spaced 10 ${M}_{\oplus }$ planets. As we have shown, the probability for a star to produce a 10 ${M}_{\oplus }$ core increases with metallicity. Therefore, the probability for a star to produce two 10 ${M}_{\oplus }$ cores likely increases with a steeper power law index. Thus, the production of sub-Saturns by collisions may explain the mass–metallicity dependence and the steeper relationship between metallicity and planet occurrence. This could also explain the mass–metallicity correlation. This theory also predicts that if late-stage mergers play a large role in the formation of sub-Saturns, they should produce relic eccentricities that are observable at later times.

Our planet sample includes only four warm Jupiters, which is insufficient to search for trends. Fischer & Valenti (2005) analyzed 1040 FGK-type stars from the Keck, Lick, and Anglo-Australian Telescope planet search programs and found a planet-metallicity correlation with a $\beta =2$ index. However, this sample included planets with P = 1–4000 days, with the bulk of the sample having $P\gt 300\,\mathrm{days}$ longer than the periods considered here.

7.2. Metallicity and the Prevalence of Short-period Planets

We found that metallicity is associated with the presence of short-period planets having P < 10 days. This is consistent with trends observed by Mulders et al. (2016) and Dong et al. (2018), who studied the host star metallicities for samples of 665 and 295 planets, respectively. Both studies used spectroscopically constrained metallicities from LAMOST. Our analysis incorporates the high-precision CKS metallicities for a larger sample of  970  planets.

While close-in planets, of all sizes, are rare compared to more distant planets, their frequency increases as a function of stellar metallicity. In Section 5, when considering just the period distribution of super-Earths and sub-Neptunes, we found that for $P\lt 10\,\mathrm{days}$, the occurrence of super-Earths and sub-Neptunes declines with decreasing period. The slope of this falloff is approximated by ${df}\propto {P}^{\alpha }d\mathrm{log}P$, with α values of ${2.4}_{-0.3}^{+0.4}$ and ${2.3}_{-0.2}^{+0.2}$, respectively.

In Section 6, when considering the joint P$[\mathrm{Fe}/{\rm{H}}]$ occurrence distribution, we found that both hot super-Earths and sub-Neptunes are correlated with host star metallicity. This is especially noteworthy for the hot super-Earths, given that the warm super-Earths exhibit no such correlation. For $P\lt 10\,\mathrm{days}$, the frequency of hot super-Earths grows from 4% for stars having $[\mathrm{Fe}/{\rm{H}}]$ = [−0.4, −0.2] dex to 10% for stars having $[\mathrm{Fe}/{\rm{H}}]$ = [+0.2, +0.4] dex (see Figure 10). We see an even steeper increase in the hot sub-Neptune rates from 1% to 8% for the same metallicity intervals. In Sections 7.2.1 and 7.2.2, we consider two possible formation pathways: (1) in situ models, where planets form near their final location, and (2) high eccentricity migration, where the semimajor axes of fully formed planets are altered by major scattering events. In Section 7.2.3, we consider some observational tests that may distinguish between these scenarios.

7.2.1. In Situ Formation

The prevalence of compact multi-planet systems of super-Earths/sub-Neptunes discovered by Kepler helped inspire in situ planet formation models (see, e.g., Hansen & Murray 2013 and Chiang & Laughlin 2013). Chiang & Laughlin (2013) introduced the minimum mass extrasolar nebula (MMESN) as a means of estimating the surface density profile ${\rm{\Sigma }}(a)$ of the protoplanetary disk. They took a sample of Kepler planets, estimated the mass in solids, and smeared those solids out over an area $2\pi {a}^{2}$. The MMESN assumes no large-scale migration of solid building blocks or planets. In the context of the MMESN framework, the falloff in super-Earth and sub-Neptune occurrence at $P\lt 10\,\mathrm{days}$ reflects a declining surface density profile. Lee & Chiang (2017) offered a physical explanation for declining density profiles. In their models, disks are truncated by interactions with the protostar magnetosphere at the co-rotation radius.

The existence of hot sub-Neptunes presents a challenge to strictly in situ models, because sub-Neptunes with P < 10 days are vulnerable to photo-evaporation. Owen & Wu (2017) performed a population synthesis starting with 3–10 ${M}_{\oplus }$ cores and a period distribution of ${df}\propto {P}^{1.9}d\mathrm{log}P$, which is similar to the power law index presented in this work. They then gave the simulated planets a range of envelope fractions from 1% to 30% and tracked the radii of these planets as they were subjected to photo-evaporation. The Owen & Wu (2017) simulations produced very few sub-Neptunes with $P\lt 10\,\mathrm{days}$.

In the Lee & Chiang (2017) model, hot super-Earth/sub-Neptune occurrence rates are set by stellar rotation on the pre-main sequence. It is not clear if stellar metallicity is correlated with pre-main-sequence rotation rates, and Lee & Chiang (2017) did not predict a metallicity dependence for hot super-Earths/sub-Neptunes.

Metallicity may still influence the vertical and radial profiles of the inner disk, even if there is no correlation with stellar rotation. The inner regions of high metallicity disks may have higher densities of free electrons, which could result in stronger coupling to stellar magnetic fields and change the accretion rate and density profile of the disk. The formation of hot super-Earths/sub-Neptunes may also involve significant radial transport of solids, either through the inward drift of "pebbles" that are no longer aerodynamically coupled to the gas disk (see, e.g., Lambrechts & Lega 2017, and references therein). The efficiency of this process is also governed by the radial and vertical density profiles.

Future theoretical modeling of protoplanetary disks is needed to investigate whether in situ models can reproduce the metallicity dependence observed here. In situ models tend to produce dynamically cool planetary systems, which can be corroborated by present-day eccentricity and obliquity measurements

7.2.2. High Eccentricity Migration

High eccentricity migration may explain the hot super-Earth/sub-Neptune period and metallicity distributions. Some of the hot super-Earths/sub-Neptunes may be the result of scattering events between multiple planets. Such encounters preserve the total orbital energy of both planets, but the semimajor axes of both planets may be significantly altered. Dawson & Murray-Clay (2013) proposed a similar mechanism to explain the fact that high eccentricity giant planets with a = 0.1–1.0 au, almost exclusively orbit stars with super-solar metallicities.

A formation pathway involving scattering may help to explain the existence of hot sub-Neptunes, which are at risk of photo-evaporative stripping (see Section 7.2.1). Scattering events may occur late enough such that a sub-Neptune is not subjected to the majority its star's XUV output ($t\gtrsim 100\,\mathrm{Myr}$).

For hot sub-Neptunes, the metallicity correlation is stronger (β =  $+{1.6}_{-0.3}^{+0.3}$) than for the warm sub-Neptunes (β = $+{0.5}_{-0.2}^{+0.2}$). This steeper relationship may support a scattering interpretation given that the probability of scattering depends on the probability of a disk producing two closely spaced planets of comparable mass. If high eccentricity migration is active, the present-day systems planetary systems should be dynamically hot, provided that star–planet interactions are not strong enough to circularize or realign orbits. Again, this may be corroborated with eccentricity and obliquity measurements.

7.2.3. Observational Tests

Additional clues to the formation of hot super-Earths/sub-Neptunes may lie in their orbital eccentricities and degree of alignment with stellar spin axes (obliquities). If planets initially form in the plane of their disks, but are then scattered by other planets, we predict relic eccentricities and obliquities. If these planets form in relative isolation, and are simply a product of the local disk density profile, their eccentricities and obliquities should be low.

Current state-of-the-art RV facilities have provided mass measurements of several dozen hot super-Earths and sub-Neptunes, but are not precise enough to measure orbital eccentricities well. Obliquity measurements via the Rossiter–McLaughlin technique have been made for only a handful of sub-Jovian-size planets (e.g., HAT-P-11, Winn et al. 2010; Hirano et al. 2011). As the precision and observing efficiency of RV facilities improve, we should be able to place tighter constraints on the orbits of these hot planets.

7.3. Hot Jupiters

Hot Jupiters represent an extreme of planet formation, and many theories have been proposed to explain their formation. A non-exhaustive list of theories includes star–planet Kozai (e.g., Wu & Murray 2003), planet–planet Kozai (e.g., Naoz et al.2011), smooth disk-driven Type-I migration (e.g., Ida & Lin 2008), high eccentricity migration, and in situ formation (e.g., Batygin et al. 2016). Hot Jupiters are also of interest because they offer a point of comparison between the Kepler population of planets and those around nearby stars.

We find a hot Jupiter rate of  ${0.57}_{-0.12}^{+0.14}$ %,21 which is about $1.5\sigma $ larger than 0.4 ± 0.1% reported by Howard et al. (2012). This difference cannot be due to differences the treatment of completeness; for hot Jupiters, ${p}_{\det }$ is nearly unity.

The larger occurrence rate presented here is likely due to the improved radius measurements in the CKS sample. Howard et al. (2012) included 13 hot planets with ${R}_{P}$ = 5.6–8.0 ${R}_{\oplus }$, slightly too small to be classified as a hot Jupiter by their criteria. With our improved CKS planet radii, we have found that these large, hot sub-Saturns are exceptionally rare. Our planet sample ${ \mathcal P }$ includes just 3 such planets, while Howard et al. (2012) included 13, even though their stellar parent is less than twice as large as ours ${n}_{{ \mathcal S }}$ (58,041 versus  33,020). Some of the hot sub-Saturns in Howard et al. (2012) were misidentified hot Jupiters.

A major challenge in comparing the hot Jupiter rates in the Kepler and the solar neighborhood is their low intrinsic occurrence. Even though our planet sample ${ \mathcal P }$ was drawn from a parent stellar sample ${ \mathcal S }$ containing  33,020  stars, ${ \mathcal P }$ contained just  14  detections due to low intrinsic occurrence and the requirement of transiting geometries. Even blind RV surveys which are not limited to transiting geometries must observe hundreds of stars to detect a few hot Jupiters. Another challenge is that RV surveys are typically not magnitude-limited samples like ${ \mathcal S }$. RV surveys often prioritize stars that are single, slowly rotating, and have stellar properties that fall within a preselected range of interest (e.g., M-stars or evolved stars). Wright et al. (2012) attempted to retroactively remove these selection effect from the California Planet Search database, to produce a magnitude-limited sample of stars that could be directly compared to the Kepler stellar sample. Wright et al. (2012) reported 10 hot Jupiters from a sample of 836 stars or a rate of 1.2 ± 0.4%, twice the rate presented here. However, the large fractional uncertainty in the Wright et al. (2012) measurement means that these two results differ by only 1.5σ.

In Section 6, we modeled the hot Jupiter occurrence as ${df}\propto {P}^{\alpha }{10}^{\beta M}d\mathrm{log}{PdM}$ and found a β index of $+{3.4}_{-0.8}^{+0.9}$. This strong association between hot Jupiters and metallicity has also been observed by RV stars. Fischer & Valenti (2005) studied the occurrence of Jupiter-mass planets in the CPS sample and modeled their occurrence according to ${df}\propto {10}^{\beta M}$ and β index of 2.0. Johnson et al. (2010) performed a similar analysis, considered an additional mass dependence ${df}\propto {M}_{\star }^{\alpha }{10}^{\beta M}$ and found a β index of 1.2 ± 0.2. It is worth recalling that both studies included long period planets ($P\gt 1\,\mathrm{year}$), which constituted the bulk of the sample. Guo et al. (2017) repeated the Johnson et al. (2010) analysis, but restricted the study to hot Jupiters and found β = 2.1 ± 0.7. While this result differs with our measurement of β by about 2σ, it is clear that hot Jupiters in the solar neighborhood and in the Kepler field are both strongly associated with metallicity.

In the previous sections, we have noted the general tendency for the metallically correlation to steepen with decreasing orbital period and increasing planet size, and have hypothesized some processes that could account for these trends such as accelerated core/envelope growth, higher disk densities, and planet–planet scattering. Given this general trend, it is perhaps not surprising that hot Jupiters would have the strongest metallicity correlation. However, given that the hot Jupiters occupy a distinct island on the P${R}_{P}$ plane, it is possible that their formation pathway is radically different than that of the other planet classes considered here.

7.4. Future Survies

Our target sample contains very few stars (16) with metallicities below −0.4 dex. As a result, we are insensitive to planet occurrence at low metallicities. Probing planet occurrence at low metallicities would shed light on some of the questions raised by this analysis, such as, What is the minimum metallicity needed to form a warm super-Earth?

Since metal-poor stars are rare, a blind survey like Kepler must target a large number of stars in order to capture sufficient numbers of low metallicity stars. There is some room for improvement with the existing Kepler sample. Our sample was drawn from ≈38,000 of the ≈150,000 stars observed by Kepler due to our magnitude limit. Characterizing the metallicities of Kepler planet hosts and field stars fainter than ${Kp}$ = 14.2 mag would enlarge the sample of metal-poor stars. However, given that these new stars would be between ${Kp}$ = 14–16 mag, their amenability to the detection of small planets is poor.

Upcoming missions like TESS (Ricker et al. 2015) and PLATO (Rauer et al. 2014) will survey more stars than Kepler and thus cast a significantly larger net for rare metal-poor stars. Gaia will also identify metal-poor stars across the sky. A targeted RV survey of low metallicity stars would also probe planet occurrence at low metallicities. Such a survey would not be restricted to transiting planets, and would need to survey fewer stars to gather statistically significant planet samples.

Finally, one could search for planets in globular clusters. Gilliland et al. (2000) conducted an important early transit search with HST, which targeted 47 Tucanae ($[\mathrm{Fe}/{\rm{H}}]$ = −0.78 dex). The expected yield was 17 hot Jupiters, but the survey yielded no detections, which was attributed to low metallicities. Recently, Masuda & Winn (2017) re-assessed the significance of the null result, using updated knowledge of the typical size, periods, and host star properties of hot Jupiters. Their revised yield calculation was 2 ± 1, and thus the null result does not require a metallicity effect. However, based on the steep dependence of hot Jupiter occurrence with metallicity, yields would likely be low. Future globular cluster surveys with higher precision or longer baselines would help constrain planet population at −1 dex. However, high stellar densities of globulars may complicate direct comparisons to field stars.

8. Conclusion

We have measured planet occurrence as a function of period, radius, and stellar metallicity within a magnitude-limited sample of Kepler target stars. We have leveraged precise planet radii and stellar metallicities from the CKS catalog of Kepler planet-hosting stars, as well as the metallicity distribution of Kepler field stars measured by LAMOST. In summary, for $P\lt 100\,\mathrm{days}$ the default planetary system contains either no planets detectable by Kepler (e.g., the solar system) or a system of one or more super-Earths/sub-Neptunes with P = 10–100 days. Stars with high metallicity are associated with some mechanism(s) that also allows for "misplaced" planets: sub-Saturns/Jupiters with P < 100 days and super-Earths/sub-Neptunes with P < 10 days. We have noted the general features of planet occurrence as a function of P, ${R}_{P}$, and $[\mathrm{Fe}/{\rm{H}}]$, and have offered some speculation regarding their physical origins. Detailed population synthesis models that treat the disk profile, growth of cores, migration, dynamical instabilities, and photo-evaporation must produce these features.

The CKS project was conceived, planned, and initiated by A.W.H., G.W.M., J.A.J., H.T.I., and T.D.M. A.W.H., G.W.M., J.A.J. acquired Keck telescope time to conduct the magnitude-limited survey. We thank the many observers who contributed to the measurements reported here.

We thank the referee, Lars Buchhave, for his detailed and thoughtful comments on the techniques and interpretations. We also thank Konstantin Batygin, Brendan Bowler, Ian Crossfield, and Eve Lee for enlightening conversations that improved the final manuscript.

Kepler was competitively selected as the tenth NASA Discovery mission. Funding for this mission is provided by the NASA Science Mission Directorate. We thank the Kepler Science Office, the Science Operations Center, the Threshold Crossing Event Review Team (TCERT), and the Follow-up Observations Program (FOP) Working Group for their work on all steps in the planet discovery process, ranging from selecting target stars and pointing the Kepler telescope to developing and running the photometric pipeline to curating and refining the catalogs of Kepler planets.

The Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

E.A.P. acknowledges support from Hubble Fellowship grant HST-HF2-51365.001-A, awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., for NASA under contract NAS 5-26555. L.M.W. acknowledges support from Gloria and Ken Levy and from the Trottier Family. T.D.M. acknowledges NASA grant NNX14AE11G.

This work made use of NASA's Astrophysics Data System Bibliographic Services.

Finally, the authors wish to recognize and acknowledge the very significant cultural role and reverence that the summit of Maunakea has always had within the indigenous Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain.

Software: All code used in this paper is available at https://github.com/California-Planet-Search/cksmet/. We made use of the following publicly available Python modules: astropy (Astropy Collaboration et al. 2013), isoclassify (Huber et al. 2017), lmfit (Newville et al. 2014), matplotlib (Hunter 2007), numpy/scipy (van der Walt et al. 2011), pandas (McKinney 2010), and pinky (https://github.com/pgromano/pinky).

Appendix A: Calibration of LAMOST Metallicities

This paper relies on CKS metallicities for the planet-hosting stars and LAMOST metallicities to characterize the metallicity distribution of Kepler field stars. Differences in metallicity scales often result from different spectral data sets or spectroscopic pipelines. The CKS and LAMOST surveys used different spectra, line-lists, and model-atmospheres. Here, we assess the agreement between the CKS and LAMOST metallicity scales, correct for a small offset, and report the uncertainty in our calibration.

There are  476  stars observed by both CKS and LAMOST. Figure 12 shows the difference between LAMOST and CKS metallicities (${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$) as a function of CKS metallicity, ${Kp}$, and the S/N of the LAMOST spectra. On average, the LAMOST metallicities are 0.04 dex lower, and we observe a small metallicity-dependent systematic trend. The dispersion of ${\rm{\Delta }}[\mathrm{Fe}/{\rm{H}}]$ increases for fainter stars or decreasing LAMOST S/N (the CKS spectra have homogeneous S/N). This indicates that the precision of the LAMOST metallicities declines with decreasing S/N.

Figure 12.

Figure 12. Difference between CKS and LAMOST $[\mathrm{Fe}/{\rm{H}}]$ for  476  stars in common as a function of CKS metallicity (a), ${Kp}$ (b), and the reported S/N of the LAMOST spectra (c). We observe a small metallicity-dependent systematic difference, which we calibrate out in Appendix A.

Standard image High-resolution image

We derived a correction Δ that calibrates the LAMOST metallicities onto the CKS scale via ${\mathrm{LAMOST}}_{\mathrm{cal}}\,={\mathrm{LAMOST}}_{\mathrm{raw}}+{\rm{\Delta }}$. The correction is a linear function of the following form:

Equation (22)

The coefficients are chosen such that they minimize the rms difference between the calibrated LAMOST and CKS parameters. Before fitting, we removed  17  stars where the CKS and LAMOST metallicities differed by more than 0.2 dex. These outliers are likely due to rare failure modes of the LAMOST pipeline.22

We estimated the uncertainties on the best-fit parameters using bootstrap resampling (with replacement), as described in Press (2002). The best-fit coefficients are c0 =  0.037  ±  0.002  and c1 =  −0.015  ±  0.001. We show a one-to-one comparison of the CKS and LAMOST metallicities (without the  17  outliers) in Figure 13. After placing the LAMOST metallicities on the CKS scale, there is no residual mean offset (by construction) and a rms dispersion of 0.05 dex.

Figure 13.

Figure 13. Comparison of CKS and LAMOST metallicities. (Left) Uncalibrated LAMOST metallicities as a function of CKS metallicities for stars in common. The green dashed line represents equality. On average, the LAMOST metallicities are 0.05 dex lower than the CKS values. (Right) comparison of CKS and LAMOST metallicities after removing a linear systematic trend.

Standard image High-resolution image

Our calibration amounts to a correction of  +0.110 ± 0.007 dex at $[\mathrm{Fe}/{\rm{H}}]$ = −0.5 dex and  −0.036 ± 0.007 dex at $[\mathrm{Fe}/{\rm{H}}]$ = +0.5 dex. The uncertainty associated with our correction at −0.5 and +0.5 dex sets an upper bound on the residual systematic offset between the LAMOST and CKS metallicity scales relevant to this paper, which treats planet hosts having metallicities between −0.5 and +0.5 dex. We estimate that the CKS and the calibrated LAMOST metallicities are consistent to 0.01 dex.

To verify that the main results of this paper are not sensitive to zero-point offsets between the CKS and LAMOST metallicities scales, we performed a parallel analysis after perturbing our calibrated LAMOST metallicities by 0.01 dex. We compared the slopes of the metallicity distributions for different planet classes, which is quantified by the β index in Equation (19). Adding 0.01 dex to the LAMOST metallicities increases ${n}_{\mathrm{trial}}$ in the high metallicity bins, which results in lower measured occurrence at high metallicities. For low metallicity bins, ${n}_{\mathrm{trial}}$ decreases, resulting larger measured occurrence. The net effect is that the slope of the metallicity distribution becomes less steep (i.e., smaller β indices). (Subtracting 0.01 dex from the calibrated LAMOST metallicities has the opposite effect and results in larger beta-indices.)

Figure 14 is analogous to Figure 11, but we created it after shifting the LAMOST metallicities by +0.01 dex. Comparing the β indices, these two figures summarize the effects of a possible offset in metallicity scales. For all planet classes, β decreases, as expected. For example, for warm super-Earths, β decreases from −0.3 ± 0.2 to −0.4 ± 0.2; for warm sub-Neptunes, β decreases from +0.5 ± 0.2 to +0.4 ± 0.2. For all planet classes, the change in β is smaller than our adopted $1\sigma $ uncertainties from our maximum-likelihood fitting, which only incorporates counting statistics. Therefore, we conclude that the uncertainties on the β indices due to our LAMOST-CKS calibration are smaller than those due to counting statistics. Thus, the conclusions of this paper are not sensitive to errors in our LAMOST-CKS metallicity calibration.

Figure 14.

Figure 14. Same as Figure 11, but after adding 0.01 dex to the LAMOST metallicities to simulate the effect of residual offsets between the CKS and LAMOST metallicity scales. For all planet classes, the change in β is smaller than the adopted uncertainty on β, which incorporates counting statistics alone. The uncertainties on β are therefore dominated by counting statistics, rather than uncertainties in our CKS-LAMOST metallicity calibration.

Standard image High-resolution image

Appendix B: Metallicity and Planet Detectability

In Section 3, we compared the properties of planets belonging to host stars of different metallicities, and in Section 6, we computed planet occurrence as a function of host star metallicity. For both calculations, we made the assumption that planet detectability is not a strong function of host star metallicity. Here, we check the validity of such an assumption.

Metallicity could correlate with planet detectability through a correlation with stellar size or noise properties. As a direct measure of the possible dependence of planet detectability with metallicity, we computed the single transit signal-to-noise ratio (S/N1) of a 3-hour transit of putative 1 ${R}_{\oplus }$ planet transiting each star in the LAMOST sample. S/N1 was computed according to

Equation (23)

where ${R}_{P}$ = 1 ${R}_{\oplus }$ and ${R}_{\star }$ was computed using LAMOST ${T}_{\mathrm{eff}}$, $\mathrm{log}g$, and $[\mathrm{Fe}/{\rm{H}}]$ measurements and the publicly available isoclassify package (Huber et al. 2017).23 As Figure 15 shows, we do not observe a significant dependence of planet detectability with stellar metallicity.

Figure 15.

Figure 15. Planet detectability vs. metallicity. The y-axis shows the signal-to-noise ratio of a putative 1 ${R}_{\oplus }$ planet, which has a transit duration of three hours. We do not observe a significant trend in planet detectability with metallicity.

Standard image High-resolution image

Appendix C: Planet Occurrence: Period and Radius

Here we provide two tables to supplement Section 5, which treats planet occurrence as a function of period and radius. Table 8 lists the occurrence measurements displayed in Figure 6 along with upper limits. Table 9 is a sampling of the occurrence distribution shown in Figure 8. We sample the occurrence the where the following conditions are met:

  • 1.  
    P = 1–300 days (i.e., the range of periods shown in Figure 8).
  • 2.  
    ${R}_{P}$ = 0.5–32 ${R}_{\oplus }$ (i.e., the range of planet sizes shown in Figure 8).
  • 3.  
    $\mathrm{log}({R}_{P}/{R}_{\oplus })\gt 0.15\,\mathrm{log}(P/1\,\mathrm{day})$, which ensures that pipeline completeness, ${p}_{\det }\,\gt \,0.25$.

The integrated occurrence within this domain is  110.7  planets per 100 stars. We simulated the periods and radii of a population of  110,733  planets in a sample of  100,000  Sun-like stars by drawing  110,733  (P, ${R}_{P}$) pairs according to their measured occurrence rates using the Python package pinky. These samples are listed in Table 9.

Table 8.  Planet Occurrence

${R}_{P,1}$ ${R}_{P,2}$ P1 P2 ${n}_{\mathrm{pl}}$ ${p}_{\det }$ ${n}_{\mathrm{trial}}$ ${f}_{\mathrm{cell}}$
1.00 1.41 1.00 1.78 7 0.95 6551.6 ${0.12}_{-0.04}^{+0.05}$
1.00 1.41 1.78 3.16 15 0.93 4369.1 ${0.36}_{-0.08}^{+0.10}$
1.00 1.41 3.16 5.62 40 0.89 2874.8 ${1.41}_{-0.21}^{+0.23}$
1.00 1.41 5.62 10.00 48 0.84 1847.5 ${2.63}_{-0.36}^{+0.39}$
1.00 1.41 10.00 17.78 51 0.76 1144.3 ${4.50}_{-0.59}^{+0.64}$
1.00 1.41 17.78 31.62 23 0.66 673.5 ${3.50}_{-0.65}^{+0.76}$
1.00 1.41 31.62 56.23 10 0.53 370.5 ${2.87}_{-0.78}^{+0.95}$
1.00 1.41 56.23 100.00 4 0.39 187.4 ${2.49}_{-0.97}^{+1.26}$
1.00 1.41 100.00 177.83 0 0.27 86.4 $\lt 2.61$
1.00 1.41 177.83 316.23 0 0.16 36.3 ...

Note. Planet occurrence computed over various intervals in the P${R}_{P}$ plane spanning 0.25 dex and 0.15 dex, respectively. Each bin boundary is given by $[{R}_{P,1}$, ${R}_{P,2}]$ and [P1, P2] respectively. We list the number of planets per bin ${n}_{\mathrm{pl}}$, the pipeline detectability ${p}_{\det }$, and the number of effective trials ${n}_{\mathrm{trial}}$. The number of planets per 100 stars per bin is given by ${f}_{\mathrm{cell}}$, which is also shown graphically in Figure 6. We report 90% upper limits on ${f}_{\mathrm{cell}}$ when there are no planets in a bin, and we do not report ${f}_{\mathrm{cell}}$ when ${p}_{\det }\lt \,0.25$.

Only a portion of this table is shown here to demonstrate its form and content. A machine-readable version of the full table is available.

Download table as:  DataTypeset image

Table 9.  Simulated Planet Properties

Planet P ${R}_{P}$
  days ${R}_{\oplus }$
0 45.194 1.185
1 94.327 0.964
2 56.235 1.100
3 44.367 2.254
4 26.778 0.827
5 67.644 6.516
6 28.391 1.366
7 236.817 7.830
8 336.385 11.736
9 22.677 2.457

Note. Simulated periods and radii of  110,733  planets in a population of  100,000  stars based on the measured occurrence rates from Kepler (Section 5). This table may be used to compute yield simulations for future surveys or integrated occurrence values over arbitrary bins of P and ${R}_{P}$.

Only a portion of this table is shown here to demonstrate its form and content. A machine-readable version of the full table is available.

Download table as:  DataTypeset image

Footnotes

  • Kepler magnitude; ${Kp}\approx V-0.4\,\mathrm{mag}$ for G2 stars.

  • 10 

    Alphabetical by author: Adams et al. (2012, 2013), Baranec et al. (2016), Cartier et al. (2015), Dressing et al. (2014), Everett et al. (2015), Gilliland et al. (2015), Horch et al. (2012, 2014), Howell et al. (2011), Law et al. (2014), Lillo-Box et al. (2012, 2014), Wang et al. (2015a, 2015b).

  • 11 
  • 12 

    Exoplanet Archive (Akeson et al. 2013).

  • 13 
  • 14 

    We use M interchangeably with $[\mathrm{Fe}/{\rm{H}}]$ for compactness.

  • 15 

    In this paper, $\mathrm{log}$ only refers to log10. Natural logs are always expressed as $\mathrm{ln}$.

  • 16 

    Here, the factorials have been replaced with gamma functions via ${\rm{\Gamma }}(x+1)=x!$, and $(n+1)$ is a normalization constant.

  • 17 

    Combined Differential Photometric Precision (CDPP; Jenkins et al. 2010).

  • 18 

    We used the affine invariant sampler of Goodman & Weare (2010), as implemented in Python by Foreman-Mackey et al. (2013).

  • 19 

    A non-exhaustive list includes Youdin (2011), Catanzarite & Shao (2011), Traub (2012), Howard et al. (2012), Fressin et al. (2013), Dong & Zhu (2013), Petigura et al. (2013), Foreman-Mackey et al. (2014), Burke et al. (2015), Dressing & Charbonneau (2015), and Mulders et al. (2015).

  • 20 

    The Dawson et al. (2015) definition of warm super-Earths was slightly different than ours: ${R}_{P}\lt 1.5\,{R}_{\oplus }$, $P\lt 15\,\mathrm{days}$.

  • 21 

    The rate of  ${0.57}_{-0.12}^{+0.14}$% is slightly higher than a simple sum of the most likely rates from each hot Jupiter bin from Figure 6 due to the asymmetric uncertainties from low counts per bin.

  • 22 

    To verify that our calibration does not depend sensitively on our choice to remove outliers, we left them in but solved for the coefficients that minimized the sum of the absolute differences, a metric which is resistant to outliers. The coefficients agreed to 1.5σ or better.

  • 23 
Please wait… references are loading.
10.3847/1538-3881/aaa54c