About 30% of Sun-like Stars Have Kepler-like Planetary Systems: A Study of Their Intrinsic Architecture

, , , , and

Published 2018 June 18 © 2018. The American Astronomical Society. All rights reserved.
, , Citation Wei Zhu et al 2018 ApJ 860 101 DOI 10.3847/1538-4357/aac6d5

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/860/2/101

Abstract

We constrain the intrinsic architecture of Kepler planetary systems by modeling the observed multiplicities of the transiting planets (tranets) and their transit timing variations (TTVs). We robustly determine that the fraction of Sun-like stars with Kepler-like planets, ηKepler, is 30 ± 3%. Here, Kepler-like planets are planets that have radii Rp ≳ R and orbital periods P < 400 days. Our result thus significantly revises previous claims that more than 50% of Sun-like stars have such planets. Combined with the average number of Kepler planets per star (∼0.9), we obtain that on average each planetary system has 3.0 ± 0.3 planets within 400 days. We also find that the dispersion in orbital inclinations of planets within a given planetary system, σi,k, is a steep function of its number of planets, k. This can be parameterized as ${\sigma }_{i,k}\propto {k}^{\alpha }$ and we find that −4 < α < −2 at the 2σ level. Such a distribution well describes the observed multiplicities of both transits and TTVs with no excess of single-tranet systems. Therefore we do not find evidence supporting the so-called "Kepler dichotomy." Together with a previous study on orbital eccentricities, we now have a consistent picture: the fewer planets in a system, the hotter it is dynamically. We discuss briefly possible scenarios that lead to such a trend. Despite our solar system not belonging to the Kepler club, it is interesting to notice that the solar system also has three planets within 400 days and that the inclination dispersion is similar to Kepler systems of the same multiplicity.

Export citation and abstract BibTeX RIS

1. Introduction

The term "planet occurrence rate" has two different interpretations: the average number of planets per star and the fraction of stars with planets. These two quantities are different, unless all planetary systems have only one planet. With transiting planets (tranets; Tremaine & Dong 2012) from surveys such as the Kepler mission (Borucki et al. 2010), one can constrain the first but not the second (Youdin 2011), unless assumptions of the intrinsic architecture (e.g., the orbital inclination distribution and/or the intrinsic multiplicity function) are made. This is because, to determine the average number of planets per star, one needs to compute the probability that individual planet transits, which only involves the orbital period of the planet (or more precisely, the ratio of stellar radius to orbital separation, R/a). The distribution of orbital periods can be reconstructed from the orbital periods of observed tranets. However, to determine the fraction of stars with planets, one needs to compute the probability that a given star has at least one tranet. This involves the orbital inclinations of all planets around this star, in addition to the orbital periods of these planets. The distribution of planetary inclinations cannot be reconstructed from the tranet sample because, by definition, all tranets have ∼90° inclinations.

The commonly accepted result says that more than 50% of Sun-like stars have Kepler-like planets (Fressin et al. 2013; Petigura et al. 2013; Winn & Fabrycky 2015). However, they used the transit probability of the innermost planet (as in Fressin et al. 2013, or the most easily detected one as in Petigura et al. 2013) as the probability that at least one planet transits. Therefore, their estimates are only valid under the assumption that all detected planets are in multi-planetary systems on coplanar orbits.

Either using the combined constraints of Kepler and Radial Velocity (RV) data (Figueira et al. 2012; Tremaine & Dong 2012) or transit duration distributions normalized by orbital velocities (Fang & Margot 2012; Fabrycky et al. 2014), multi-tranet systems are found to be on average nearly coplanar. However, these methods cannot be applied to the Kepler single-tranet systems, which contribute over half of the detected tranets.

Lissauer et al. (2011) modeled the observed multiplicity function of Kepler and found that single-tranet systems are in excess to a single simulated underlying planet population. This is sometimes called the "Kepler dichotomy" (Johansen et al. 2012; Ballard & Johnson 2016). However, Tremaine & Dong (2012) showed that modeling the observed multiplicity function from Kepler data alone cannot arrive at a reliable conclusion on the intrinsic multiplicity function due to its degeneracy with inclination distribution.

The transit timing variation (TTV) technique can help break the degeneracy between intrinsic multiplicity function and inclination distribution (e.g., Xie et al. 2014). Although TTV is the behavior of the transiting planet, it can reveal the existence of the non-transiting companion (Agol et al. 2005; Holman & Murray 2005). If there is indeed a large population of intrinsic singles, then transiting planets in the single-tranet systems should have a considerably smaller probability to show TTV signals than the transiting planets in multi-tranet systems. However, this is not supported by the large and uniform TTV catalogs. For example, Holczer et al. (2016) found that of the total 260 Kepler planets that showed TTV signals, 121 were in single-tranet systems. The larger TTV catalog of Ofir et al. (2018), which is more sensitive to smaller TTV amplitudes, gives a similar result. Both strongly indicate that transiting planets in transit singles and transit multiples have a similar probability of showing TTV signals, and therefore that there is no large population of intrinsic singles.

Another piece of evidence against the assumption that all Kepler planets are coplanar comes from the study of the distribution of planet eccentricities. Using the distribution of transit durations, Xie et al. (2016) found that single tranets have on average substantially larger eccentricities than multiple tranets. Because the dispersions of orbital eccentricities and inclinations are generally expected to be correlated (e.g., Ida et al. 1993), this result suggests that systems with fewer number of planets may have larger mutual inclinations.

If a significant fraction of planets are in multi-planet systems with larger mutual inclinations than previously thought, then the probability that one star is seen to have at least one tranet increases. Therefore, the total fraction of Sun-like stars with at least one Kepler-like planet will decrease.

For similar reasons, the statistical studies based on the RV samples also overestimated the fraction of stars with planets by using the probability to detect the most detectable planet for the probability to detect at least one planet (Cumming et al. 2008; Mayor et al. 2011). Although this overestimation is less severe for giant planets because of their low multiplicity rate, it can significantly reduce the fraction of stars with lower mass planets (such as super Earths).

In this study, we combine the information of transiting planets and their non-transiting companions as inferred by TTVs to constrain the intrinsic architecture of planetary systems. We focus on Sun-like stars in this paper. In Section 2, we construct the transit and TTV multiplicity functions based on a homogeneous sample. We then forward model these functions to constrain the intrinsic architecture in Section 3. Our results are presented in Section 4 and are discussed in more details in Section 5.

2. Kepler-LAMOST Sample

To select Sun-like (FGK-type dwarf) stars for our study, we rely on the spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST, also known as Goushoujing Telescope, Cui et al. 2012; Zhao et al. 2012), which surveyed over 30% of all Kepler targets by 2017 (DR4) with no bias toward planet hosts (De Cat et al. 2015; Ren et al. 2016). The derived stellar parameters are accurate at least for main-sequence stars, as previous studies (Dong et al. 2014; Xie et al. 2016) have shown. The sample selection is similar to Xie et al. (2016) and Dong et al. (2018). In short, we find 30,759 stars with an effective temperature Teff in the range 4700–6500 K and a stellar surface gravity log g > 4.0 (in cgs unit), based on the stellar parameters derived by the LAMOST official pipeline (LASP, Luo et al. 2015; Xie et al. 2016). We then cross-match this stellar catalog with the planet candidate table from Kepler data release 23 (Mullally et al. 2015) and find 1635 KOIs. Then we remove KOIs that meet the following criteria:

  • 1.  
    Identified by Mullally et al. (2015), Coughlin et al. (2016), and Thompson et al. (2018) as false positives; 484 are removed.
  • 2.  
    KOIs with transit S/N < 7.1 according to Mullally et al. (2015); 108 are removed.
  • 3.  
    KOIs with P > 400 days; 34 are removed.
  • 4.  
    KOIs with Rp > 20 R; here Rp is computed using LAMOST stellar parameters. 108 are removed.
  • 5.  
    KOIs that are in single-tranet systems and have large False Positive Probabilities (FPP > 68%, Morton et al. 2016); 74 are removed.

The last criterion is not applied to the multi-tranet systems, which overall have very low false positive rates (Lissauer et al. 2012; Rowe et al. 2014). As Morton et al. (2016) pointed out, their FPPs for multi-tranet systems could have been inflated by the effect of unidentified TTVs. Indeed, Kepler-23b (Ford et al. 2012) and Kepler-50b (Steffen et al. 2013) are both confirmed planets, but have FPP > 0.68 according to Morton et al. (2016). This criterion is applied to the single-tranet systems because of their overall high false positive rates and low TTV fractions. Of the 74 single tranets removed, only two have TTV signals according to Holczer et al. (2016), the inclusion or exclusion of which does not affect our results.

In the end, we have 827 planets (or planet candidates) around 589 stars. The transit multiplicity function, denoting the number of systems as a function of number of tranets in each system, is (N1, N2, N3, N4, N5, N6) = (432, 99, 42, 11, 3, 2), and includes no system with more than six tranets. We show in Figure 1 the radii and orbital periods for tranets in our sample. The observed transit multiplicity function is illustrated in Figure 2.

Figure 1.

Figure 1.  Radii and orbital periods of transiting planets (tranets) in our Kepler-LAMOST sample. Tranets in different multiples are shown with different symbols and colors. We also over plot the average efficiency of the Kepler detection pipeline (Burke et al. 2015) as well as the positions of solar system planets (Mercury, Venus, and Earth). The vertical dashed line indicates the period boundary (400 days).

Standard image High-resolution image
Figure 2.

Figure 2. Transit and TTV multiplicity functions constructed based on our sample. Here the TTV multiplicity means the number of systems with j transiting planets and at least one of them showing TTV signal.

Standard image High-resolution image

To find out how many of these tranets show TTV signals, we cross-match with the TTV catalog of Holczer et al. (2016), which was produced based on a uniform search among over 2600 KOIs with relatively high S/N. We find that there are (23, 12, 7, 4, 0, 1) systems in our sample that have (1, 2, 3, 4, 5, 6) tranets and at least one of the tranet shows TTV signals. We dub this the TTV multiplicity function, and also show it in Figure 2.

In our analysis that follows, we will make use of all the transiting planets in our sample, which includes the subsample exhibiting TTV signals. In what follows, we argue that the subsample exhibiting TTVs and those that do not are drawn from a common population, validating our usage of both subsets within a common statistical framework. Based on whether one system has single or multiple tranets and whether any of the tranets show TTV signals, we can divide the whole sample into four subsamples: transit singles, transit multis, TTV singles, and TTV multis. In the transit singles (transit multis) subsample, we do not exclude planets/stars in the TTV singles (TTV multis) subsample. Because the transiting planets outnumber the TTV planets by an order of magnitude, the inclusion or exclusion of the ones with TTVs does not make any noticeable difference. Figure 3 shows the cumulative distributions of planetary (radius and orbital period) and stellar (radius, mass, [Fe/H], and Teff) properties of planets/stars in these subsamples, and Table 1 provides the two-sample Kolmogorov–Smirnov (KS) test p values between selected subsamples. There are a number of notable features. First, transit singles and transit multis are statistically similar in almost every index, in particular of the stellar parameters, suggesting that most of the transit singles are likely drawn from the same underlying population and gone through similar formation processes as the transit multis. This is an indication that a substantial fraction of the transit singles are in fact intrinsic multiples, a conclusion that we come to endorse later in the paper.

Figure 3.

Figure 3. Cumulative distributions of planetary and stellar parameters for four subsamples of planets in the Kepler-LAMOST sample.

Standard image High-resolution image

Table 1.  The Two-sample KS Test p Values for Different Combinations of Subsamples and Different (Planetary and Stellar) Parameters

Input Subsamples Rp P R M [Fe/H] Teff
Transit singles and Transit multis 0.055 0.003a 0.80 0.52 0.66 0.55
Transit singles and TTV singles 0.09 0.15 0.96 0.15
Transit multis and TTV multis 0.10 0.08 0.99 0.63
TTV singles and TTV multis 0.08 0.09 0.014b 0.005b 0.95 0.074

Notes.

aThis small p value is likely due to the geometric effect. See the Appendix for more discussions. bGiven that TTV singles are statistically similar to transit singles and that TTV multis are statistically similar to transit multis, these two small p values are likely due to a random sampling effect.

Download table as:  ASCIITypeset image

Another notable feature is that the TTV planets prefer to have slightly larger planetary radii and are at slightly longer periods than the rest. This is expected. To enable TTV detections, the planet transits tend to be deeper and have longer periods (Lithwick et al. 2012; see also Equation (14)). However, the stellar properties of systems with and without TTVs (i.e., TTV singles versus transit singles and TTV multis versus transit multis) are statistically similar since their two-sample KS test p values given in Table 1 are all above the standard threshold (0.05). Therefore, there is no reason to suspect that planets in the TTV sample are drawn from a different population than the tranets are. In this work, we explicitly assume that the TTV planets, despite their relative proximity to mean-motion resonances (MMRs), are not special and that their abundances can be used to constrain the overall planet population. Besides the similarity in stellar parameters, another supporting piece of evidence is that, as new techniques are invented to detect lower amplitude TTVs, more systems appear to show TTV signals (Ofir et al. 2018). The transition from showing and not showing TTVs is smooth rather than sharp.

Figure 2 shows that, although both the transit and TTV multiplicity functions are monotonically decreasing (subject to statistical noises) as the transit multiplicity increases, the TTV multiplicity function has a weaker dependence on the transit multiplicity. This is because TTV is relatively insensitive to the inclination variations. Although such a feature prevents using TTV as a characterization technique to precisely constrain the mutual inclination values (e.g., Hadden & Lithwick 2017), it indeed helps to use TTV as a detection technique to probe planets with a broader range of inclination values than the transit technique. The different slopes of the two multiplicity functions are the key to uncover the inclination distribution in multi-planet systems.

3. Forward Modeling the Observed Multiplicity Functions

The transit and TTV multiplicity functions, as illustrated in Figure 2, are both monotonically decreasing (subject to Poisson noises) as the intrinsic multiplicity k increases, but they behave quantitatively differently, with the TTV multiplicity function less dependent on k. Below we show that the transit and TTV multiplicity functions can be simultaneously well described when the inclination dispersion of the k-planet system is a power-law function of the intrinsic multiplicity k. Using this relation, we can constrain the intrinsic multiplicity vector F ≡ (f1, f2, ..., fk), where fk is the fraction of Sun-like stars with k Kepler-like planets. This section describes the model we use to fit the observed transit and TTV multiplicity functions.

3.1. Notations

To facilitate further discussions, we introduce a few mathematical notations here. Following Tremaine & Dong (2012), we use a matrix G to quantify the detection probability of planetary systems in transit surveys. Each element gjk denotes the probability that one k-planet system is seen to have j (j ≥ 1) tranets. Thus we have

Equation (1)

If the parameter space that is of interest can fit in at most K planets, then G should be a K × K upper-triangular matrix.6

We use symbol ${ \mathcal N }$ for the number of stars in the sample and the vector N = (N1, ..., NK) for the observed transit multiplicity function. With the intrinsic multiplicity vector F, the expectation of the transit multiplicity function can be given as

Equation (2)

The second equality in the summation form has used the fact that gjk = 0 if k < j. The fraction of stars with planets, Fp, and the average number of planets per star, ${\bar{n}}_{{\rm{p}}}$, are given by

Equation (3)

respectively. The ratio of these two quantities, ${\bar{n}}_{{\rm{p}}}/{F}_{{\rm{p}}}$, gives the average number of planets per planetary system, which we call the average multiplicity.

We also introduce the matrix T to quantify the detection probability of TTVs. Each element tjk represents the probability that one k-planet system has j tranets and at least one of them shows detectable TTV signals. The TTV multiplicity function is given by M = (M1, ..., MK) and the expectation of this is given by

Equation (4)

3.2. Model Ingredients

In our model, whether a planet transits or not is determined by the transit parameter epsilon ≡ R/a and its orbital inclination Ip. We do not take into account the minor impact of the planet size. Below we describe the distributions of epsilon and Ip. We also describe the criteria we use in generating multi-planet systems and detecting TTV signals.

3.2.1. Distribution of Transit Parameters

Following Tremaine & Dong (2012), we model the distribution of transit parameter epsilon as

Equation (5)

which is essentially a broken power law but with smooth transition at epsilon0. Instead of using the values for epsilon from transit modelings that are not well constrained for some tranets, we re-compute them based on the orbital periods and LAMOST stellar parameters (R and M). In this way this parameter epsilon is better constrained and its lower and upper boundaries are compatible with our sample selection criteria (Section 2). This reconstructed epsilon distribution using all tranets in our sample is shown as black dots in Figure 4, in which we also show the distribution from only transit singles for a reference.

Figure 4.

Figure 4. Distributions of the transit parameter epsilon (≡ R/a) using all tranets and only those in transit singles. The gray dashed line marks a logarithmically flat distribution after the correction of the geometric transit probability.

Standard image High-resolution image

We then model this distribution with the smoothed broken power-law form (Equation (5)). After correcting for the geometric transit probability (∝epsilon), we determine the underlying epsilon distribution to be

Equation (6)

for 0.004 < epsilon < 0.6, and zero elsewhere. This yields a logarithmically flat distribution below epsilon0, a result in agreement with previous studies (e.g., Dong & Zhu 2013; Petigura et al. 2013).

3.2.2. Distribution of Planetary Inclinations

For multi-planet systems, the planetary inclination relative to the observer, Ip, depends on the inclination of the system invariable plane, I, the planet inclination with respect to this invariable plane, i, and a nuisance parameter ϕ (i.e., the phase angle)

Equation (7)

The distribution of I is isotropic (∝sin I for 0° ≤ I ≤ 180°) and the distribution of ϕ is random between 0° and 360°. The inclination i quantifies the flatness of the multi-planet system and we assume that it is related to the number of planets in the system k. We parameterize this dependence as a power law between the dispersion of i (or more accurately, sin i) and k, and choose the normalization at k = 5:

Equation (8)

It is written in this form, so that the normalization factor, ${\sigma }_{i,5}$, can be determined separately from the distribution of transit duration ratios of planet pairs in five-planet systems (Section 3.3). For such high-multiple planetary systems, the observed multiplicity very likely reflects the intrinsic multiplicity. We decide to use k = 5 for the normalization term for two reasons. First, there are only a few k ≥ 6 planetary systems found by Kepler, the number being so small that the mutual inclination dispersion cannot be well constrained. Second, although there are more four-planet systems than five-planet systems, the fraction of contamination from intrinsically higher multiplicities is also larger for four-planet systems than for five-planet systems. The power-law index α quantifies the steepness of this inclination dispersion function and can be constrained from the transit and TTV multiplicity functions.

With this inclination dispersion σi,k, the planetary inclination with respect to the invariable plane is then modeled as a Fisher (1953) distribution (Fabrycky & Winn 2009; Tremaine & Dong 2012),

Equation (9)

The parameter κk is related to the dispersion parameter σi,k via

Equation (10)

This Fisher distribution provides a smooth transition from an isotropic distribution (κn ≪ 1) to a Rayleigh distribution (κn ≫ 1). The latter one is commonly used for compact multi-planet systems (e.g., Fabrycky et al. 2014).

The inclination dispersion σi,k, by its definition given by Equation (8), has a maximum value of $\sqrt{2/3}$, which can be achieved only when the distribution of i becomes isotropic. For any given ${\sigma }_{i,5}$, the upper bound on the inclination dispersion therefore sets a limit on α (and vice versa).

3.2.3. Stability Criterion

We describe the stability criterion used in generating multi-planet systems. For intrinsic multiples (k ≥ 2), we inject planets one by one. The transit parameter epsilon of the first planet is randomly drawn from the distribution specified by Equation (6), and then the orbital period is derived via

Equation (11)

Throughout our simulations, we fix ρ = ρ. This is because the distribution of epsilon has absorbed the variance of ρ and therefore there is no need to assume a separate distribution for ρ. For any additional planet, the transit parameter and the orbital period are randomly assigned in a similar way but with the restriction that the new planet must be far away from any previously injected planets such that the system remains dynamically stable. For the latter, we use the Deck et al. (2013) stability criterion, which requires that for any given planet pair,

Equation (12)

The critical value is derived by assuming a characteristic planet-to-star mass ratio q = 10−4. The actual choice of this characteristic value has a very marginal impact on the modeling output, primarily because of the weak dependence on q. Furthermore, even though the chosen q value is larger than the typical planet-to-star mass ratio (∼10−5) of Kepler planets, it is extremely rare to have a period ratio below 1.3 in actual observations (Lissauer et al. 2011; Fabrycky et al. 2014).

3.2.4. TTV Detection Criteria

For any system with at least one tranet, we determine whether there is detectable TTV signal. Previous studies (e.g., Holczer et al. 2016; Hadden & Lithwick 2017) have shown that in systems with detected TTVs, it is almost always the case that the TTV signal comes from first-order MMRs between two neighboring planets. We use this empirical result and only consider the closest one (if the tranet is innermost or outermost) or two planets in the TTV detection. We only consider the TTV signals from first-order (J:J − 1 = 2:1, 3:2, 4:3, and 5:4) MMRs. Higher order MMRs are too weak and the additional first-order MMRs are either not allowed by or are too close to the stability limit. We consider the TTV signal to be detectable if all the following conditions are met for at least one of the chosen first-order MMRs:

  • 1.  
    The orbital period of the tranet is less than 200 days.7
  • 2.  
    The super period of the planet pair
    Equation (13)
    is in the range 100–3000 days. Here, Pin and Pout are the orbital periods of the inner and outer planets, respectively.
  • 3.  
    The TTV amplitude indicator
    Equation (14)
    where P is the orbital period of the tranet and Δ is the fractional separation to period commesurability (Lithwick et al. 2012).
    Equation (15)

The first two criteria are used to mimic the conditions in actual TTV detections (Holczer et al. 2016). The threshold used in the third criterion, 1.3 × 103 days, is approximately the median value of P/Δ in identified TTV pairs of Holczer et al. (2016). For the characteristic mass ratio (q = 10−4), this corresponds to a 30 minutes TTV amplitude, which is also the median amplitude of all TTV planets in Holczer et al. (2016). Our results are insensitive to the numerical threshold adopted here, as we will show in Section 5. Specifically, reducing (or increasing) the threshold value on the right-hand side of Equation (14) will allow more (or fewer) pairs to become TTV eligible. However, since this raises the fraction of TTV sample uniformly across all systems, this normalization change does not affect our results.

3.3. Constraining σi,5 from Transit Duration Ratios

We acquire external constraints on the normalization factor ${\sigma }_{i,5}$ of the inclination dispersion relation (Equation (8)). This is necessary because otherwise we would end up with a strong correlation between ${\sigma }_{i,5}$ and α.

Because we are constraining ${\sigma }_{i,5}$ separately, we are not limited to the planetary systems in our current sample. In fact, our sample only contains 3 five-tranet systems and these provide $3\times {C}_{5}^{2}=30$ planet pairs. Instead, we find 15 five-tranet systems whose hosts are Sun-like stars from the California Kepler Survey (Petigura et al. 2017), which give us 150 planet pairs to constrain the inclination dispersion ${\sigma }_{i,5}$.

We adopt a similar approach as Fabrycky et al. (2014) to constrain the inclination dispersion. Each observed planet pair gives a quantity (Steffen et al. 2010)

Equation (16)

where Tdur is the transit duration (from first to fourth contact) and P is the orbital period. The subscripts "in" and "out" denote the values of the inner and outer planets, respectively. With Kepler's third law, one can easily see that this parameter ξ is essentially the ratio of orbital-velocity normalized transit durations, which is most sensitive to the inclination i and is marginally dependent on other parameters such as the orbit eccentricity e (Fabrycky et al. 2014). We can also write ξ in terms of the planet-to-star radius ratio r and the transit impact parameter b

Equation (17)

Because both Tdur and P are much better measured than r or b in observations, the expression given by Equation (16) is therefore used in constructing the distribution of ξ from the data. For the 15 five-planet systems, we use the values of Tdur and P from the the most recent Kepler data release (DR25; Thompson et al. 2018). We then compute the ξ values for individual planet pairs and show the distribution as the black histogram on the left panel of Figure 5.

Figure 5.

Figure 5. Left panel: distributions of the weighted transit duration ratio ξ from a selected sample of 15 five-tranet systems and our best-fit model. Right panel: likelihood as a function of the inclination dispersion ${\sigma }_{i,5}$. The 1–3σ regions are marked out with different colors.

Standard image High-resolution image

We then model this ξ distribution and attempt to constrain the inclination dispersion ${\sigma }_{i,5}$. We ignore the dependence of ξ on the eccentricity e and simply assume circular orbits for all planets in such five-planet systems. While simplifying the modeling, this still remains a very reasonable assumption. First, planets in such high-multiple and stable systems are not expected to have large eccentricities. Second, Fabrycky et al. (2014) showed that this ξ parameter alone could not constrain e very well, and that the correlation between inclination dispersion and eccentricity is weak.

For a given value of ${\sigma }_{i,5}$, we produce the ξ distribution following the method of Fabrycky et al. (2014). First, we randomly draw an impact parameter for the outer planet, bout,sim, uniformly from the range [0, bout,max], where bout,max is the impact parameter the planet would require in order that the total S/N ($\sqrt{{b}_{\mathrm{out},\max }/{b}_{\mathrm{out}}}$ times the actual S/N) would be equal to the S/N threshold (7.1). Then we randomly draw bin,sim from a normal distribution with mean ${b}_{\mathrm{out},\mathrm{sim}}{({P}_{\mathrm{in}}/{P}_{\mathrm{out}})}^{2/3}$ and dispersion ${\sigma }_{i,5}$/(din + rin), where d ≡ R/a is the stellar radius scaled to the planet–star separation and is taken as the value from DR25. Such a normal distribution in impact parameters reproduces a Rayleigh distribution with dispersion ${\sigma }_{i,5}$ in inclinations. If $| {b}_{\mathrm{in},\mathrm{sim}}| \leqslant {b}_{\mathrm{in},\max }$, we consider this simulated planet to be detectable. Otherwise we repeat the previous step until the above condition is met. Once a simulated planet pair is generated, we compute ξ using Equation (17). We repeat the whole process and generate 250 simulated pairs for each planet pair in the sample. The ξ distribution for the given ${\sigma }_{i,5}$ is then generated from the ensemble of all simulated pairs.

We then compute the likelihood for the actual ξ distribution to be drawn from the simulated one. We note that this is more accurate than the approach of Fabrycky et al. (2014), which used the p value of Kolmogorov–Smirnov test. We compute the likelihood as

Equation (18)

where Psim(ln ξ) is the probability distribution of simulated ξ in logarithmic space, ξj is the observed value of ξ of the jth planet pair, and σlnξ,j is the fractional uncertainty of ξi. We compute σlnξ,j based on the measured uncertainties on Tdur,in and Tdur,out.

We repeat the Monte Carlo simulation and compute the likelihood ${ \mathcal L }$ for 100 values of ${\sigma }_{i,5}$ equally spaced from 0fdg1 to 4fdg0, and show the results on the right panel of Figure 5. As the scatter plot shows, the likelihood reaches its maximum around ${\sigma }_{i,5}$ = 0fdg8. The simulated ξ distribution for this best-fit ${\sigma }_{i,5}$ is also shown on the left panel of Figure 5. To find the different confidence levels of ${\sigma }_{i,5}$, we use a spline function to smooth the $\mathrm{ln}\,({ \mathcal L }/{{ \mathcal L }}_{\max })$ versus ${\sigma }_{i,5}$ scatter plot, and find the 1σ, 2σ, and 3σ confidence intervals, defined as $\mathrm{ln}({ \mathcal L }/{{ \mathcal L }}_{\max })\geqslant -{n}^{2}/2$ to be 0fdg65–0fdg96, 0fdg53–1fdg16, and 0fdg42–1fdg44 for n = 1, 2, and 3, respectively.

For the subsequent modeling of the power-law index α and intrinsic multiplicity vector F, we will only consider values of ${\sigma }_{i,5}$ from the 3σ confidence interval. The smoothed $\mathrm{ln}\,({ \mathcal L }/{{ \mathcal L }}_{\max })$ is also used as our prior on ${\sigma }_{i,5}$ unless specified otherwise.

3.4. Monte Carlo Simulations

We use Monte Carlo simulations to compute the G and T matrices for a grid of ${\sigma }_{i,5}$ and α. We note that Tremaine & Dong (2012) provided an analytical formalism to compute the G matrix. However, the time-limiting factor is always the computation of the T matrix, which may not be done in the analytical way. We compute the T matrix within a Monte Carlo, and this automatically produces the G matrix.

For the intrinsic singles (k = 1), there is no TTV signal (t11 = 0), and only one parameter (g11) needs to be calculated. This can be done analytically (Tremaine & Dong 2012): ${g}_{11}=\langle \epsilon \rangle =0.03$.

For intrinsic multiples (k ≥ 2), we first randomly draw the inclination of the invariable plane, I, from an isotropic distribution and then inject planets one by one with the stability criterion given in Section 3.2.3 imposed. For each planet, we assign randomly a phase parameter, ϕ, from the uniform distribution and the planet inclination with respect to the invariable plane, i, from the Fisher distribution (Equation (9)), whose parameter κk is determined by Equations (8) and (10). The inclination of the planet with respect to the line of sight is then given by Equation (7). Finally, whether a planet is a tranet or not is determined by the transit criterion

Equation (19)

For any system with at least one tranet, we invoke the TTV criteria in Section 3.2.4 to determine whether there is detectable TTV signals.

With the above procedures, we are able to generate a k-planet system, compute the number of tranets, and determine whether any of the tranets shows detectable TTV signals. Given that we only have up to six tranets in one system, our simulations run up to six-planet systems. The impact of higher multiples to our results will be discussed in Section 5.1. We repeat the whole process and generate a large number of planetary systems, until each element in the T matrix is determined to <2% precision.

3.5. Modeling the Observed Multiplicity Functions

For a given intrinsic multiplicity vector F and matrices G and T, the probability to see the transit and TTV multiplicity functions as shown in Figure 2 is given by

Equation (20)

where ${\bar{N}}_{k}$ and ${\bar{M}}_{k}$ are given by Equations (2) and (4), respectively. Note that here we have extended the transit multiplicity function down to k = 0, with ${f}_{0}=1-{\sum }_{k\geqslant 1}\,{f}_{k}$. In practice, we ignore the constants in the log of the above likelihood and try to maximize the following quantity to find the best model parameters (${\sigma }_{i,5}$, α, and F):

Equation (21)

Given the large number of dimensions, we choose the Markov Chain Monte Carlo (MCMC) algorithm that is implemented in emcee (Foreman-Mackey et al. 2013) as the optimization method. This way we also obtain the posterior distributions of Fk (k = 1, ..., 6) for given sets of ${\sigma }_{i,5}$ and α.

4. Results

4.1. Planetary Inclination Dispersion

We find the maximum likelihood values for each grid point in the (${\sigma }_{i,5}$, α) plane following the procedure detailed in the previous section. After imposing the prior probability on ${\sigma }_{i,5}$ from the transit duration ratios (Figure 5), we can determine the nσ contours, where n (=1, 2, 3) refers to the number of σ and the contour is defined by $\mathrm{ln}({ \mathcal L }/{{ \mathcal L }}_{\max })=-{n}^{2}/2$. Consequently, we can construct the posterior distribution of the power-law index α in a similar way. These results are shown in Figure 6.

Figure 6.

Figure 6. Posterior distributions of the parameters quantifying the inclination dispersion function. The left panel shows the 1–3σ contours in the power-law index α vs. normalization factor ${\sigma }_{i,5}$ plane. The 3σ contour does not fully reach the α = −4 line at the rightmost end because the lower limit of α at ${\sigma }_{i,5}$ = 1fdg35 is slightly above −4. The right panel shows the $\mathrm{ln}\,{ \mathcal L }$ as a function of the power-law index α. The black dots are the calculations from grid points and the black curve is the smoothed function. Here 1–3σ regions are also indicated.

Standard image High-resolution image

Figure 6 indicates that the power-law index α is well constrained to be close to its lower bound −4, which is given by the normalization factor ${\sigma }_{i,5}$ ≈ 0fdg8 and ${\sigma }_{i,2}\leqslant \sqrt{2/3}$ (Section 3.2). Specifically, α < −3 at the 1σ level, α < −2 at the 2σ level, and α ≥ 0 can be securely excluded. Therefore, the more planets a system has, the smaller the planetary inclination dispersion is, and this dispersion is a steep function of the intrinsic multiplicity.

To understand what information is driving the constraint on α, we show in Figure 7 the best-fit models with fixed values of α. As this figure shows, a steep inclination dispersion function is required primarily because of the large number of TTV singles, which contribute nearly half of systems with TTVs. Although our sample only contains 23 TTV singles, the ratio between numbers of TTV singles and TTV multiples remains essentially the same even if a much larger TTV catalog (Holczer et al. 2016) or a different TTV catalog (Ofir et al. 2018) is used. Therefore, our constraints on the power-law index parameter is robust.

Figure 7.

Figure 7. Observed multiplicity functions (with error bars) and the best-fit models with different values of the power-law index α. The numbers on the top indicate the actual numbers in individual bins. We use solid lines for models of the transit multiplicity function and dashed lines for models of the TTV multiplicity function.

Standard image High-resolution image

Figure 7 seems to suggest that there may be more TTV singles than even the steepest model curve (α = −4) can account for. However, with our current sample size it is not statistically significant that would require a more complicated model than our current one.

4.2. Intrinsic Multiplicity Vector

We stack the Markov chains from all MCMC runs and discard entries that are more than 3σ away (Δχ2 > 9) from the best one. With this combined Markov chain, we can then investigate the constraints on individual components of the intrinsic multiplicity vector F.8

The left panel of Figure 8 shows the full posterior distributions of individual components fk, which is the fraction of Sun-like stars with k (1 ≤ k ≤ 6) Kepler-like planets. As we can see, the majority of the components are not constrained very well because of the strong degeneracies between neighboring components. However, it is notable that the first component, f1, is consistent with zero. That is, there can be effectively zero intrinsic singles and nearly all the transit singles are in fact intrinsic multiples with the additional planets non-transiting. This is a result we have discussed qualitatively in Section 1.

Figure 8.

Figure 8. Constraints on the individual components fk of the intrinsic multiplicity vector F. The left panel shows the full posterior distributions and the right panel shows the reported values and associated uncertainties.

Standard image High-resolution image

For future practical usage (such as to predict the yield of future transit missions), we nevertheless report measurements and uncertainties of individual components fk. This is done by taking the median, 16%–84%, and 5%–95% of the posterior distributions. The result is shown in the right panel of Figure 8. Note that this plot seems to suggest a sharp drop in occurrence rate from low multiples (k ≤ 4) to high multiples (k ≥ 5), but this feature is artificial and comes purely from the way these values are derived.

4.3. Overall Planet Occurrence Rates

Although the individual components of the intrinsic multiplicity vector F are not well constrained, the overall occurrence rates, meaning the total fraction of stars with planets Fp and the average number of planets per star ${\bar{n}}_{{\rm{p}}}$, are found to be well constrained. This is not surprising for ${\bar{n}}_{{\rm{p}}}$, because this quantity only depends on the distribution of the transit parameter epsilon (Youdin 2011; Tremaine & Dong 2012). In fact, as we prove in the Appendix,

Equation (22)

That is, the average number of planets per star is given by the total number of tranets, the total number of stars, and the average probability that one planet transits. For our sample, the above equation gives ${\bar{n}}_{{\rm{p}}}=0.90\pm 0.03$, which agrees with our model outputs. Our constraint on ${\bar{n}}_{{\rm{p}}}$ also agrees with previous studies (e.g., Fressin et al. 2013; Petigura et al. 2013).

As the left panel of Figure 9 indicates, the total fraction of Sun-like stars with Kepler-like planets is also well constrained. To avoid the confusion with the general fraction Fp (for arbitrary planet sizes and orbital distances), we introduce ηKepler for this specific fraction.9 The posterior distribution gives

Equation (23)

This is a factor of ∼2 lower than previous estimates (Fressin et al. 2013; Petigura et al. 2013; Winn & Fabrycky 2015). With the determinations of ${\bar{n}}_{{\rm{p}}}$ and ηKepler, we also find that on average each Kepler-like planetary system has 3.0 ± 0.3 planets, as shown in the right panel of Figure 9. Our work is the first to determine this average multiplicity.

Figure 9.

Figure 9. Constraints on the fraction of Sun-like stars with Kepler-like planets (left panel) and the average number of planets per Kepler planetary system (right panel). The black dots are the (normalized) $\mathrm{ln}\,{ \mathcal L }$ values from the model, and the black curve is the smoothed distribution. The vertical line and vertical bands indicate the best values and 1σ uncertainties.

Standard image High-resolution image

How could the total fraction be constrained so well even though the individual components fk were not? We will provide the explanation in Section 5.1, but the conclusion is that this results from some property of the matrix G that relates the nature of the transit probabilities. Therefore, our result that only 30% of Sun-like stars host Kepler-like planets is robust and, in particular, it is not sensitive to the details of TTV multiplicity function or our TTV modelings.

5. Discussion

5.1. The Determination of ηKepler

Here, we answer the question from Section 4.3: why could ηKepler be constrained so well?

Following Section 3.1, the total number of transiting systems reads

Equation (24)

Unfortunately, the quantity ${\sum }_{j=1}^{k}{g}_{{jk}}$ (i.e., the probability to see at least one tranet) is not a constant. Otherwise the determination of Fp would be straightforward. However, unless the period distribution of planets in multiple systems is dramatically different from the period distribution of planets in single systems, the probability of having at least one tranet out of k (k ≥ 1) planets is no less than the probability of seeing one tranet in the single-planet systems. Mathematically, this implies ${\sum }_{j=1}^{k}{g}_{{jk}}\geqslant {g}_{11}$. Therefore, the above equation gives an upper limit on Fp,

Equation (25)

For our sample, this gives ηKepler ≤ 62%.

A second relevant quantity is the number of transit singles,

Equation (26)

Again, the quantity g1k, the probability to see one tranet in a k-planet system, is not conserved: the more planets a system has and the hotter (i.e., larger inclination dispersion) the system is, the larger this quantity will be.

However, we find that the combination of the two quantities, ${g}_{1k}+{\sum }_{j=1}^{k}{g}_{{jk}}$, remains roughly constant for broad ranges of ${\sigma }_{i,5}$, α, and k. This can be seen in Figure 10, which illustrates the dependence of this quantity on various model parameters. Here, because only the G matrix is involved, we use the deterministic approach of Tremaine & Dong (2012) to compute G. This approach requires truncating the Legendre series at a certain threshold lmax. Unlike Tremaine & Dong (2012), who used lmax = 50, we choose a much higher threshold, lmax = 3000, which is necessary in order to have the elements of G for flat and multi-planet systems (i.e., upper-left corner of the left panel of Figure 10) converge.

Figure 10.

Figure 10. Left panel: gray contours show the values of the quantity ${g}_{1k}+{\sum }_{j=1}^{k}{g}_{{jk}}$ for different combinations of k and σi,k. The black square with an error bar is our determination of the normalization factor ${\sigma }_{i,5}$. The blue curves are the inclination dispersion functions with different values of α. Right panel: different curves are the quantity ${g}_{1k}+{\sum }_{j=1}^{k}{g}_{{jk}}$ as a function of k for different values of power-law index α.

Standard image High-resolution image

The conservation of the quantity ${g}_{1k}+{\sum }_{j=1}^{k}{g}_{{jk}}$ means that the more planets a system has and the hotter the system is, the larger this joint probability is, and the two separate probabilities, g1k and ${\sum }_{j=1}^{k}{g}_{{jk}}$, compensate each other if the number of planets increases while the inclination dispersion decreases. Figure 10 also suggests that the weighted mean

Equation (27)

so that the total fraction of stars with planets can be given by

Equation (28)

With numbers from our sample, this estimator directly gives ηKepler = 30%, as long as the following three conditions are met.

First, the k = 1 term does not provide a significant contribution to the weighted mean. That is, there is no large population of intrinsic singles. This has been discussed qualitatively in Section 1 and here we provide a simple quantitative argument. Of all the tranets in our sample, 432 and 395 are in transit singles and transit multis, respectively. Of the subset of those that show detected TTV signals, there are 23 in transit singles and 26 (excluding the double counting from tranet pairs both showing TTVs) in transit multis. For the transiting planets in transit multis, the probability of showing TTV is 26/395 = 6.6%. For those in transit singles, the same probability is 23/432 = 5.3% < 6.6%, which we interpret as the blending of intrinsic singles in the transit singles. Therefore, one finds that at most 19% (or 82 systems) of the transit singles are intrinsic singles. With the mean transit probability ($\langle \epsilon \rangle =0.03$) and the total number of surveyed stars (30759), the fraction of intrinsic singles (f1) is at most 9%. Although this upper limit is derived based on a subset of the Kepler catalog and a specific TTV table (Holczer et al. 2016), the conclusion remains the same even if one uses the multiplicity fraction of the overall Kepler catalog or a different TTV table (Ofir et al. 2018). Even with this upper limit for f1, the weighted mean of ${g}_{1k}+{\sum }_{j=1}^{K}{g}_{{jk}}$ only varies by 10%.

Second, the more planets there are in the system, the smaller the planet inclination dispersion is. Together with previous investigations of the multi-tranet systems (Lissauer et al. 2012; Tremaine & Dong 2012; Fabrycky et al. 2014), the result that there is no large population of intrinsic singles suggests that a significant fraction of planetary systems must have large planet–planet mutual inclinations. Although it is in principle possible that these are high multiples, it is much more likely that such systems with high mutual inclinations are low multiples, and indirect evidence from the planet eccentricity study also supports this (Xie et al. 2016).

Finally, there is no large population of very high (k ≥ 7) multiples. Given the previous result, these k ≥ 7 multiples should be very flat. With our inner (∼1 day) and outer (400 days) period boundaries, the weighted quantity ${f}_{1k}\,+{\sum }_{j=1}^{K}{f}_{{jk}}\lesssim 0.25$, and the probability to see seven tranets is at least R/(1.06 au) = 0.0044. Therefore, the fact that we do not have any 7-tranet system in a sample of 30,759 stars sets an upper limit on the fraction of systems with k ≥ 7, f≥7 < 2.2% (95% confidence level). Including these very high multiples in the calculation of the weighted mean changes it by 10% in the opposite direction as the intrinsic singles does.

The above arguments explain why the total fraction can be constrained very well even though the individual components cannot, and more importantly, confirm that our determination of the total fraction of Sun-like stars with Kepler-like planets, ηKepler, is fairly robust, and does not depend on the details of our modeling or the TTV multiplicity function. Furthermore, because the average number of planets per star, ${\bar{n}}_{{\rm{p}}}$, is determined independently from the inclination distribution (Equation (22)), the average multiplicity is also robustly measured.

Equation (28) also points toward a robust and straightforward way to determine the total fraction of stars with planets, which has practical applications. As Zhu et al. (2016) noted, the fraction of stars with planets is better than the average number of planets per star for the purpose of quantifying the correlation between planet formation efficiencies and stellar properties (such as metallicity). It is nevertheless the latter that has been broadly used.

5.2. Only 30% of Sun-like Stars Host Kepler Planets

In Section 1 we explained qualitatively why previous studies (Fressin et al. 2013; Petigura et al. 2013) overestimated ηKepler. Now we explain it in a more quantitative way. We focus here on the transit studies, but the conclusion should apply to RV studies (e.g., Mayor et al. 2011) as well given that the same statistical approach was used.

There are two primary differences in our study and previous studies: the parameter space under investigation and the statistical method. Here, we discuss the impact of the former. We use Fressin et al. (2013) as the example of previous studies. Fressin et al. (2013) took into account both the geometric transit probability and the pipeline detection efficiency, and concluded that 52% of Sun-like stars should have at least one planet with Rp > 0.8 R and P < 85 days. Our result that only 30% of Sun-like stars host Kepler planets comes from studying the Kepler planets as a whole and only accounting for the geometric transit probability. However, the parameter space we study is inclusive of the parameter space investigated in Fressin et al. (2013). Specifically, the pipeline detection efficiency would only increase the number of systems with planets of Rp > 0.8 R and P < 85 days by 10%. Such a small change is far from what is needed to explain the discrepancy between our work and Fressin et al. (2013).

All previous studies on the fraction of stars with planets used the probability to detect the most detectable planet for the probability to detect at least one planet. In those transit studies (Fressin et al. 2013; Petigura et al. 2013), by reducing the number of tranets in all systems to one, they assumed that the resulting average number of planets per star should be the total fraction of stars with planets. Using the distribution of the transit parameter epsilon determined by transiting planets in transit singles (the red curve in Figure 4),10 we find that the probability that a typical planet transits is g11 = 0.025. According to Equation (22), the resulting average number of planets per star is 0.77. Taking this value for ηKepler would mean an overestimation by a factor of 2.6. Note that this value (77%) also exceeds the upper limit (62%) we derived in Section 5.1 under the very general condition.

5.3. Inclination Dispersions and Multiplicities: Comparisons with Theories

The bulk of the Kepler planets are the so-called super Earths (planets with radii between Earth and Neptune) and the majority of such planets discovered by Kepler reside within ∼100 days. Such super Earths are absent in our own solar system. We find that the average number of such planets a system hosts is ∼3, and that the dispersion of planetary inclinations is a steep function of the intrinsic multiplicity. As Figure 11 shows, a system with k ≥ 5 planets within 400 days is pretty flat, with an inclination dispersion within 1°. But as the number of planets reduces, the system puffs up and the planets can be mutually inclined up to ∼10. An isotropic distribution for planets in two-planet systems is not completely ruled out by the data. Together with the statistical result of eccentricities from transit durations (Xie et al. 2016), we now have a consistent picture that systems with fewer planets are dynamically hotter. In the following, we briefly cast these results in light of formation theories that have been proposed.

Figure 11.

Figure 11.  Blue curves show the inclination dispersion as a function of intrinsic multiplicity, with slightly different colors for different values of power-law index α. Here we only show the 2σ range −4 ≤ α ≤ −2. The black square with an error bar indicates the normalization factor we obtained from the transit duration ratios of five-tranet systems. The orange vertical band shows the 1σ range of the average multiplicity, with the central vertical line indicating the best value. The gray dashed line is the upper limit of inclination dispersion, which can be achieved if the inclination follows an isotropic distribution. We also mark the position of our solar system with "S" in this plot. Throughout our simulations, we always use the radian values of the inclination dispersion, and they are also easy to be connected to the orbital eccentricities (Xie et al. 2016). However, for discussions of the inclination itself, especially at small values, we also indicate the scales in degree.

Standard image High-resolution image

The formation of super Earths remain unresolved. In the standard core accretion theories (Ida & Lin 2004; Mordasini et al. 2009), in which the cores of these planets are formed at large distances when the gas disk is still fully present, these planets are migrated rapidly to near the inner edge of the proto-planetary disks. It is thought that MMRs naturally get set up between planets that are migrating with different speeds, preventing them from being engulfed by the host stars. However, such a story predicts an abundance of MMRs, in contrast with the observed period ratios in which MMRs feature minimally (Lissauer et al. 2011; Fabrycky et al. 2014). While multiple scenarios have been proposed to break the planets out of MMRs (Delisle et al. 2014; Goldreich & Schlichting 2014; Chatterjee & Ford 2015; Liu et al. 2017), it is unclear that the overall period distribution can be explained. It is also unclear in such a framework how to account for the diverse multiple systems and their current dynamical states (but see Izidoro et al. 2017).

An alternative scenario, first proposed by Hansen & Murray (2013), favors in situ formation, in which these planets are locally assembled.11 This is similar to the conventional story for the formation of terrestrial planets (e.g., Kokubo & Ida 1998, 2000; Raymond et al. 2004). Using N-body simulations, Hansen & Murray (2013) studied planet assembly, starting from a large number of proto-planets with a total mass of 20 M within 400 days. They further assumed that all impacts are fully accretional and that there is no external source of dissipation (by gas or by planetesimals). They presented a number of statistical properties for the resulting systems, which we proceed to compare against here.

First, Hansen & Murray (2013) found that the median number of planets in a system is 4, an average eccentricity dispersion of σe = 0.11, and an average inclination dispersion σi ∼ a few degrees. These are somewhat similar to our findings of 3 planets per system, and $\langle {{kf}}_{k}{\sigma }_{i,k}\rangle \sim 4^\circ $ ($\langle {f}_{k}{\sigma }_{i,k}\rangle \sim 6^\circ $) with 2 ≤ k ≤ 7. Second, Hansen & Murray (2013) noted that their simulations predicted too few single-tranet systems. This remains a problem even after this work. The mismatch seems to be mainly due to the differences in their predicted multiplicities: f1 = f2 = 0 and f3 ≪ f4, f5 in Hansen & Murray (2013), compared to fk ∼ 5%–10% for k = 1, 2, 3, 4 and much smaller for k > 4 in our work (Figure 8).

We show that lower k systems are dynamically hotter (also see Xie et al. 2014, for the eccentricity aspect). What is the reason behind this?

One possibility is that this is a natural outcome of the stability requirement. One naively expects that a more packed system (higher k) has to have a lower dispersion to avoid dynamical instability. From Pu & Wu (2015), the critical number of mutual Hill radii that a system can have is ${K}_{\mathrm{crit}}=2+k+27/5\times {\sigma }_{i,k}{[3/(2q)]}^{1/3}$, with q = Mp/M. Here, we have assumed that σe ∼ 2σi and that the dependence of Kcrit on σi,k obtained for k = 5 can be applied to smaller k values.12 Further assuming that the systems extend one decade (∼0.1–1 au) in semimajor axis, we then find that the total number of Hill spheres is ln (10)[3/(2q)]1/3, and that the average separation (in unit of Hill radii) between planets in a k-planet system is ln (10)[3/(2q)]1/3/k. By equating this average separation and the critical number Kcrit, we get that the critical inclination dispersion is σi,k ∼ 24°/k − 0fdg8(1 + k/2) (with q = 10−5). This boundary follows a decreasing trend with k, which is similar to but less steep than our result in Figure 11. However, given the approximate nature of the arguments above, we are unable to quantitatively compare this prediction with our result, specifically at k = 2, where the extrapolation from larger k likely breaks down. Further theoretical work will be required to address whether stability is responsible for the observed inclination dispersion trend.

It is also possible that the correlation between σi,k and k reflects the formation process via giant impacts. Unfortunately, Hansen & Murray (2013) did not explicitly present their eccentricity/inclination dispersions as functions of intrinsic multiplicities, which would have made a pivotal comparison against our Figure 11. An analytic version of this formation process was put forward by Tremaine (2015), which used the ergodic approximation and dynamical stability. This model predicts that the dispersion in eccentricities ∝1/k, which is similar to the stability argument given above if we assume σi ∝ σe. In practice, this model does not provide any prediction for the inclinations as it has assumed coplanar orbits.

Some theoretical works suggest that distant giant planets could drive up the mutual inclinations of the inner planetary systems and/or decrease their multiplicity by dynamical instabilities (Johansen et al. 2012; Huang et al. 2017; Lai & Pu 2017; Mustill et al. 2017; Pu & Lai 2018). If this is the primary channel for the observed features, one would expect that the stellar hosts of single tranets (low j and/or large σi) should reflect the well-known giant planet-metallicity correlation (e.g., Fischer & Valenti 2005). However, as the lower middle panel of Figure 3 shows, the metallicity distribution of the stars with single tranets is similar to that of stars with multiple tranets. A similar result is found by looking at the sample of single-tranet systems with TTVs, which should be indicative of multi-planet systems with large inclination dispersions. Therefore, these observations indicate that the perturbation by the unseen distant giant planet is likely not the primary cause of the observed inclination dispersions.

Alternatively, host stars with a large quadrupole moment can also excite planet–planet mutual inclinations (Spalding & Batygin 2016; Spalding et al. 2018). Unfortunately, our current sample does not have enough hot (Teff > 6200 K) stars, for which this mechanism is expected to be most efficient, to test this hypothesis.

5.4. Solar System versus Kepler Systems

We have determined the intrinsic architecture of Kepler planetary systems and now we discuss briefly how our solar system fits in this revised picture.

First of all, with ηKepler = 30%, our solar system belongs to the majority of Sun-like stars that do not have any planets detectable by Kepler. As was first pointed out by Chiang & Laughlin (2013), the Kepler systems contain more solid mass in their inner regions than is expected from a minimum-mass solar nebula. The typical mass for Kepler planets is ∼3 M (Marcy et al. 2014; Fulton et al. 2017; Hadden & Lithwick 2017; Owen & Wu 2017). So these systems contain from 3 to 18 M of solid masses, in contrast to our own 2 M (within 400 days). As far as the inner (≲1 au) planetary system is concerned, it seems plausible that the difference arises because our system is a slightly low-metallicity version of the typical Kepler system.13

Even though none of the solar system planets are detectable by Kepler, it is interesting to note that our own system shares at least two similarities with the Kepler systems, as has been shown in Figure 11. First, similar to the typical Kepler system, we also have three planets within 400 days. Second, if only considering these three planets (Mercury, Venus, and Earth), the average inclination relative to the invariable plane, 3fdg5, is consistent with the inclination dispersions of 3-planet Kepler systems. It is important to note, however, that the higher mass Kepler systems are typically thought to form in a substantially shorter amount of time (prior to the dispersal of the natal disk) as compared to the solar system terrestrial planets (Morbidelli et al. 2012; Chiang & Laughlin 2013). It is therefore unclear whether the similarities between the solar system and Kepler systems are purely coincidental or are representative of a more fundamental behavior of general planetary systems.

We thank the anonymous referee for comments. We would also like to thank Andy Gould for comments on an earlier version of the manuscript. This paper includes data collected by the Kepler mission. Funding for the Kepler mission is provided by the NASA Science Mission directorate. This paper also uses data from the LAMOST survey. Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences. C.P. acknowledges support from the Jeffrey L. Bishop Fellowship and from the Gruber Foundation Fellowship. S.D. acknowledges Project 11573003 supported by National Natural Science Foundation of China (NSFC).

Appendix: Simple Estimation of Average Number of Planets Per Star

As Youdin (2011) pointed out, the average number of planets per star ${\bar{n}}_{{\rm{p}}}$ is directly given by the the total number of tranets ${\sum }_{j}{{jN}}_{j}$, the total number of stars ${ \mathcal N }$, and the average transit parameter $\langle \epsilon \rangle $. See also our Equation (22). With our mathematical notations, this is equivalent to

Equation (29)

In other words, the number of tranets one expects to see in a k-planet system is proportional to k, regardless of the details of gjk. This can be proved mathematically using the expression of G in Tremaine & Dong (2012). Below we provide another simple and robust proof and discuss the associated assumption.

The transit probability is essentially the fractional area on a unit sphere where transit happens (epsilon > cos Ip). This can be shown graphically by projecting the 3D sphere onto a 2D plane, as done in Figure 12 for 1-planet and 2-planet cases. See also Brakensiek & Ragozzine (2016) for the 3-planet case. For k = 1, Equation (29) reduces to the definition of $\langle \epsilon \rangle $. For k = 2, with the notations for different areas in Figure 12, the left-hand side of Equation (29) is (A1 + A2 + A3 + A4 + A5 + A6) + 2(A7 + A8) = (A1 + A7 + A2 + A8 + A3) + (A4 + A7 + A5 + A8 + A6) = g11(p1) + g11(p2), where p1 and p2 denote the planet 1 and 2, respectively. Therefore, the left-hand side equals the right-hand side as long as the transit parameters (or approximately the separations) of the two planets are statistically no different. This is the only assumption that goes into Equation (29). The k ≥ 3 cases can be easily proved similarly.

Figure 12.

Figure 12. Schematic views of the transit probabilities for 1-planet (left panel) and 2-planet (right panel) cases. The unit sphere is now projected onto a plane following the standard Mollweide projection. The shaded regions indicate the positions of the observer where the transit happens. The width of the band corresponds to the transit parameter epsilon. In the 2-planet case, the overlapping regions are the positions where the observer would see both planets transit. We use symbols Ai (i = 1, ..., 8) to denote the area of individual strips.

Standard image High-resolution image

Is the separation distribution of planets in singles different from the separation distribution of planets in multiples? If using tranets, one indeed sees a difference between these two distributions. For example, see the upper middle panel of Figure 3 as well as Figure 4. However, this is not super surprising, as the detections of tranets around the same star are correlated due to the geometric effect. Even so, the difference in these two distributions is not very prominent. Perhaps a better sample to use is the sample of planets from RV observations. As Tremaine & Dong (2012) discussed, the distributions of the semimajor axes of RV planets are statistically the same in single- and multiple-planet systems. See in particular the lower left panel of their Figure 2.

Footnotes

  • Note that the Tremaine & Dong (2012) extended their notations to j = k = 0 and thus their G matrix had (K + 1) × (K + 1) dimensions.

  • Although Holczer et al. (2016) included P < 300 days in their initial selections, they also required at least six transits observed in the Kepler window.

  • We confirm that this approach produces very similar but smoother posteriors than directly using the shape of the χ2 (i.e., $\mathrm{ln}\,{ \mathcal L }$) curve.

  • This notation follows the well-accepted term η.

  • 10 

    Because transit singles outnumber the transit multiples significantly. The epsilon distribution will be essentially the same regardless of whether the innermost tranets (Fressin et al. 2013) or the most detectable tranets (Petigura et al. 2013) of transit multiples are included.

  • 11 

    Here, migration by the gas disk is artificially suppressed. One possibility for this is late assembly (Lee et al. 2014).

  • 12 

    The extrapolation might break down at k = 2 as Kcrit is somewhat insensitive to changes in mutual inclinations for ≲40° (Petrovich 2015).

  • 13 

    The solar system has giant planets beyond 1 au, which is atypical for Sun-like stars with solar metallicity. If the outer planetary system is also taken into account, our solar system no longer belongs to the majorities (Zhu & Wu 2018).

Please wait… references are loading.
10.3847/1538-4357/aac6d5