IS THE GALACTIC BULGE DEVOID OF PLANETS?

Matthew T. Penny; Calen B. Henderson; Christian Clanton

doi:10.3847/0004-637X/830/2/150

1. INTRODUCTION

The process of planet formation is complex and the abundance and architectures of planetary systems will likely depend on numerous factors. The prime example of such a dependence is the strong correlation of the abundance of giant planets with the metallicity of the host star (e.g., Santos et al. 2004; Fischer & Valenti 2005). This suggests that the formation of giant planets depends on the amount of solids in the protoplanetary disk (Ida & Lin 2004). A correlation of the abundance of giant planets with host mass (Johnson et al. 2010), though less secure (see, e.g., Lloyd 2013), also points to a similar conclusion, or at least to a dependence on total disk mass. It is also possible that extrinsic factors such as the presence of a binary companion (e.g., Kaib et al. 2013), the density of nearby stars (e.g., de Juan Ovelar et al. 2012), or the intensity of the ambient radiation field (e.g., Thompson 2013) may affect planet formation during the protoplanetary disk phase or later by dynamical interactions. In order to understand the magnitude of each possible effect on planet abundance, it will be necessary to search for planets around stars that span a broad range of fundamental properties and formation environments.

With the advent of modern, high-cadence gravitational microlensing surveys it is now becoming possible to measure the abundance of planets of roughly the mass of Neptune to Jupiter at the orbital distances at which such planets are thought to form (see Gaudi 2012 for a detailed review). So far, studies have focused on measuring the overall abundance and mass function of planets in these orbits (Gould et al. 2010b; Sumi et al. 2010; Cassan et al. 2012; Shvartzvald et al. 2016), but microlensing light curves combined with various auxiliary observations can also yield either measurements of or constraints on the distance to the planet hosts. This is particularly interesting for microlensing detections, because microlensing events can be caused by stars at any distance between the Earth and the Galactic bulge, where most of the stars that act as sources reside. Therefore, the distributions of host age, metallicity, and kinematic population membership (e.g., disk or bulge) will vary as a function of distance. If the planet abundance is affected by any of these parameters it may be possible to constrain this dependence. Measuring the relative abundance of planets between the disk and bulge is an especially interesting prospect because it is extremely difficult to probe with any other technique. For example, Clarkson et al. (2008) attempted to address this question using 16 planet candidates discovered by the Sagittarius Window Eclipsing Extrasolar Planet Search (SWEEPS) carried out using the Hubble Space Telescope, but because of a potentially large false-positive fraction they could not draw firm conclusions. Our sample of 31 microlensing planet hosts is already significantly larger, and future microlensing samples will be larger still and significantly cleaner and easier to model. Additionally, observations from future space-based microlensing surveys will enable the discovery of large samples of transiting planets (McDonald et al. 2014), whose populations can be determined using similar methods to those of Clarkson et al. (2008).

Our aim is to investigate what information the distribution of planet host distances inferred by microlensing planet searches might contain about the process of planet formation. The sample we have to work with is far from ideal for this task. It has been constructed through a heterogeneous mix of survey strategies that aim to maximize the number of planet detections, but that do not always allow an easy assessment of the detection sensitivity owing to human intervention in the observing schedule and the large, heterogeneous network of follow-up telescopes that are necessary to achieve significant planet sensitivity in many cases. This has made it extremely difficult to measure the abundance of microlensing planets from microlensing surveys conducted so far: the surveys themselves were until recently not able to observe at high enough cadence to find most planets blind, and follow-up resources needed to be directed in a manner that was independent of the presence of a planet.

These considerations would seem to render our task hopeless with the current data set. However, while the presence or not of a potential planetary signature in a microlensing event may affect the way in which it is observed, there is little to no information on the host distance, which could affect the decision process of an observer. This is not to say that the detectability of a planet in a microlensing event is independent of its host distance, just that there is unlikely to be anything that would make the detectability as a function of host distance difficult to model. This means that if we can construct a reasonable model both of planet detectability as a function of host distance and of the distribution of lens stars (i.e., the potential planet hosts) then we can reasonably compare the distribution of actual planet host distances to that we expect from the model. More specifically, we can test the hypothesis that the distribution of actual distances of planet host stars is drawn from the same distribution of distances of host stars that we produce from our model.

In this paper we construct a simulated sample of microlensing planet host stars with detected planets and compare the resultant (expected) distribution of host distances with the observed distribution. We begin by describing how distances are measured to microlensing planet hosts in Section 2. In Section 3 we describe the selection of the sample of observed microlensing planet hosts, and in Section 4 we describe the model with which we build the comparison distribution. In Section 5 we present the results of our comparisons of the model and observed distance distributions. In Section 6 we discuss our results and their potential causes, and in Section 7 we present our conclusions.

2. MEASURING DISTANCES

Before discussing the selection of our observed sample of planet hosts it is necessary to first introduce the methods by which lens (host) distances are measured in microlensing events. In most microlensing events that involve a single point lens the only constraint on the lens distance is provided by the event timescale,

$\begin{eqnarray}&&{t}_{{\rm{E}}}=\displaystyle \frac{{\theta }_{{\rm{E}}}}{{\mu }_{\mathrm{rel},\oplus }},\end{eqnarray} \tag{ 1 }$

where ${\theta }_{{\rm{E}}}$ is the angular Einstein radius and ${\mu }_{\mathrm{rel},\oplus }$ is the relative proper motion between the lens and source, measured in the geocentric reference frame.⁶ The angular Einstein radius

$\begin{eqnarray}&&{\theta }_{{\rm{E}}}=\sqrt{\kappa M{\pi }_{\mathrm{rel}}},\end{eqnarray} \tag{ 2 }$

is determined by the lens mass M and the relative lens–source parallax ${\pi }_{\mathrm{rel}};$ κ = 8.144 mas ${M}_{\odot }^{-1}$ is a constant.

From an observer's standpoint, ${\pi }_{\mathrm{rel}}$ is the fundamental quantity related to the location of the lens, and is a measure of the proximity of the lens to the source,

$\begin{eqnarray}&&{\pi }_{\mathrm{rel}}=\displaystyle \frac{{\rm{a}}{\rm{u}}}{{D}_{{\rm{l}}}}-\displaystyle \frac{{\rm{a}}{\rm{u}}}{{D}_{{\rm{s}}}},\end{eqnarray} \tag{ 3 }$

where ${D}_{{\rm{l}}}$ and ${D}_{{\rm{s}}}$ are the lens and source distances, respectively. For most microlensing events, the source lies in the bulge, and it is reasonable to assume that ${D}_{{\rm{s}}}$ is known to a precision better than 20%. This means that ${\pi }_{\mathrm{rel}}$ is usually a good indicator of the lens distance. Unfortunately, many papers that estimate lens distances do not quote an estimate for ${\pi }_{\mathrm{rel}}$ and do not contain enough information to calculate it, so instead we will work with the estimates of lens distance ${D}_{{\rm{l}}}$ to investigate the distribution of lens locations.

It is possible to measure ${\pi }_{\mathrm{rel}}$ and hence obtain a good estimate of the lens distance if two potentially observable quantities are measured: ${\theta }_{{\rm{E}}}$ and the microlensing parallax

$\begin{eqnarray}&&{\pi }_{{\rm{E}}}=\displaystyle \frac{{\pi }_{\mathrm{rel}}}{{\theta }_{{\rm{E}}}}.\end{eqnarray} \tag{ 4 }$

A detailed derivation of microlensing mass and distance measurements using standard notation is given by Gould (2000), Gould & Horne (2013), and Calchi Novati & Scarpetta (2016). In all but one planetary microlensing event to date ${\theta }_{{\rm{E}}}$ has been measured from finite-source effects (Witt & Mao 1994): these resolve the source star, which provides an angular ruler with which to measure ${\theta }_{{\rm{E}}}$ , and hence also ${\mu }_{\mathrm{rel},\oplus }$ . Measuring ${\pi }_{{\rm{E}}}$ is more difficult, and until recently has required the detection of subtle distortions to the light curve of the microlensing event that result from the acceleration of the Earth in its orbit (Gould & Loeb 1992). The chance of detecting these distortions is greatly increased if the event has a long timescale and if the projection of the Einstein ring from the source onto the solar system is small. These conditions end up strongly favoring nearby lenses. A much less biased way to measure ${\pi }_{{\rm{E}}}$ requires observation of a microlensing event from a spacecraft well separated from the Earth (Refsdal 1966; Gould 1994); this has recently come to fruition using Spitzer (Calchi Novati et al. 2015; Shvartzvald et al. 2015; Udalski et al. 2015c; Yee et al. 2015; Street et al. 2016) and will be further advanced with Campaign 9 of the K2 mission (Howell et al. 2014; Henderson et al. 2015).

If it is not possible to measure the lens distance through measurements of both ${\theta }_{{\rm{E}}}$ and ${\pi }_{{\rm{E}}}$ , it is possible to estimate the distance through a Bayesian analysis by applying what constraints happen to be available together with priors on the distribution of lenses and sources from a Galactic model, as well as often unstated but implied flat priors such as the abundance of planets as a function of host star mass, metallicity, and orbital architecture. In the absence of constraints on the lens distance, these Bayesian estimates will tend to cluster at the peak of the prior probability distribution. For this reason, most authors caution against using Bayesian distance estimates for the kind of study we are conducting. As there is currently no other way to access distant planets, we choose to commit this sin and throughout remain wary of its consequences (see Section 6.2 for a discussion of its impact).

3. DATA

We select our sample of planet hosts from all planetary microlensing events detected in seasons up to and including 2014 and with a publication date of 2015 or earlier. Our only cut is to exclude systems with mass ratios $(q\equiv {M}_{{\rm{p}}}/M)$ larger than 0.03. This cut prevents a strong distance bias that exists for large planets that orbit very low-mass stars, such as OGLE-2011-BLG-420 (Choi et al. 2013). If the system has a mass ratio that overlaps that possible for stellar binaries then it is likely, first, that it will not be analyzed in detail, and second, that unless its parallax is measured, it will not be recognized as a planet.

Table 1 lists the distances and relative proper motions of the microlensing planet hosts in our sample and the method by which their distance was measured. Where there is more than one possible solution we use the solution favored by the authors, or where they are agnostic we choose the solution with the lowest χ². Often the choice of solution has very little impact on the distance estimate. Where the method column lists ${\pi }_{{\rm{E}}}{\theta }_{{\rm{E}}}$ , the host distance was measured by combining measurements of parallax and finite-source effects. Where method is listed as Bayes, ${D}_{{\rm{l}}}$ was estimated using a Bayesian analysis. Many of these Bayesian estimates incorporate constraints from adaptive optics (AO) imaging.

Table 1. Microlensing Planet Host Stars

Event	D_l	μ_rel	${\theta }_{{\rm{E}}}$	$\mathrm{log}q$	ℓ	b	Method	References
	(kpc)	(mas yr⁻¹)	(mas)		(deg)	(deg)
O03-235	${5.8}_{-0.7}^{+0.6}$	3.1 ± 0.5	0.52 ± 0.08	−2.4	+2.20	−3.69	Bayes	1, 2
O05-071	3.2 ± 0.4	4.3 ± 0.2	0.85 ± 0.05	−2.1	−4.42	−3.79	π_Eθ_E	3, 4
O05-169	4.1 ± 0.4	7.7 ± 0.8	1.00 ± 0.22	−4.2	+0.68	−4.74	Bayes	5, 6, 7
O05-390	6.6 ± 1.0	6.8 ± 0.7	0.20 ± 0.03	−4.1	−0.27	−2.36	Bayes	8
O06-109	1.5 ± 0.1	4.3 ± 0.1	1.50 ± 0.04	−3.3	−0.21	−1.89	π_Eθ_E	9, 10
M07-192	${0.70}_{-0.12}^{+0.21}$	5.1 ± 1.4	0.56 ± 0.06	−4.4	+4.03	−3.39	π_Eθ_E	11, 12, 13
O07-368	${5.9}_{-1.4}^{+0.9}$	3.5 ± 0.6	0.53 ± 0.08	−4.0	−1.65	−3.69	Bayes	14
M07-400	${5.8}_{-0.8}^{+0.6}$	8.2 ± 0.5	0.32 ± 0.02	−2.6	+2.38	−4.70	Bayes	15
O08-092	8.1 ± 2.0	3.2 ± 0.2	0.34 ± 0.02	−3.6	−4.75	−3.34	other	16
M08-310	7.7 ± 2.0	5.1 ± 0.3	0.15 ± 0.01	−3.5	−4.09	−4.56	Bayes	17
O08-355	6.8 ± 1.1	3.1 ± 0.4	0.28 ± 0.03	−1.9	−0.08	−3.45	Bayes	18
M08-379	${3.3}_{-1.2}^{+1.3}$	7.6 ± 1.6	0.88 ± 0.19	−2.2	+0.38	−3.11	Bayes	19
M09-266	3.0 ± 0.3	5.9 ± 0.3	0.98 ± 0.04	−4.2	−4.93	−3.58	π_Eθ_E	20
M09-319	${6.1}_{-1.2}^{+1.1}$	7.5 ± 0.7	0.34 ± 0.03	−3.4	+4.20	−3.01	Bayes	21
M09-387	5.7 ± 1.3	2.8 ± 0.3	0.31 ± 0.03	−1.9	−3.44	−4.09	Bayes	22
M10-328	0.81 ± 0.10	5.7 ± 0.7	0.98 ± 0.12	−3.6	−0.16	−3.21	π_Eθ_E	23
M10-353	6.4 ± 1.1	6.0 ± 2.6	0.19 ± 0.09	−2.9	+3.60	−2.90	Bayes	24
M10-477	2.3 ± 0.6	10.3 ± 0.8	1.38 ± 0.11	−2.7	+0.05	−5.09	Bayes	25
O11-0251	2.6 ± 0.6	4.3 ± 1.6	0.75 ± 0.28	−2.7	+0.67	+2.33	π_Eθ_E	26
M11-262	7.2 ± 0.8	11.6 ± 0.9	0.12 ± 0.01	−3.4	−0.37	−3.92	Bayes	27
O11-265	4.4 ± 0.5	2.9 ± 0.3	0.42 ± 0.04	−2.4	+2.70	−1.52	π_Eθ_E	28
M11-293	7.7 ± 0.4	4.3 ± 0.3	0.26 ± 0.02	−2.3	+1.52	−1.65	Bayes	29, 30
M11-322	${7.6}_{-0.9}^{+1.0}$	5.0 ± 1.7	>0.085	−1.5	+3.63	−2.81	Bayes	31
O12-0026	4.1 ± 0.3	3.5 ± 0.3	0.91 ± 0.09	−3.9	+0.19	+3.06	π_Eθ_E	32
O12-0406	5.0 ± 0.3	3.0 ± 0.3	0.53 ± 0.05	−2.2	−0.46	−2.22	π_Eθ_E	33, 34
O12-0563	${1.3}_{-0.8}^{+0.6}$	6.4 ± 0.6	1.36 ± 0.13	−3.0	+3.31	−3.25	Bayes	35
M13-220	4.5 ± 2.0	12.6 ± 0.9	0.46 ± 0.03	−2.5	+1.50	−3.76	other	36
O13-0341	1.2 ± 0.1	10.2 ± 0.8	0.93 ± 0.07	−4.3	−0.05	−1.68	π_Eθ_E	37
O13-0723	0.49 ± 0.04	3.8 ± 0.3	0.70 ± 0.06	−4.2	−0.02	+2.83	π_Eθ_E	38
O14-0124	4.1 ± 0.6	2.3 ± 0.7	0.84 ± 0.26	−3.2	+2.34	−2.92	π_Eθ_E	39
MOA-bin-1	${5.1}_{-1.9}^{+1.2}$	8.8 ± 1.4	0.77 ± 0.11	−2.3	−0.11	−1.48	Bayes	40

Note. Estimates of lens distance, proper motions, mass ratios, and galactic coordinates for our sample of microlensing planet hosts. The method column is explained in the text. Event names of the format SURVEY-YYYY-BLG-NNN(N) are abbreviated as SYY-NNN(N) with O = OGLE and M = MOA, e.g., OGLE-2014-BLG-0124 becomes O14-0124. Events with joint survey names are listed only by the first name.

References. (1) Bennett et al. 2006; (2) Bond et al. 2004; (3) Dong et al. 2009b; (4) Udalski et al. 2005; (5) Bennett et al. 2015; (6) Batista et al. 2015; (7) Gould et al. 2006; (8) Beaulieu et al. 2006; (9) Bennett et al. 2010; (10) Gaudi et al. 2008; (11) Kubas et al. 2012; (12) Gould et al. 2010a; (13) Bennett et al. 2008; (14) Sumi et al. 2010; (15) Dong et al. 2009a; (16) Poleski et al. 2014; (17) Janczak et al. 2010; (18) Koshimoto et al. 2014; (19) Suzuki et al. 2014; (20) Muraki et al. 2011; (21) Miyake et al. 2011; (22) Batista et al. 2011; (23) Furusawa et al. 2013; (24) Rattenbury et al. 2015; (25) Bachelet et al. 2012; (26) Kains et al. 2013; (27) Bennett et al. 2014; (28) Skowron et al. 2015; (29) Batista et al. 2014; (30) Yee et al. 2012; (31) Shvartzvald et al. 2014; (32) Han et al. 2013b; (33) Tsapras et al. 2014; (34) Poleski et al. 2014; (35) Fukui et al. 2015; (36) Yee et al. 2014; (37) Gould et al. 2014; (38) Udalski et al. 2015a; (39) Udalski et al. 2015c; (40) Bennett et al. 2012.

Download table as: ASCII Typeset image

For two of the events included in our sample, the authors did not compute Bayesian estimates of the lens distance: OGLE-2008-BLG-092 (Poleski et al. 2014) and MOA-2013-BLG-220 (Yee et al. 2014). It is important to include these events, because excluding events without parallax measurements would bias the sample toward more nearby lenses. Poleski et al. (2014) argue that their lens is in the bulge due to the small relative proper motion and perform a Monte Carlo simulation to estimate a distance of 8.1 kpc, which we adopt. Yee et al. (2014) use blended flux constraints and proper motions to argue that their lens resides in the disk, but only provide an approximate upper limit to the distance of 6.5 kpc; we adopt a distance of 4.5 kpc, which corresponds approximately to the peak of the distance distribution for the disk in our model, which is described in the next section. In both cases we arbitrarily adopt an error bar of 2.0 kpc in order to be able to run a bootstrap analysis in Section 6.

4. MODEL

For our model we adopt the existing simulation of the KMTNet microlensing survey by Henderson et al. (2014), hereafter H14. The simulation assumes that the Galaxy follows the model of Han & Gould (2003), and tracks whether stars belong to the bulge (which follows a G2 model of Dwek et al. 1995 as described by Han & Gould 1995a, 2003) or the disk (which follows a model of Bahcall 1986 as described by Han & Gould 1995b). Source and lens stars are drawn from the Galactic model and assigned a weight proportional to their contribution to the event rate, i.e., proportional to ${\mu }_{\mathrm{rel},\odot }{\theta }_{{\rm{E}}}$ . Here, ${\mu }_{\mathrm{rel},\odot }$ is the relative proper motion between lens and source measured in the heliocentric frame, because the simulations of H14 do not consider the Earth's orbital motion. Given the Earth's instantaneous velocity of 2π au yr⁻¹, the heliocentric and geocentric proper motions can differ significantly, but for the purposes of this paper it is rare for a large relative proper motion in one frame to be small in the other frame. For the rest of this paper then, we will treat heliocentric and geocentric velocities as equivalent when comparing model to data. This is not ideal, but we have no other option because the simulations did not output enough information to reconstruct the geocentric proper motion, and it is not possible to compute the heliocentric proper motion of the observed events unless parallax is measured or the source and lens star are resolved with high-resolution observations long after the microlensing event (see Yee et al. 2015 and Bennett et al. 2007, 2015; Batista et al. 2015 for discussions of each option, respectively). We will therefore use ${\mu }_{\mathrm{rel}}$ to refer to relative proper motions from here on, be they heliocentric or geocentric, and assume that their difference does not significantly affect our results.

A planet is assigned to each lens, with mass ${M}_{{\rm{p}}}$ and semimajor axis a chosen from a grid, and observations of the planetary microlensing event are simulated. To match observational constraints on the mass function of planets where microlensing is sensitive, we weight each event by a factor ${M}_{{\rm{p}}}^{0.73}$ (Cassan et al. 2012). The simulated data are fit with a microlensing event that involves a single point mass, and if the χ² of this fit exceeds the χ² of the data related to the underlying planetary microlensing event by 160, the event is flagged as a planet detection. Further details of the simulation can be found in H14.

KMTNet plans to observe a small number of fields continuously at high cadence, which is a different observing strategy than was used to find most of the planets in our sample. Most were found using lower-cadence survey observations that covered a larger region of the Galactic bulge, in combination with follow-up observations of individual events, though some of the later discoveries can be classified as survey-only detections. Without using extremely complicated simulations, we are forced to assume that the observing strategy used will not significantly bias the distance distribution of our sample. This may or may not be the case, but what is perhaps more likely to affect the distance distribution is the area over which the planets are searched for. The four-field survey planned for KMTNet is far smaller than the area over which our sample is distributed. However, H14 also simulated a larger-area survey at lower cadence in order to optimize the number of fields observed. We choose to use the widest, 13-field simulation that H14 used; this covers a large fraction of the area covered by the OGLE-III survey, with fields between ℓ = 7° and ℓ = −8° and b = −2° to b = −6° as well as a couple of fields in the northern bulge. Somewhat surprisingly, we did not find a significant variation with field location of the relative ratio of planet hosts in the bulge to those in the disk, apart from in the most outlying southern fields. With 13 fields, the cadence of observation was 32.5 minutes.

For this simulation, H14 use a relatively sparsely sampled grid of planet mass versus semimajor axis, with masses of 1, 10, and $100\ {M}_{\oplus }$ and $\mathrm{log}a=0.15,0.40$ , and 0.65; however, our early work showed that our results were not changed significantly when we used their more densely sampled fiducial simulations that cover a smaller area at higher cadence.

As we are interested in the role of metallicity, we assign each lens star a metallicity according to the prescription of Clanton & Gaudi (2014). Metallicities for disk stars are drawn from the distributions of [Fe/H] as a function of Galactocentric radius and distance from the plane as measured by APOGEE (Hayden et al. 2014), and bulge stars are assigned a metallicity drawn from the sample of bulge dwarf metallicities collected by Bensby et al. (2013).

5. THE DISTANCE DISTRIBUTION OF PLANET HOSTS

With our sample and model defined we can begin to investigate whether the observed data match our expectations from the model. We can imagine a sophisticated procedure to incorporate all possible information on each planet host and assess the likelihood of a given model in a Bayesian framework. In the future, with more uniform samples of planet hosts, this level of detail will be appropriate, but given the non-uniformity of our sample and the difficulty in understanding its selection effects, we select a simpler frequentist hypothesis testing approach as a first attempt.

The top panel of Figure 1 shows the differential distribution of distance estimates of planet hosts together with the distribution expected from our model. The model (normalized to the total number of hosts) underpredicts the number of hosts in each distance bin less than 6 kpc, whereas beyond 6 kpc it generally overpredicts. We have also plotted the contribution of disk hosts and note that the shape of the disk-only host distribution could potentially offer a better fit to the data.

Shown in the lower panel of Figure 1 are the same data in cumulative form. We computed the Anderson–Darling (AD) test⁷ statistic ${A}^{2}=6.661$ by comparing the model⁸ to the data; this has a p-value of $5.0\times {10}^{-4}$ . Comparing only the disk hosts from the model yields ${A}^{2}=1.468$ , which has a p-value of $0.18$ . The AD test therefore rejects the hypothesis that the distance estimates for microlensing planet hosts were drawn from the same distribution as our model, but does not reject the hypothesis that the data were drawn from our simulated distribution including only disk hosts.

5.1. A Paucity of Bulge Planets?

Rather than ask the binary question of whether or not there are planets in the bulge, we can ask are there fewer planets in the bulge than the disk? We have weighted the bulge hosts in our model by an additional factor ${f}_{\mathrm{bulge}}$ , the abundance of planets around bulge hosts relative to the abundance around disk hosts, and repeated the AD test at various values of ${f}_{\mathrm{bulge}}$ . We plot the results in Figure 2 as the dark blue solid line.⁹ A² rises as ${f}_{\mathrm{bulge}}$ increases, and crosses the p = 0.05 and p = 0.01 lines at ${f}_{\mathrm{bulge}}=0.32$ and $0.54$ , respectively. The KS test statistic behaves similarly, but crosses the p = 0.01 line at ${f}_{\mathrm{bulge}}=0.93$ . This result suggests that the abundance of planets in the bulge relative to the disk is less than $0.54$ at 3σ. Before we can draw this conclusion however, we must more critically examine our assumptions regarding both the data and our model and also reject any other potential explanations.

**Figure 2.** Anderson–Darling (solid) and Kolmogorov–Smirnov (dot-dashed, right axis) test statistics plotted as a function of ${f}_{\mathrm{bulge}}$ , the abundance of planets in the bulge relative to the disk. Horizontal lines are plotted at labeled p-values for the AD test. The KS test axis is plotted such that there is an approximately linear mapping between AD and KS p-values in the range 0.001–0.2. Lightly shaded lines weight the planet abundance by the joint mass and metallicity correlation found by Johnson et al. (2010).
Download figure:
Standard image High-resolution image

**Figure 2.** Anderson–Darling (solid) and Kolmogorov–Smirnov (dot-dashed, right axis) test statistics plotted as a function of ${f}_{\mathrm{bulge}}$ , the abundance of planets in the bulge relative to the disk. Horizontal lines are plotted at labeled p-values for the AD test. The KS test axis is plotted such that there is an approximately linear mapping between AD and KS p-values in the range 0.001–0.2. Lightly shaded lines weight the planet abundance by the joint mass and metallicity correlation found by Johnson et al. (2010).
Download figure:
Standard image High-resolution image

5.2. Relative Proper Motions

Given the difficulty in measuring the distance of planet hosts, there are a number of ways in which the distribution of distance estimates could be altered in a way that we have not modeled. Relative proper motions are measured routinely in planetary microlensing events, and stellar kinematics are correlated with population, so we can use the distribution of measurements of proper motionas a consistency check on our findings from the previous subsections. Figures 3 and 4 show the results of the same exercise performed with measurements of the relative proper motion.

**Figure 3.** Distribution of relative proper motions for microlensing events with planets. See Figure 1 for the description of the plot elements.
Download figure:
Standard image High-resolution image

**Figure 4.** AD and KS test statistics as a function of ${f}_{\mathrm{bulge}}$ for the ${\mu }_{\mathrm{rel}}$ distribution. See Figure 2 for a description of the plot elements.
Download figure:
Standard image High-resolution image

The distribution of proper motions should be much more robust to catastrophic errors than the distance distribution, but due to the significant overlap of the distributions of proper motions of bulge and disk stars, it will have less statistical power to disentangle the contributions of each population. Recall also that we are not comparing relative proper motions in the same frame, so it is possible that we are biasing the result.

The lack of statistical power is borne out, with a large range of ${f}_{\mathrm{bulge}}$ producing AD test statistics with large p-values. However, the AD statistic does rise toward smaller values of ${f}_{\mathrm{bulge}}$ , reaching $2.704$ at ${f}_{\mathrm{bulge}}=0$ , which corresponds to $p=0.039$ and indicates tension with the hypothesis of no bulge planets. However, the distribution of proper motions does not rule out all of the space with ${f}_{\mathrm{bulge}}\lt 0.54$ that is consistent with the distance distribution.

5.3. Other Physical Explanations?

To investigate whether other physical factors could cause the observed distribution of distance estimates, we considered toy models of the planet abundance f as a function of host metallicity [Fe/H], host mass M, and planet semimajor axis a:

$\begin{eqnarray}&&f\propto {10}^{\alpha [\mathrm{Fe}/{\rm{H}}]},\quad f\propto {M}^{\beta },\quad f\propto {a}^{\gamma },\end{eqnarray} \tag{ 5 }$

where

$\begin{eqnarray}&&f=\displaystyle \frac{{d}^{3}N}{d[\mathrm{Fe}/{\rm{H}}]{dMda}},\end{eqnarray} \tag{ 6 }$

following a long tradition of using power laws to model the distribution of exoplanet abundances (e.g., Stepinski & Black 2000; Lineweaver & Grether 2003; Fischer & Valenti 2005; Johnson et al. 2010). Figure 5 plots the AD and KS test statistics as a function of each parameter α, β, and γ. If a value of α, β, or γ different from our fiducial value of 0 significantly reduces the value of the AD test statistic, then this would indicate that, rather than a different abundance of planets in the disk and bulge, a dependence of planet abundance on intrinsic stellar properties or system architecture may be responsible for the discrepancy that we see between the model and the observational data.

**Figure 5.** AD and KS test statistics as a function of power-law slope parameters for toy models (Equation (5)) of the planet abundance as a function of metallicity (top), host mass (middle), and semimajor axis (bottom) for the distributions of both distance estimate ${D}_{{\rm{l}}}$ (blue), and relative proper motion ${\mu }_{\mathrm{rel}}$ (red).
Download figure:
Standard image High-resolution image

**Figure 5.** AD and KS test statistics as a function of power-law slope parameters for toy models (Equation (5)) of the planet abundance as a function of metallicity (top), host mass (middle), and semimajor axis (bottom) for the distributions of both distance estimate ${D}_{{\rm{l}}}$ (blue), and relative proper motion ${\mu }_{\mathrm{rel}}$ (red).
Download figure:
Standard image High-resolution image

A value of $\alpha =0.9$ improves the model's match to the distribution of distance estimates, but only to a value of ${A}^{2}=5.263$ with $p=2.2\times {10}^{-3}$ . Interestingly, negative values of $\alpha \leqslant -1$ are rejected by the AD test with p ≤ 0.01 for the ${\mu }_{\mathrm{rel}}$ distribution as well as the ${D}_{{\rm{l}}}$ distribution. Were there a strong negative correlation of planet abundance with metallicity, the fraction of stars in the low-metallicity tail of the bulge's broad metallicity distribution would vastly overproduce planets compared to the disk, to the extent that it would even affect the distribution of relative proper motions of planetary microlensing events.

Allowing non-zero values of either β and γ does not significantly reduce A² (though implausibly strong negative correlations do slightly increase A²), which implies that these parameters do not significantly affect the distribution of distance estimates. Note, however, that our model has some shortcomings for both these parameters. First, for both the disk and bulge H14 do not simulate events with host stars more massive than $1\,{M}_{\odot }$ . This cutoff is reasonable for the older bulge, where stars more massive would have evolved off the main sequence, but for the younger disk more massive main-sequence stars will still be present. If there were a dependence of planet abundance on host mass, the relatively longer lever arm of the mass distribution in the disk could have an effect on the distance distribution. Second, H14 only explores a coarse grid of semimajor axis, so it is not clear whether a smooth distribution of semimajor axis would change the result. Because the Einstein ring radius ${r}_{{\rm{E}}}$ depends on ${D}_{{\rm{l}}}$ and the sensitivity of microlensing to planets is strongly peaked around separations of $1{r}_{{\rm{E}}}$ , a dependence of planet abundance on semimajor axis would affect the distance distribution of planet hosts.

Given that radial velocity surveys have found a correlation between both host mass and metallicity, we can ask how such a correlation would change our findings from Section 5.1. With no correlation with mass or metallicity (α = β = 0), we found 95% limits of $0.02\leqslant {f}_{\mathrm{bulge}}\leqslant 0.32$ from AD tests on the distributions of ${D}_{{\rm{l}}}$ and ${\mu }_{\mathrm{rel}}$ . Weighting by the trend of Johnson et al. (2010) between mass, metallicity, and planet abundance (α = 1.2, β = 1.0) produces a qualitatively similar model distribution to our fiducial model, but the number of disk hosts at distances of 3–5 kpc is enhanced at the expense of bulge planets between 6 and 9 kpc. This enhancement of planets in the inner disk is due to the radial metallicity gradient from APOGEE rising above the mean metallicity of the bulge in this region. The AD and KS statistics as a function of ${f}_{\mathrm{bulge}}$ , including the mass and metallicity correlations, are plotted in Figures 2 and 4 with lighter shades. The 95% confidence limits in this case are $0.06\leqslant {f}_{\mathrm{bulge}}\leqslant 0.32$ . The 1% upper limits on ${f}_{\mathrm{bulge}}$ increase from $0.54$ in the no-correlation case to $0.7$ with the observed correlations. Note that ${f}_{\mathrm{bulge}}$ has a slightly different interpretation in the two scenarios; with no mass or metallicity correlations, ${f}_{\mathrm{bulge}}$ can be interpreted as the ratio of planets per star in the bulge relative to that in the disk, but with the correlations, ${f}_{\mathrm{bulge}}$ is only the contribution to this ratio that has not been accounted for already by mass and metallicity correlations.

6. DISCUSSION

So far, while acknowledging potential problems with the distance estimates, we have treated them as if they were correct and investigated the consequences. Now the time has come to examine the distance estimates more critically and ask whether they themselves are causing the apparent lack of planets in the bulge. We must also examine the validity of our methods and the accuracy of our model.

An obvious criticism of our work so far is that we have not accounted for the uncertainties in the distance estimates (or proper motions) when computing p-values. If we were to do so, the p-values would increase. To compute accurate p-values we would need to bootstrap-resample from the posterior probability distributions of each distance estimate, but these distributions are rarely made available with their papers and are sometimes not even plotted. As an inferior substitute to the posteriors (especially when some are bimodal), we assumed that the quoted uncertainties on each measurement were Gaussian (with asymmetric error bars being treated as two half-Gaussians with equal probability), and resampled the ${D}_{{\rm{l}}}$ distribution 10⁴ times with ${f}_{\mathrm{bulge}}=1;$ distances that were drawn as negative were redrawn until they were positive. 95% of the resamplings had a p-value less than $1.7\times {10}^{-3}$ , compared to the p-value we measured for the actual sample of $5.0\times {10}^{-4};$ and only three of the resamplings had p > 0.01. This strongly suggests, but is not conclusive given our simplistic handling, that proper consideration of the posterior distributions would not explain the apparent lack of planets in the bulge.

6.1. Problems with the Model?

It is certain that our model is inaccurate at some level. The Galactic model of Han & Gould (2003) used by H14 is relatively old now in comparison to many new results on the structure and kinematics of the bulge and disk over the intervening time (e.g., Clarkson et al. 2008; McWilliam & Zoccali 2010; Nataf et al. 2010; Wegg & Gerhard 2013). Additionally, the disk of Han & Gould (2003) does not have a central hole, so one might consider disk stars near the Galactic center to belong to the bulge. But the effect of a lack of a disk hole might be cancelled by not also accounting for stars that were born in the disk and later swept up by the bar. Despite these problems, it is difficult to imagine how the various inaccuracies in the model could combine to form an error of at least a factor of three (for our 95% confidence limits accounting for the mass and metallicity correlations of Johnson et al. 2010) in the relative microlensing event rates between the disk and bulge that would be necessary to explain the disagreement between model and data.

It is possible that the simulations of H14 do not accurately reflect the process by which most of the planets were found, and therefore alter the distribution of distances. The simplest way in which such an error may manifest itself is by an additional dependence of planet detection efficiency on the parameters of the microlensing events. We can again use toy power-law models to test how a term unaccounted for in the detection efficiency behaves. The event timescale ${t}_{{\rm{E}}}$ and impact parameter ${u}_{0}$ are the most likely culprits, but we also test s and q, and a derived parameter $| s-1/s|$ , which describes the approximate position of the planetary caustics and has a minimum at s = 1 where microlensing is most sensitive; the results are shown in Figure 6.

**Figure 6.** As in Figure 5, but for microlensing parameters that could affect planet detectability.
Download figure:
Standard image High-resolution image

An additional weighting of ${t}_{{\rm{E}}}^{\sim 1}$ improves the match for the ${D}_{{\rm{l}}}$ distribution (p = 0.023), but at the cost of increased tension with the ${\mu }_{\mathrm{rel}}$ distribution (p = 0.013), and would not completely solve the problem. Such a weighting is a potentially plausible explanation for the lack of planets at large ${D}_{{\rm{l}}}$ because bulge–bulge lensing events tend to have shorter timescales, but longer timescales allow for more time to organize follow-up observations of high-magnification events, which the survey-only simulations of H14 would not capture. Applying a ${t}_{{\rm{E}}}^{+1}$ weighting also brings the median of the distribution of timescales of model events to 39.5 days (instead of 23.5 days for the unweighted distribution), which is more in line with the median timescale of the sample, which is 40 days.

Despite high-magnification events being over-represented in the observed sample relative to the simulations, there is no improvement offered by an inverse scaling with the impact parameter $| {u}_{0}|$ . A power law in s does not provide any improvement. A strong negative power of $\,| s-1/s{| }^{-1\to -2}$ does provide an improvement, which would imply that there was a selection effect that favored planets closer to the Einstein ring. This might be a plausible explanation if many putative discoveries of planets in high-magnification events are not published because of potential degeneracy with stellar mass binaries; alternatively, planets with large $| s-1/s|$ may be missed because follow-up observations tend to cover only the higher-magnification portions of light curves. Finally, an inverse power law with mass ratio ${q}^{\lesssim -0.5}$ could explain the differences between the data and the model, but this seems implausible because we already account for the measured planetary mass function of Cassan et al. (2012), and this would require that the mass function be roughly twice as steep. Furthermore, a slope twice as steep as that of Cassan et al. (2012) is inconsistent by more than 3σ with the constraints on the slope of the planetary mass function reported by Clanton & Gaudi (2016), which were derived from a joint analysis of results from microlensing, radial velocity, and direct imaging surveys. Finally, we also tried increasing the Δχ² threshold for the model and found no significant change in the test statistics until small-number statistics in the model became an issue.

6.2. Problems with the Distance Estimates?

It is also possible that the distance estimates we use are the cause of the disagreement between the data and the model. As mentioned in Section 2, the authors who compute Bayesian estimates of the distance and mass would caution us not to use them for statistical studies. However, we should also cast a critical eye upon the parallax measurements that are generally thought to be accurate.

The constraints that shape the posterior distribution of the Bayesian distance estimate come primarily from the measurements of the angular Einstein radius and proper motion that are routine for most planetary microlensing events.¹⁰ ${\theta }_{{\rm{E}}}$ and ${\mu }_{\mathrm{rel}}$ are derived from measurements of the crossing time of the source radius t_*, ${t}_{{\rm{E}}}$ , and the source color. t_* can essentially be "read off" from the duration of finite-source effects in the light curve. Though there is some sensitivity to the choice of limb darkening parameters, it should be determined very robustly given sufficient coverage over the finite-source features. ${t}_{{\rm{E}}}$ and the source color can be smoothly degenerate with blending, which means that uncertainties should account for this degeneracy; however, discrete degeneracies between different models in the q–s plane can cause catastrophic errors in ${\mu }_{\mathrm{rel}}$ and ${\theta }_{{\rm{E}}}$ . Bayesian distance estimates can incorporate the multimodal posteriors that such a discrete degeneracy would produce, but our analysis cannot. Some events in the observed sample are affected by such a discrete degeneracy (e.g., MOA-2007-BLG-192, Bennett et al. 2008), but they are in the minority.

According to our model, the distribution of distances as a function of ${\theta }_{{\rm{E}}}$ is monotonic, and roughly linear in $\mathrm{log}({\theta }_{{\rm{E}}}/\mathrm{mas})$ with a slope of ∼10 kpc dex⁻¹ and an rms scatter of ∼1.3 kpc in each 0.1 dex bin (see also Figure 7 below); the dominant contributor to this scatter is the mass distribution of the hosts. Other Galactic models used as priors in Bayesian distance estimates will likely have similar properties. Often ${\theta }_{{\rm{E}}}$ measurements are limited by a systematic error in the conversion of the source color to an angular source size (e.g., Yoo et al. 2004) with a magnitude of ∼0.05–0.1 in $\mathrm{ln}({\theta }_{{\rm{E}}}/\mathrm{mas})$ . This implies that, with only a ${\theta }_{{\rm{E}}}$ measurement, it should be possible to localize the distance of most lenses to within ∼2 kpc or less. Therefore, we conclude that while the Bayesian estimates of mass are likely to be unreliable, and are likely to conform to our prior on the mass, the Bayesian distance estimates are likely to be much more useful.

**Figure 7.** Top row and bottom right panel: projections of the ${\theta }_{{\rm{E}}}$ – ${\mu }_{\mathrm{rel}}$ – ${D}_{{\rm{l}}}$ parameter space for a realization of our model (small red points for bulge hosts and blue for disk hosts) and our data sample (large black points for parallax and finite-source distance measurements and gray points for Bayesian estimates). Events in the sample with distance estimates ${D}_{{\rm{l}}}\lt 2\,\mathrm{kpc}$ are individually labeled. In the ${\theta }_{{\rm{E}}}$ – ${D}_{{\rm{l}}}$ panel; small black points show what ${\theta }_{{\rm{E}}}$ would be if all hosts had the same mass ( $1\,{M}_{\odot };$ hosts in the model have masses between 0.08 and $1\,{M}_{\odot }$ ). Bottom left: cumulative distribution of relative proper motions for detected planet hosts in our model for all hosts (gray), hosts between 1 and 2 kpc (red), and hosts closer than 1 kpc (black).
Download figure:
Standard image High-resolution image

The precision of the Bayesian distance estimates is certainly enough to conduct the present study. Even if one were suspicious of the exact form of the distance distribution that Bayesian estimates use as priors, with such uncertainties a more crude test that asks whether the number of hosts with distances less than 5 kpc (beyond which bulge hosts should begin to contribute) is consistent with the model assuming binomial statistics, should be robust. When this test is performed, the model predicts $9.0\pm 2.5$ hosts closer than 5 kpc, and the model weighted by the trends in mass and metallicity abundance of Johnson et al. (2010) predicts $9.4\pm 2.6$ hosts. There are $17$ hosts closer than 5 kpc in our sample, implying that our fiducial model and the model weighted by the trends in mass and metallicity are inconsistent with the data at 3.2σ and 3.0σ respectively.

The final ingredient to examine is the parallax distance measurements. As can be seen in Figure 1, all of the $12$ hosts in events with parallax measurements are within 5 kpc, which is not surprising given that there is a strong bias toward nearby lenses as discussed in Section 2. Of the $12$ hosts, four are between 4 and 5 kpc, five are closer than 2 kpc, and three are closer than 1 kpc. By itself, this may not be too surprising given the observational biases. However, if in contrast to the rest of the paper we assume that the fiducial model is accurate, the number of hosts at extremely small distances (<2 kpc) is extremely surprising. For the number of hosts in our sample, our model would predict $0.26$ hosts in the 0–1 kpc bin and $1.24$ in the 1–2 kpc bin, compared to the three hosts in each (including the Bayesian estimated hosts). In other words, if all the hosts in the first two bins are in the correct bins, the model would predict that the sample would contain $61.9$ planet hosts, twice the number that are actually in the sample! Moving four or five out of the six events with ${D}_{{\rm{l}}}\lt 2\,\mathrm{kpc}$ to the 6–8 kpc range (where there is the largest deficit of hosts relative to the model) would remove all tension with the model.

So, could the parallax distance estimates be wrong? As described in Section 2, these require both parallax and finite-source measurements, but as discussed earlier in this section, the finite-source measurements, which yield ${\theta }_{{\rm{E}}}$ , should be fairly robust. Until recently, the only way to measure the microlensing parallax ${\pi }_{{\rm{E}}}$ was via orbital parallax.¹¹ Orbital parallax is an often subtle effect that arises from the departure of the apparent trajectory of the source from linear motion caused by the Earth's acceleration in its orbit. ${\pi }_{{\rm{E}}}$ is actually a vector quantity ${{\boldsymbol{\pi }}}_{{\rm{E}}}$ , with magnitude ${\pi }_{{\rm{E}}}={\pi }_{\mathrm{rel}}/{\theta }_{{\rm{E}}}$ and the same direction as the vector of relative proper motion. Often only the so-called parallel component of the vector (in the direction parallel to the direction of the Earth's acceleration, which causes an asymmetric distortion to the light curve) is well measured, with the perpendicular component (which causes a symmetric distortion) being degenerate with the impact parameter and blending. In addition, orbital parallax is also degenerate with the orbital motions of the lens (Batista et al. 2011; Skowron et al. 2011) and the source (xallarap, e.g., Poindexter et al. 2005), which can cause similar apparent changes to the source trajectory. The magnitude of the parallax distortions and their duration (covering the whole magnified portion of the light curve) can also make them vulnerable to corruption by long-term systematic trends in photometry.

The events with ${D}_{{\rm{l}}}\lt 2\,\mathrm{kpc}$ are OGLE-2006-BLG-109, MOA-2007-BLG-192, MOA-2010-BLG-328, OGLE-2012-BLG-0563, OGLE-2013-BLG-0341, and OGLE-2013-BLG-0723; we will use abbreviated names (see the notes for Table 1) for events in our sample from here on. We have highlighted them on ${\theta }_{{\rm{E}}}$ – ${\mu }_{\mathrm{rel}}$ , ${\theta }_{{\rm{E}}}$ – ${D}_{{\rm{l}}}$ , and ${\mu }_{\mathrm{rel}}$ – ${D}_{{\rm{l}}}$ diagrams in Figure 7, which visually compares our host sample to our model. Model points were drawn with replacement from the model, and displaced randomly from their actual values (in order to better visualize point densities) by $\mathrm{log}({\theta }_{{\rm{E}}}/\mathrm{mas})\pm 0.1$ , $\mathrm{log}({\mu }_{\mathrm{rel}}/\mathrm{mas}\,{\mathrm{yr}}^{-1})\pm 0.1$ , and ${D}_{{\rm{l}}}\pm 0.25\,\mathrm{kpc}$ , where the ± sign indicates the range of a uniform distribution.

Recalling that ${\theta }_{{\rm{E}}}$ and ${\mu }_{\mathrm{rel}}$ should be relatively robustly measured, in the ${\mu }_{\mathrm{rel}}$ – ${\theta }_{{\rm{E}}}$ panel of Figure 7 the nearby hosts do not look to be especially unusual; they cluster toward larger ${\theta }_{{\rm{E}}}$ , but are generally within the cloud of model points, with O06-109 (labeled a) being the most extreme. However, in the ${D}_{{\rm{l}}}$ – ${\theta }_{{\rm{E}}}$ and ${D}_{{\rm{l}}}$ – ${\mu }_{\mathrm{rel}}$ plots, at least three of the nearby hosts (b, c, and f, or M07-192, M10-328, and O13-0723, respectively) stand out from the cloud of model points; these three hosts are the ones within 1 kpc. O06-109 lies at the edge of the model cloud in the ${D}_{{\rm{l}}}$ – ${\mu }_{\mathrm{rel}}$ plot and O13-0341 lies at the edge in the ${D}_{{\rm{l}}}$ – ${\theta }_{{\rm{E}}}$ plot. O12-0563, being a Bayesian distance estimate, unsurprisingly lies within the cloud of points.

The majority of the scatter in ${\theta }_{{\rm{E}}}$ in the model is caused by the range of lens masses. This can be seen by comparing the ${\theta }_{{\rm{E}}}$ scatter in the ${D}_{{\rm{l}}}$ – ${\theta }_{{\rm{E}}}$ plot between the fiducial red and blue points and the black points, which are the same realization with ${\theta }_{{\rm{E}}}$ plotted as if every star had the same mass of $1\,{M}_{\odot }$ . From this it can easily be seen that the small distance estimates imply extremely small estimates of host mass. The host masses in our model range from 0.08 to $1\,{M}_{\odot }$ . The authors of papers on M07-192, M10-328, and O13-0723 estimate host masses of 0.084, 0.11, and $0.031\,{M}_{\odot }$ , respectively (Kubas et al. 2012; Furusawa et al. 2013; Udalski et al. 2015a). These events have relatively normal angular Einstein radii and relative proper motions, so the low mass and distance estimates are being driven entirely by the parallax measurements. Given that at least some of the events closer than 2 kpc occupy a region of parameter space that our model predicts should be empty, it is worth looking at each one individually in more detail to see if we can plausibly move them to a larger distance and bring them in line with our model's expectations.

O06-109 (a) was an extremely complex event with five distinct features from two planets (Gaudi et al. 2008). Parallax and finite-source effects combined to give mass and distance estimates of $0.51\,{M}_{\odot }$ and 1.51 kpc. The event also had observable terrestrial parallax (with ${\rm{\Delta }}{\chi }^{2}\sim 12$ , Bennett et al. 2010), and H = 17.2 mag of blended light coincident with the source in AO imaging matches essentially exactly that which is expected from the parallax solution. Given that terrestrial parallax cannot be mimicked by xallarap or orbital motion and the bright and unmistakable blended light, we conclude that there is very little chance that the O06-109 distance estimate is significantly in error.

M07-192 (b) was a high-magnification event with poor coverage over its peak, and only four data points cover the planetary deviation. As such, there were many models (16) that could fit the data well, but these all had similar planetary mass ratios, with the only significant difference being two classes of models with ${\theta }_{{\rm{E}}}$ differing by a factor of ∼2 (Bennett et al. 2008); even if the solution with the smaller ${\theta }_{{\rm{E}}}$ were chosen, the distance to the host would be less than 2 kpc, but the relative proper motion would be reduced to 2–3 mas yr⁻¹. Bennett et al. (2008) found xallarap models that fit the data, but argued that these had a low probability relative to the parallax model. From AO imaging, Kubas et al. (2012) find that there is a K_s = 19.2 star blended with the source, but do not measure its color very precisely. Kubas et al. (2012) do not question the parallax measurement of Bennett et al. (2008) and so interpret the blended light as being due to a 0.08 ${M}_{\odot }$ star nearby, but the K_s magnitude they measure is compatible with a 0.7 ${M}_{\odot }$ star at ∼7 kpc, and would only cause some tension with their 0.37 mag $J-{K}_{{\rm{s}}}$ error bar. Bennett et al. (2008) estimate that the a priori probability of the parallax signal being caused by xallarap is 0.32%, accounting for the distribution of binary periods, but not accounting for the increased stellar density in the bulge relative to the nearby disk. They guess that the increased density in the bulge would increase this probability by a factor of about 7, but do not rigorously compute this number. Given the difficulty in computing such a probability, it might be reasonable to assume that it might actually be significantly larger.

M10-328 (c) was a low-magnification event with a ∼5 day long planetary deviation that was poorly covered due to bad weather, except for the exit of the caustic over a cusp (Furusawa et al. 2013). The authors find a xallarap model that fits better than the parallax model by Δχ² ∼ 5, but argue that the parallax interpretation is most likely because an unconstrained xallarap fit with a period fixed to 1 yr yields an orbital orientation that is coincident with the Earth's relative to the event's line of sight. We agree that this would seem to rule out xallarap as an impostor for parallax. Additionally, the large angular Einstein radius of the event would imply that, if the host were in the bulge, it would be $\sim 1\,{M}_{\odot }$ , and therefore bright unless it were a stellar remnant. It therefore seems very unlikely that we could plausibly place this host in the bulge. The only remaining hope of doing so is that the poor coverage of the planetary anomaly and the combination of a cusp crossing with orbital motion on the single exit of the caustic result in a larger than reported uncertainty on the angular Einstein radius or a misestimation of it.

O12-0563 (d) was a high-magnification event that was densely covered over the peak (Fukui et al. 2015). Surprisingly, despite the source not crossing the central caustic, the authors claim that they are able to measure ${\theta }_{{\rm{E}}}$ to be 1.36 mas to a precision of ∼10%. While formally the authors measure parallax, they argue that the detection is likely due to systematics, and so we do not count it among the events with parallax detections. AO imaging shows blended light of K_s = 17.7. The authors do not consider a xallarap model. The large ${\theta }_{{\rm{E}}}=1.36$ mas of the event would by itself exclude the possibility of a bulge lens, unless it were a neutron star or black hole. Even if ${\theta }_{{\rm{E}}}$ were overestimated, it would need to be severely overestimated to allow us to move the lens into the bulge.

O13-0341 (e) is a triple-lens event with a very small planetary signal (the dip) that by itself might be overlooked (Gould et al. 2014). However, the well-covered crossing of a large, binary-star central-caustic is also better fit if a planet with the same properties as would cause the dip is included in the model. A low-amplitude bump during an earlier observing season identifies the binary as a wide binary, and so the planet orbits one of the stars. The parallax signal in the light curve is strong. The authors do not mention whether they considered a xallarap model, but the large proper motion and large angular Einstein radius place the event in a region of parameter space that is almost entirely occupied by disk stars.

O13-0723 (f) is a second triple-lens planetary event discovered in 2013. It also seems to have a strong parallax signal (Udalski et al. 2015a). Again, the planet orbits only one of the binary stars, but in this case the argument for this scenario over a circumbinary planet is based upon an argument that the parameters of the circumbinary model require fine tuning. The authors do not mention whether they considered a xallarap model, but unlike O13-0341, the location of the event in the ${\mu }_{\mathrm{rel}}$ – ${\theta }_{{\rm{E}}}$ plane is occupied by both bulge and disk stars and such a model should be considered.¹²

While both O13-0341 and O13-0723 seem to have strong parallax signals, it is unsettling, to say the least, that the only two planets discovered in binary-star microlensing events with caustic crossings also have extremely small estimates of host distance (and mass).¹³ There have been other binary microlensing events with small estimates of mass and distance, e.g., OGLE-2009-BLG-151/MOA-2009-BLG-232 (0.39 kpc, $0.025\,{M}_{\odot }$ total mass), OGLE-2011-BLG-0420 (1.99 kpc, $0.034\,{M}_{\odot }$ , Choi et al. 2013), OGLE-2012-BLG-0358 (1.73 kpc, $0.024\,{M}_{\odot }$ , Han et al. 2013a), OGLE-2013-BLG-0578 (1.16 kpc, $0.16\,{M}_{\odot }$ , Park et al. 2015), and MOA-2011-BLG-149 (1.07 kpc, $0.16\,{M}_{\odot }$ , Shin et al. 2012). For these binaries it is easy to dismiss the unusually small masses and distances as a publication bias: were they further away, parallax would not be measurable, which would mean their low mass would go unnoticed, and there would be nothing interesting to write a paper about. However, this bias is less prominent for planetary microlensing events because all planetary discoveries are publishable, though there could be a delay in publishing "less interesting" ones without parallax measurements, which would bias planets in binary detections to smaller distances. It is perhaps more likely that planets in binaries would suffer a bias that works in the opposite direction, due to the possibility that parallax effects, in addition to the triple-lens nature, would make the events more difficult to model and hence delay publication.

One might think that there is another possible bias that could result in us detecting planets in binary systems only when they are nearby. That is, if the binary star prevents the formation or survival of planets on wider orbits, then it will only be possible to detect the surviving, closer-in planets in events with small physical Einstein ring radii ${r}_{{\rm{E}}}$ . However, ${r}_{{\rm{E}}}\propto \sqrt{{D}_{{\rm{l}}}({D}_{{\rm{s}}}-{D}_{{\rm{l}}})}$ , and so there are lenses with small physical Einstein ring radii near their sources in the bulge as well as near us, the observer; there should be many more planets discovered in binary systems in the bulge than in the nearby disk. We are therefore forced to ask the question: is there something going wrong with the modeling of parallax in triple-lens microlensing events (and maybe in binary-star events too)? Given the complexity of triple-lens models it seems plausible that something could be going wrong, but it is difficult to guess at what this might be, especially in the case of O13-0341 where the parallax, angular Einstein radius, and relative proper motion seem to tell a consistent story.

It is possible to test the predictions of the parallax models to some degree. They predict the direction of the relative source–lens proper motion, and after ∼10 years the lens and source may separate enough for this direction to be measured. An expedited method to test the validity of the parallax models would be to model only the ground-based data for upcoming binary microlensing events that are observed by Spitzer (e.g., Udalski et al. 2015c; Zhu et al. 2015b) or K2 Campaign 9 (Howell et al. 2014; Henderson et al. 2015), and then test the prediction of orbital parallax with the much more robust Spitzer or K2 satellite measurements of parallax. Udalski et al. (2015c) performed this test for a planetary event and found that the orbital and Spitzer-measured satellite parallaxes agreed, but the binary event modeled by Zhu et al. (2015b) did not have a detectable signature of orbital parallax, and alas Spitzer did not observe the past events whose parallax solutions we are worried about.

While we can offer no convincing evidence to do so, potentially two of the six hosts with ${D}_{{\rm{l}}}\lt 2\,\mathrm{kpc}$ (M07-192 and O13-0723) could plausibly be moved to larger distances if xallarap is mimicking parallax in either event or if there were errors in the parallax modeling of O13-0723. It is worth noting that Poindexter et al. (2005) found that five out of 22 single-lens events with parallax-like signals that they modeled were more likely to have been caused by xallarap. We might therefore expect that ∼20% of our $12$ parallax events (i.e., two or three) would actually be caused by xallarap. In fact, O07-368 should also be added to our parallax-like sample, because it has a reasonable parallax solution, but Sumi et al. (2010) find that it is better fit and more likely caused by xallarap. If the parallax solution of O07-368 had been believed blindly without considering the possibility of xallarap, the event would also have been in our subsample with suspiciously small distance estimates of less than 2 kpc. If we remove M07-192 and O13-0723 from our sample, AD tests like those we performed in Section 5.1 have p-values greater than 0.01 for ${f}_{\mathrm{bulge}}\lt 0.96$ , and so would go most of the way toward explaining the discrepancy between the model and the observed data.

6.3. Combining Corrections

We have found that various factors can help to bring the model distribution toward consistency with the null hypothesis, namely that the planet abundance in the bulge is no different from that in the disk, but do not go all the way. We now investigate whether combining all the factors can make the model consistent. We assume that planet abundance is a function of both host mass and metallicity with the same dependence as found by radial velocity surveys (Johnson et al. 2010, $\propto M{10}^{1.2[\mathrm{Fe}/{\rm{H}}]}$ ). We also assume that the simulations of H14 do not fully capture the increased sensitivity to planets in longer-timescale events for planets detected by follow-up observations, and so weight the simulated planet detections by ${t}_{{\rm{E}}}^{1}$ . Finally, we remove the event O13-0723 from our sample of distance estimates, due to a non-planetary model being found that better explains the data on the event (Han et al. 2016), which was reported after this paper was submitted.

Figure 8 shows the AD statistic as a function of ${f}_{\mathrm{bulge}}$ for the model with all of the combined corrections, for the distributions of both distance and relative proper motion. For ${f}_{\mathrm{bulge}}=1$ , the model is much more consistent with both the ${D}_{{\rm{l}}}$ and ${\mu }_{\mathrm{rel}}$ distributions, with p > 0.05 in both cases. The full range of ${f}_{\mathrm{bulge}}$ allowed with a threshold of p > 0.01 for both ${D}_{{\rm{l}}}$ and ${\mu }_{\mathrm{rel}}$ distributions is ${f}_{\mathrm{bulge}}\lesssim 1.7$ . This might suggest that the puzzle has been solved. However, there is still some tension with the ${f}_{\mathrm{bulge}}=1$ scenario, with significantly smaller values of ${f}_{\mathrm{bulge}}$ providing better fits to the ${D}_{{\rm{l}}}$ and ${\mu }_{\mathrm{rel}}$ distributions. We expect that further events with erroneous parallax measurements implying very small distance estimates are likely to be the cause of this remaining tension.

7. CONCLUSIONS

Using the Anderson–Darling test, we have found that the distribution of distance estimates for the hosts of planets discovered in microlensing surveys is inconsistent with the distribution expected from a simulation of a planetary microlensing survey that incorporates a reasonable Galactic model. The model can be brought into consistency with the observed distribution if the relative abundance of planets in the Galactic bulge is reduced relative to that in the disk by a factor ${f}_{\mathrm{bulge}}\leqslant 0.54$ at 99% confidence (model-dependent).

Does this mean that the Galactic bulge is devoid of planets? We cannot completely rule out this conclusion, but there are a number of lines of reasoning that suggest it is unlikely. First, the distribution of relative proper motions of the hosts would be in tension with a model that was completely devoid of bulge planets. Second, incorporation of a plausible mass and metallicity dependence in the planet abundance increases the upper limit on ${f}_{\mathrm{bulge}}$ to $0.7$ . Third, the inclusion of an additional dependence of planet detection efficiency on some microlensing parameters that is not captured by our model can increase the consistency of the model with the data, especially the event timescale. Finally, some of the inconsistency of the model with the data might be removed if a relatively small set of the distance estimates are wrong. Specifically, we suspect that maybe one third of events with parallax distance estimates smaller than 2 kpc might actually have other signals such as xallarap mimicking parallax signals and thus causing completely erroneous distance estimates. A recent paper by Han et al. (2016) has confirmed our suspicions for one event.

Our study has shown that in principle it is possible to infer relative properties of the Galactic distribution of planets (such as the relative abundance of planets in the disk versus the bulge) without the need to compute the absolute abundances of planets in each population as advocated by Calchi Novati et al. (2015), Yee et al. (2015), and Zhu et al. (2015a). Studies similar to ours will be complementary to the space-based parallax programs that are now underway with Spitzer and are forthcoming with K2 (Howell et al. 2014; Henderson et al. 2015). The space-based parallax campaigns will enable precise parallax distance measurements for almost all planetary microlensing events they observe, but these will be limited to relatively small samples of planets. We have pioneered an approach that, while model-dependent, can be used with the large samples of microlensing planets that will be found by second-generation microlensing planet searches such as OGLE-IV (Udalski et al. 2015b), MOA-II (Sako et al. 2007), and KMTNet (Kim et al. 2010; H14); these samples will have selection effects that are much easier to model than those of our current sample. Such samples are beginning to be assembled (Shvartzvald et al. 2016; D. Suzuki et al. 2016, in preparation). Finally, the huge samples of planets that will be found by Euclid (Penny et al. 2013) and WFIRST (Spergel et al. 2015) should make it possible to probe the distribution of various planet populations (e.g., Jupiters compared to Earths) as a function of their host's location within the Galaxy.

We thank Radek Poleski, Wei Zhu, Andy Gould, Dave Bennett, and the referee for their suggestions. Work by M.T.P. was performed in part under contract with the California Institute of Technology (Caltech)/Jet Propulsion Laboratory (JPL) funded by NASA through the Sagan Fellowship Program executed by the NASA Exoplanet Science Institute. Work by C.B.H. was supported by an appointment to the NASA Postdoctoral Program at the Jet Propulsion Laboratory, administered by Oak Ridge Associated Universities through a contract with NASA.

IS THE GALACTIC BULGE DEVOID OF PLANETS?

Article metrics

Permissions

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

ABSTRACT

1. INTRODUCTION

2. MEASURING DISTANCES

3. DATA

4. MODEL

5. THE DISTANCE DISTRIBUTION OF PLANET HOSTS

5.1. A Paucity of Bulge Planets?

5.2. Relative Proper Motions

5.3. Other Physical Explanations?

6. DISCUSSION

6.1. Problems with the Model?

6.2. Problems with the Distance Estimates?

6.3. Combining Corrections

7. CONCLUSIONS

Footnotes

IS THE GALACTIC BULGE DEVOID OF PLANETS?

Article metrics

Permissions

Share this article

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

ABSTRACT

1. INTRODUCTION

2. MEASURING DISTANCES

3. DATA

4. MODEL

5. THE DISTANCE DISTRIBUTION OF PLANET HOSTS

5.1. A Paucity of Bulge Planets?

5.2. Relative Proper Motions

5.3. Other Physical Explanations?

6. DISCUSSION

6.1. Problems with the Model?

6.2. Problems with the Distance Estimates?

6.3. Combining Corrections

7. CONCLUSIONS

Footnotes