This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy. Close this notification
Skip to content
Invited Review The following article is Open access

Solar System Physics for Exoplanet Research

, , , , , , , , , , , , , , , and

Published 2020 September 3 © 2020. The Astronomical Society of the Pacific.
, , Citation J. Horner et al 2020 PASP 132 102001 DOI 10.1088/1538-3873/ab8eb9

1538-3873/132/1016/102001

Abstract

Over the past three decades, we have witnessed one of the great revolutions in our understanding of the cosmos—the dawn of the Exoplanet Era. Where once we knew of just one planetary system (the solar system), we now know of thousands, with new systems being announced on a weekly basis. Of the thousands of planetary systems we have found to date, however, there is only one that we can study up-close and personal—the solar system. In this review, we describe our current understanding of the solar system for the exoplanetary science community—with a focus on the processes thought to have shaped the system we see today. In section one, we introduce the solar system as a single well studied example of the many planetary systems now observed. In section two, we describe the solar system's small body populations as we know them today—from the two hundred and five known planetary satellites to the various populations of small bodies that serve as a reminder of the system's formation and early evolution. In section three, we consider our current knowledge of the solar system's planets, as physical bodies. In section four we discuss the research that has been carried out into the solar system's formation and evolution, with a focus on the information gleaned as a result of detailed studies of the system's small body populations. In section five, we discuss our current knowledge of planetary systems beyond our own—both in terms of the planets they host, and in terms of the debris that we observe orbiting their host stars. As we learn ever more about the diversity and ubiquity of other planetary systems, our solar system will remain the key touchstone that facilitates our understanding and modeling of those newly found systems, and we finish section five with a discussion of the future surveys that will further expand that knowledge.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Prior to the discovery of the first planets around other stars (Gamma Cephei Ab and HD 114762b, initially thought to be brown dwarfs (Campbell et al. 1988; Latham et al. 1989); the planets orbiting pulsars PSR 1257+12 (Wolszczan & Frail 1992) and PSR B1620-26 (Thorsett et al. 1993); and 51 Pegasi, the first planet around a Sun-like star (Mayor & Queloz 1995)), our ideas on how planetary systems formed and evolved were based purely on observations of the solar system. A wide range of scenarios had been proposed, of varying complexity, all of which attempted to explain the minutiae of our solar system.

For many years, it was believed that the solar system was formed as a result of an encounter between the Sun and a passing star, which was thought to have pulled a tongue of material either from our star, or that which it encountered, that subsequently fragmented to form the planets (e.g., Lyttleton 1936a; Woolfson 1964). Such a scenario would suggest that the formation of planetary systems should be an incredibly rare and violent process, since the likelihood of passing stars experiencing such close encounters (nearly colliding with one another) is almost vanishingly small.

The theory which held sway at the start of the epoch of exoplanetary discovery argued instead that the planets had formed from a primordial circumstellar disk around the Sun, and that their formation was essentially a gentle, leisurely process (e.g., Edgeworth 1949; Swedenborg 1734; Safronov 1969; Lissauer 1993; Laplace 1796).14 The expectation was that, if we were we to discover planets around other stars, those planetary systems would strongly resemble our own. The fact that those first detected systems (and, indeed, the vast majority of those discovered since) were dramatically different to our own led to a thorough re-examination of how planetary systems could form.

Since the discovery of the first planets orbiting other stars, the fields of exoplanetary and solar system science have remained largely separate—one dealing with a small amount of information about a large number of systems, the other dealing with our almost overwhelmingly detailed knowledge of our own solar system. In an attempt to help remedy the disconnect between the two fields, in this work, we present a review of our current knowledge and understanding of the solar system.

In the sections that follow, we detail the various populations of objects known in the solar system and describe the theories that attempt to explain how they arrived in their current locations. We include some of the more speculative ideas within solar system astronomy, in order to ensure that this review is as clear and comprehensive as possible. Once all these concepts have been discussed, we move on to a discussion of our burgeoning knowledge of exoplanetary systems, with a particular focus on highlighting the similarities and differences between those systems and our own. Since this is by nature a lengthy review, spanning a wide variety of topics, we here present a short bulleted summary of the various topics described in this work, together with pointers to the relevant section, and indications of any exoplanetary systems to which those ideas may apply (where appropriate).

  • 1.  
    In Section 2, we describe the solar system in its entirety, as known today.
  • 2.  
    In Section 3, we introduce the solar system's planets, describing them as physical objects—from their atmospheres to our current understanding of their interiors and compositions.
  • 3.  
    In Section 4, we give a brief overview of the key processes thought to have occurred in the solar system's formation. That section comprises the following subsections:
    • (a)  
      In Section 4.1, we describe the pivotal role played by giant impacts in the formation of the solar system we observe today.
    • (b)  
      In Section 4.2, we discuss the impact rate within our solar system in more general terms.
    • (c)  
      In Section 4.3, we consider the migration, both inward and outward, of the giant planets of the solar system.
    • (d)  
      In Section 4.4, we discuss the ongoing debate over the origin of water on the Earth.
    • (e)  
      In Section 4.5, we discuss theories that invoke the presence of planet-mass objects moving sufficiently far from the Sun that they have, to date, escaped detection.
    • (f)  
      In Section 4.6, we consider the formation and evolution of the Oort cloud, which might provide evidence on the type of cluster environment in which the Sun formed.
    • (g)  
      In Section 4.7, we detail the formation and distribution of dust in the solar system.
    • (h)  
      In Section 4.8, we discuss the recent discoveries of rings orbiting some of the solar system's small bodies
    • (i)  
      In Section 4.9, we describe the various non-gravitational effects that can significantly affect the orbits of both dust and kilometer-sized bodies, before moving on to detail the various mechanisms by which material is lost from the solar system's small body reservoirs.
  • 4.  
    In Section 5, we turn our attention to exoplanetary science, highlighting how our knowledge of the solar system can (and does) inform our understanding of other planetary systems. That section comprises the following subsections:
    • (a)  
      In Section 5.1, we describe the known demographics of exoplanets.
    • (b)  
      In Section 5.2, we discuss how our solar system would look as an exoplanetary system.
    • (c)  
      In Section 5.3, we discuss debris and dust in exoplanetary systems.
    • (d)  
      In Section 5.4, we discuss the comparative analysis of planetary systems.
    • (e)  
      In Section 5.5, we discuss super-Earths and sub-Neptune—two classes of planet that are not found within the solar system.
    • (f)  
      In Section 5.6, we discuss the atmospheres of exoplanets, and their relation to those in the solar system.
    • (g)  
      In Section 5.7, we describe research into the dynamics of exoplanetary systems.
    • (h)  
      In Section 5.8, we describe current and future exoplanet surveys and missions.
  • 5.  
    Finally, in Section 6, we draw together our conclusions.

2. Our Solar System—A Plethora of Small Bodies

The solar system contains a wide variety of objects, many of which have unusual or unexpected features that hold information on their formation, and the formation of the solar system itself (de Pater & Lissauer 2015). Aside from the Sun, the most massive objects in the solar system are the planets15—the gas giants, Jupiter and Saturn; the ice giants, Uranus and Neptune; and the terrestrial planets,16 Mercury, Venus, Earth and Mars (see Table 1). Six of these eight planets host one or more natural satellites—with over two hundred distributed between them (the great majority of which orbit either Jupiter or Saturn). These satellites in turn range in size from the tiny (kilometer-sized, or smaller) to those that are almost planet-like (the Moon, the Galilean satellites of Jupiter, Saturn's Titan, all of which are comparable in diameter to Mercury, albeit significantly less massive). The satellites of the outer planets are often broken down into two distinct classes (e.g., Kuiper 1951a, though he counted our Moon as a third class by itself).

Table 1. The Key Physical Parameters of the Planets, and the Number of Satellites They Possess

Namea (au)ei (°)M/MEarthR/REarthNSata
Mercury0.3870.2067.000.05530.3830
Venus0.7230.006753.390.8150.9500
Earth1.000.01640.002501.001.001
Mars1.520.09351.850.1070.5322
Jupiter5.200.004851.3031811.079 (71)
Saturn9.580.005092.4995.29.1482 (58)
Uranus19.20.004610.76914.53.9827 (9)
Neptune30.20.01061.7817.13.8614 (8)

Notes.  The masses (M) and mean radii (R) were taken from the JPL's solar system dynamics pageb on 2020 May 20th. The osculating orbital elements (semimajor axis, a, eccentricity, e, and inclination, i) were obtained using the JPL HORIZONS web-interfacec, and are valid on 2020 May 20th. For ease of reading, all values are given to three significant figures—the actual values are known to a far greater level of precision. The number given in parentheses in the NSat column details how many of the planet's satellites are classified as irregular.

aThe number of satellites for each planet was taken from the JPL Planetary Satellite Discovery Circumstances page (at http://ssd.jpl.nasa.gov/?sat_discovery), which gives the details of each known satellite's discovery circumstances. The site was accessed on 2019 July 26th, at which time it had last been updated on 2020 May 20th. bhttp://ssd.jpl.nasa.gov/?planet_phys_par, accessed on 2020 May 20th. chttp://ssd.jpl.nasa.gov/horizons.cgi#top, accessed on 2020 May 20th.

Download table as:  ASCIITypeset image

The regular satellites orbit close to their host planet, moving on prograde orbits that have both low inclinations (with respect to the planet's equatorial plane) and low eccentricities (e.g., Mosqueira & Estrada 2003a, 2003b; Sasaki et al. 2010). The irregular satellites, by contrast, are scattered much more widely around their host planets, and move on orbits that can be both prograde and retrograde (e.g., Nesvorný et al. 2003; Jewitt & Haghighipour 2007; Holt et al. 2018). These objects often move on highly inclined and eccentric orbits, and are typically (with the exception of Neptune's giant moon Triton) relatively small. Indeed, the largest irregular satellite after Triton is Saturn's Phoebe, which is just over 200 km in diameter (Thomas 2010).

In addition to the major planets, five objects are currently classified as dwarf planets17 by the International Astronomical Union—namely Ceres, Pluto, Haumea, Makemake and Eris. These five objects reside in two distinct reservoirs (the Asteroid belt and the trans-Neptunian disk), each of which contains vast numbers of smaller objects. It is likely that, as more observations are made to determine the size and shape of known trans-Neptunian objects, and astronomical surveys delve farther into the outer reaches of our solar system, the number of known dwarf planets will increase significantly (for an up-to-date tally, we direct the interested reader to the Minor Planet Center website18).

The two reservoirs mentioned above, in addition to the Oort cloud (a third reservoir which stretches out to halfway to the nearest star), are dynamically stable on timescales comparable to the age of our solar system, and as such, contain a vast number of objects (with the Asteroid belt, the least populous, hosting an estimated 106 objects of diameter 1 km or greater; e.g., Tedesco & Desert 2002). In addition to these three famous reservoirs, two additional regions of stability are thought to each house at least as many objects as are present in the Asteroid belt19—the Jovian and Neptunian Trojans (e.g., Jewitt et al. 2000; Sheppard & Trujillo 2006). Trojans are objects moving in essentially the same orbit as their host planet, which liberate around the L4 and L5 Lagrange points of that planet's orbit (Figure 1), such that, on average, they remain either 60° ahead, or 60° behind the planet in its orbit.

Figure 1.

Figure 1. The locations of the five Lagrange points in the case of the restricted three-body problem, originally plotted for Horner & Lykawka (2011). Solid lines connect areas of equal gravitational potential. All of the Lagrange points are equilibrium points—and hence are regions in which objects can move on orbits that are more dynamically stable than might otherwise be expected. L1, L2 and L3 are similar to saddle points—although they are local plateaus in the gravitational potential, a small displacement is enough to carry an object away from the region of relative stability, back to a chaotic orbit. By contrast, L4 and L5 are broad regions in which an object can remain dynamically stable on long timescales. The populations of Jovian and Neptunian Trojans, which are thought to number at least a million objects of diameter 1 km or greater, are concentrated within these tadpole-like regions ahead and behind the location of their host planet in its orbit. The figure shows the potentials for a scenario where the primary:secondary mass ratio matches that between the Sun and Jupiter—i.e., where the primary is approximately 1050 times the mass of the secondary.

Standard image High-resolution image

In Figure 2, we plot the instantaneous locations of all known bodies in the inner solar system, at the locations they would have occupied at 00:00:00 UT on 2000 January 1st. The two clouds of Jovian Trojans are clearly seen sharing Jupiter's orbit, marked in purple, as are the Hilda asteroids (in red), trapped in 3:2 mean-motion resonance with Jupiter,20 which trace a triangular pattern with the vertices located at the Jovian L3, L4 and L5 Lagrange points. The most prominent feature, however, is the Asteroid belt, which can be clearly seen, with its members colored blue. The belt stretches from just beyond the orbit of Mars to a loosely defined outer edge well within the orbit of Jupiter, and is thought to contain more than a million objects of diameter greater than 1 km (e.g., Tedesco & Desert 2002). To date, more than 700,000 asteroids have been discovered21—the vast majority of which (545,135)22. have given numerical designations to denote that their orbits are considered to be well constrained on the basis of observations over an extended period of time. The overwhelming majority of those objects lie within the Asteroid belt—as can be seen in Figures 2 and 3.

Figure 2.

Figure 2. The distribution of objects in the inner six au of the solar system, shown in Cartesian coordinates (In the Appendix, we present a series of figures in black and white showing the same information as that contained in the population figures within this section of the paper. Those figures show the individual populations one at a time, rather than all together, and are suitable for printing out in black-and-white, and for those who might otherwise struggle to differentiate between the variety of colors required to show all populations in a single figure.). The right panel shows a "top-down" view of the system, while the left panel shows a "side-on" view. The data plotted were taken from the JPL HORIZONS database (The JPL Horizons database is a tool that can be used to generate ephemerides for all known solar system bodies. The web interface to the database can be found at https://ssd.jpl.nasa.gov/horizons.cgi, though this offers a limited interface to the full data set. As noted on the website, full access is available through a dedicated telnet interface, at horizons.jpl.nasa.gov, port 6775.), and the positions of the objects are shown at epoch 2000 January 1 00:00:00 UT. The various small body populations are color coded as follows: the near-Earth asteroids (NEAs) in green (with the sub-populations distinguished by distinct shares, as follows: Atiras in aquamarine, Atens in chartreuse, Apollos in sea green and Amors in dark green); main-belt asteroids in blue, the Hilda asteroids in red, the Jovian Trojans in purple, the Centaurs in brown, long-period comets in gray, Jupiter family comets in olive and Halley-type comets in cyan. The locations of the five innermost planets (Mercury, Venus, Earth, Mars and Jupiter) are marked in orange, with their orbits shown in white. The large number of objects moving on highly eccentric orbits that extend toward the 11 O'clock position (top) and almost vertically down from the plane of the system (bottom plot) are the sungrazing comets (e.g., Marsden 1967, 1989), thousands of which have been discovered in recent years by the Solar Heliospheric Observatory, SOHO (e.g., Battams & Knight 2017). We note that individual plots for each of the various small body populations are provided in the Appendix, for the interested reader. The "arc" feature of Jupiter family comets at the outer edge of the asteroid belt (top left of the belt) is the result of the fragmentation of comet 73P/Schwassmann–Wachmann 3 in 1995 (e.g., Crovisier et al. 1996). Almost 70 fragments of that comet are now known, and in the 24 yr since the parent comet disintegrated, those fragments have begun to disperse around the parent's orbit (In 2022, the debris from the disintegration of 73P/Schwassmann–Wachmann will come very close to Earth, potentially causing a significant meteor storm from the τ Herculid meteor shower (e.g., Wiegert et al. 2005). A second potential storm could occur in 2049, when the debris once again comes particularly close to the Earth.).

Standard image High-resolution image
Figure 3.

Figure 3. The observed distribution of the small bodies in the inner solar system, as a function of their semimajor axis, eccentricity, and inclination. The various small body populations are plotted in different colors, using the same color scheme as for Figure 2 (near-Earth asteroids in green, with the sub-populations as follows: Atiras in aquamarine; Atens in chartreuse, Apollos in sea green and Amors in dark green. The Main-Belt Asteroids are shown in blue, the Hildas in red, Jovian Trojans in purple, and the Jupiter family comets in olive). The inner planets (Mercury, Venus, Earth, Mars and Jupiter—orange) are also indicated. The important role of mean-motion resonances can be clearly seen in the main belt, in the form of gaps where few objects are found, and beyond, in the form of the Hilda and Jupiter Trojan populations, constrained to narrow bands in semimajor axis. The various individual populations of object plotted here are also plotted, individually, in the Appendix, as a resource for the interested reader (Figures A1A8).

Standard image High-resolution image

When the instantaneous locations of the asteroids are plotted within the solar system (as can be seen in Figure 2), the bulk appear to be constrained within a puffed-up torus of material that initially appears to be remarkably unstructured. However, when one instead plots the orbital elements of the objects in the inner solar system (Figure 3), it becomes obvious that the region beyond the orbit of Mars is actually highly structured. The main Asteroid belt is threaded by narrow regions where few asteroids are found—features known as the Kirkwood gaps, which mark the location of mean-motion resonances between the asteroids and Jupiter. Exterior to the main concentration of objects in the Asteroid belt, the Hildas (in red) and Trojans (in purple) are narrow bands of objects, trapped in mean-motion resonance with Jupiter (the 3:2 and 1:1 resonances, respectively).

Figure 4 shows a more detailed view of the main Asteroid belt. By zooming in in this fashion, the complex structure of the belt is more clearly visible. The tendency for the asteroids to move on orbits that avoid certain semimajor axes (the locations of mean-motion resonance with Jupiter) can clearly be seen. Similarly, certain regions of inclination space appear favored, and some regions contain objects spread over a far wider range of orbital eccentricities than others. This complexity is the result of the interplay between mean-motion resonances, secular resonances, and also the ongoing collisional attrition of the belt, with collisional families of asteroids causing certain regions to be more densely populated than their neighbors (a topic we will revisit in more detail in Section 4.1).

Figure 4.

Figure 4. The observed distribution objects in the Asteroid belt, plotted in semimajor axis-eccentricity space (top) and semimajor axis-inclination space (bottom). The data were taken from the JPL Horizons database, In this plot, The Apollo NEO's are indicated by sea green, and the Amor NEOs by dark green. Main belt asteroids, including Mars crossing asteroids, are indicated in blue. By zooming in on the belt in this manner, a vast amount of fine structure is visible—including the impact of secular and mean-motion resonances throughout the belt, as well as the collisional disruption of asteroids creating asteroid families.

Standard image High-resolution image

The trans-Neptunian region contains a vast number of objects, ranging in size up to the dwarf planets (such as Pluto and Eris), which are shown (along with those objects whose objects cross those of the outer planets) in Figure 5. The stable populations of objects beyond Neptune's orbit are typically broken down into a number of sub-categories. The Edgeworth–Kuiper Belt (named for two of the scientists to discuss its potential existence—see Edgeworth 1943, 1949; Kuiper 1951b), or Classical belt, stretches between the approximate location of the Neptunian 2:3 mean-motion resonance and its 1:2 resonance (located at ∼39.4 and ∼47.8 au, respectively). Objects in the Edgeworth–Kuiper Belt typically move on orbits with inclinations less than 30°, and are themselves often broken down into two sub-populations, which seem to display somewhat different physical characteristics, and are known as the "hot" and "cold" populations (e.g., Levison & Stern 2001; Elliot et al. 2005; Petit et al. 2011; Fraser et al. 2014). The bulk of such objects known to date move on orbits characterized by both low eccentricities and low inclinations (typically of order a few degrees), and are observed to be characteristically red in color. These objects are known as the cold classical component of the disk. The "hot" component contains objects with significantly greater orbital inclinations, and often somewhat higher eccentricities. These objects are often observed to be somewhat bluer than their "cold" counterparts, which might be evidence that they experience more frequent and energetic collisions (a result of their dynamically more excited state), and are hence less "weathered" (Fraser & Brown 2012).

Figure 5.

Figure 5. The distribution of bodies in the solar system. Top: the solar system as seen face-on (left) and seen edge-on (right), showing the distribution of objects in Cartesian coordinates. The positions of the objects are those they occupied on 2000 January 1st. The middle and lower panels show the orbital elements of the objects in semimajor axis–eccentricity (ae) and semimajor axis–inclination (ai) space, respectively. The objects in the inner solar system are colored as in Figure 3. Outer solar system objects are also colored according to their classification. As was the case in the inner solar system, populations of resonant objects stand out in the middle and lower plots as concentrations of objects centered on a specific semimajor axis. Centaurs (brown) are shown between Jupiter and Neptune. The Neptune Trojans (orange–red) can be seen at 30 au, and the Plutinos (deep pink) at 39.5 au, just interior to the objects that make up the classical Edgeworth–Kuiper Belt (orchid, between ∼40 and 48 au). To higher eccentricities, the Scattered Disk objects (maroon) can be seen spreading outward in a curved population in the middle plot—objects whose perihelia fall between ∼30 and 40 au that move on eccentric, chaotic orbits. Two cometary populations are shown, the Jupiter family comets (olive) and the Halley type comets (cyan).The various populations shown here are also plotted, individually, in the Appendix (Figures A9A18).

Standard image High-resolution image

In addition to the Edgeworth–Kuiper Belt, the trans-Neptunian population contains two further types of object that are dynamically stable on timescales comparable to the age of the solar system—the resonant and detached populations. The resonant objects (e.g., Gladman et al. 2012) move on orbits trapped within one of Neptune's MMRs—for example the so-called Plutinos (named for the dwarf planet Pluto, the largest member; e.g., Yu & Tremaine 1999; Chiang & Jordan 2002) are trapped within Neptune's 2:3 MMR. These resonant objects can move on orbits that are significantly more excited than those of the Classical population, with some displaying particularly large inclinations and eccentricities. Indeed, many of the Plutinos move on orbits that cross that of Neptune. The nature of the resonance in which they are trapped ensures that they never encounter the planet, however—whenever they cross its orbit, it is located well ahead, or well behind the crossing point. The detached population move on orbits with semimajor axes greater than the location of Neptune's 1:2 resonance, and have perihelion distances (closest approach to the Sun) sufficiently far from Neptune that it cannot significantly perturb their orbits over the age of the solar system. Typically, detached objects pass perihelion at least 40 au from the Sun (Gladman et al. 2008), although there are a few exceptions whose perihelia lie at slightly smaller radii.

The final population of trans-Neptunian objects is known as the Scattered Disk (Duncan & Levison 1997). Scattered Disk objects typically move on dynamically excited orbits, with perihelia closer to Neptune than the location of its 2:3 resonance. As such, they can experience dynamical perturbations from that planet sufficient to result in significant variations in their orbits over the age of the solar system. Though many of these objects are nominally dynamically unstable on a wide range of timescales, the lifetime of the population is such that the great majority would be expected to remain as Scattered Disk objects for timescales comparable to the lifetime of our solar system. Trans-Neptunian objects that are highly dynamically unstable due to strong interactions with Uranus or Neptune are often described as scattering objects. Gladman et al. (2008) classify such objects as being those that experience such interactions within a 10 Myr window of orbital evolution, while the remaining unstable TNOs are classified as scattered objects.

The Oort cloud (sometimes called the Öpik–Oort cloud, named for the researchers who first proposed its existence; Öpik 1932; Oort 1950), the third large reservoir of small bodies in the solar system, stretches approximately halfway to the nearest star, and is thought to contain up to 1011 cometary nuclei of diameter 1 km or greater. Objects within the cloud itself are much too far from the Sun to be detected directly, but the existence of the cloud is inferred from the ongoing flux of material sourced from that region to the inner solar system—a group of objects known as the Oort Cloud Comets. The Oort cloud is sufficiently far from the Sun that its members are only tenuously bound to our star, and are continually tugged and tweaked by the effects of the galactic tide and the gravitational influence of passing stars. These continual perturbations act to nudge new comets toward the inner solar system, replacing those that are removed from the system by ejection, collision with an object in that region, and fragmentation.

In addition to the dynamically stable reservoirs of debris detailed above, the solar system is also cluttered with innumerable unstable objects—including the Jupiter-family, Halley-type, and long-period comets, the Centaurs, and the near-Earth asteroids (e.g., Wiegert & Tremaine 1999; Bottke et al. 2002; Morbidelli et al. 2002a; Horner et al. 2003; Jewitt 2009). By nature, these populations are highly dynamically unstable, and would typically decay to nothing on timescales far shorter than the age of the solar system. As such, they must be continually replenished in order to support the supposedly steady-state populations we observe today.

The near-Earth asteroids (NEAs) are a population of primarily rocky and metallic objects moving on short-period orbits that either cross, or approach, the orbit of the Earth, and their distribution is shown in Figure 6. The aphelia of the orbits of near-Earth asteroids (the point at which they are furthest from the Sun) either lie within the main Asteroid belt, or significantly closer to the Sun. The population of NEAs are often broken into three sub-populations. The Aten asteroids are those objects whose semimajor axes are less than 1 au, and so have orbital periods less than one year. A small fraction of these objects (∼2%) follow orbits entirely within that of the Earth (e.g., Bottke et al. 2002)—this subset of the Aten population are sometimes referred to as Apohele asteroids or Atiras, to distinguish them from the other Atens. The Apollo asteroids have semimajor axes greater than 1 au, and move on orbits that cross that of the Earth. The Amors move on orbits with perihelia greater than that of the Earth, meaning that they do not, currently, cross our planet's orbit. The orbits of NEAs are unstable, and they have typical dynamical lifetimes orders of magnitude shorter than the age of the solar system. The population is continually replenished by a slow trickle of fresh objects from the main Asteroid belt, typically as a result of the perturbations of distant Jovian MMRs (e.g., Wetherill 1988; Morbidelli et al. 2002a), coupled with the influence of the non-gravitational Yarkovsky effect, which results in the gradual orbital drift of objects in the size range ∼1 m to ∼10 km, a size range typical of the fragments generated by collisions between asteroids. We discuss the Yarkovsky effect in more detail in Section 4.9.2, below.

Figure 6.

Figure 6. The orbital element distribution of the known near-Earth asteroids, in semimajor axis-eccentricity space (top) and semimajor axis-inclination space (bottom). The four sub-groups within the near-Earth asteroid population are shown in different shades of green, to match the color scheme used in Figures 25. The Atira asteroids are shown in aquamarine, the Atens in chartreuse, Apollos in sea green and Amors in dark green. Plots for the individual sub-populations (and all other populations of solar system small body) are given in the Appendix. The impact of observational bias is clearly seen here, particularly in the upper panel. Objects are easier to detect when closer to Earth than farther away—and the smallest (but most numerous) objects can only be discovered during close approaches to our planet. For that reason, the greatest population in ae space is bounded by lines of constant perihelion = 0.9833 au (curving outward to the right) and aphelion = 1.0167 au (moving inwards toward higher eccentricities), which we show in white in the top panel. The wedge bounded by these two lines contains those objects that can reach a heliocentric distance at a distance within the bounds set by Earth's perihelion and aphelion distances and can therefore experience very close encounters with our planet.

Standard image High-resolution image

The Jupiter-family comets are a population of objects that typically move on orbits with aphelia located very close to the orbit of Jupiter. In the same way that the bulk of the near-Earth asteroid population originated in the Asteroid belt, the primary parent population for the Jupiter-family comets are the dynamically unstable Centaurs (which move on orbits which cross those of the outer planets, having perihelia located between the orbits of Jupiter and Neptune). The Centaurs, in turn, are thought to have their origins primarily in the trans-Neptunian population, with the main sources thought to be the Scattered Disk (e.g., Duncan & Levison 1997; Di Sisto & Brunini 2007; Volk & Malhotra 2008), the Edgeworth–Kuiper Belt (e.g., Levison & Duncan 1997; Lowry et al. 2008; Volk & Malhotra 2013), the Plutinos (e.g., Di Sisto et al. 2010) and the Neptune and Jovian Trojans (e.g., Horner & Lykawka 2010a; Horner et al. 2012c; Di Sisto et al. 2019), although it seems likely that there is at least some contribution from objects originating in the inner Oort cloud (Emel'yanenko et al. 2005; Brasser et al. 2012; de la Fuente Marcos & de la Fuente Marcos 2014). de la Fuente Marcos 2014b. Both the Centaurs and Jupiter-family comets are dynamically unstable on timescales much shorter than the age of the solar system, and so must be continually replenished from those reservoirs in order to maintain the populations we see today23 (e.g., Duncan et al. 1988; Duncan & Levison 1997; Levison & Duncan 1997; Horner et al. 2004a, 2004b; Pál et al. 2015; Grazier et al. 2018, 2019).

The Halley-type comets are cometary bodies that move on orbits that take tens or hundreds of years to complete. In comparison to the Jupiter-family and long-period comets, their orbits are remarkably stable, and they can remain on the same planet-crossing orbit for many revolutions around the Sun. There remains significant debate as to their origin (see e.g., Emel'yanenko & Bailey 1998; Wiegert & Tremaine 1999; Levison et al. 2001, 2006; Emel'yanenko et al. 2007; Wang & Brasser 2014; Nesvorný et al. 2017). Some studies have suggested that the amount of time it would take for a comet to evolve onto such a stable orbit from the short-period comet population is so great that they should have faded (through loss of volatiles) or disintegrated long before they can attain such orbits. By contrast, if the Halley-types are primarily captured from the long-period, isotropic comet flux, similar studies have suggested that their number should be far greater than we currently observe—implying that most Halley-type comets must fade below detectability remarkably rapidly, either through devolatilisation or disintegration. Compared to the populations of Jupiter-family comets, Centaurs, and long-period comets, very few Halley-types are known, but they remain an enigma that awaits a solution.

The long-period comets are cometary (icy) bodies moving on orbits that take many hundreds, or even millions of years to complete. Their aphelia lie far beyond the orbit of Neptune, and many are observed on their first passage through the inner solar system. Their origin is the Oort cloud—they are a continual trickle of material being removed from that vast reservoir by the galactic tide, passing stars, and even close passages of giant molecular clouds. Many long-period comets are ejected from the solar system on their first pass through its inner reaches, as a result of distant perturbations by the planets (primarily Jupiter). Those long-period comets that are observed on their first passage through the inner solar system are often called "Oort cloud comets" or "Dynamically New" comets, to distinguish them from those comets that move on shorter, more tightly bound orbits that must be the result of modification through the course of previous perihelion passages.

The Main Belt Comets are an unusual population of small bodies that exhibit both cometary and asteroidal properties. Trapped on dynamically stable orbits in the Asteroid belt, they seem likely to be primordial objects that formed in the belt, and have remained there ever since (e.g., Haghighipour 2009). The first such object to be identified was the asteroid 7968 Elst–Pizarro, which was observed to sport a short tail in 1996 September (e.g., Boehnhardt et al. 1996). Further activity was observed at subsequent perihelion passages (e.g., Hsieh et al. 2010), even though 7968 Elst–Pizzaro's orbit keeps it well within the bounds of the asteroid belt. In recent years, several more Main Belt Comets have been identified, and a variety of different mechanisms have been proposed to explain the observed activity (such as the collision of a small asteroid with the observed Main Belt Comet, liberating dust; the release of dust as a result of the rapid or unstable rotation of the object; the sudden exposure of previously buried volatile material; or even the rotational disruption of a larger parent body; e.g., Prialnik & Rosenberg 2009; Jewitt et al. 2011, 2013; Hirabayashi et al. 2014). Based on the number of Main Belt Comets observed by the Pan-STARRS1 survey telescope, Hsieh et al. (2015) suggest that, at any given epoch, there should be roughly 140 Main Belt Comets among the population of the outer part of the Asteroid belt, which suggests that the next generation of survey telescopes may well reveal significantly more about this fascinating new population of solar system small bodies.

In much the same way that asteroids are given a numerical designation once their orbits are sufficiently well constrained by ongoing observation, periodic comets that have been seen at multiple apparitions are also "numbered." To date, 375 numbers have been given to periodic comets with well determined orbits—split between Encke-type comets, Halley-type, Jupiter-Family Comets, and active Centaurs such as 95P/Chiron.24 However, several of those comets have fragmented, with the fragments carrying the same name and number as their parent body, but with the addition of a letter at the end of their name to distinguish between them—these fragments bring the total number of cometary bodies with designated numbers to 455, as of 2019 February 5th.

The naming convention for numbered comets is to give the number, followed by a letter (typically P, for "Periodic"), followed by a/, then the name (or names) of those who either discovered the comet, were the first to calculate its orbit, or successfully identified that several cometary apparitions over the years were actually the same object. Several of the numbered comets have disintegrated or been lost—for these comets, the letter "P" is typically replaced with "D." The naming convention for non-periodic comets (or those that have only been seen at a single apparition) takes the form C/1995 O1 Hale–Bopp—the C denotes a cometary body, 1995 is the discovery year, O1 indicates that the comet was the first discovered in the 15th/24th of the year, and Hale–Bopp denotes the discoverers (in this case Alan Hale and Thomas Bopp). If an object once thought to be cometary is later determined to be asteroidal, the "C" at the beginning of the designation is replaced with an "A," while if no reliable orbit can be determined for an object, the "C" is replaced with an "X." Finally, the discovery of the first interstellar objects passing through the solar system, in 2017, led to the introduction of the prefix "I" (for interstellar). That object, now known as 1I/'Oumuamua, is unlikely to be the last interstellar interloper discovered traversing the solar system, and was remarkably well studied (e.g., Bannister et al. 2017; Jewitt et al. 2017; Meech et al. 2017; Fitzsimmons et al. 2018), despite only remaining visible for a short period of time due to its high velocity. Indeed, while this paper was under review, a second interstellar interloper, comet 2I/Borisov, was discovered, passing through perihelion beyond the orbit of Mars on 2019 December 8th. Like 1I/'Oumuamua, 2I/Borisov generated a wealth of global interest, and has already been the subject of a number of research studies (e.g., Fitzsimmons et al. 2019; Jewitt & Luu 2019; Opitom et al. 2019; Guzik et al. 2020; Hallatt & Wiegert 2020; McKay et al. 2020).

Table 2 presents a summary of the number of objects currently known across all the small body populations discussed above, along with the approximate boundaries that delineate membership of those populations. Where they are available in the literature, we have also detailed the estimated number of objects in those populations whose diameter exceeds a certain value (1 km, by default, unless otherwise stated). We also distinguish between populations that are held to be dynamically stable on long (Gyr) timescales and those that are unstable. Finally, for the unstable populations, we also name the accepted source population—as we described above.

Table 2. The Populations of Small Bodies Within the Solar System

PopulationInner Edge (au)Outer Edge (au)NknownNest (>1 km)StabilityMain Parent Population
NEA—Atena < aEarth1538aUnstableAsteroid Belt
NEA—Apolloq < QEarth10151aUnstableAsteroid Belt
NEA—Amorq > QEarth8538aUnstableAsteroid Belt
NEA—Totalq ∼ 1.320227a990bUnstableAsteroid Belt
JFCsq ∼ 4, 2 ≤ TJc ≤ 3664d(>) 300 > 2.3 kmeUnstableCentaurs
Encke-Type + Main Belt Cometsq ∼ 4 TJ > 3f58dStable
Halley Typeq ∼ 4 TJ = 2 P ∼ 200 yr105d(>) 100 >2.3 kmgUnstableDebated
Long Period CometsP > 200 yrhq ∼ 42764d UnstableOort Cloud
Centaursq ∼ 4q ∼ 32525i + 71j= 596∼44300kUnstableDebated—TNO
Asteroid Belt∼1.3 a > 2∼4 a < 4755510l∼106mStable
Earth Trojans∼1∼11nUnstableNEA
Mars Trojans∼1.524∼1.5249nUnstableAsteroid Belt
Jovian Trojans∼4.8∼5.68151n>106oStable
Uranus Trojans∼19.2∼19.21n UnstableCentaurs
Neptunian Trojans∼29.5∼30.528n106–107pStable
Scattered Diskq ∼ 32q ∼ 40619i + 509j= 1128>109q∼StableOther TNOs
Detached Objectsq ∼ 403iStable
Resonant TNOsvariousvarious670r>50000 (>100 km)sStable
Classical EK belt∼40∼501339j>109tStable
Oort Cloud∼200,0000>1011uStable

Notes. Boundaries given are typically somewhat flexible, and are based primarily on the work of Horner et al. (2003) and Gladman et al. (2008). Some objects fall into multiple categories at once—for example, if an object currently classified as a near-Earth asteroid were observed to be outgassing, it would also be categorized as a cometary body, most likely in the Jupiter family. Another example of such a multiply categorized object is the Centaur Chiron, which is also counted in many lists of short-period comets, since it displays a coma around perihelion. The distances quoted in the inner and outer edge columns refer to semimajor axes unless stated otherwise. In those columns, and throughout this work, a refers to an object's semimajor axis, in au, P its orbital period, q its perihelion distance, and Q its aphelion distance.

aThe number of objects in these categories was taken from the Minor Planet Center's "Unusual Minor Planets" page (https://www.minorplanetcenter.net/iau/lists/Unusual.html), on 2020 May 20th.. bHarris & D'Abramo (2015). cTJ is the object's Tisserand Parameter, with respect to Jupiter, given by ${T}_{J}=\displaystyle \frac{{a}_{J}}{a}+2\cos i\sqrt{\displaystyle \frac{a\left(1-{e}^{2}\right)}{{a}_{J}}}$, where a is the object's semimajor axis, aJ the semimajor axis of Jupiter, i the object's orbital inclination, and e its eccentricity. For more information on the Tisserand parameter, we direct the interested reader to page 3 of Horner et al. (2003), and references therein. dThe number of objects in these categories was calculated using the boundaries denoted in the table, applied to an ASCII table of cometary orbital parameters downloaded from the JPL Solar System Dynamics page (https://ssd.jpl.nasa.gov/?sb_elem) on 2020 May 20th. eBrasser & Wang (2015); this estimate considers cometary nuclei with diameter >2.3 km, and perihelion distance <2.5 au, and hence represents a lower limit in terms of the criteria described in this table. fEncke-type comets are those whose aphelion is interior to that of Jupiter, sufficiently displaced from that planet's orbit that their Tisserand Parameter with respect to Jupiter is greater than three. Such objects are therefore included here, paired with the Main Belt Comets, rather than as a Jupiter-Family Objects, despite the disparate formation mechanisms proposed for the two classes of object. gWang & Brasser (2014) ; this estimate considers cometary nuclei with diameter >2.3 km, and perihelion distance <1.8 au, and hence is a lower limit in terms of the criteria in this table. hHistorically, the threshold that divided the long and short-period comets was set at 200 yr. However, that definition became somewhat blurred in 2002 by the discovery of comet 153P/Ikeya–Zhang, which follow up observations revealed was identical to a comet observed in 1661, as well as those seen in 877 and 1273 AD (e.g., Hasegawa & Nakano 2003). With an outbound orbital period of 366.5 yr, Ikeya–Zhang holds the record for the longest orbital period of any periodic comet to have been observed at more than one apparition, and blurred the admitted arbitrary dividing line between those comets considered "periodic" (or short-period) and the single-apparition long-period comets. iThe number of objects in these categories was calculated based on the Minor Planet Center's list of Centaurs and Scattered Disk objects (https://www.minorplanetcenter.net/iau/lists/Centaurs.html), accessed on 2020 May 20th. jThe number of objects in these categories was calculated using the parameters in this table, applied to the Minor Planet Center's list of "Trans-Neptunian Objects" (https://www.minorplanetcenter.net/iau/lists/t_tnos.html), on 2020 May 20th. We note that more than 700 objects in that list would be classified as Scattered Disk objects or Centaurs, following the classification scheme detailed in the table, giving the alternate number detailed above. It seems that this dichotomy is born of the manner by which the Minor Planet Center divides information. between the Centaurs and Scattered Disk object list (mentioned in footnote i) and the trans-Neptunian object list. The former contains those objects with semimajor axes less than 30 au, or greater than 50 au, while the latter contains all those objects with 30 au < a < 50 au. kHorner et al. (2004a). lThe number of Main Belt Asteroids was calculated using ASCII tables of numbered and unnumbered asteroids downloaded from the JPL Solar System Dynamics page on 2020 May 20th, and applying the cuts detailed in the table. mTedesco & Desert (2002). nThe number of objects in these categories was taken from the Minor Planet Center's "Trojans" page (https://www.minorplanetcenter.net/iau/lists/Trojans.html), on 2020 May 20th. oYoshida & Nakamura (2005). pSheppard & Trujillo (2006). qGomes et al. (2008). rThe Minor Planet Center does not maintain statistics on the number of TNOs that are trapped in mean-motion resonance. However, Dr Wm Robert Johnston maintains an archive of TNOs, with orbital parameters taken from MPC circulars, in "Johnston's Archive" (http://www.johnstonsarchive.net/astro/tnoslist.html). The number of resonant TNOs given in this work was therefore taken from that list, which, at the time of writing, was last updated on 2019 December 27th. Of these 837 objects, 462 are Plutinos (in Neptune's 2:3 resonance). sGladman et al. (2012); note, the value given here is for objects with D > 100 km. tDurda & Stern (2000). uEmel'yanenko et al. (2007).

Download table as:  ASCIITypeset images: 1 2

3. The Planets

Even though the solar system's six innermost planets have been known since time immemorial, aside from the Earth, the detailed study of the planets as physical objects had to wait for the dawn of the Space Age. In the past fifty years, we have sent robot envoys to all of the planets—although the ice giants (Uranus and Neptune) have, to date, only seen a single fleeting visit—the fly-by of the Voyager 2 spacecraft (in 1986 and 1989, respectively; e.g., Smith et al. 1986, 1989). In this section, we provide a brief overview of the current state of our knowledge of the planets as physical objects—from their atmospheres to their bulk composition and interiors.

3.1. Mercury

Mercury, the closest planet to the Sun, is a shattered husk of a world. In terms of its uncompressed bulk density, it is the densest of the planets25—the result of a composition that is dominated by metals, rather than silicates or volatiles. From the Earth, Mercury's surface features remain elusive—a result of the planet's small size, and the challenges involved in observing it (since it always remains with some 28° of the Sun in the sky). It was long thought to be trapped in 1:1 mean-motion resonance with the Sun—spinning once on its axis for every orbit—a theory finally that was finally disproved as a result of radar observations of the planet, carried out in 1965 (e.g., McGovern 1965; Pettengill & Dyce 1965). Rather than spinning once per orbit, those observations revealed that Mercury instead spins on its axis once every 59 days, while it orbits the Sun in just under 88 days. As a result, the planet is trapped in a 3:2 spin–orbit resonance—meaning that it completes three full rotations in the time it takes to orbit the Sun twice (e.g., Liu & O'Keefe 1965). No other object is known to be trapped in such a resonance—and it seems likely that the resonance nature of the planet's rotation is maintained, in part, as a result of the planet's eccentric orbit. Were Mercury's orbit circular, it seems likely that the tidal interaction between it and the Sun would continue to slow its rotation, driving it toward an eventual 1:1 spin–orbit resonance. However, as Mercury's orbit is eccentric, the strength of the tides raised upon Mercury by the Sun vary markedly through the course of each orbit—strongest at perihelion, and weakest at aphelion.

The result of Mercury being trapped in this spin–orbit resonance is that the Solar day on Mercury's surface (i.e., the time between two consecutive sunrises for most locations on the planet) is twice the length of Mercury's year. More peculiar still, however, is the combination of Mercury's spin and its eccentric orbit. As the velocity of Mercury in its orbit around the Sun reaches its peak, at perihelion, the easterly motion of the Sun across the sky that results from the planet's motion exceeds the westerly motion that results from Mercury's constant rotation. As a result, the motion of the Sun across Mercury's sky appears to slow as the planet approaches perihelion, then reverses through the perihelion passage (with the Sun moving from west to east). Finally, as Mercury recedes from perihelion, the situation reverses, and the Sun resumes its normal east-to-west motion across the sky. As a result, from some locations on Mercury's surface, the Sun would appear to rise in the east in the morning, slow down, reverse direction, and set once again, before rising a second time and moving across the sky as normal! At other locations, the same kind of behavior would be seen at sunset—the Sun would sink below the western horizon, before rising again, moving west-to-east. It would then slow, turn, and resume its motion, setting once again and ushering in a lengthy 88 day night!

Another result of this peculiar rotation is that there are two "heat poles" on the surface of Mercury—locations where, every second orbit, the Sun reverses direction while close to the zenith. Not only is the Sun overhead (or close to) at perihelion—hence maximizing the instantaneous flux falling on the surface—but it remains there for an extended period of time, thanks to the retrograde motion induced by the speed of the planet's perihelion passage. A huge impact feature, one of the largest impact basins in the solar system, lies close to one of these two "heat poles," with its name (Caloris Planitia, or the Caloris basin; e.g., Gault et al. 1977) intended to invoke the fact it is likely to be one of the hottest locations on the planet, at local noon. The basin shows evidence of ancient volcanism that dates from some time after the basin's formation (e.g., Thomas et al. 2014), indicating that Mercury remained volcanically active until less than a billion years ago.

For many years, it was believed Mercury had no atmosphere—but the first spacecraft to visit Mercury, Mariner 10, revealed that the planet has an extremely tenuous exosphere (e.g., Broadfoot et al. 1974, 1976)—a result confirmed by the more recent Messenger spacecraft (e.g., Solomon et al. 2008; Zurbuchen et al. 2008). That exosphere is not considered bound to Mercury—rather, it is continually generated and denuded as a result of the interaction between the Solar wind and the planet's surface, which leads to a continual erosion and ejection of ionized material from that surface. As such, Mercury's tenuous exosphere consists primarily of a mix of Solar helium, and native sodium, potassium and oxygen ions, with other ionized species forming smaller components—as detailed in Zurbuchen et al. (2008). Interestingly, there is evidence that the impact of meteoroids upon Mercury's surface plays an important role in the generation of its exosphere (e.g., Killen & Hahn 2015, who suggest that passages of the planet through the Taurid meteor stream could lead to annual enhancements in the amount of Calcium found in Mercury's exosphere, and Pokorný et al. 2017, who invoke the seasonal variation in the impact flux at Mercury as a potential origin for the observed dawn-dusk asymmetry in the planet's exosphere).

The Mariner 10 spacecraft gave us our first close-up look at the solar system's innermost planet. In addition to revealing Mercury's tenuous exosphere, it also revealed that Mercury has a significant intrinsic magnetic field, enough to generate a magnetosphere and bow shock (e.g., Ness et al. 1974, 1975), which might indicate the presence of an internal dynamo, and hence that Mercury's core is still in the process of freezing (e.g., Stevenson et al. 1983). The structure of that magnetic field was revealed in more detail by the MESSENGER spacecraft, which made three flybys of the innermost planet in 2008 and 2009, before moving into orbit around the planet in 2011 March (e.g., Solomon et al. 2007, 2008). The spacecraft revealed that Mercury's magnetic field is tilted by ∼5° from the planet's rotation axis (e.g., Anderson et al. 2008, 2010), is approximately 1% the strength of the Earth's magnetic field (at the planet's surface) and is centered on a point that is displaced by just over 400 km to the north of the planet's center (Alexeev et al. 2010). For a detailed review of our knowledge of Mercury's magnetic field following the MESSENGER mission, we direct the interested reader to Anderson et al. (2010).

Mariner 10 also provided our first images of Mercury's surface, and first measurement of the planet's mass and density. Through those flybys, the spacecraft imaged around 45% of the planet's surface, revealing a scarred, cratered world, similar in appearance to the Moon (e.g., Murray et al. 1974). In addition to the abundant impact features of all sizes, more recent observations carried out by MESSENGER have revealed that some ∼27% of Mercury's surface is covered by smooth planes, most of which are thought to be volcanic in origin (e.g., Denevi et al. 2013). In the region surrounding the Caloris basin, MESSENGER revealed a region of hummocky plains, covering approximately 2% of the planet's surface, which might be the result of eject from the impact that formed the basin (Denevi et al. 2013). The imagery returned of Mercury also revealed unusual thrust fault features, known as "lobate scarps," whose origins likely reflect the contraction of the planet as it cooled, following its formation (e.g., Solomon 1977; Watters et al. 1998), with a potential contribution coming from changes in the planet's oblateness as its spin slowed to the current ∼59 day period (e.g., Melosh 1977).

Based on the observations made by Mariner 10, researchers were able to determine that Mercury is differentiated, with a large iron–nickel core that is markedly larger and more massive (as a fraction of the planet's mass) than the cores of the other terrestrial planets (e.g., Murray et al. 1974; Siegfried & Solomon 1974; Gault et al. 1977). These results were refined considerably as a result of the MESSENGER mission—which enabled the construction of a detailed model of Mercury's gravity field (Smith et al. 2012). Those observations revealed that the planet's crust varies in thickness as a function of latitude—being thickest near the equator, and thinnest near the poles. The distribution of mass in Mercury's interior was found to be consistent with the planet having an iron-rich liquid outer core—a result that ties in nicely with the idea that Mercury's magnetic field is generated by an interior dynamo, in much the same way as that of the Earth. As we discuss later (in Section 4.1.1), Mercury's anomalously massive core, and high bulk density, are key pieces of evidence that suggest the planet was once victim to a celestial hit-and-run collision, which shattered the proto-Mercury, and stripped it of much of its silicaceous mantle material (e.g., Cameron et al. 1988; Benz et al. 2007).

For a detailed review of our knowledge of Mercury prior to the MESSENGER mission, we direct the interested reader to the book "Mercury," published as part of the Space Sciences Series of ISSI (Balogh et al. 2008), while Rothery (2015) provides a detailed overview of our understanding of the planet, following the new data delivered by MESSENGER. In the coming years, our knowledge and understanding of the solar system's smallest planet will likely be revolutionized once again, with the arrival of the BepiColombo mission (e.g., Benkhoff et al. 2010). Launched in 2018 October, BepiColombo is following a convoluted path to Mercury—and is scheduled to move into orbit around the iron planet in 2025 December. Before this, the spacecraft will use a mixture of gravitational assists (through flybys of the Earth, Venus, and Mercury itself) and Solar-electrical propulsion to modify its orbit, and gradually reduce its encounter velocity, with respect to Mercury, to one sufficiently slow as to effect the orbit insertion. The spacecraft will then split into two components—the Mercury Planetary Orbiter, and the Mercury Magnetospheric Orbiter (Mio), which will observe Mercury until at least mid-2027. For more details on the goals of the BepiColombo mission, we direct the interested reader to Benkhoff et al. (2010).

3.2. Venus

Venus is the second planet from the Sun, and one of the solar system planets known since antiquity. It is typically the third brightest object visible from Earth, after the Sun and Moon, and is frequently visible soon after sunset or before sunrise as the "evening star" or "morning star." The apparent brightness of Venus largely stems from the combination of its proximity to the Sun and the Earth, the reflecting/scattering properties of its atmosphere, and its size.

Due to its similarity in size to the Earth (0.95 Earth radii), Venus is frequently referred to as our planet's "twin" or "sibling." Furthermore, the mass of Venus is 82% of that of the Earth, which results in the planet having a similar bulk density to our own (5.204 g cm−3 compared to Earth's 5.514 g cm−3). It is therefore considered reasonable to assume that Venus has both a comparable bulk composition, and underwent a similar formation process to the Earth (e.g., Zharkov 1983). Up until as recently as the early 1960s, the Venusian surface was theorized to have a range of temperate surface environments, including steamy jungle landscapes. Ground-based microwave observations (Mayer et al. 1958) combined with data from the NASA Mariner (e.g., Barath et al. 1964) and Soviet Venera space programs (e.g., Avduevskij et al. 1971; Keldysh 1977) finally revealed the extent to which the surface of our sister planet diverged from a temperate Earth-like environment (e.g., Sagan 1962; Ronca & Green 1970; Marov 1978).

The first suggestions that Venus might not be a true twin to the Earth came in the early part of the 20th Century. Pioneering spectroscopic observations by Slipher (1903) strongly suggested that the planet's rotation must be far slower than that of the Earth. Indeed, his observations found no evidence of any rotation—leading Slipher to conclude "A glance at the table will show that the errors of observation were small and that there is no evidence of a short rotation period for the planet. A rotation period of twenty-four hours would incline the planetary lines one-third of a degree, a quantity quite large in comparison with the errors of observation." Confirmation of Venus' incredibly slow rotation came with the advent of radar observations, in the 1960s (Goldstein & Carpenter 1963), which revealed both that the planet rotates incredibly slowly (with a period of slightly more than 243 days), and that that rotation is retrograde in nature (i.e., occurs in the opposite direction to its orbital motion).26 As a result, Venus is the planet with the slowest rotation (both in terms of rotational velocity at the planet's equator, and rotation period) and the greatest axial tilt (with a value of around 177°). As we discuss in Section 4.1.1, a number of explanations have been put forward to explain Venus' peculiar rotation—ranging from torques exerted on the planet by its thick atmosphere (e.g., Dobrovolskis & Ingersoll 1980) to the chaotic evolution of the spin under the gravitational influence of the other planets (Correia & Laskar 2001), or even the effects of a giant collision, early in the planet's youth (Dormand & McCue 1987; McCue & Dormand 1993). Regardless of the true origin of the planet's slow spin, it places the planet in an unusual situation—unlike the Earth and Mars, whose atmospheres rotate along with their host planets, the atmosphere of Venus instead super-rotates (aside from the layers adjacent to the planet's surface)—with the upper layers of the atmosphere rotating with a period of approximately four days (e.g., Schubert et al. 1980).

The atmosphere of Venus is dominated by CO2 (96.5%), with the remainder consisting mostly of N2 (3.5%). Other trace atmospheric constituents include SO2 and clouds of H2SO4 that extend from 25 to 50 km altitudes above the surface. The thickness of the atmosphere creates enormous pressure at the planet's surface, with an average surface pressure of 92 bars, equivalent to the pressure at an ocean depth of almost 1 km. The scattering properties of the thick atmosphere produce a relatively high Bond albedo of 0.76 (Haus et al. 2016). Venus receives 1.91 times the incident flux received at Earth, but less than 3% of that incident flux penetrates to the Venusian surface. However, the Venusian climate is locked in a post-runaway greenhouse state, with an average surface temperature of 735 K (Walker 1975).

The present state of the Venusian atmosphere is presumed to have been caused through the planet's passage through a threshold whereby the insolation exceeded the outgoing thermal radiation limit for a moist atmosphere (Komabayasi 1967; Ingersoll 1969; Nakajima et al. 1992; Goldblatt & Watson 2012; Goldblatt et al. 2013). Moreover, it is possible that Venus may have retained substantial surface liquid water until as recently as a billion years ago (Way et al. 2016), although it has also been argued that the planet's water may never have condensed after its accretion (Hamano et al. 2013). The evidence for a previously habitable Venus is critically important both for the general evolution of terrestrial planets and the studies of Venus that are applicable to exoplanets (Kane et al. 2018, 2019b).

The interior and geology of Venus retain numerous outstanding questions, including the size and state of the core, refractory elemental abundances, seismic activity and density variations with depth, and convection within the mantle with relation to current surface geology. Pioneer Venus produced one of the first complete topographical maps of Venus via radar mapping (Pettengill et al. 1980), shown in Figure 7. The map reveals the complexity of the surface, including the dominant highlands of Ishtar Terra in the northern hemisphere and Aphrodite Terra along the equator (Ivanov & Head 2011).

Figure 7.

Figure 7. Topographical map of the Venus surface, based on Pioneer Venus orbiter observations. Credit: NASA Ames Research Center, US Geological Survey, Massachusetts Institute of Technology, and the NASA Space Science Data Coordinated Archive.

Standard image High-resolution image

Images from the JAXA Akatsuki spacecraft show strong evidence for stationary waves in the upper atmosphere, shown in Figure 8, where the wave features are centered above the Aphrodite Terra highland (Fukuhara et al. 2017b). Such atmospheric phenomena are explained by stationary gravity waves caused by deep atmosphere wounds over the highland regions, and emphasize the importance of interactions between planetary atmospheres and surface topography. Indeed, topographical maps of the Venusian surface yield the appearance of analogues to continents and ocean basins. However, unlike Earth, the surface of Venus does not represent a broad distribution of ages (Head 2014).

Figure 8.

Figure 8. Stationary wave above the Aphrodite Terra highland, based on Akatsuki spacecraft observations. Credit: JAXA. Image from Fukuhara et al. (2017a); images a–e were obtained using the LIR instrument, which observes at 10 μm, while image f was taken by the UVI instrument at a wavelength of 283 nm.

Standard image High-resolution image

Analyses of surface crater counts have shown that the average age of the surface is ∼750 Ma (Schaber et al. 1992; McKinnon et al. 1997), which is less than 20% of the total age of the planet. The precise mechanisms through which resurfacing scenarios could have occurred is still a matter of some debate (Bougher et al. 1997) and include catastrophic overturn (Parmentier & Hess 1992) and mantle convection mechanism changes (Herrick 1994) as possible explanations. Answering such questions regarding the interior and atmosphere will shed new light on the evolution of habitable terrestrial planets.

3.3. Earth

The Earth is, of course, by far the best studied of the planets, and, as a result, is the one whose nature is the best understood. As the cradle of life, our planet remains the only known inhabited world—a fact that helps to drive the global community's fascination with the search for another "Earth-like" world—a phrase which means different things to different people. While it is beyond the scope of this review to describe all of the contemporary research into our terrestrial home, we here provide a general overview of our planet's primary physical characteristics, and describe those that mark the Earth as distinct from its neighbors in the solar system.

The Earth is the largest and most massive of the solar system's four terrestrial planets. As our home, it is by far the best studied of all of the solar system's planets—although thanks to our ongoing exploration using robotic spacecraft, it is fair to say that we have better imagery of the surfaces of the Moon or Mars than we do of the bottom of Earth's oceans.

The Earth is accompanied by a single natural satellite—the Moon—whose unusually large size and peculiar composition hint at a violent origin (as we discuss in more detail in Section 4.1.1). The Moon is gradually receding from the Earth, as a result of their mutual tidal interaction—a process that is also slowing the Earth's spin, as angular momentum is transferred between the two bodies. As a result, in the past the Earth–Moon distance was markedly smaller, and our planet's spin faster than that we observe today. In a recent study, de Winter et al. (2020) used core samples taken through a fossilized bivalve (torreites sanchezi) that lived 70 million years ago to show that the year, at that time, was 372 days long. Since it is unreasonable to assume that the Earth's orbital semimajor axis was markedly larger at that time, those results suggest that the length of the Solar day on Earth at the time was 23 hr 31 minutes—some 29 minutes shorter than that we observe today. As well as offering a fascinating insight into life in the late Cretaceous, that work offers a promising new technique that could allow researchers to accurately track the evolution of the Earth's spin over time. Such work is particularly important, since it seems likely that, shortly after the formation of the Moon, the Earth may have spun with a period of just a few hours (e.g., Canup 2012; Ćuk & Stewart 2012; Wisdom & Tian 2015).

Unlike Venus and Mars, the Earth has an appreciable magnetic field, one that is approximately one hundred times stronger than that of Mercury. Earth's strong magnetic field is thought to be produced by a deep internal dynamo generated within the liquid outer core of our planet (e.g., Buffett 2000). This magnetic field produces a large magnetosphere, which acts to deflect the majority of the Solar wind. Those charged particles that succeed in penetrating the Earth's magnetic field follow the field lines down to hit Earth's atmosphere in its polar regions, driving continual aurora at high magnetic latitudes. At times of particularly energetic Solar activity, those aurorae are driven equator-wards, with the most energetic events causing aurorae that can be seen close to the equator. It has been argued that the presence of Earth's strong magnetic field has played an important role in ensuring the ongoing habitability of our planet—with the atmospheric degradation experienced by Mars usually held up as an example of what can happen to the atmosphere of a terrestrial planet without the twin protections of a strong magnetic shield, and outgassing and recycling of the atmosphere driven by plate tectonics. For an overview of the impact of plate tectonics and the Earth's magnetic field on our planet's habitability (along with the various other factors that may have combined to make our planet a suitable host for life), we direct the interested reader to Horner & Jones (2010a), and references therein.

The orientation of Earth's magnetic field is far from fixed—with the location of the magnetic north pole on Earth's surface having drifted markedly over the past three centuries. In the mid-1800s, the northern magnetic pole was located more than twenty degrees from the geographical north pole. In recent years, the orientation of Earth's magnetic axis has been shifting rapidly, with the spin and magnetic poles currently separated by less than four degrees (Chulliat et al. 2019). On longer timescales, there is evidence that Earth's magnetic field can vary markedly in strength, and its orientation can reverse. Such reversals have been demonstrated in numerical models of the Earth's dynamo (Kuang & Bloxham 1997) and, indeed, the periodic reversals in Earth's magnetic field provide an elegant piece of evidence for the ongoing process of plate tectonics on Earth. When iron-bearing rock forms, it freezes in place a record of the orientation of Earth's magnetic field. It was observed that changes in the remnant magnetism in rocks across regions signified that not only had the Earth's magnetism changed during geological time, but also gave large clues as to how the continents had moved relative to each other (Runcorn 1956).

The dynamics of the Earth's surface reflects its interior structure, the deepest layers of which having been probed by seismic investigations. Indeed, the Earth is almost unique among the planets in that it has been subject to a detailed seismic study of its interior, although the NASA InSight mission, currently on the surface of Mars, is aiming to remedy this, with the goal of using seismic observations to investigate the red planet's interior, as well as the flux of impacts it experiences (e.g., Banerdt et al. 2012; Dauber et al. 2018). In the case of the Earth, our ability to continually monitor and study seismic activity has allowed the interior of our planet to be studied in remarkable detail. By monitoring the propagation of seismic waves generated by earthquakes through our planet we have been able to discern that the Earth has two-component core, a liquid outer core (Oldham 1906) and a solid inner core (Lehmann 1936). Surrounding this is a plastically deforming mantle that comprises most of the volume of the planet, and itself can be divided into several layers that are characterized by changes in mineralogy (e.g., Tackley et al. 1994). It is the heat flow within the mantle, through convection, that is the main driving force for movement of Earth's crust, in the form of plate tectonics—a process that is notably absent on the surfaces of the other terrestrial planets. Interestingly, it has been argued that the Earth can only support plate tectonics as a result of abundant water trapped within the mantle. Indeed, the lack of global plate tectonics on Venus, whose size, mass, and composition are similar to those of the Earth, has often been attributed to the lack of water on the planet's scorching surface (e.g., Nimmo & McKenzie 1998; O'Neill et al. 2007). Even water, however, might not have been enough on its own to allow Earth to begin and sustain plate tectonics—with a recent study suggesting that modern tectonic processes might well have been initially triggered by the impacts that bombarded the Earth in the billion years after its formation (e.g., O'Neill et al. 2019).

The Earth's atmosphere is deep and complex. By volume, when dry, it comprises approximately 78% nitrogen, 21% oxygen, and 0.93% argon, 0.04% carbon dioxide,27 and traces of a wide variety of other elements and compounds. In the lower atmosphere, water vapor provides an additional component, though the amount of H2O can vary quite dramatically, from almost none in deserts and at times of particularly low humidity, to being super-saturated. The Earth's atmosphere is noticeably stratified, and is typically broken down into several major layers. Closest to the ground is the troposphere, within which the great bulk of the planet's weather and clouds can be found. Within the troposphere, in general, the temperature decreases with increasing altitude. The second layer of Earth's atmosphere, the stratosphere, sees this trend reverse, with temperature once again increasing as one rises farther above Earth's surface. This temperature inversion may well have played a major role in helping to ensure that the Earth has not become dehydrated over the aeons since it formed, by serving as a "cold trap." Water vapor which rises too high in the atmosphere will condense out, freezing, effectively forcing it to fall back to lower altitudes. As a result, almost all of our planet's water vapor remains trapped beneath this layer—well below the altitudes at which solar ultraviolet radiation (most of which is blocked by the ozone layer) would dissociate that water vapor, allowing its constituent hydrogen to escape from our planet. Above the stratosphere, the Earth's atmosphere begins to cool once again—a region known as the mesosphere, which extends to an altitude of approximately 80 km above the ground. Then, once again, the temperature of the atmosphere begins to climb, rapidly—a region known as the thermosphere. Beyond the thermosphere lies the exosphere—the region where the Earth's atmosphere gradually thins until it becomes indistinguishable from the interplanetary medium.

Perhaps the most striking feature of the Earth is the presence of liquid water across our planet's surface. Indeed, oceans, rivers, and lakes cover something like 70% of the Earth's surface—a stark contrast to the surfaces of the other terrestrial planets. The origin of that water remains heavily debated—a topic we discuss in more detail in Section 4.4. While it has been suggested that both Mars and Venus were once warm and wet, like the Earth, our planet is the only one of the three whose climate has remained suited to the presence of widespread surface liquid water. Over the course of the Earth's history, our planet's climate has remained remarkably stable—a fact made even more remarkable by the fact that, over the same time period, the Sun's luminosity has increased by an estimated ∼30%. The fact that the Earth's climate was clement at a time when the Sun was just 70% as luminous as it is today led to a conundrum known as the "faint young Sun problem." For many years, the Earth's early warm climate appeared to be at odds with the fact that the young Sun was so faint. For a detailed overview of the "faint young Sun problem," we direct the interested reader to Feulner (2012), and references therein.

Despite its apparent stability on billion years timescales, the climate of the Earth does vary to some degree on shorter timescales. The most famous of these periodic oscillations are known as the Milankovitch cycles—and are the result of the continual perturbation of the Earth by the gravitational influence of the other planets in the solar system. For a more detailed discussion of the Milankovitch cycles and the history of the Earth's climate variability, we direct the interested reader to Horner et al. (2020), and references therein.

3.4. Mars

Mars, the outermost of the terrestrial planets, is probably the most studied object in the solar system, aside from Earth. It is markedly smaller than the Earth (with an equatorial radius of just 3396 km, compared to Earth's 6378 km), with mass just 0.107 times that of our planet. It is also significantly less dense than the other terrestrial planets, with a bulk density of 3.934 g cm−3, which recent work has suggested might be the result of the planet being starved of material during its formation as a result of the migration of Jupiter (e.g., Brasser et al. 2016), as we discuss in section four. Unlike the other planets visible with the unaided eye, which typically appear to have a vaguely yellowish hue, Mars appears a striking red—the result of the iron oxide that dominates the planet's surface.

During the 19th Century, telescope observations of Mars began to reveal a world that displays some similarities to the Earth. Like Earth, Mars has polar caps whose size waxes and wanes with the passing seasons. While Earth's polar caps are dominated by water ice, we now know that those on Mars are a combination of a "bedrock" made of water-ice and carbon dioxide ice, with their expansion and contraction caused primarily by seasonal deposits of carbon dioxide ice—the result of the winter temperatures at the poles being far colder than those on the Earth. Those observations also revealed dark areas on Mars' surface, which some considered to potentially be areas of vegetation (e.g., Hollis 1908), though we now know that they are instead highland regions, and bedrock scoured clear of Mars' ubiquitous reddish dust. Toward the end of the 19th Century, speculation on the possibility that there could be widespread life on Mars—potentially even advanced, technological life—reached fever pitch (e.g., Flammarion 1892; Comstock 1902; Lowell 1908), with speculation fueled by the observation of "canali" (or channels) on the planet's surface. The concept of the Martian Canals was born, primarily through a mistranslation of the Italian "canali" (e.g., Green 1879; Maunder 1888; Lowell 1895; Hamilton 1916). As telescopes improved, however, it gradually became accepted that these "canali" were actually optical illusions/artefacts (e.g., Evans & Maunder 1903)—though debate over the topic continued to rage through to the dawn of the space age (e.g., Webb 1955, and references therein). Nevertheless, through the first half of the twentieth century, it became clear that the modern Mars is far from an ideal location for widespread, complex life (e.g., Tombaugh 1950, who wrote "It appears likely that Mars has always had a thin atmosphere, very little water, and a very dry climate"). The first images returned of Mars from space drove the final nail into that particular coffin—revealing an arid, cratered, cold and desolate world (e.g., Chapman et al. 1968), and confirming the lack of canals on the planet's surface (e.g., Sagan 1975).

Over the past few decades, the idea that Mars might once have supported life, or at least supported conditions suitable for the development of life, has played an important role in directing our ongoing exploration of the red planet. The first spacecraft to successfully visit Mars was the American Mariner 4, which flew rapidly past the planet in 1965 July, returning a small amount of data and imagery (e.g., Fjeldbo et al. 1966; Chapman et al. 1968; Siscoe et al. 1968). The first mission to orbit Mars, the American Mariner 9, in 1971 November (e.g., Hanel et al. 1972; McCauley et al. 1972; Conrath et al. 1973; Sagan et al. 1973), was quickly followed by the first successful soft landing on the surface of the red planet, by the Russian mission Mars 3 (which failed some 20 s after reaching Mars' surface). The Viking missions, in the mid-1970s, were a huge success—both Viking 1 and Viking 2 comprised a Mars orbiter and a lander—with the two landers surviving and operating on the surface of Mars for several years (six for Viking 1, and three for Viking 2). The Viking landers provided our first in situ measurements of Mars surface, yielding a vast amount of useful data (e.g., Hess et al. 1977; Toulmin et al. 1977; Adams et al. 1986, and many others). The landers carried a suite of three biological experiments, designed to search for any evidence of life in the soils at their landing site—the results of which were inconclusive, at best (e.g., Klein et al. 1976; Levin & Straat 1976; Klein 1977, 1978).

After the success of the Viking landers, our exploration of Mars underwent a hiatus of almost two decades, before undergoing a renaissance in the second half of the 1990s. In the 21st Century, Mars has been peppered with a series of landers and rovers, with new missions departing to the red planet with every opposition (with the mean synodic period of Mars being approximately 780 days). The Mars rovers Spirit and Opportunity were a particular highlight of the 21st Century exploration of Mars. The two rovers landed on Mars in 2004 January, for a mission initially scheduled to last just 90 Sols (1 Sol is one Martian day, 24h 37m). Both rovers vastly outlived their expected lifetimes—Spirit remained active for more than six years, while Opportunity's journey across Mars was finally brought to an end as a result of a catastrophic loss of power resulting from an intense Martian dust storm on 2018 June 10th, after a remarkable 5352 Sols. In that time, Opportunity covered a distance of more than 45 km—meaning that it holds (by a comfortable distance) the record for the greatest distance traveled over the surface of another world. At the same time, a series of Mars orbiters have returned exquisite images of the planet's surface—with the result that the surface of Mars has been more closely studied than that of the Earth (given the challenges inherent in imaging the bottom of the Earth's oceans).

The result of all that work is that Mars is better studied and characterized than any other object in the solar system (except the Earth). The earliest missions to Mars revealed that the planet has essentially no magnetic field (e.g., Smith et al. 1965), though parts of the planet's crust retain evidence of ancient magnetism (Acuña et al. 1999; Langlais et al. 2004), evidence supported by measurements of the Martian meteorite ALH84001 (Weiss et al. 2002). Based on that evidence, it seems likely that the young Mars had a molten, iron-rich core, which allowed the planet to generate a strong magnetic field. Unlike the Earth, which has maintained its core dynamic until the current day, Mars lost its dynamo, and thence its magnetic field, after a few hundred million years—potentially as a result of the planet cooling more rapidly than the Earth (e.g., Stevenson 2001, 2003; Williams & Nimmo 2004). The surface of Mars shows that, unlike the Earth, the planet does not undergo plate tectonics. Indeed, the lack of plate motion has allowed a cluster of vast shield volcanoes to grow above a hotspot, forming the largest volcanoes in the solar system (the largest of which, Olympus Mons, is almost 26 km tall, as measured from its base, and has a surface area only marginally smaller than that of France; e.g., Mouginis-Mark 2018). For more information on Mars' volcanic history, see Werner (2009), and references therein.

There is abundant evidence that Mars was once warm and wet, with an ocean potentially filling the planet's vast northern lowlands (e.g., Helfer 1990; Clifford & Parker 2001; Carr & Head 2003; Rodrigeuz et al. 2015). This in turn suggests that the planet's atmosphere must once have been much thicker and warmer than that we see today (e.g., Ramirez 2017, and references therein). It seems likely that the lack of a global magnetic field for the bulk of Mars' evolution has allowed Mars' atmosphere to be denuded by the influence of the Solar wind. The lack of plate tectonics on the red planet has also played a role in the loss of its original thick atmosphere, since without plate tectonics, the planet lacks a mechanism by which any gas chemically trapped in the planet's surface rocks could be recycled (e.g., Tomkinson et al. 2013). For a more detailed discussion on the loss of Mars' atmosphere, we direct the interested reader to Horner & Jones (2010a), and references therein. Interestingly, recent radar observations of Mars' south polar cap have revealed that there may exist a permanent reservoir of liquid water on modern Mars—in the form of large lakes buried beneath the ice of Mars' southern polar cap (Orosei et al. 2018). It seems likely that future Mars mission will look to explore this in greater detail, since such deeply buried lakes in Antarctica are known to teem with life (e.g., Christner et al. 2014). If life ever did find a foothold on Mars, then it seems plausible that it could still survive, buried deep beneath the Martian polar ice.

Tied to these ideas of Mars' ancient oceans, it is worth noting that Mars' exhibits a remarkably varied range of surface features—from the vast Valles Marineris (a valley over 4000 km long, 200 km wide, and up to 7 km deep) to the vast volcanoes of the Tharsis bulge. Perhaps the most interesting feature of Mars' global topology, however, is known as the Martian Dichotomy. Aside from the vast impact basin Hellas, Mars' southern hemisphere consists of highlands, which are heavily cratered, and considered to be ancient terrain. The northern hemisphere, by contrast, consists of smooth lowlands, with few if any scars. The Dichotomy is show below, in Figure 9, based on data from the Mars Orbiter Laser Altimeter on Mars Global Surveyor. That dichotomy may well be the scar left behind by a cataclysmic impact on the planet (e.g., Andrews-Hanna et al. 2008), as we discuss in more detail in Section 4.1.1.

Figure 9.

Figure 9. The topography of Mars, based on data taken by the MOLA instrument on Mars Global Surveyor. The shield volcanoes of the Tharsis bulge can be seen close to the equator, to the left, with the Hellas basin the deep round feature in the southern hemisphere between longitudes 60° and 90°. The most striking feature, however, is the Martian Dichotomy—the difference between the smooth northern lowlands and the ancient, cratered, southern highlands. Image credit NASA/JPL/USGS; image is public domain.

Standard image High-resolution image

The robotic exploration of Mars is scheduled to continue over the coming years. Of particular interest is the NASA InSight mission, which landed on Mars in 2018 November. InSight carries a sensitive seismometer, and a thermal probe, which together will provide a wealth of fresh insights on the nature of Mars' interior—from the thickness of the planet's crust, and new information on the radius and density of the planet's core (e.g., Folkner et al. 2018), to the rate at which heat flows outward from the planet's interior (e.g., Spohn et al. 2018), and even new data on the frequency with which Mars experiences impacts from cometary and asteroidal bodies (e.g., Teanby 2015; Dauber et al. 2018).

Early highlights from the InSight mission include the discovery that the local magnetic field around InSight's location is an order of magnitude stronger than that estimated from instruments in orbit around Mars, which is thought to be the result of magnetized rocks buried beneath the surface near the lander, and the finding that that field is time variable, likely as a result of activity high in the planet's atmosphere (e.g., Banerdt et al. 2020; Johnson et al. 2020). Perhaps the highest profile results from InSight to date have come from the SEIS instrument, designed to measure Mars' seismic activity (Lognonné et al. 2019). That instrument has proved hugely successful in detecting Marsquakes, having cataloged several hundred to date. The largest of those quakes occurred in the Cerberus Fossae region, revealing that the faulting observed in that area remains geologically active today (e.g., Witze 2019; Banerdt et al. 2020; Giardini et al. 2020), and it will be fascinating to see to what degree that activity continues in the coming years. For all that Mars is by far the best studied celestial body, it seems certain that the coming decade will deliver a wealth of new surprises, as our ongoing exploration allows us to learn ever more about the red planet.

3.5. Jupiter and Saturn: The Gas Giants

The gas giant planets Jupiter and Saturn contain most of the planetary mass in solar system (at 318 and 95.2 MEarth, respectively; see Table 1). Although they are similar in diameter (with Saturn's equatorial diameter being just 17% smaller than that of Jupiter), Jupiter is more than three times the mass of Saturn, and as a result, it is bulk density is markedly higher (1.326 g cm−3 versus 0.687 g cm−3). In addition to comprising the bulk of the mass of the solar system (aside from the Sun), Jupiter and Saturn also comprise the majority of the angular momentum budget of the entire system—a fact that points to the importance of the giant planets and their evolution on the architecture of planetary systems.

Both planets maintain systems of rings composed of rocky and ice particles of varying size. The presence of rings around Saturn has been known since the seventeenth century (as described in Section 4.1.1)—in part as a result of the planet's axial tilt of almost 27°.28 When they are close to edge on, the rings disappear from the view of all but the largest telescopes—and if Saturn's axial tilt were as low as that of Jupiter (a meagre 3fdg13), it seems quite likely that they would have evaded such early telescopic discovery. In contrast to Saturn's magnificent ring system, Jupiter's rings were not discovered until the Voyager 1 spacecraft visited the planet in 1979 March (see Miner et al. 2007, and references therein). Recent observations have revealed that such ring systems may well be transient companions for the giant planets, providing evidence that Saturn's ring may well become entirely depleted as a result of an ongoing process of "ring rain" within the next 300 million years (O'Donoghue et al. 2019). The question of whether Saturn's rings are primordial or transient has implications for the origin of the system, as we discuss in Sections 4.1.1 and 4.1.3.

Both Jupiter and Saturn host large numbers of natural satellites (with, between them, 161 of the 205 planetary satellites discovered to date), some of which display active volcanic (i.e., Io) or cryovolcanic (i.e., Enceladus) activity. Over the past five decades, Jupiter and Saturn have been investigated in exquisite detail by a series of spacecraft: Pioneer 10 and 11 (e.g., Carlson & Judge 1974; Smith et al. 1974; Gehrels et al. 1980; Null et al. 1981), Voyagers 1 and 2 (e.g., Broadfoot et al. 1979; Smith et al. 1979, 1981, 1982), Galileo (e.g., Carlson et al. 1996; Niemann et al. 1998; Gautier et al. 2001; Vasavada & Showman 2005), Ulysses (e.g., Balogh et al. 1992; Stone et al. 1992; Grün et al. 1993), Cassini-Huygens (e.g., Porco et al. 2005; Dougherty et al. 2009; Fletcher et al. 2010; Kanani et al. 2010), New Horizons (e.g., Baines et al. 2007; Gladstone et al. 2007; Reuter et al. 2007; Spencer et al. 2007), and Juno (e.g., Bolton et al. 2017; Connerney et al. 2017; Wahl et al. 2017).

Jupiter and Saturn are composed of mostly hydrogen and helium at a roughly Solar composition ratio, but both planets are enriched in heavy elements relative to Solar (see Atreya et al. 2016, for a discussion). After diatomic hydrogen, methane and ammonia are the two most abundant molecular species in the atmospheres of Jupiter and Saturn. Global cloud layers consisting of ammonia ice, ammonium hydrosulfide (NH3SH), and water (H2O) are present in both atmospheres, although at different pressure levels between 0.1 and 10 bars (see e.g., Ragent et al. 1998; West et al. 2009). Jupiter's global clouds are clearly visible when imaged in true (or enhanced color) images, such as that shown in Figure 10 (taken by the Juno spacecraft). The clouds in Jupiter's and Saturn's atmospheres have been used to measure these planets' zonal winds, although the generation of those winds and their connection to deeper layers of the atmosphere or interior is still an area of active research (e.g., Kong et al. 2018; Galanti et al. 2019). Jupiter and Saturn both have warm stratospheres, within which temperature inversions are driven by the deposition of solar radiation. This radiation drives photochemistry in both atmospheres that generates hazes (e.g., Irwin et al. 1998; Fouchet et al. 2009), which can be seen to reflect solar radiation near 2 microns in Saturn (as can be seen in Figure 11, below). Above the region dominated by neutral species, the extended atmospheres of both Jupiter and Saturn include ionospheres and magnetospheres. These magnetospheres are dynamic (e.g., Mitchell et al. 2009; Connerney et al. 2017) and produce auroral emission (visible in Figure 11) caused by interactions with the solar wind as well as material originating on the surfaces, or in the interiors, of the planet's satellites (e.g., Vasavada et al. 1999; Clarke et al. 2009; Mura et al. 2017).

Figure 10.

Figure 10. Color-enhanced image of Jupiter taken by the JunoCam onboard the Juno Spacecraft. The cloud bands and various storms are clearly visible. Image credit: NASA/JPL-Caltech/SwRI/MSSS/Kevin M. Gill (https://www.jpl.nasa.gov/spaceimages/details.php?id=PIA22946).

Standard image High-resolution image
Figure 11.

Figure 11. Saturn in false color as viewed from the Visual and Infrared Mapping Spectrometer onboard the Cassini Spacecraft. The blue represents reflected solar radiation at 2 microns, the green represents auroral emission between 3 and 4 microns, and the red represents deep thermal emission at 5 microns. Image credit: NASA/JPL/ASI/University of Arizona/University of Leicester (https://photojournal.jpl.nasa.gov/catalog/PIA13403).

Standard image High-resolution image

Both Jupiter and Saturn exhibit differential rotation, with their equatorial regions rotating more rapidly than the poles.29 As a result of their rapid rotation (with periods of order ten hours), the two planets are noticeably oblate.30 Within the interiors of Jupiter and Saturn, molecular hydrogen transitions to liquid metallic hydrogen (H+) at a pressure of roughly 2 Mbar (e.g., Fortney et al. 2018). The depth at which this transition occurs in differs between the two planets, as a result of the significant difference in their mass. Deeper still, Jupiter and Saturn contain cores of rock and ice. Previous measurements of the nondimensional moments of inertia of the two giant planets have led to the suggestion that Saturn is more centrally condensed than Jupiter (e.g., Helled 2011; Guillot & Gautier 2015). Indeed, recent high-precision measurements of the gravitational moments of Jupiter from the Juno spacecraft have revealed that Jupiter's core is diluted, and potentially extends to a large fraction of the planet's radius (Wahl et al. 2017). Such diffusion between the core and the envelope of giant planets has been predicted by previous high-pressure equation of state experiments (Wilson & Militzer 2012), but there are also suggestions that this dilution of the Jovian core could be evidence that Jupiter was once victim to a giant collision (e.g., Liu et al. 2019; see Section 4.1.1). Jupiter's core is thought to contain between seven and twenty-five Earth masses of heavy elements (Wahl et al. 2017), while Saturn's core is thought to contain between five and twenty MEarth of such material (Fortney et al. 2018)—findings that offer strong support to their idea that the two planets formed through a process of core accretion.31

Both gas giants are still in the process of cooling from their formation, and this thermal radiation can be readily detected with observations at 5 microns (see Figure 11). However, research suggests that the thermal evolutionary histories of Jupiter and Saturn are markedly different. Homogeneous evolutionary models tend to accurately reproduce the current luminosity of Jupiter, but substantially underpredict that of Saturn (see e.g., Fortney & Nettlemann 2010 for a review on this topic). This discrepancy has presented a longstanding, critical issue to the fundamental understanding of the formation and evolution of the entire class of gas giant objects. Helium phase separation and subsequent "rain out" onto Saturn's core is often used to invoke an extra source of energy within Saturn (e.g., Stevenson & Salpeter 1977a, 1977b). Over forty years after the introduction of this theory, results from the Cassini mission seem to have provided verification (Koskinen & Guerlet 2018). However, modeling and observational efforts will likely continue to explore this topic into the future.

3.6. Uranus and Neptune: The Ice Giants

A whole generation of space scientists were inspired when Voyager 2 flew by Neptune in 1989 August, sending us the first images of its dark blue disk. Four years earlier, the same spacecraft had flown past Uranus, and still over 30 yr later remains the only spacecraft to have visited our solar system's ice giants. Many of the observations made by Voyager 2 remain to date the best information we have to constrain our understanding of the ice giants. Based on Voyager 2 results, both Uranus and Neptune have been the subjects of books within the University of Arizona's Space Science Series—"Uranus" (Bergstralh et al. 1991) and "Neptune and Triton" (Cruikshank et al. 1995). However, over the last 30 yr, Earth and near-Earth observations have made a significant impact to our understanding of these worlds. Observations from facilities such as the Very Large Telescope, the Atacama Large Millimeter/submillimeter Array (ALMA), and the Hubble Space Telescope (HST), have shown both the ice giants to be much more dynamic worlds than the Voyager 2 data had suggested.

Why the name "Ice Giants"? This moniker has come from the average densities of Uranus and Neptune, which "weigh" in at 1.27 and 1.65 g cm−3 respectively. This early observation led to the conclusion that these planets were markedly enriched with the "heavier" elements of oxygen, carbon, sulfur and nitrogen compared to the composition of Jupiter and Saturn, whose overall makeup reflects more closely that of our Sun (as discussed above). The widely accepted view is that these elements will be in the form of ices; with H2O, CH4, H2S and NH3 combined making up ∼70% of the mass of both Uranus and Neptune, and that it is the physical properties of these materials that will govern the interactions on these planets.

Though Uranus and Neptune are often collectively written about as "the Ice Giants," it should be pointed out that there are some large differences between these worlds. The starkest of these is the negligible heat flux exhibited flowing from Uranus' interior, as observed by Voyager 2, making its upper layers the coldest planetary location within our solar system. Added to this is the 98° obliquity of Uranus, which results in the planet experiencing extreme seasonal change. In contrast, Neptune has very strong self-luminosity (2.61 times more the solar influx; Pearl & Conrath 1991), and has been continually observed to display distinct meteorological features. As a result, it has been suggested that the two planets represent end-members of a "Neptune and sub-Neptune" class of astronomical object (Fulton & Petigura 2018).

Uranus and Neptune both host a suite of satellites, many of which are discussed in subsequent sections. Uranus's larger, regular satellites (including Titania, Oberon, Ariel and Umbriel) have orbits which are coplanar to Uranus' rotation, tilted to the plane of the rest of the solar system.32 Thirteen of these regular satellites have, to date, been discovered orbiting interior to the orbit of Miranda, the innermost of the five moons known to orbit the planet at the dawn of the Space Age. Ten of those satellites were discovered during the Voyager 2 flyby of the planet (Smith et al. 1986; Jacobson 1998), with an eleventh, Perdita, being discovered more than a decade later using data obtained during the flyby (e.g., Karkoschka 2001). The other two new regular satellites (Cupid and Mab) were discovered in 2003 using the HST (Showalter & Lissauer 2003, 2006). In the last three decades, ground-based observations have revealed a secondary satellite system orbiting Uranus, comprising (at the time of writing) nine much smaller irregular satellites largely moving on retrograde orbits (see e.g., Gladman et al. 1998b, 2000; Sheppard et al. 2005).33

Neptune's satellite system is dominated by its largest moon, Triton, which stands out as it is the only large satellite within our solar system to move on a retrograde orbit about its planet. It is now considered that the most likely explanation for the origin of Triton is that it was once a binary trans-Neptunian object, moving on an orbit not unlike that of the dwarf planet Pluto, and that it was gravitationally captured by Neptune (Agnor & Hamilton 2006) as a result of an encounter with the planet that tore the binary asunder. That capture event is thought to have had catastrophic implications for the rest of Neptune's satellites at the time. We discuss the origin of Triton in more detail in Section 4.1.3.

Both Uranus and Neptune rotate with similar periods, roughly intermediate between those of the gas giants and the Earth. Uranus rotates once every 17 hr, 14 minutes, while Neptune spins slightly faster, with a period of 16 hr, 6 minutes. These similar rotation periods, coupled with the limited available measurements of the two planet's gravity fields, have led to a common interior structure being proposed for both planets. This is based on a three-layer model (Guillot 1999), comprising of an upper layer of ices mixed with molecular hydrogen and helium, that transitions to an icy mantle of water, methane and ammonia—potentially mixed with silicates. The very center of the ice giants is thought to comprise a dense core of silicates and metals, though some have suggested that carbon could condense to diamond under these conditions (Benedetti et al. 1999).

Our abiding image of Uranus is of a pale turquoise gas world devoid of the tumultuous storms that are enduring on the other gas giants. This imagery, taken by Voyager 2 as it flew past Uranus in 1985 is perhaps unrepresentative, as it is of the planets' Northern hemisphere summer. Due to its extreme obliquity, Uranus is subject to extreme seasons, and during a given hemisphere's winter the majority of it will not see the Sun for 20 yr. On the progression to equinox in 2007, a noticeable increase of cloud activity was observed in Uranus' atmosphere (de Pater et al. 2015), and this continues to be monitored in the hope it will reveal much about Uranus' atmospheric dynamics. Neptune, in contrast, has shown constant atmospheric activity since being first observed by Voyager 2 on its 1989 flyby. This activity is mostly observed in the infra-red and at the planet's mid-latitudes, however a bright equatorial storm tracked by professional and amateur observers through 2017–2018 is challenging our emerging understanding of Neptune's atmosphere (Molter et al. 2019).

Both Uranus and Neptune exhibit magnetic fields that are anomalous to those observed elsewhere in the solar system. Unlike the magnetic fields of Earth, Jupiter and Saturn; those of Uranus and Neptune are not dipolar or axially symmetric. It was suggested (Hubbard & MacFarlane 1980) that to generate fields of this nature a type of thin-shell dynamo would need to arise. Calculations have shown that this is possible (Stanley & Bloxham 2004), but for this to occur a convective region in the interior must occur. This is at odds with many of the current interior models of the two planets, which have assumed purely conductive interiors for Uranus and Neptune.

With the great success of the Galileo, Cassini, and Juno missions in exploring the solar system's gas giants, there is a current ground-swell of support for a similar flagship mission to explore Uranus and Neptune. Our current lack of detailed information about our own ice giants is particularly disappointing given the great number of potential "ice-giant" type planets that have been detected orbiting other stars in recent years (as we discuss in Section 5). While those planets orbit at much smaller radii than Uranus and Neptune, the best laboratories for the detailed study of such planets are, at least in the relatively near future, those in our own planetary system. If we are to understand these far away worlds at all we need first to well constrain our own neighbors Uranus and Neptune.

4. The Formation and Evolution of the Solar System

Our understanding of the formation and evolution of our solar system has been shaped by the almost overwhelming amount of information we have about it. Prior to the discovery of the first exoplanets, all models of planetary formation were based on our own solar system, which is now actually thought to be somewhat unusual as planetary systems go (e.g., Wittenmyer et al. 2011b, 2016; Bryan et al. 2019). For example, "super-Earths" (planets of ∼1.2–1.9 Earth radii) are the most common type of planet as revealed by the Kepler mission (Howard et al. 2012; Zhu et al. 2018; Hsu et al. 2019), yet are conspicuously absent in our solar system.

Current thinking holds that our solar system formed from a collapsing giant molecular cloud. It has been suggested that that collapse may have been triggered by the shock wave from a nearby supernova (e.g., Cameron & Truran 1977; Boss & Keiser 2010; and many others). Key evidence for the influence of a nearby supernova comes from the study of the composition of meteorites, which reveal the existence of short-lived radioactive nuclei in the material from which they formed. As Boss & Vanhala (2000) state:

"Because the half-lives of these nuclides are so short, this evidence requires that no more than about a million years elapsed between their nucleosynthesis and their inclusion in cm-sized solids in the solar nebula. This abbreviated time span can be explained if these nuclides were synthesized in a stellar source such as a supernova, and were then transported across the interstellar medium by the resulting shock wave, which then triggered the gravitational collapse of the presolar molecular cloud core."

Essentially, then, the idea is that the same explosion both triggered the collapse of the pre-solar cloud, and polluted it with large amounts of heavy elements. It should be noted, however, that there are some problems with the invocation of a single nearby supernova as the sole source of these short-lived radioactive nuclei, as is discussed in review by Pfalzner et al. (2015). Instead, those authors describe how any contribution to the solar system's budget of radioactive nuclei resulting from nearby supernova(e) was likely supplemented by an additional dose of 26Al, injected through the wind of a massive nearby star (e.g., Arnould et al. 2006; Tatischeff et al. 2010). In order that the solar system's birth cluster be large enough that there be a reasonable likelihood of such pollution, Pfalzner et al. (2015) suggest that the cluster was relatively massive—comprising at least 2000 stars.

An alternative explanation for the unusually high abundance of 26Al in the early solar system was proposed by Dwarkadas et al. (2017), who note that, while the protoplanetary nebula from which the solar system formed was enriched in 26Al, the abundance of 60Fe (compared to 56Fe) appears to have been somewhat depleted compared to the norm in the galaxy at the time. To explain this discrepancy, Dwakadas et al. propose that the formation of the solar system was not triggered by a nearby supernova, but was instead triggered by the outflow from a nearby Wolf-Rayet star. If the solar system formed on the edge of a bubble blown clear by such a star, it would naturally be inoculated with an appreciable quantity of 26Al, without simultaneously being polluted by 60Fe.

In contrast to these ideas of a peculiar/rare confluence of events leading to the formation of the solar system, Young (2016) argues that no such unusual processes (nearby supernovae, or AGB stars) need to be invoked to explain the solar system's initial budget of short-lived radionuclides—instead suggesting that "The radiochemistry of the early solar system is therefore unexceptional, being the consequence of extensive averaging of solids from molecular clouds." More recently, Bartos & Marka (2019) have proposed that the early solar system was inoculated by short-live radio isotopes as a result of a relatively nearby binary neutron-star merger. Had such an event occurred in the ∼100 Myr prior to the solar system's formation, it could account for the abundances of radioactive "r-process" isotopes in the early solar system—isotopes with halflives of less than 100 Myr. In particular, they suggest that single merger event could be responsible for the bulk of the early solar system's curium and plutonium, and proposed that that event might have occurred approximately 80 million years before the solar system formed, at a distance of approximately 300 parsecs.

As the proto-Solar nebula collapsed, it is thought to have flattened out into a dynamically cold protoplanetary disk, with the great bulk of the material moving on near-circular, low-inclination orbits. Within the disk, the temperature varied greatly—with the inner regions, near our proto-Sun, fiercely hot, while the outer reaches were cold. Beyond a distance of ∼3 au, temperatures were sufficiently low for water to form icy grains, which meant that beyond this "ice-line," significantly more solid material was available to speed the accretion of the planets (e.g., Dodson-Robinson et al. 2009). Over time, the solid material in the nebula experienced collisions, causing the agglomeration of planetary embryos.34 The more material that was available, and the greater the density of that material, the faster these embryos grew. Beyond the ice-line, this core-accretion process occurred sufficiently quickly that the cores were able to become massive enough that they could capture gas from the proto-planetary nebula, leading to them undergoing "runaway growth" (e.g., Pollack et al. 1996). In this model, the growth of the giant planets (Jupiter, Saturn, Uranus and Neptune) can be broken down into three key stages.

  • (1)  
    The accretion of solid material from the disk starts very slowly, but as the mass of the planetesimal increases, the rate at which material is accreted undergoes a runaway growth, then falls away again once the planet has swept up all the material in its immediate "feeding zone" (e.g., Lissauer 1987). This process involves the formation of a number of "embryos" or "oligarchs"—the largest objects in the feeding zone—that can undergo violent giant collisions with one another. Eventually, one mass comes to dominate a given region—an object often described as a "protoplanet."
  • (2)  
    Once the feeding zone is cleared, the protoplanet undergoes a very slow continued accretion, as small amounts of material (both gas and solids) are captured from the edges of the feeding zone. This stage tends to determine the evolutionary timescale of giant planet formation (e.g., Pollack et al. 1996).
  • (3)  
    Eventually the mass of the protoplanet reaches a critical value (roughly ten times the mass of the Earth, e.g., Mizuno 1980), at around the time its gaseous and solid mass are equivalent. At this point, it undergoes a process of runaway gas accretion, which only ceases once the gas in the disk is cleared out by the Sun—the end of the lifetime of the proto-planetary disk (typically believed to occur around 2–3 Myr after the start of the formation process, though some discs are thought to survive for up to ∼10 Myr, or more; e.g., Li & Xiao 2016; Murphy et al. 2018, while those in clusters might be more short-lived; e.g., Concha-Ramírez et al. 2019).

A more recently developed hypothesis for the early part of this process is the so called "Pebble Accretion" model (Lambrechts & Johansen 2012; Levison et al. 2015a, Levison et al. 2015b; Chambers 2016). One of the issues with the "classical" formation theory detailed above is that the rate at which material is accreted into planetesimals is often too slow to produce a massive enough object to act as the nucleus of a gas giant planet before the gas disk dissipates. Historically, this has proven to be a major flaw in the classical accretion process (Goldreich et al. 2004). A solution to this problem has planetesimals accrete "pebbles," centimeter to meter sized objects, from the debris disk (Lambrechts & Johansen 2012). If the pebbles form in conjunction with the scattering of the smaller planetesimals in the disk (Levison et al. 2015b), then the cores of gas giants can form in a realistic number and timeframe. For a detailed overview of current theories of terrestrial planet formation, we direct the interested reader to Izidoro & Raymond (2018).

Once the gas has been removed from the system, the growth of the giant planets all but stops, with the only ongoing mass gain coming from collisions with dynamically unstable small bodies (such as the Shoemaker–Levy 9 impact on Jupiter, in 1994; e.g., Zahnle & Mac Low 1994; Hammel et al. 1995; Noll et al. 1995). As the giant planets interacted with the small body reservoirs (in particular the trans-Neptunian disk), the transfer of material from those reservoirs around the solar system (with, ultimately, the bulk being ejected from the system,35 or colliding with one or other of the planets) caused the planets to continue the migration that would have been observed during their formation, albeit at a much, much slower rate. Indeed, some authors (e.g., Gomes et al. 2005; Tsiganis et al. 2005) have suggested that such migration could lead to a period of significant destabilization of the outer solar system, and potentially have been the cause of the putative Late Heavy Bombardment of the terrestrial planets (e.g., Bottke & Norman 2017). On the other hand, it has recently been suggested that the instability of the giant planets probably occurred much earlier than previously thought, perhaps within the first 100 Myr of solar system evolution (Kaib & Chambers 2016; Quarles & Kaib 2019). It has even been suggested that Uranus and Neptune formed closer to the Sun than Saturn, and were ejected to their current orbits in that process (e.g., Thommes et al. 2002), or that the solar system initially contained at least one additional ice giant that was ejected during such a period of system instability (e.g., Batygin et al. 2012; Nesvorný & Morbidelli 2012; Cloutier et al. 2015). Such a planet could still lurk in the solar system's outer reaches, a proposal that has recently driven wide discussion of, and a number of searches for, the proposed "Planet Nine" (e.g., Batygin & Brown 2016a; Beust 2016; Brown & Batygin 2016, 2019; Batygin & Morbidelli 2017; Batygin et al. 2019).

An even more speculative idea is that Jupiter undertook a dramatic excursion through the inner solar system, with its orbital radius decreasing from >5 au to <2 au before increasing again to its current position within the solar system. This model, which has become known as the "Grand Tack" (e.g., Walsh et al. 2011; Nesvorný 2018), has been invoked to explain the mixing of icy and rocky bodies in the asteroid belt. In addition, it offers a plausible explanation for the relatively low mass of Mars, suggesting that the alleged mass-deficit is the result of proto-Mars having been a stranded protoplanetary embryo that was starved of material as a result of the dynamical clearing of material from its zone of capture by the migrating Jupiter (e.g., Brasser et al. 2016).

Regardless of the fine details of the formation of the outer solar system, there is significant evidence that the giant planets migrated over significant distances (both inwards and outwards; e.g., Malhotra 1995; Levison et al. 2007; Lykawka 2012; Nesvorný & Vokrouhlický 2016; Nesvorný 2018) to reach their current locations. The bulk of that evidence comes from the distribution of the solar system's small bodies, which will be discussed in some depth in Section 4.3.

The formation of the terrestrial planets, far interior to the ice line, is thought to have been a slower and more stochastic process. Without the extra solid mass afforded by ices in the outer solar system, none of the embryos in the inner regions grew sufficiently rapidly to become massive enough to capture a large gaseous atmosphere prior to the removal of the proto-planetary nebula by the youthful Sun, or to become super-Earths. Instead, the terrestrial planets are thought to have grown more slowly, with the final stages of that formation taking between 10 and 100 Myr to reach their conclusion (e.g., Chambers 2004). The initial growth proceeded much as described above, for the giant planets, with a process of pebble accretion aiding the growth of a number of embryos which became the dominant large bodies in the inner reaches of the proto-planetary nebula (e.g., Izidoro & Raymond 2018).36

Once formed, these embryos dynamically interacted with one another, with giant collisions between embryos being the primary route by which they gained mass. This highly stochastic process eventually resulted in the terrestrial planets we observe today, each of which have features that have been explained as being the result of giant collisions between embryos toward the end of planetary formation (e.g., Benz et al. 1986, 1987, 1989, 2007; Dormand & McCue 1987; Raymond et al. 2014; Canup & Salmon 2018; Lykawka & Ito 2019).

The formation of the planetary satellites appears to have occurred through a number of routes. For the regular satellites of the giant planets, current theories suggest that those planets developed their own small sub-nebulae, within which the satellites formed in much the same manner as the terrestrial planets within the main proto-planetary nebula (e.g., Nesvorný et al. 2002; Mosqueira & Estrada 2003a, 2003b; Mosqueira et al. 2010; Sasaki et al. 2010). Some recent studies take this theory one step further, and have generations of satellites forming in the sub-nebulae of the outer planets, migrating inwards, and being devoured by their hosts. Only the final generation of satellites formed in this way would have survived to the current day (e.g., Canup & Ward 2009).

By contrast, the irregular satellites of the outer planets are thought to have formed elsewhere and then been captured, either through collisions (e.g., Goldreich et al. 1989; Woolfson 1999; Koch & Hansen 2011), three-body encounters between the host planet and binary planetesimals (e.g., Agnor & Hamilton 2006; Vokrouhlický et al. 2008), three-body encounters involving two of the giant planets and the captured object (e.g., Nesvorný et al. 2007), or through gas-drag (e.g., Ćuk & Burns 2004). No one mechanism is sufficient to explain all the observed properties of all irregular satellites (e.g., Jewitt & Haghighipour 2007), and so these objects are thought to be representative of the different processes that occurred during the final stages of planetary formation. The Moon is different again, thought to have formed when the proto-Earth was involved in a giant collision with a Mars-sized embryo, toward the latter stages of its accretion (e.g., Benz et al. 1986; Reufer et al. 2012). In recent years, a similar process has been suggested to explain the origin of Mars' two unusual satellites, Phobos and Deimos—albeit invoking a somewhat less catastrophic collision (e.g., Craddock 2011; Canup & Salmon 2018).

The formation of our solar system left behind large populations of small bodies, whose orbits and composition provide valuable data that are used to help untangle the detail of the formation process (e.g., Malhotra 1993; Gomes 1997; Lykawka & Mukai 2008; Minton & Malhotra 2009; Lykawka 2012; Nesvorný & Vokrouhlický 2016). In the coming subsections, we present detailed reviews of many of the processes thought to have occurred during planetary formation, together with those that continue to the current day, and highlight a number of theories put forward to explain the various features of our system as a whole.

4.1. Giant and Cataclysmic Impacts

When we look around the solar system today, many objects bear witness to the giant collisions that occurred during the latter stages of planetary formation. From the impact that created the Moon (e.g., Benz et al. 1986; Reufer et al. 2012) to that which has been invoked to explain the capture of Neptune's giant moon Triton (e.g., Goldreich et al. 1989; Woolfson 1999), it seems that cataclysmic impacts and collisions between planet-sized bodies were a regular occurrence as the planets formed, and played a pivotal role in shaping the solar system as we know it.

4.1.1. Giant Impacts and the Planets

A number of the planets in our solar system have features that are best explained as having resulted from giant impacts thought to have occurred during the final stages of planet formation.

Mercury: The planet Mercury has long been recognized as being both unusually dense, and unexpectedly small (as discussed in section three). Its uncompressed mean density of ∼5.3 g cm−3 (Cameron et al. 1988) is significantly greater than the value for the next densest planet (the Earth, with ∼4.45 g cm−3; Lewis 1972). This increased density must be the direct result of Mercury containing significantly more iron, as a fraction of its total mass, than any of the other terrestrial planets. Indeed, Benz et al. (2007) state that the expected silicate-to-iron ratio of Mercury is typically thought to lie in the range 30:70–50:50, though some authors have suggested that the true silicate content could be even less than the lower extreme stated here, if the interior contains a significant amount of sulfur (e.g., Harder & Schubert 2001).

Over the years, a number of models have been put forward to explain the strange properties of Mercury, but the leading contender (e.g., Benz et al. 1988, 2007; Chau et al. 2018) involves the collision of a primordial "super-Mercury" with a smaller, but still planet-sized object, which occurred toward the latter stages of planetary formation. Through the use of smoothed-particle hydrodynamics (hereafter SPH) modeling, Benz et al. (2007) suggest that, prior to the impact, the large proto-planet moving on an orbit similar to that occupied by our Mercury was at least twice the mass of the current planet, and was made of the same materials as the other terrestrial planets—a mix of silicates and iron. At the time of the impact, the proto-planet was differentiated, such that the bulk of the planet's iron had settled to the core, which was surrounded by a mantle rich in silicates. At this point, the proto-planet was struck by another proto-planetary body, and severely disrupted. The outer layers were stripped away, and dispersed into the solar system by the force of the collision, leaving behind the bulk of the core. Over time, some small fraction of the ejected material re-accreted, but the great majority was lost, leaving the iron-rich, silicate-poor, over-dense and under-sized husk that we observe today.

More recent work by Chau et al. (2018) delves deeper into the nature of Mercury's origin, investigating a variety of collision scenarios, from a single giant impact to a series of smaller collisions. They find that it is possible to replicate Mercury's current composition by invoking a range of initial scenarios—a high speed head-on collision with a very massive impactor, a glancing "hit-and-run" collision between a massive proto-Mercury (4–5 times the mass of the current planet) and another body, and even a series of smaller, less catastrophic collisions (as is also discussed by Asphaug & Reufer 2014). In a similar vein, Jackson et al. (2018) investigated the leading giant impact scenarios that have been proposed to explain the origin of Mercury. They suggest that all models that invoke a single, high velocity impact should be disfavored, and that a multiple hit-and-run scenario seems to be the most likely explanation for the formation of Mercury.

Recent studies investigating the formation of the terrestrial planets through N-body simulations have found it hard to replicate the initial conditions that would be needed for the collisional scenarios described above. However, those models were also unable to explain Mercury's current mass, orbit, and high core mass fraction (Clement et al. 2019b; Lykawka & Ito 2019)—leaving the full picture of Mercury's formation and evolution open to further debate. What remains clear, however, is that the best explanation for Mercury's peculiarities is an extreme collisional past, which serves as a reminder that the final stages of terrestrial planet formation were a chaotic and violent time.

Venus: The second planet out from the Sun, Venus, also has a few unusual features (e.g., Kane et al. 2019b). While the majority of the planets spin (relatively rapidly) in the same direction as they orbit the Sun (counter-clockwise, as viewed from above the Sun's north-pole, Venus actually spins, very slowly, in the other direction. With respect to the background stars (the sidereal day), Venus takes some 243 days to spin once on its axis (a period longer than that of the planet's orbit, 225 days). However, because it spins in the opposite direction, the Solar day (the time between successive sunrises or sunsets) is around 117 days. Although, in the past, suggestions were made that this unusual spin had its origin in an ancient collision between Venus and another body (e.g., Dormand & McCue 1987; McCue & Dormand 1993), the preferred current theories for the unusual spin invoke other mechanism. Dobrovolskis & Ingersoll (1980) proposed that Venus' current slow spin could principally be the result of atmospheric torques upon the planet's surface, driven by the heating of the planet's dense lower atmosphere by Solar insolation.37 Such a torque would, over long time periods, have led to the gradual slow-down of Venus' rotation, resulting in a spin state that they suggest may be "a steady state among tides in the atmosphere, tides in the solid body, and possibly the influence of the Earth." More recently, Correia & Laskar (2001) and Correia et al. (2003) proposed that the chaotic dynamical evolution of Venus' spin under the gravitational influence of the other planets might explain its peculiarities. Neither of these mechanisms would require a giant collision involving Venus at some point in the past, though of course it should be noted that neither would rule out the occurrence of such a collision.

Interestingly, the idea that a giant impact might have played some role in the formation of Venus has not totally gone away. Over the years, a number of studies have suggested that the interior of Venus is significantly drier than that of the Earth. Zolotov et al. (1997) state that the bulk-silicate-Venus "plausibly contains only 1% of the amount of water in the bulk silicate Earth," a deficiency that is widely accepted in the solar system science community, and one that has been used to explain the current lack of widespread plate tectonics on that planet (e.g., Ward & Brownlee 2000; O'Neill et al. 2007). In order to explain the dehydration of Venus, Davies (2008) suggests that the planet may once have experienced a giant collision with a large planetary embryo. This, he suggests, would have helped the planet to outgas the bulk of the water it accreted as it formed, resulting in it being significantly dryer than the other terrestrial planets. More recent work (Gillmann et al. 2016) revealed that a large, late impact could readily facilitate the loss of Venus' water. However, if the timing were different, a similar impact could instead help to generate a dense, volatile-rich atmosphere for the planet—leading to the kind of hot-house conditions observed on the planet today. Indeed, Lykawka & Ito (2019) suggest that Venus would likely have experienced a number of giant impacts through the latter stages of its formation. Venus' desiccated nature might, instead, reflect differences in the delivery of water to the terrestrial planets (as we will discuss in more detail in Section 4.5), or may simply be the result of the dehydration of the planet's surface (such that all water out-gassed is lost, stripped from the atmosphere by the Solar wind, effectively drying the planet from the outside in; e.g., Kulikov et al. 2006; Dubinin et al. 2011), it is certainly interesting that the idea of such a giant collision keeps recurring in studies of that planet.

Earth: Possibly the most famous theorized giant impact is that which is thought to have created the Earth's Moon (e.g., Benz et al. 1986, 1987, 1989; Stevenson 1987; Cameron & Benz 1991; Canup & Asphaug 2001; Wada et al. 2006; Halliday 2008; Reufer et al. 2012). It is proposed that a large, Mars-sized object (often called Theia) struck the proto-Earth at some point up to ∼100 Myr after the planets began to form, with recent work suggesting that the impact most likely occurred while the Earth's surface was still covered by a global magma ocean (Hosono et al. 2019). The low-velocity, oblique impact severely disrupted both the Earth and the impactor, emplacing a large amount of debris into orbit around the Earth—debris that primarily comprised the upper layers of the Earth and the impacting body. This debris then accreted to form the Moon, at that time located just beyond the Roche limit (the distance at which tidal forces would disrupt a strengthless body).

Since its formation, the Moon has receded from the Earth as a result of their tidal interaction, which has also acted to slow the rotation of the Earth, and has locked the Lunar rotation such that one face perpetually points toward Earth (a 1:1 spin–orbit resonance). The theory explains why the Earth and Moon have significantly different bulk densities. Had the Earth and Moon simply accreted together, or were the Moon captured having formed independently, it would be expected that they would both have roughly similar fractions of iron and silicates. Instead, the Earth is observed to have a large iron core, and a density of some ∼5.5 g cm−3 (corresponding to an uncompressed density of ∼4.45 g cm−3, as discussed above), while that of the Moon is just 3.346 g cm−3 (Table 3.13, Wieczorek et al. 2006). It also explains the high angular momentum of the Earth–Moon system, and the particularly large mass of the Moon relative to that of the Earth—both factors which suggest that the system had an unusual formation.

In order to explain the low collision velocity required by SPH simulations to form the Moon, Belbruno & Gott (2005) suggest that the impactor formed as a Trojan-companion to the Earth, librating around our planet's L4 or L5 Lagrange point. Such a configuration can be stable for a lengthy period of time (indeed, both Jupiter and Neptune still host significant populations of Trojans at the current day, as will be discussed later, particularly in Section 4.3.2), so it is perfectly reasonable to assume that such a body could survive and continue to grow for a hundred million years prior to escaping from the resonant orbit (potentially through dynamical interaction with other growing embryos, or distant perturbations from other planets) and colliding with the Earth. If the impactor formed at the same heliocentric distance as the Earth, this would fit well with the observed oxygen isotope abundances in the Moon and Earth, which are essentially identical (e.g., Wiechert et al. 2001; Young et al. 2016). Had the impacting body formed elsewhere in the solar system, it would have brought material with measurably different abundances into play, and there would be no guarantee that both Earth and Moon would receive just the right dose that the two bodies, post-impact, would look the same. The idea of a local origin for Theia found further support in the work of Quarles & Lissauer (2015), who performed a series of integrations of an inner solar system that contained, initially, five terrestrial planets. They found that a variety of scenarios could lead to a Moon-forming impact, with Theia starting life on a variety of orbits in the general vicinity of 1 au, without requiring Theia to form in 1:1 resonance with the Earth. The scenarios the resulted in planetary systems most similar to our own (with the terrestrial planets on orbits with relatively low eccentricities) came when the proto-Earth and Theia began with similar semimajor axes.

While the search for the most promising impact scenario for the formation of the Moon remains an active topic of research, recent work has identified subtle but statistically significant differences between the oxygen isotope abundances of the Earth and Moon—at the ∼12 parts per million level (Herwartz et al. 2014)—potentially revealing the isotopic signature of Theia for the very first time.

The debris that was ejected from the Moon-forming collision above the escape velocity of the Earth–Moon system would have taken a long time to disperse. A significant amount of that debris would have collided with the Earth and the other terrestrial planets over the millions of years following the collision (∼20% falling back to Earth, and ∼17% accreted by Venus, for example, e.g., Jackson & Wyatt 2012), and the impact would have resulted in a markedly increased debris and dust environment in the inner solar system that would have lasted for tens of millions of years (e.g., Wyatt & Jackson 2016).

Mars: Mars, too, has been proposed as the recipient of at least one giant impact. A key feature of that planet is the clear dichotomy between the terrain in the Southern Highlands, and that in the Northern Lowlands. The origin of the feature has long been the subject of debate, with models suggesting that it might be the result of either a giant impact, or convection and overturn in the planet's mantle (e.g., Wilhelms & Squyres 1984; McGill & Squyres 1991; Roberts & Zhong 2006). In order to address this question, Andrews-Hanna et al. (2008) studied the morphology of the feature, examining satellite measurements of Martian gravity and the topography of the boundary between the two distinct zones. They were able to remove the effects of the Tharsis bulge, a volcanic region of crustal uplift featuring the solar system's largest volcanoes (including Olympus Mons), and revealed that, once that feature was removed, the crustal dichotomy itself could be modeled as a large elliptical region that, in turn, is best explained as being the scar left over from a giant impact that occurred during the latter stages of planetary formation.

Following that work, Marinova et al. (2008), performed detailed SPH modeling to study whether it was possible that an impact sufficiently large to create such a vast basin (the largest impact basin known to date in our solar system) would actually leave such a scar (since the energy involved in such a collision could easily melt the entire crust of the planet, obliterating any evidence that it had happened). They present a number of scenarios that appear to fit the observations, suggesting that the basin-forming impact likely involved an oblique (rather than face-on) low velocity collision (6–10 km s−1) between the planet and an impactor of diameter 1600–2700 km. Given the extreme age of the feature, such an explanation seems perfectly reasonable, as one would expect there to be a significant number of such potential impactors left over at around the epoch that planet formation was coming to a close, and so this collisional origin for the crustal dichotomy is currently considered the leading explanation.38 It has even recently been suggested that the peculiar Martian moons, Phobos and Deimos, could have been produced in such a giant impact—a theory which may solve the long standing question of the origin of those peculiar tiny satellites (e.g., Craddock 2011).

The most recent studies of such a collision suggest that, rather than the impactor being of comparable size to the Red Planet, the formation of Mars' two satellites might only require a collision with an impactor some 10−3 times Mars' mass (Canup & Salmon 2018). Such a low relative mass would suggest an impactor diameter between that of the asteroids Vesta (∼525 km) and Ceres (946 km). Interestingly, that model suggests that such an impact could have created several small satellites, accreted from a circum-planetary disk resulting from the initial collision. In that scenario, Phobos and Deimos would have been the outermost of those new satellites, with the others destroyed relatively quickly (on timescales less than a million years) after spiralling back in to Mars as a result of their tidal interaction with the planet. The authors note that the scale of impact required for the formation of Phobos and Deimos would potentially be compatible with the formation of Mars' largest known impact basins—Utopia (diameter ∼3300 km) and Hellas (diameter ∼2300 km).

Jupiter: One of the key predictions of the Core Accretion model is that the giant planets should have relatively dense core of solid material—with masses of several to ten times that of the Earth (e.g., Mizuno 1980; Bodenheimer & Pollack 1986; Pollack et al. 1996; Inaba et al. 2003; Hubickyj et al. 2005). It is only once the giant planets had accreted that much solid matter that their gravitational pulls would be sufficient to begin to capture the hydrogen and helium that made up the bulk of the protoplanetary nebula. However, it has historically proven challenging to determine the true scale and nature of the cores of Jupiter and Saturn (e.g., Guillot et al. 1997; Saumon & Guillot 2004; Hori et al. 2008; Nettelmann 2011; Lozovsky et al. 2017). NASA's Juno mission (e.g., Matousek 2007; Bolton et al. 2017) has attempted to provide an answer to that question for the giant planet Jupiter. Juno moves on a highly eccentric and inclined orbit that, at perijove, skims Jupiter's cloud tops. Moving on such an orbit has allowed Juno to probe the structure of Jupiter's deep interior—mapping the distribution of mass deep within the giant planet (e.g., Militzer et al. 2016; Folkner et al. 2017; Wahl et al. 2017).

The first results from Juno have found strong evidence for a massive core deep within Jupiter—containing between 7 and 25 times the mass of the Earth (e.g., Wahl et al. 2017). Surprisingly, however, those results also reveal that Jupiter's core is somewhat larger (and therefore less dense, or more "dilute") than had been expected (e.g., Wahl et al. 2017; Debras & Chabrier 2019). To explain this otherwise anomalous mass distribution, Liu et al. (2019) recently proposed that Jupiter's diluted, extended core, may well be the result of a giant head-on collision between the young proto-Jupiter and a massive planetary embryo, itself around ten times the mass of the Earth. Such an impact could shatter the proto-Jupiter's dense, compact, massive core, dispersing its material through the lower reaches of the planet, and resulting in the degree of dilution observed in the Juno data. As more data comes in from the ongoing Juno mission, it seems likely that the precision with which Jupiter's inner regions can be mapped will increase still further, and it will be interesting to see whether those results continue to match well with the violent collision hypothesis proposed by Liu et al. (2019).

Saturn: The first evidence that Saturn's spin axis is tilted, compared to the plane of its orbit, came with the discovery of its rings, in the 17th Century. Galileo's observations of Saturn in 1610 suggested that the planet was triple—he wrote (in translation) "This discovery is that Saturn is not single but a composite of three, which seem to touch each other and never change their relative position and never move among themselves nor change: they are placed in a line parallel to the Zodiac; the one in the middle being about three times larger than the two lateral ones, and being situate in this manner, oOo."39 By the end of 1612, however, the two smaller bodies accompanying Saturn appeared to vanish—much to Galileo's consternation. Forty-seven years later, Huygens (1659) published the solution to Galileo's quandary. Based on his own observations, made with a telescope he constructed with his brother, Huygens discovered Saturn's giant satellite, Titan, and revealed that the "smaller bodies" were in fact a thin ring of material, in orbit around the planet. The ring and the satellite moved in the same plane, tilted to the plane of Saturn's orbit by more than 20°. The cause of Galileo's vanishing "companions," therefore, was the tilt of Saturn's spin axis—which caused the rings to wax and wane over the years, as Saturn moved around the Sun. During the summer and winter for Saturn's northern hemisphere, Saturn's rings were face on—clearly visible, and the source of the optical illusion of a "triple planet" observed by Galileo. Around the Saturnian equinoxes, however, the rings approach edge-on, and all but disappear as seen from Earth.

In many ways, Saturn's rings are the most obvious indication of a planet's axis tilt in the solar system—since they can readily be seen even through binoculars, and they reveal that the giant planet's axis is tilted by almost 27° to the plane of its orbit. Lissauer & Safronov (1991) considered the degree to which accretive collisions between protoplanets and large planetesimals would impact the final spin of the planet formed through those collisions. They found that such collisions would, on average, serve to hasten the spin of the youthful planet, as well as acting to increase their obliquities. Their results added weight to the idea the Moon could have formed in a giant collision between the proto-Earth and another body, and supported the idea that Mercury was a collisionally shattered husk. Building on those ideas, they considered whether such impacts could be the cause of Saturn's observed obliquity. They suggested that, if Saturn's obliquity were the result of its collisional history, during its formation, then it most likely must have accreted at least one planetesimal several times more massive than the Earth.

The origin of Saturn's obliquity remains to be definitively determined, however. Ward & Hamilton (2004) noted an alternative mechanism that could drive Saturn's spin axis from a negligible tilt to its current value. They noted that the precession period for Saturn's spin axis is strikingly similar to the precession period of Neptune's orbital plane (which is slightly tilted compared to the solar system's invariable plane, at an inclination of 0fdg72). They note that Neptune's orbit precesses with a period of ∼1.87 million years—a period which would, in the past, have been faster. Under the assumption that, when the Edgeworth–Kuiper Belt was more massive, Neptune's orbit would have precessed faster than Saturn's spin axis, the authors noted that, as Neptune's orbit precessed ever more slowly, it would eventually reach the same period as Saturn's spin axis. They proposed that Saturn's axial tilt would have been pumped to its current value as a result of the planet's spin moving into this secular spin–orbit resonance with Neptune's orbital precession—and note that, during occupation of that secular resonance, Saturn's axis might well have liberated with an amplitude of up to 31°. In a follow-up work, Hamilton & Ward (2004) used numerical integrations to demonstrate that such resonant excitation of Saturn's axial tilt could occur, and that it would not be disrupted by the perturbations exerted upon Saturn by the other planets in the solar system.

More recently, Brasser & Lee (2015) considered the same mechanism, in the context of the migration of the giant planets, to see whether the obliquities of Saturn (tilted by ∼27°) and Jupiter (tilted by just ∼3°) were compatible with a variety of migration models. They found that it was challenging to explain the obliquities of both Jupiter and Saturn as a result of migration—with even their best simulations only yielding appropriate values for the orbital and obliquity evolution of the two giant planets some 0.3% of the time. While it was possible to tilt Saturn's axis as a result of a resonance crossing during migration, then, such a hypothesis was hard to fit together with the other constraints on the final architecture of the solar system. As a result, the true story of Saturn's obliquity remains an open question—and it seems likely that future observations (similar to those being obtained by the Juno mission, currently in orbit around Jupiter) may be required to shed light on the true origin of the ringed planet's tilt. Indeed, during the last few months of the Cassini mission, the spacecraft moved on an ever more eccentric orbit, with a pericenter moving closer and closer to the giant planet. That evolution facilitated Juno-like measurements, offering a wealth of new information on Saturn's interior structure, from the fact that Saturn's differential atmospheric rotation must extend to at least 9000 km depth (Iess et al. 2019), to a confirmation that the results are consistent with Saturn having an increased central density (indicative of the presence of a core; e.g., Movshovitz et al. 2020). As researchers continue to analyze the wealth of data returned by Cassini, it seems likely that the interior structure of the planet will become more clear cut, and that the answer to the question of Saturn's tilt may finally find a definitive answer.

Uranus: The discovery of Uranus by Sir William Herschel, in 1781, marked a watershed in solar system astronomy. Prior to that date, our solar system was thought to have just six planets, all of which had been known since prehistory. The new planet was observed widely, and enthusiastically, by astronomers around the globe, and Herschel followed his discovery, six years later, with the detection of the first two of the planet's moons (Herschel 1787). In the letter through which he announced the discovery of these satellites, Herschel states:

"Their orbits make a con[s]iderable angle with the ecliptic; but to a[ss]ign the real quantity of this inclination, with many other particulars, will require a great deal of attention, and much contrivance: for, as aftimations by the eye cannot but be extremely fallacious, I do not expect to give a good account of their orbits until I can bring [s]ome of my micrometers to bear upon them; which, the[s]e la[s]t nights, I have in vain attempted, their light being [s]o feeble as not to [s]uffer the lea[s]t illumination, and that of the planet not strong enough to render the [s]mall [s]ilk-worm's threads of my delicate micrometers vi[s]ible."

Herschel's surprise at the inclination of these satellites' orbits was understandable—our Moon orbits the Earth within around 5° of the ecliptic (our planet's orbital plane). Similarly, the regular satellites of the other planets that were known in Herschel's day (4 around Jupiter, 5 around Saturn) all orbit a degree or less from the equatorial plane of their host planets, which, in turn, lie close to the plane of the ecliptic. The fact that this new planet's moons instead orbit far from that plane was therefore understandably surprising. It turns out, in fact, that the spin-axis of Uranus is tilted almost perpendicular to the plane of its orbit, with an axial tilt of approximately 98°. The planet's regular satellites and ring system orbit about the planet's equator, and so the entire system is tilted over dramatically compared to the other planetary systems. In order to explain this, it has been suggested that the planet was struck, during or after its formation, by an Earth-sized protoplanet (e.g., Safronov 1966; Cameron 1975; Slattery et al. 1992; Brunini 1995; Parisi & Brunini 1997; Kegerreis et al. 2018). Such a collision would tip the rotation axis of the planet over from that with which it initially formed. The planet's regular satellites, which orbit very close to the plane of its equator, presumably formed after the giant collision that tilted the planet, either from material ejected from the impactor, or from the planet itself as a result of the collision (Mousis 2004). Although several alternative explanations for the planet's tilt have been suggested over the years (such as the dynamical influence of an unusually massive Uranian satellite, which would have since escaped the system; Boué & Laskar 2010, or the tidal capture and disruption of a massive object; Singer 1975), the giant impact hypothesis seems a remarkably good fit to the known data.

4.1.2. Giant Impacts, the Asteroid Belt, and the TNOs

The evidence of the violent collisional history of our solar system is not limited to the planets themselves. Obviously, the larger the object, the larger the collision must be to significantly disrupt it. It is no surprise, then, that when we look at the small body populations of the solar system, examples abound of objects that at some point suffered giant collisions. Indeed, it is likely that those known are only the tip of the iceberg. In this subsection, we present just a few exemplar cases that reveal the violent history of the solar system's small body reservoirs.

The asteroid belt is a collisionally ground reservoir, which is generally thought to contain only a small fraction of the mass it had at the birth of the solar system (as discussed by e.g., Weidenschilling 1977; O'Brien et al. 2007; Morbidelli et al. 2009; Bottke et al. 2015).40 Within the belt, tens of asteroid families have been identified, thought to be the results of catastrophic collisions between asteroids (e.g., Marzari et al. 1995a, 1998; Milani et al. 2014; Nesvorný et al. 2015; Cellino et al. 2019). Unlike the giant impacts that are thought to have affected the planets, the collisional grinding of the asteroid belt is an ongoing process, with new families having been created at all points in the solar system's history (e.g., the Karin family, a sub family of the older Koronis family, which is thought to have formed just ∼5.8 Myr ago; Nesvorný et al. 2002). Over time, the members of these families diffuse through the belt, to a greater or lesser extent, depending on a variety of dynamical and non-gravitational effects, to the extent that some of the families that must have been created by the oldest collisions can no longer be detected (e.g., Hanuš et al. 2019). Figure 12 shows the distribution of known collisional families across the asteroid belt.

Figure 12.

Figure 12. The distribution of collisional families in the inner solar system. The blue dots show the orbital elements of all known asteroidal bodies in the inner solar system. Overlaid on that distribution, in yellow, are shown all asteroids identified as belonging to one or other of the system's myriad collisional families—with such membership determined based on the work of Nesvorný et al. (2015). The five orange dots show the orbital elements of the five innermost planets, Mercury, Venus, Earth, Mars and Jupiter, for scale.

Standard image High-resolution image

The X-type asteroids in the inner main belt have recently been shown to comprise, among their number, at least two such widely dispersed collisional families (Delbó et al. 2019). The eldest of these, whose largest member may be the asteroid (689) Zita, has a wide distribution of orbital elements, with members spread across the entirety of the inner belt. That family was only identified after the authors considered the correlation between the size of a given asteroid and the rate at which it will migrate through the belt under the influence on non-gravitational forces (e.g., Vokrouhlický et al. 2006, Milani et al. 2014; Spoto et al. 2015; Bolin et al. 2017). The wide distribution of the members of the Zita family indicates that that family may even have formed prior to the early planetary instabilities invoked by the Grand Tack and Nice Models of planetary migration (Delbó et al. 2017). The second family identified in the inner belt by Delbó et al. (2019) is the Athor family, after asteroid (161) Athor, has an estimated age of approximately 3 Gyr age, and while still widely dispersed, clusters more closely in element space than the Zita family, potentially as a result of its formation coming after periods of instability in the outer solar system.

Some of the fragments ejected in these asteroid–asteroid collisions are ejected onto orbits that evolve, under the influence of both non-gravitational forces (see Section 4.9) and distant gravitational perturbations from the planets, to regions of instability in the belt, and are then injected into the inner solar system, to become the near-Earth asteroids (e.g., Morbidelli 1999; Morbidelli et al. 2002a, 2002b; Morbidelli & Vokrouhlický 2003; Bottke et al. 2015; Granvik et al. 2017). Such material forms a significant fraction of the ongoing meteoritic flux at the Earth, and certain classes of meteorites have been strongly linked to specific asteroid families, or even individual asteroids (e.g., the basaltic achondrite meteorites and asteroid 4 Vesta, Binzel & Xu 1993; Asphaug 1997; Drake 2001; Marchi et al. 2012).

It has been proposed (Bottke et al. 2007) that the fragmentation of the parent body of the Baptistina family, around 160 Myr ago, was the source of the object that collided with the Earth 65 Myr ago, creating the Chicxulub impact crater (e.g Hildebrand et al. 1991; Morgan et al. 1997; Dressler et al. 2003), and potentially causing the Cretaceous/Tertiary (K/T) mass extinction (e.g., Alvarez et al. 1980; Smit 1999; Schulte et al. 2010), in which the dinosaurs went extinct. It should be noted, however, that the direct link between the extinction and the impact is still the subject of some debate (e.g., Reddy et al. 2009), with more recent work supporting the idea that the extinction was in fact the result of a double-whammy—the combined effects of an enormous ongoing volcanic extrusion of flood basalts, forming the Deccan Traps in India, and the impact that created the Chicxulub impact crater (e.g., Peterson et al. 2016). It is even possible that the scale of the eruption that formed the Deccan Traps was influenced by the Chicxulub impact, with Renne et al. (2015) finding that as much as 70% of the total extruded material was erupted within 50,000 yr of the impact.

Even though the known trans-Neptunian population is far smaller than that of the asteroid belt (an artifact of its greater distance from the Sun—we remind the reader that the estimated population of the trans-Neptunian region is far greater than that of the asteroid belt; e.g., Durda & Stern 2000; Gomes et al. 2008), it is already known to contain a number of examples of giant collisional processes. The dwarf planet Pluto is accompanied by five satellites (Charon, Hydra, Nyx, Kerberos and Styx; Showalter & Hamilton 2015), the orbits of which are best explained by a giant-impact formation mechanism like that invoked for the Earth–Moon system (e.g., McKinnon 1989; Canup 2005; Stern et al. 2006; Canup 2011)—a theory supported by the more recent observations performed by the New Horizons mission, which visited the dwarf planet in 2015 (e.g., McKinnon et al. 2017; Sekine et al. 2017).41

Eris, another of the trans-Neptunian region's dwarf planets, has a satellite, Dysnomia, which may also have been produced in this fashion (Greenberg & Barnes 2008), while a number of the smaller trans-Neptunian objects have similar satellite systems which might have formed in this way (e.g., the Orcus–Vanth system, Brown et al. 2010). However, recent observations of both the Orcus–Vanth and Eris–Dysnomia systems have cast some doubt on the origin of their satellites (Brown & Butler 2018). Those observations reveal that the two satellites in question (Vanth and Dysnomia) have low albedos, similar to those observed for other objects of a similar size in the Edgeworth–Kuiper Belt—rather than the high albedos that might be expected had those moons formed through accretion from an icy disk around their parent bodies resulting from a giant collision. As Brown & Butler (2018) note, further study is needed to definitely determine the origin of the satellites orbiting these two large trans-Neptunian objects.

The dwarf planet Haumea has two satellites, thought to have formed in a giant impact (e.g., Brown et al. 2006), and its own ring system (Ortiz et al. 2017; Winter et al. 2019). In addition, it is the largest object in the first detected collisional family in the trans-Neptunian region (e.g., Brown et al. 2007; Ragozzine & Brown 2007; Schlichting & Sari 2009; Leinhardt et al. 2010; Lykawka et al. 2012; Volk & Malhotra 2012; Vilenius et al. 2018; Proudfoot & Ragozzine 2019). It is almost certain that, over the coming years, more collisional families will be found in the trans-Neptunian region (Marcus et al. 2011). However, since the orbital velocities of objects in that region are far smaller than the orbital velocities of the asteroids, the ejecta from any given collision will disperse far more widely through the region (since the ejecta velocities will be a far greater fraction of the orbital velocity).42 In addition, as Marcus et al. (2011) note, the slow collisional velocities may well be "insufficient to disrupt the largest (and most visible) bodies"—with collisions on such objects likely resulting in either accretion, or a hit-and-run scenario for large impact angles (as described in Agnor & Asphaug 2004). This, coupled with the fact that TNOs are significantly fainter than main belt asteroids, means that such detections will be significantly more difficult—and it is actually quite remarkable that the first TNO family should have been found so soon after the discovery of the first members of the population.

4.1.3. Giant Impacts and Planetary Satellites

Even within the satellite systems of the solar system, there is ample evidence for the key role played by giant impacts in shaping the evolution of the system. A number of the satellites bear the scars of impacts that were close to totally disrupting the body—the largest crater on Saturn's moon, Mimas, for example, Herschel, is around a third of the diameter of the moon, and the collision which formed it left fractures that can be found all around the satellite. Were the impactor slightly more massive, or traveling a little more quickly, it could well have totally disrupted the moon (e.g., Beatty et al. 1981).

Staying within the Saturnian system, giant impacts have been proposed as the source of the unusual ridge which encircles another of Saturn's moons, Iapetus. The ridge, which runs along the satellite's equator, is one of the most unusual looking features of the solar system, leaving the moon resembling a cricket ball with a raised seam. In this case, however, the seam is over 10 km high, and some 20 km wide, and stretches almost completely around the circumference of the satellite. Over the years, a number of explanations have been suggested for the ridge, ranging from it being a relic of the satellite having once been significantly oblate (e.g., Castillo-Rogez et al. 2007), to the collapse of a circum-Iapetan ring (Ip 2006; Stickle & Roberts 2018), or even the contraction of the moon's interior as it cooled following its formation (e.g., Sandwell & Schubert 2010). Dombard et al. (2010, 2012) have proposed that the feature could be the result of a giant impact upon the satellite. They suggest that a giant impact upon the satellite led to the formation of a moon, in orbit around Iapetus, in much the same way that models suggest our own Moon, and Pluto's Charon (among others) were formed. Such an object would accrete on an orbit close to the equatorial plane of the satellite, and then undergo tidal decay, with its orbit shrinking until it passed within the Roche limit of Iapetus, at which point the tides raised upon it by the large satellite would tear it into fragments, which would then slowly de-orbit to the surface of Iapetus at subsonic speeds. This would allow sufficient material to fall to create the ridge while simultaneously explaining the fact the ridge lies on the satellite's equator. More recently, Leleu et al. (2018) suggest that the merger of two similarly sized objects could explain not only Iapetus' unusual equatorial ridge and oblate shape, but that similar processes could well be behind the peculiar shapes of several of Saturn's other innermost satellites—Pan, Atlas and Prometheus.

A giant collision leading to the disruption of a massive, close-in Saturnian satellite has also been proposed as the origin of the planet's splendid ring system (e.g., Ip 1988). More recently, Charnoz et al. (2009) take this idea one step further, suggesting that Saturn's rings were formed during the putative Late Heavy Bombardment (which we discuss in more detail in Section 4.2.1). In that work, they suggest the bombardment would also have led to significant disruption of the closer in Saturnian satellites, with all smaller than Mimas being destroyed, and then re-accreting after the bombardment finished—which in turn suggests that, if that theory is to be believed, the giant Herschel crater could be the relic of an impact during that epoch that almost managed to disrupt Mimas. Instead of invoking a giant collision, Canup (2010) suggests that the rings could instead have been formed by the tidal stripping of an inward-spiraling Titan-sized satellite, during the final stages of the planet's formation. If such a satellite were to migrate inwards through a circum-planetary disk surrounding Saturn, Canup found that Saturn could strip the mantle and crust from such an object, while leaving its core relatively intact, fated to spiral inwards and be devoured by the planet. Given the size of such a satellite, it would be reasonable to expect it to be fully differentiated, and hence the material stripped away in this manner would be a good match to the observed composition of Saturn's rings—which are thought to be between 90% and 95% water ice. However, recent results based on analysis of data obtained during the final phase of the Cassini mission have cast doubt on both these proposed origins for the Saturnian ring system. Rather than being an ancient feature, dating back to the solar system's youth, that data instead suggests that the rings are far younger than previously thought—having formed in just the last 10–100 million years (Iess et al. 2019). Indeed, it seems likely that the rings are actually a transient feature—with observations suggesting that they may well decay within the next 300 million years (e.g., O'Donoghue et al. 2019). Such a recent origin is still entirely compatible with the idea that the rings were created in a catastrophic collision—and the transient nature of the Saturnian ring system seems in keeping with the fact that the other three giant planets, Jupiter, Uranus and Neptune, each also have ring systems—albeit ones far less spectacular than those that gird Saturn.

In orbit around Uranus, the small satellite Miranda proved the highlight of the Voyager 2 flypast for many astronomers. Unlike the other Uranian satellites imaged, which were heavily cratered, pictures of Miranda revealed an incredibly unusual terrain—as can be seen in Figure 13.

Figure 13.

Figure 13. Uranus' icy moon Miranda, as imaged by the Voyager 2 spacecraft on 1986 January 24. Image courtesy of NASA/JPL-Caltech (public domain image). At the bottom of the image can be seen Verona Rupes, vertical cliffs that may stand 20 km tall. Just below the center is the Inverness Corona—a feature often called the Chevron, while the two other unusual coronae terrains, Arden and Elsinore, are located on the left and right-hand sides, respectively.

Standard image High-resolution image

Some of the features looked as though large "blocks" had been dropped into the planet, with adjacent regions appearing greatly distinct from one another. Some of the surface appeared "normal," with the usual craters, while other regions exhibited stretch marks, and enormous cracks stretched across other parts of the satellite. One particularly fascinating feature was the region known as Verona Rupes, a giant vertical cliff face that may be as much as twenty kilometers in height—the tallest cliff in the solar system. In order to explain these features, it has been suggested that Miranda was formed through the re-accretion of debris left over when an earlier moon in its orbit was collisionally shattered (e.g., McKinnon 1988; Marzari et al. 1998; Movshovitz et al. 2015). Although this theory remains the most favored in discussions of Miranda's formation, some authors (e.g., Marcialis & Greenberg 1987; Janes & Melosh 1988; Pappalardo et al. 1997; Hammond & Barr 2014) have suggested that other mechanisms (such as convection-driven resurfacing) could create the observed features, removing the need for their cataclysmic origin.

The families of irregular satellites in orbit around the outer planets are another example of the effects of giant collisions. In much the same way as the asteroid families mentioned in Section 4.1.2, distinct families of irregular satellites can be found around the giant planets, each of which is thought to be linked to the collisional disruption of a larger parent object, followed by gentle orbital dispersion (e.g., Sheppard & Jewitt 2003; Nesvorný et al. 2004), a finding supported by recent cladistical analysis of the satellite systems (Holt et al. 2018). Despite this collisional erosion of the irregular satellite population, it has been noted that the number of irregular satellites larger than a given size appears to be roughly constant between the outer planets (e.g., Jewitt & Sheppard 2005). Jewitt & Haghighipour (2007) present a detailed review of our understanding of these unusual objects, and describe the various models that have been proposed for their origin in some detail. Future observations of these satellites, and the discovery of further members of the population, will not only help to constrain the mechanism by which they were captured, but will also help to constrain the exact processes involved in the formation of their host planets themselves.

Neptune's gigantic irregular satellite Triton is an oddity among the planetary satellites. The seventh largest satellite in the solar system (with a diameter of 2706 km), it is by far the largest and most massive irregular satellite known, with a diameter an order of magnitude larger than Saturn's Phoebe, the next largest irregular. It also orbits remarkably close to its host planet, compared to the other irregulars, with an orbital radius of 3.5 × 105 km placing it slightly closer to Neptune than our Moon is to the Earth (by contrast, the next closest irregular satellite known is Uranus' Francisco, with an orbital radius of just over 4 × 106 km). Triton's unusual properties mark it as having a markedly different origin to the other irregular satellites, and many theories have been suggested over the years to explain its current orbit. It is noteworthy that, aside from Triton, Neptune lacks any sizable satellites—of the thirteen moons in the Neptunian system, Triton contains over 99% of the total mass. The second largest of Neptune's moons, Proteus, has dimensions of just 424 × 390 × 396 km (Stooke 1994), and the total mass of the regular satellite system of Neptune is far, far smaller than that of any of the other giant planets. In addition, the orbit of the irregular satellite exterior to Triton, Nereid, is remarkably eccentric.

All this evidence points to an unusual and possibly catastrophic origin for Triton (e.g., Banfield & Murray 1992). It is widely acknowledged that Triton is most likely a captured object, rather than one which formed in its current orbit, but the mechanism by which it was captured is still under some debate. Early theories (e.g., Lyttleton 1936b) attempted to link the origin of Triton to that of Pluto. This was based on the observational similarities of the two objects, the fact that Pluto's orbit crosses that of Neptune, and on estimates that both objects were of order 1/10th the mass of the Earth. After a lengthy discussion of hypothetical encounters between Neptune, Pluto and Triton, Lyttleton (1936b) highlighted a scenario by which such an encounter could capture Pluto to a prograde orbit around Neptune while simultaneously causing the direction of Triton's orbit to reverse, and also become prograde. Noting that dynamics is a time-reversible process, Lyttleton then proposed that Triton and Pluto could initially have been large prograde satellites of the giant planet. At some point, such an encounter, operating in reverse, could have occurred, leading to the ejection of Pluto to its current orbit, and could have been the origin of Triton's current retrograde orbit. The idea that Pluto was once a Neptunian satellite has played an important role in many theories of the origin of Triton—from the proposed disruption of the Neptunian satellite system by an encounter with a massive trans-Neptunian planet (Harrington & van Flandern 1979), to suggestions that Triton was an unbound interloper that was captured by Neptune through a close encounter with proto-Pluto (suggesting also that the approach was so close that the proto-Pluto was disrupted in the process, leading to the formation of its largest moon, Charon; Farinella et al. 1979). It should be noted, however, that none of these theories are currently considered as serious answers to the question of either Triton or Pluto's origin—but they still serve to highlight how our knowledge of Pluto and Triton has changed over the past century, and how those changes have informed our theories on their origins.

As our knowledge of both Triton and Pluto increased, it was realized that their masses were so low that such scenarios were implausible, unless the two objects had been involved in an actual physical collision (e.g., McKinnon 1984). Once this was realized, studies instead typically concentrated on mechanisms through which Triton could be captured alone—although some authors (e.g., Goldreich et al. 1989; Woolfson 1999, 2013) continue to consider the statistically improbable idea that Triton was captured as the result of such a giant collision. The model currently accepted by the majority of researchers, however, considers that Triton was likely a binary dwarf planet (like Pluto, or Eris) that approached Neptune sufficiently closely that it was captured by the planet as a result of the disruption of the binary pair—with the other member of the system continuing on an unbound orbit (e.g., Agnor & Hamilton 2006). Rufu & Canup (2017) considered the effect that such a capture event would have on a pre-existing satellite system, and found that the interactions between a newly captured Triton and a satellite system comparable to that seen orbiting Uranus could readily result in a system like that seen around the modern-day Neptune. In addition, they note that the interactions involved in such a scenario offer a mechanism by which the orbital eccentricity of Triton could be reduced on a timescale short enough to prevent the ejection of satellites like Nereid, which might otherwise be lost from the system as a result of scattering by Triton should its orbit solely be circularized by tidal interactions with Neptune.

4.2. The Impact Rate of Smaller Bodies in the Solar System

At sizes smaller than those at which the impactor would destroy the target body, impacts become ever more frequent. While the epoch of giant collisions, at least from the point of view of the planets, passed in the early days of the solar system, smaller impacts have continued right up to the current day. Although such impacts are no doubt of interest when one considers the potential habitability of Earth-like planets (e.g., Horner & Jones 2008a, 2010a, 2011, 2012), the effects of such impacts, taken individually, would be so minor as to be undetectable in other planetary systems. Indeed, the first confirmed impacts to be observed on any solar system body only came in the last thirty years43—the Shoemaker–Levy 9 impacts on Jupiter in 1994 (e.g., Hammel et al. 1995; Asphaug & Benz 1996), the two further impacts upon that planet observed in 2009 and 2010 (e.g., Sánchez-Lavega et al. 2010; Orton et al. 2011), and the impact of meteoroids observed on the surface of the Moon (e.g., Bellot Rubio et al. 2000; Ortiz et al. 2002; Madiedo et al. 2014, 2019).

Despite this, the study of these impacts in our solar system has provided a wealth of information on the processes by which the objects within formed and evolved. Studies of the chemistry and origin of meteorites reveal the timescales on which asteroids accreted and differentiated (e.g., Whitby et al. 2000; Kleine et al. 2002, 2005), as well as providing strong evidence that our solar system was seeded with short-lived radionuclides by a nearby supernova at some point during its formation (e.g., Lattimer et al. 1978; Hoppe et al. 1996; Yin et al. 2002; Mostefaoui et al. 2005). In addition to these results, which clearly provide valuable data to be incorporated in models of planetary formation, the two main areas of small-impact research that probably have the most impact on our studies of exoplanets are the proposed Late Heavy Bombardment of the terrestrial planets, and the suggested role of giant planets in shielding terrestrial planets from potentially catastrophic fluxes of small bodies.

4.2.1. The Late Heavy Bombardment

The theory of the Late Heavy Bombardment suggests that, approximately 700 million years after the formation of the solar system, there was a dramatic increase in the rate at which the terrestrial planets were bombarded by asteroids and comets. Evidence for the theory first came from the analysis of impact melt dates in Apollo Lunar samples (Tera et al. 1974) that showed a predominance of dates in the range 3.8–4.0 Gyr which were interpreted as a "Terminal Lunar Cataclysm." This cataclysm hypothesis was, however, criticized as "a misconception" by Hartmann (1975). The controversy has continued ever since.

Supporters of the cataclysm hypothesis (e.g., Ryder 1990, 2002; Stoffler & Ryder 2001) derive dates for the best-studied major Lunar impact basins in the very tight range from 3.85 Gyr (for Imbrium) to 3.92 Gyr (for Nectaris). Since relative dating of Lunar basins based on stratigraphic analysis (Wilhelms 1987) places 11 of the major basins at ages between Nectaris and Imbrium, this implies a very short period of intense bombardment. They also argue (Ryder 2002) that extrapolating the impact rates back to the Moon's formation, as a steady decline, would imply accretion of more than a Lunar mass of material.

However, an alternative view is that the impact history of the Moon is consistent with a steady decline of impact rates with time. Advocates of this view (e.g., Neukum et al. 2001; Hartmann 2003) argue that the Lunar surface record obtained from the Apollo and Luna missions (limited to the equatorial near-side) is strongly biased by major events such as the Imbrium impact, and that the record of older impacts has been largely erased by subsequent events.

To remove the selection effects that result from the limited coverage of the Lunar surface, Cohen et al. (2000) analyzed Lunar meteorites. They found a range of impact melt ages with no sharp impact spike, but with none earlier than 3.92 Gyr, which was argued as supporting the cataclysm hypothesis. Support for the cataclysm hypothesis has also been reported from studies of other solar system bodies. Kring & Cohen (2002) used analysis of meteorites to determine that main belt asteroids were heavily cratered ∼3.9 Gyr ago. The only ancient Martian meteorite (ALH 84001) shows a shock event at 3.92 Gyr (Turner et al. 1997) and a thermal event at ∼3.9 Gyr has been recorded in Hadean zircons on Earth (Trail et al. 2007). While these results are consistent with a cataclysmic bombardment throughout the inner solar system, the evidence for a strong spike in impacts rests on the Lunar record.

A key to the cataclysm interpretation is the ages for the older Lunar basins, Serenitatis and Nectaris, which are given as 3.89 and 3.92 Gyr by Stoffler & Ryder (2001). However, both these ages are controversial. Spudis et al. (2011) has concluded that the Apollo 17 impact melts used to date Serenitatis are not, in fact, from Serentatits which means the basin could in fact be much older. Norman et al. (2010) argue that the Apollo 16 Descartes terrain often considered to be ejecta from Nectaris are in fact ejecta from the Imbrium basin, allowing a much older age for Nectaris. Older ages for Nectaris and Serenitatis would weaken the case for a Lunar cataclysm. The controversy may not be resolved until future Lunar missions are carried out that would enable better age determinations for the older basins, e.g., the proposed MoonRise sample return mission to the South Pole-Aitken basin (Jolliff et al. 2012).

So while it is well established that there was a high impact rate on the Moon at around 3.8–3.9 Gyr, it remains somewhat uncertain as to whether this represents a cataclysmic spike, the tail of a steady decline or some more complex history, at least as far as the Lunar sample record is concerned. Bottke et al. (2007) have analyzed the problem from the point of view of solar system dynamics. They argue that the occurrence of major impact events as late as Imbrium and Orientale (the last major Lunar basin forming impacts at ∼3.8 Gyr) is inconsistent with being the tail of a steadily declining population of objects left over from the formation of the solar system. This is true whatever is assumed about the initial population of objects, since with a large population of objects, collisions between the objects themselves depletes the population of the larger impactors.

Building on that work, the excellent review by Bottke & Norman (2017) describes the current state of the art of research into the true nature of the Late Heavy Bombardment, and the overall decay in the scale and frequency of impacts across the inner solar system during the first billion years of its life. They find that the evidence for a "strong version of the Terminal Cataclysm hypothesis" is now far weaker than it was in the past, but note that both the Imbrium and Orientale basins were formed during the short time interval spanned by that hypothesis. Based on the latest data, they propose a hybrid, two-phase model for the late bombardment of the inner solar system. First, in the very early stages of planet formation, there was a very intense phase of bombardment, lasting until ∼4.4 Gyr before the present day. That phase was driven by the cleanup of left-over planetesimals from the formation of the terrestrial worlds. A second, prolonged phase of intense bombardment followed somewhat later, beginning between 4.2 and 4.0 Gyr before the present day—a phase that included the impacts that created the Imbrium and Orientale basins, and might well have extended for well over a billion years (e.g., Bottke et al. 2012). Rather than being the result of the clean-up phase of terrestrial planet formation, that secondary period of intense bombardment was the result of the migration, and potential dynamical instability, of the solar system's giant planets—as we will discuss in more detail in Section 4.3.2. In the conclusion of their work, Bottke & Norman suggest several regions for future research that should help to shed further light on the nature of the Late Heavy Bombardment, and it seems likely that the coming decade will yield fresh insights into the processes that sculpted the face of solar system as we see it today.

4.2.2. Do Giant Planets Offer Shielding to Terrestrial Worlds?

As described earlier, there are three main types of potentially hazardous objects in our own solar system, each of which finds its origin in one of the three main reservoirs of dynamically stable objects. The near-Earth asteroids originate within the Asteroid belt (e.g., Binzel et al. 1992; Bottke et al. 2000; Morbidelli et al. 2002a; Dunn et al. 2013), between the orbits of Mars and Jupiter, and are injected into the inner solar system, after being created in collisions between larger bodies, primarily by distant perturbations from the planets Jupiter and Saturn. The short-period, or Jupiter-family, comets are continually replenished from the Centaur population (e.g., Horner et al. 2003, 2004a, 2004b; Tiscareno & Malhotra 2003; Volk & Malhotra 2008; Grazier et al. 2018, 2019; Peixinho et al. 2020), which are themselves sourced from objects in the trans-Neptunian region (e.g., Holman & Wisdom 1993; Duncan & Levison 1997; Emel'yanenko et al. 2005; Volk & Malhotra 2008), though a significant contribution to that population may originate among the Neptunian and Jovian Trojan clouds (e.g., Marzari et al. 1995b; Horner & Evans 2006; Horner & Lykawka 2010a; Horner et al. 2012c). The third population of potential impactors, the Oort cloud comets, are sourced from the Öpik–Oort cloud (e.g., Öpik 1932; Oort 1950), and are typically thrown into the inner solar system by the gravitational influence of passing stars and the galactic tide (e.g., Hills 1981, Biermann et al. 1983; Heisler et al. 1987; Matese & Lissauer 2004; Feng & Bailer-Jones 2015; Vokrouhlický et al. 2019), although it has been suggested that some fraction might well be sent inwards as a result of perturbations by a Jovian-mass Solar companion (e.g., Matese et al. 1999; Horner & Evans 2002; Matese & Whitmire 2011), as we will discuss in more detail in Section 4.5.1.

In the case of the near-Earth asteroids and short-period comets, despite the presence of relatively populous reservoirs of parent objects (the asteroid belt, and trans-Neptunian disk, respectively), the number of objects being moved from orbits in those regions to orbits in the inner solar system (where they can imperil the Earth) would be minimal were it not for the perturbative influence of the giant planets. For this reason, it is immediately apparent that the presence of giant planets can play a significant role in determining the frequency with which debris rains down upon the terrestrial planets. In other words—the presence of a massive debris disk around a given star should not be taken, per se, as evidence that the frequency of collisions on a planet in a different part of that planetary system would be prohibitively high for life to develop. If no giant planets were present in that system, then no mechanism would exist to effectively transfer material from that disk to a potentially habitable planet elsewhere in the system, and so one might instead expect the impact regime on such a world to be relatively peaceful, despite the presence of a massive disk.

Despite the clarity of such logic, it has historically been widely taught that the planet Jupiter acts as a giant shield, significantly lowering the rate at which asteroids and comets collide with the Earth. Without Jupiter, the argument goes, significant impacts would be sufficiently frequent on the Earth that the development of life would be stymied, or even prevented absolutely (e.g., Ward & Brownlee 2000). The origin of this idea is unknown, but it likely has its roots around the time that craters on the surface of the Earth were finally acknowledged as being the result of impacts—an idea that was not fully accepted until the 1960s (e.g., Chao et al. 1960; Shoemaker 1963; Shoemaker & Chao 1961; Bjork 1961).

Given that weathering removes all but the largest craters on relatively short geological timescales, it was obvious that most craters could not be ancient features, but had instead been created by impacts in the relatively recent past. At that time, very few short-period comets, and even fewer near-Earth asteroids were known. The obvious culprits, therefore, were the long-period comets, objects sourced from the Oort cloud. Such comets are only tenuously bound to the Sun, with their aphelia located at enormous distances (typically tens of thousands of au), and as such only need to experience a very small perturbation in order to become unbound, and be ejected from the solar system forever. In most cases, the planet that ejects comets in this way is Jupiter, and so therefore it seems an obvious conclusion that, if the planet were not there, far more of these objects would survive to pass through the inner solar system multiple times, and therefore the impact rate at the Earth would be much higher. The idea that Jupiter has shielded us is now firmly entrenched in both the public and scientific consciousness. By extension, it is often argued that giant planets are a necessity in order to shield Earth-like planets from impacts for those planets to be habitable (as in, for example, Greaves 2006, who suggests that the lack of a Jupiter-type planet in the Tau Ceti system has resulted in a significant amount of debris being present that would threaten any Earth-like planets therein).

Given the pervasiveness of this idea, it is somewhat surprising that, until recently, very little work had been done to examine it in any depth. The first study of any note into the question of planetary shielding was carried out by Wetherill (1994), who concluded that, in planetary systems containing only "failed Jupiters" (i.e., planets which only manage to reach the mass of Uranus or Neptune, rather than that of a Jupiter), the impact flux experienced by an Earth-like planet would be of order a thousand times higher than that experienced in our own solar system. This would be a direct result of the reduced efficiency with which small objects would be ejected from such a system, allowing them to remain on planet-crossing orbits for a much longer period of time. However, in order for a given object to pose a threat to an Earth-like planet, it must first be emplaced on an orbit that crosses that of the planet. The dynamical lifetime of such objects is relatively short, and so in order for small bodies to pose an on-going threat, their numbers must continually be replenished—a question which was not addressed in that early work.

Laakso et al. (2006) approached the question of planetary shielding from a different angle. Using numerical integration, they examined the effect of the position and mass of a Jovian planet on the rate of ejection of particles that were initially placed on eccentric orbits that crossed the classical habitable zone from the start of the integration. Using our solar system as a test case for their method, they found the surprising result that Jupiter "in its current orbit, may provide a minimal amount of protection to the Earth." Unfortunately, again, their work did not consider the transfer of material from non-threatening to dangerous orbits, but it did provide the first hints that something might be a little amiss with the canonical view of Jupiter's role.

Our current understanding of the various populations of bodies that can threaten the Earth is vastly different to that which was present around the time that the idea the Jupiter is a shield caught on. Rather than the long-period comets being the main source of impact threat to the Earth, it is now believed that the Near-Earth Asteroids constitute the great bulk of the impact risk, with the comets contributing just a few percent of the total risk (e.g., Chapman & Morrison 1994; Bottke et al. 2002). While it is still true that Jupiter plays a significant role in removing the near-Earth asteroids from threatening orbits, it is also well known that it is the main source of fresh asteroidal material in the inner solar system (e.g., Morbidelli et al. 2002a, 2002b). In the case of the short-period comets (often known as the Jupiter-family), the situation is equally confused—the great majority of such comets make their way onto Earth-crossing orbits as a result of Jupiter's perturbations, and so any conclusions as to the true level of shielding offered by the giant planet must take into account the efficiency with which it creates new threats, as well as the ease with which it removes them.

In order to study this problem, a series of detailed dynamical studies have been carried out (Horner & Jones 2008a, 2008b, 2009, 2010b, 2012; Horner et al. 2010) that examine the influence of Jupiter on the three populations of potentially hazardous objects in our own solar system—namely the near-Earth asteroids, the short-period comets and the long-period comets. The results are startling. In the case of the near-Earth asteroids, it turns out that the Earth actually experiences a greater number of impacts with our Jupiter than it would with no Jupiter at all (see Figure 14, below). However, if Jupiter were just a quarter of its current mass, the impact rate of near-Earth asteroids would be far higher than in the other two cases. This rise and fall of the impact rate as the mass of Jupiter increases is found to be a direct result of the changing location of the ν6 secular resonance, which can drive asteroids trapped within into the inner solar system on astronomically short timescales. In our own solar system, this resonance, although strong, is located almost at the inner edge of the asteroid belt, where very few objects can fall prey to its perturbative influence. When Jupiter is less massive, however, the resonance is located farther from the Sun, and is also somewhat broader, and can therefore influence a greater fraction of the total mass of the asteroid belt. The "worst case" scenario occurs when Jupiter is around a quarter of its current mass (similar in mass to the planet Saturn), at which point the resonance is both broad and strong, and is located in the middle of the asteroid belt, ideally located to disturb the orbits of the maximum number of asteroids.44

Figure 14.

Figure 14. The impact rate experienced by the Earth as a result of near-Earth asteroids (left) and Jupiter-family comets (right), as a function of Jupiter's mass, based on the simulations described in Horner & Jones (2008a, 2009). In both cases, the flux has been normalized to the highest flux experienced by the Earth through the course of those simulations. For both the near-Earth asteroids and Jupiter-family comets, the impact flux Earth would experience without Jupiter is less than a quarter of that we see in the simulations that feature Jupiter as we know it in the solar system. The impact flux due to both population peaks for a "Jupiter" somewhat less massive than our solar system's Saturn, before falling away again—although in both cases, the flux experienced in our solar system still exceeds that experienced when Jupiter is absent from those simulations.

Standard image High-resolution image

For the short-period comets, a similar pattern emerges, albeit for different reasons—as can be seen in Figure 14. At low Jupiter masses, the planet is unable to either efficiently eject comets from the solar system or emplace them on Earth-crossing orbits. At high masses, the planet can very efficiently emplace comets on Earth-crossing orbits, giving them the opportunity to collide with our planet. However, it can also easily remove them from the inner solar system and eject them, so any given comet will most likely have relatively few opportunities to hit the Earth before being removed again. In both these cases, then, the impact flux experienced by Earth is relatively low. Once again, there is a "worst case" scenario, however. Remarkably, this "worst case" again occurs when the Jupiter-like planet has around a quarter of the mass of our Jupiter. At that mass, the planet can efficiently emplace comets on to Earth-crossing orbits. However, it is still of sufficiently low mass that it is very difficult for it to eject those comets from the solar system entirely. As such, any comets placed on Earth-crossing orbits by the giant planet at that mass will have plenty of opportunities to collide with the Earth before they are removed, and so the impact flux reaches a peak. Beyond that peak, as the planet's mass grows larger, objects are ever more efficiently ejected, causing the time spent on Earth-crossing orbits to fall, thereby reducing the impact rate.

In the case of the long-period comets, it seems that the long-held belief holds true—Jupiter does indeed act as a shield. As its mass increases, it becomes ever more efficient at perturbing loosely bound comets in such a way that their orbits become unbound, and they head out of the system, never to return. A more detailed study of the effect of giant planets on the long-period comet flux (Lewis et al. 2013) studied the evolution of cometary orbits from the initial population of the Oort Cloud by the giant planets through to their eventual re-injection and ejection from the inner planetary region. That work once again highlighted the role of giant planets (of mass comparable to, or greater than, Saturn) in ejecting potentially Earth-crossing comets from the system, ensuring that the chance of them colliding with a terrestrial planet is minimized. Once again, for long-period comets, it seems that Jupiter (and Saturn) truly act to reduce the impact threat posed to the Earth.

Although a significant amount of further research into the question of the shielding offered by giant planets is needed before any definitive answer is reached as to the true balance between their roles as friend and foe, it is clear from these studies that, at the very least, the situation is significantly more complicated than had previously been thought (e.g., Grazier 2016; Grazier et al. 2018). Rather than being the benign and beneficial influence on our planet's development that has long been assumed, it seems that Jupiter might actually be more of an enemy than a friend!

4.3. The Migration of the Giant Planets

The idea that giant planets migrate during the process of their formation and evolution is now well accepted. In exoplanetary systems, key evidence for this behavior includes the hot Jupiters (e.g., Mayor & Queloz 1995; Johnson et al. 2006; Vogt et al. 2010; Winn et al. 2010; Bryan et al. 2012; Becker et al. 2015), and a number of dynamically compact systems with planets trapped on mutually resonant orbits (e.g., Johnson et al. 2011; Lissauer et al. 2011; Robertson et al. 2012a, 2012b; Wittenmyer et al. 2012b; Gillon et al. 2017). Such migration is the most common explanation for all giant planets discovered orbiting closer to their host stars than the location of the ice line, with the idea being that the planet first accreted beyond that point in its host system before careening inwards during the latter stages of its formation (e.g., Kuchner & Lecar 2002; Masset & Paploizou 2003; Crida & Morbidelli 2007; Beaugé & Nesvorný 2012)—although in recent years, alternative mechanisms have been proposed to explain at least some of the known short-period giant planets (e.g., Batygin et al. 2016; Bailey & Batygin 2018; Dawson & Johnson 2018). It has even been suggested that such inward migration might often lead to the in-fall of one (or more) giant planets to the atmosphere of a youthful star, polluting that star's atmosphere with the heavy elements accreted, and so artificially inflating the observed metallicity (e.g., Pasquini et al. 2007). A planet migrating in such a fashion might also drag with it vast quantities of volatile material from beyond the ice-line, leading to the accretion of "water-worlds," which might potentially be habitable, in the giant planet's wake (e.g., Fogg & Nelson 2007a, 2007b, 2009; Izidoro et al. 2014b; Darriba et al. 2017).

Within our own solar system, too, evidence abounds for the post-formation migration of the giant planets. However, while currently migration scenarios involving exoplanets typically only discuss inward migration, within our own solar system there is a significant body of evidence that argues for the outward migration of Neptune, and potentially Uranus, leading to the dynamical relaxation of the outer solar system. Indeed, it is currently thought that Neptune migrated over a distance of at least 7 au, and potentially as much as 15 au outward after its formation, doubling the radial extent of the planetary component of our solar system.

4.3.1. Evidence for Migration—The Small Bodies

The stable reservoirs of small bodies within the solar system contain a great deal of information pertaining to the formation and migration of the giant planets, with their current distributions the direct result of the gravitational sculpting they experienced during that period. The main belt asteroids, for example, are distributed across a wide range of orbital eccentricities and inclinations—plots of their locations in a:e and a:i space (as shown in Figure 2) reveal a great deal of intricate structure within the belt, from the gravitationally cleared locations of MMRs with Jupiter to the separation of a small population interior to the main body of the belt from the rest of the population. The locations of the key mean-motion and secular resonances which have sculpted the belt are nicely portrayed in the first three figures of Nagasawa et al. (2000), who discuss whether the current level of excitation observed within the asteroid belt could be the result of secular resonances sweeping through the region as the Solar nebula became depleted.

In other studies, the role of such sweeping resonances as a result of planetary migration is considered pivotal in creating the excited distribution we observed today (e.g., Gomes 1997; Minton & Malhotra 2009). Indeed, Minton & Malhotra (2011) used the distribution of the eccentricities of the main belt asteroids to place constraints on the rate at which Saturn migrated. At the current epoch, the ν6 secular resonance, driven by Saturn, lies at the inner edge of the asteroid belt (for orbits with zero inclination), and delineates the high-inclination boundary for main-belt asteroids at greater heliocentric distances (see e.g., Figure 3 of Nagasawa et al. 2000). In order for the asteroid belt to have survived the inward migration of Saturn, as the secular resonance swept through it, Minton & Malhotra (2011) propose that Saturn must have migrated at a speed of at least 0.15 au Myr−1—although they note that this migration limit is a function of Saturn's orbital eccentricity. If Saturn's eccentricity was smaller at the time of its migration, then slower migrations could have occurred without overly disrupting the asteroid belt.

The Jovian Trojans, too, are thought to be a direct result of planetary migration (Morbidelli et al. 2005; Lykawka & Horner 2010; Nesvorný et al. 2013; Pirani et al. 2019)—the source of their dynamically hot distribution (with some objects exhibiting inclinations up to 55° ((83483) 2001 SC93) and eccentricities as high as 0.272 ((5144) Achates)). Interestingly, models describing the capture of the Jovian Trojans during planetary migration suggest that objects would be captured with a wide range of dynamical stability, from those that are stable on timescales far longer than the age of the solar system to those that remain Trojans for just tens, or hundreds, of years. As such, even four billion years after their capture, objects should still dynamically bleed from the Jovian Trojan clouds to the rest of the outer solar system, as the less stable objects are gradually "shaken free" by the distant perturbations of the planets (e.g., Levison et al. 1997; Di Sisto et al. 2014, 2019; Holt et al. 2020). Recent dynamical simulations have revealed that up to a third of the members of the Jovian Trojan swarms could be unstable (e.g., Di Sisto et al. 2014), while at least one Jovian Trojan exhibits dynamical instability on Gyr timescales (Horner et al. 2012c). That object, (1173) Anchises, was found to have a dynamical half-life of order 619 million years, which is entirely compatible with it being an unstable Trojan captured during planet formation, rather than a recent entrant to the cloud. The Jovian Trojan population might even contain the key to tying down the timing of the migration of the giant planets, with recent simulations of the binary Trojans Patroclus–Menoetius suggesting that the migration of the giant planets must have occurred very early in the solar system's history, prior to the Late Heavy Bombardment (Nesvorný et al. 2018). That work suggests that the Patroclus–Menoetius binary formed in a thick planetesimal disk beyond the orbit of Neptune, and argues that it must have been ejected from that disk, and captured to Jupiter's Trojan population, at a very early stage in the planet formation process, otherwise it would have been disrupted through encounters within that disk.

The Neptunian Trojans, like the Jovian Trojans, have an orbital distribution far too dynamically excited to be considered pre-formed objects. In the past decade, a number of studies have been carried out into the formation and evolution of these objects, revealing that they were most likely captured during Neptune's outward migration (e.g., Lykawka et al. 2009, 2010; Nesvorný & Vokrouhlicky 2009; Horner & Lykawka 2010b, 2012; Parker 2015; Gomes & Nesvorný 2016). Just as for the Jovian Trojans, such a scenario would suggest that there remain unstable members of the Neptunian Trojan population at the current day, gradually bleeding from that population to contribute to the Centaur population, the dynamically unstable parents of the Jupiter Family comets (e.g., Horner & Lykawka 2010a, 2010d). Indeed, recent studies have revealed that two of the known Neptunian Trojans may well be such objects, namely 2001 QR322 (Horner & Lykawka 2010c) and 2008 LC18 (Horner et al. 2012b), with two additional objects (2004 KV18 and 2012 UW117) proving in actuality to be recently captured interlopers to that population (Horner et al. 2011; Wu et al. 2018).

The orbital distribution of objects in the trans-Neptunian region, too, carries a wealth of information on the precise nature of that planet's migration (e.g., Chiang & Jordan 2002; Hahn & Malhotra 2005; Murray-Clay & Chiang 2005; Nesvorný 2015; Gomes et al. 2018; Volk & Malhotra 2019). The Plutino population, trapped within the 3:2 Neptunian MMR, are believed to have been captured and transported as that resonance migrated outward with the planet. Once objects were trapped in the resonance, their orbital eccentricities and inclinations were pumped as they were pushed outwards by the moving resonance, resulting in the distribution observed today (e.g., Malhotra 1993, 1995; Lykawka & Mukai 2008; Levison et al. 2008). The farther Neptune migrated, the more extreme the inclinations and eccentricities acquired would be, and so it is in theory possible to determine, from the Plutino distribution, the precise scale of Neptune's outward motion. That outward migration also played a significant role in sculpting the dynamically stable Edgeworth–Kuiper Belt (e.g., Chiang & Jordan 2002; Hahn & Malhotra 2005; Petit et al. 2011; Nesvorný 2018; Volk & Malhotra 2019), driving the observed asymmetric distribution of objects trapped in Neptune's 2:1 MMR (Murray-Clay & Chiang 2005), and resulting in the excited yet stable population we observe today (e.g., Gladman et al. 2012).

4.3.2. Migration and the Late Heavy Bombardment

In Section 4.2.1, we discussed the possibility that the inner solar system was subject to a period of enhanced impact rates, known as the Late Heavy Bombardment, several hundred million years after the formation of the solar system. While there remains some debate over the true nature and duration of that bombardment, there remains evidence that the impact rate was markedly elevated during that time—with the formation of Imbrium and Orientale basins at around 3.8–3.9 Gyr seemingly inconsistent with the idea that the impact rate was solely due to the steady depletion of a population of impactors dating from the solar system's formation (e.g., Bottke et al. 2007; Bottke & Norman 2017).

Given the ample evidence for the migration of the giant planets detailed in the previous sub-section, it is natural to wonder whether such migration could help to explain the origin of the Late Heavy Bombardment. In 2005, a series of papers exploring that reasoning proposed what became known as the "Nice Model" (e.g., Gomes et al. 2005; Morbidelli et al. 2005; Tsiganis et al. 2005). That model suggests that the four giant planets formed in a more compact architecture than that we see today.45 As those planets interacted with the debris left behind from the planet formation process, they migrated, leading to a very gradual relaxation of their orbits—causing them to slowly spread apart. Eventually, after a few hundred million years, that migration caused their orbits to become dynamically unstable, leading to a period of chaotic evolution that destabilized the outer solar system—and in the process led to the injection of large amounts of debris to the inner solar system, causing the Late Heavy Bombardment.

In the Nice Model, the authors suggest that the four giant planets formed in a compact group, with Jupiter and Saturn located closer together than the location of their mutual 2:1 mean motion resonance, and the ice giants packed in relatively closely beyond the orbits of the two largest planets. Beyond the orbit of the outermost ice giant (which could either be Uranus or Neptune, depending on the particulars of the integration considered), there was a massive, thick disk of planetesimals—stretching outward from a distance of approximately 15 au from the Sun (see e.g., Figures 1 and 2 of Gomes et al. 2005). Over a period of several hundred million years, the orbits of the four giant planets relaxed, ever so slowly. The outermost continually stirred the disk of debris beyond its orbit, scattering material sunwards, and migrating outwards in response (as a result of the conservation of angular momentum). The objects injected to the realm of the giant planets were then scattered in the same manner as the modern Centaurs, experiencing stochastic close encounters with the giant planets before finally being ejected from the solar system. The two planets whose orbits lay in the center of the four-planet sandwich only migrate slightly as a result of this, as they can hand objects both inward (to the next sunward planet's control) or outward (to the next planet out)—although both exhibit slight outward migration due to the net exchange of angular momentum through objects passed inward to Jupiter. Jupiter, on the other hand, migrates inward. When Jupiter throws objects into the inner solar system, there is no mechanism by which they can readily be decoupled from the giant planet's influence,46 and so they will eventually once again encounter the giant planet. But when Jupiter ejects objects from the solar system, they clearly never return. As a result, Jupiter gradually loses angular momentum, in exchange with the ejected objects, and migrates sunwards.

Eventually, this gradual migration leads to a cataclysmic period of instability in the orbits of the giant planets. Jupiter and Saturn first approach, then cross, the location of their mutual 2:1 mean-motion resonance. This destabilizes the orbits of the two ice giants, causing them to evolve chaotically, leading to the ejection of at least one of those planets into the bulk of the massive disk in the solar system's outer reaches. That planet would then disrupt that disk, scattering debris in all directions, and sending a large amount of icy material inwards, to impact upon the planets in the inner solar system—all the while experiencing significant dynamical friction from those encounters that would serve to rapidly circularize its orbit, pulling it away from the possibility of further encounters with the other ice giant.

In addition to the destabilization and disruption of the massive debris disk beyond the orbit of the outermost planet, this period of chaotic evolution would also markedly disrupt and deplete the asteroid belt—again showering the inner solar system with potentially deadly debris. At the same time, the destabilization would allow the emplacement of large quantities of icy material to the outer reaches of the main belt—including, potentially, the ice-rich dwarf planet Ceres (e.g., McKinnon 2008; Levison et al. 2009). Eventually, the orbits of the outer planets would settle into the new, stable, relaxed architecture we observe today. The trans-Neptunian region and asteroid belt would have been significantly denuded of material, and the inner planets would have experienced a delayed period of intense bombardment, triggered by the instability in the outer planet's orbits—the Late Heavy Bombardment. Furthermore, the theory does an elegant job of explaining the capture of the Jovian Trojans (e.g., Morbidelli et al. 2005), plundered from the debris liberated from the asteroid and trans-Neptunian belts, and captured to Jupiter's 1:1 mean-motion resonance during the period when that resonance was itself destablized by the resonance-crossing of the two largest planets. As Jupiter and Saturn moved away from mutual 2:1 mean-motion resonance, any objects temporarily captured to Jupiter's 1:1 MMR would be "frozen in," locked into that resonance as it returns to its former extremely stable state. While other models exist that can also explain the origin of the Jovian Trojan population (e.g., Pirani et al. 2019), the Nice Model remains a fascinating insight into the possibilities of delayed dynamical instabilities in planetary systems, and serves as an important reminder that, just because a planetary system appears to be dynamically stable at the current epoch, that does not necessarily mean that it always has been, or always will be.

In the years since the Nice Model was proposed, a wide variety of variants and alternate theories have been proposed, all invoking the dramatic migration and interaction of the outer planets to explain the various features of the modern solar system. Levison et al. (2008) investigate the impact that the chaotic evolution described in the Nice Model would have on the formation and structure of the Edgeworth–Kuiper Belt. They find that, in order to explain why Neptune stopped its outward migration at 30 au, the trans-Neptunian disk must have been truncated at between 30 and 35 au from the Sun—otherwise the outward motion would have continued until the planet reached the outer edge of the disk (e.g., Gomes et al. 2004). Such truncation of the solar system's protoplanetary disk has been proposed and studied before, in the context of the sharp outer edge of the Edgeworth–Kuiper Belt (e.g., Melita et al. 2002; Levison et al. 2004; Lykawka & Mukai 2008), but those works typically invoked an outer truncation distance closer to ∼50 au, rather than the 30–35 au suggested by Levison et al. (2008). Morbidelli et al. (2010) also expanded on the work of the Nice model, showing how the stochastic "jumping" of Jupiter and Saturn that would result from chaotic close encounters with a Neptune mass planet during the period of instability could explain both the depletion and current structure of the asteroid belt. Follow up work (Nesvorný et al. 2013) showed how such a scenario could result in the current orbital distribution of the Jovian Trojans, and note that it could even help to explain the origin of the observed asymmetry between the populations of the leading and trailing Jovian Trojan clouds.

More recently, still more dramatic versions of a chaotic early solar system have been proposed. Some scenarios (e.g., Batygin et al. 2012; Nesvorný & Morbidelli 2012) invoke the idea that the outer solar system might once have contained more than the four giant planets we observe today, and suggest that the additional planet(s) that were once there were either ejected entirely from the system, or were flung onto long-period, distant orbits (e.g., Bromley & Kenyon 2014, 2016; Trujillo & Sheppard 2014; Cloutier et al. 2015; Nesvorný 2018), as we discuss in more detail in Section 4.5.

A particularly interesting model of early solar system chaotic behavior is known as the "Grand Tack Hypothesis," which is based on the idea that a pair of inwardly migrating giant planets can become trapped in mutual mean-motion resonance, as a result of the outer of the two migrating more rapidly, and catching the innermost up. In certain circumstances, interactions between those two planets and the protoplanetary disk within which they are migrating can result in the direction of their migration reversing, with the pair marching back outwards toward their initial formation locations (e.g., Masset & Snellgrove 2001; Morbidelli & Crida 2007).

In the Grand Tack scenario, Jupiter and Saturn formed somewhat closer to the Sun than their current locations (with Jupiter at ∼3.5 au) before initially migrating inwards, toward the Sun. After Jupiter had migrated some ∼2 au inwards, to around 1.5 au, it and Saturn changed the direction of their migration after becoming trapped in mutual mean motion resonance. Thereafter, the two planets moved back outwards to their current locations (Walsh et al. 2011). This "Grand Tack" (Walsh et al. 2012) of the giant planets would have the effect of both stirring and truncating the solid material in the inner parts of the protoplanetary disk—creating conditions similar to the truncated inner proto-planetary disk invoked by Hansen (2009) to explain the origin of the terrestrial planets. It would also act to significantly deplete the asteroid belt, and would also aid in the delivery of volatile material to the inner planets (e.g., O'Brien et al. 2014).

A key and widely studied advantage of this theory is that it offers a mechanism by which the growth of Mars can be truncated, helping to explain why the Red Planet is so much less massive than the Earth or Venus (e.g., Walsh et al. 2011; Brasser 2013; Kobayashi & Dauphas 2013; Izidoro et al. 2015b; Brasser et al. 2017).47 More recent work (Brasser et al. 2016) suggests that the innermost point of Jupiter's grand tack was likely somewhat more distant than previously thought—around 2 au, rather than the 1.5 au suggested by Walsh et al. (2011). In the simulations carried out by Brasser et al. (2016), a tack inward to 1.5 au was found to be strongly incompatible with the formation of the most massive terrestrial planet (the Earth) occurring at 1 au, even though such a tack was capable of truncating Mars' formation as required. By halting Jupiter's tack at a greater heliocentric distance, those authors were more readily able to replicate the modern inner solar system.

One problem with the Grand Tack hypothesis, however, is that the migration and associated disruption of the asteroid belt would occur long before the time of the Late Heavy Bombardment. It should, however, be noted that the Grand Tack model and the Nice Model are not necessarily mutually exclusive. The Nice Model teaches us that a period of instability among the giant planets can occur after a lengthy period of quiescence—so it is, of course, possible that the giant planets experienced both a period of evolution in the earliest stages of the solar system's formation that matched the behavior proposed in the Grand Tack model, and then, later, experienced a delayed period of instability, resulting in the injection of the ice giants (Uranus and Neptune) to a massive trans-Neptunian disk. However, as yet, the two models have not been unified, and there remains extensive debate on the true narrative of planetary migration and the origin of the Late Heavy Bombardment.

In that light, it is interesting to note that a number of studies have suggested that a Nice-model like dynamical instability in the orbits of the giant planets may well have occurred within the first 100 million years of the solar system's history (e.g., Agnor & Lin 2012; Kaib & Chambers 2016; Clement et al. 2018; Nesvorný 2018; Mojzsis et al. 2019; Quarles & Kaib 2019), with some suggesting that the giant planets had reached their current architecture prior to the end of terrestrial planet formation (e.g., Kaib & Chambers 2016). Such studies have attempted to address the potential incompatibility between the late giant planet migration proposed by the Nice Model and the observed orbits of the terrestrial planets, noting that, had the instability between the giant planets occurred after the terrestrial planets were fully formed, then it should have significantly impacted and disrupted their orbits (e.g., Brasser et al. 2013). This situation is far less problematic if the instability happens at an earlier epoch—and several studies have shown that such an instability could provide an additional mechanism by which the growth of Mars could be truncated (e.g., Clement et al. 2018, 2019a), while also offering a potential explanation for the origin of Mercury's unusual orbit (Roig et al. 2016; Clement et al. 2019b), and being able to well reproduce the observed depletion and structure of the asteroid belt (Clement et al. 2019c). The true history of the migration of the solar system's giant planets remains to be determined, and it will be interesting to see how future studies address the apparently dissonant evidence provided by the solar system's small body populations and impact record.

4.4. The Origin of Water on the Earth, and the Hydration of the Terrestrial Planets

One of the ingredients considered most vital for the development and ongoing survival of life is water (e.g., McKay 1991, 2014; Rampino & Caldeira 1994; Chyba & McDonald 1995; Ward & Brownlee 2000; and many others). In fact, the presence of water is considered so important that the classical definition of the "Habitable Zone" around a star is simply the range of distances from the star over which an Earth-like planet would be able to have stable liquid water at some point on its surface (e.g., Hart 1979; Kasting et al. 1993; Horner & Jones 2010b; Kopparapu et al. 2013, 2014, 2016; Zsom et al. 2013; Agnew et al. 2017).

The role of water is considered to go beyond simply providing a medium in which life can develop, and being the key solvent for its survival. It is thought, for example, that if the Earth was desiccated, plate tectonics would not occur, which would remove a key route through which gases that are removed from the atmosphere by weathering, life, and other processes, are recycled from the surface (e.g., Wallace & Hobbs 2006, Section 2.3). It is even argued that the lack of plate tectonics might lead to the collapse of the dynamo that produces the magnetic field that protects our planet (as is the case for both Venus and Mars; e.g., Nimmo & McKenzie 1998; Nimmo 2002; Breuer & Spohn 2003; O'Neill et al. 2007). For a more detailed review of the various influences on planetary habitability, we direct the interested reader to the detailed review of that topic, Horner & Jones (2010b).

Given that water is such an important part of life on Earth, it is somewhat surprising that the origin of that water is still under heavy debate. Our planet formed well sunwards of the location of the ice-line in the solar system, and so temperatures were far too high for water to condense from the Solar nebula to be directly accreted in situ. It seems obvious, then, to conclude that the terrestrial planets should have accreted solely from dry rocks. It has been proposed, however, that a significant amount of water could be trapped in the form of hydrated silicates in that region, which would allow water to be accreted concomitantly with the planet's silicate budget. The apparent paradox of our wet planet, and its hot origins, has prompted significant amounts of debate, and has led to a number of different models being proposed to detail the hydration of the terrestrial planets. These can be broken down into three main types—endogenous hydration, early exogenous hydration and "late veneer" exogenous hydration.

Models detailing the endogenous hydration of the terrestrial planets suggest that the great bulk of their water was accreted from their local environment, with only a small contribution being added from sources farther from the Sun. In such a scenario, the planets are thought to have accreted the bulk of their water in the form of hydrated silicates, which have been shown to be stable at far higher temperatures than those at which water can exist alone as a solid (e.g., Drake 2004, 2005; Asaduzzaman et al. 2015). One problem with such scenarios, which on the surface seem to provide a promising avenue by which terrestrial planets throughout the universe could gain significant amounts of water during their formation, is that there is a well established correlation between the water content of meteorites that fall on the Earth and the region of the asteroid belt in which they originated (with those that originated farther out in the belt having a greater water content than those with origins in its inner reaches). In particular, Morbidelli et al. (2000) detail results for a class of meteorites known as the Enstatite chondrites, which are thought to originate in the innermost region of the asteroid belt. They highlight the fact that these meteorites are the driest of all known meteorites in the solar system (containing only 0.05%–0.1% water, by mass), which suggests that the rocky material which formed interior to the ice-line was unable to hold much in the way of water, and therefore that the amount of water concomitantly accreted during the formation of the terrestrial planets from local materials most likely was insufficient to explain the Earth's water budget.

Because of these problems tying the Earth's water to the materials from which it formed, most researchers now believe that exogenous models of hydration are the best explanation for our water budget. Such scenarios postulate that, rather than having a local reservoir of hydrated material, the water budget of the terrestrial planets was instead sourced from objects that formed much farther from the Sun, beyond the ice-line, and were then flung inwards to impact on the planets. These models are further broken down into two main types—early accretion models, which suggest that the water was pumped into the inner solar system (typically from objects that formed in the outer reaches of the asteroid belt, just beyond the ice line) while the planets were themselves in the process of accreting (e.g., Morbidelli et al. 2000; Petit et al. 2001; Marty 2012). Some of these models require the bulk of the water to have been delivered in a few stochastic giant impacts (similar to those described in Section 4.1; e.g., Marty 2012), a process which has been invoked to explain the fact that the terrestrial planets appear to have significantly different water budgets (with, for example, the Earth being significantly more hydrated than Mars). It has even recently been suggested that the bulk of Earth's water was delivered in the planetary collision that gave birth to the Moon—with Earth's water being delivered by the impacting planet, Theia (Budde et al. 2019). On the other hand, mechanisms have also been proposed that could deliver water in the form of a continuous early rain of icy pebbles, ranging in size from millimeters to tens of centimeters in diameter, that formed through pebble accretion beyond the ice-line, then migrated inward following the ice-line as it, in turn, briefly moved in to heliocentric distances smaller than that of the Earth's orbit (e.g., Sato et al. 2016).

The "late-veneer" exogenous hydration models, by contrast, suggest that the hydration of the terrestrial planets occurred at some point after their bulk accretion was completed (e.g., Chyba 1987; Owen & Bar-Nun 1995). One potential event which could have been the source of this hydration is the Late Heavy Bombardment (e.g., Oberbeck & Fogleman 1989; Wells et al. 2003), which may well have led to a prolonged influx of material from both the outer Asteroid belt, and the trans-Neptunian disk, through the inner solar system (e.g., Gomes et al. 2005). Such a bombardment, even though it would consist of many impacts of smaller bodies, rather than a few giant impacts (as in the early exogenous scenarios) could easily lead to the terrestrial planets having different water budgets, particularly if the impactors were sourced from both the Asteroid belt and the outer reaches of the solar system, as discussed by Horner et al. (2009). Equally, the Grand Tack model, in which Jupiter and Saturn careen inward through the asteroid belt, and then migrate back out to their current locations, could efficiently deliver material from beyond the snow-line to the terrestrial planets (e.g., O'Brien et al. 2018).

One way in which researchers have attempted to differentiate between the various methods suggested for the delivery of water to the Earth is through the study of the ratio between deuterium and hydrogen in water (the D:H ratio). A number of studies (e.g., Drouart et al. 1999; Dauphas et al. 2000; Mousis et al. 2000; Mousis 2004; Horner et al. 2007, 2008; Yang et al. 2013) of the formation of our solar system have suggested that the D:H ratio in the water accreted to icy bodies should vary significantly as a function of heliocentric distance within the Solar nebula—with the lowest values being found near the ice-line (in the outer Asteroid belt), and the largest values occurring in the region beyond Neptune. Clearly, then, one should expect that water delivered from the asteroid belt would have a measurably different D:H signature than that sourced from the outer regions of the solar system—a result supported by observed large D:H values in comets (e.g., Balsiger et al. 1995; Eberhardt et al. 1995; Bockelée-Morvan et al. 1998; Meier et al. 1998; Villanueva et al. 2009), and the significantly lower values measured in meteorites (e.g., Figure 2 of Drouart et al. 1999). One would expect that it would be a relatively simple procedure to compare the different values expected for different objects to that measured in the Earth's oceans, and therefore have an independent test of the origin of the water stored therein. However, the issue is complicated by the fact that the D:H ratio measured in the Earth's water is not solely the product of its origin, but rather has been significantly modified over geological timescales by a variety of processes. This also greatly hinders comparisons between the deuteration of terrestrial, venutian, and Martian water. For simplicity, many authors assume that the initial D:H ratio of water on these planets was identical (e.g., Gurwell 1995; Krasnopolsky et al. 1998; Lécuyer et al. 2000) and then use the observed differences in the deuteration at the current day to draw conclusions on the processes that have occurred on those planets in the intervening time.

Such difficulties are one of the key reasons that the true origin of the Earth's water has not yet been universally agreed, although, at the current time, the most widely accepted scenario seems to be that the bulk of the water was sourced from far beyond the Earth's orbit, with at least some contribution coming as a "late veneer" during the Late-Heavy Bombardment (e.g., Morbidelli et al. 2012).

The situation is further complicated by the fact that Venus is markedly drier than the Earth at the current epoch, while Mars is believed to have once been both warm, and wet (although most of that water is likely still locked up in permafrost across the planet). It seems likely that both those planets have lost significant amounts of water through it being stripped from their atmospheres (e.g., Jakosky & Phillips 2001; Hodges 2002; Lammer et al. 2003; Valeille et al. 2010). In addition, a number of studies have suggested that both Mars and Venus were dessicated as a result of the solar system's early impact regime (e.g., Davies 2008; Kurosawa 2015; Rickman et al. 2019b). While the final narrative of the inner solar system's hydration remains to be determined, it seems increasingly likely that the giant collisions that dominated the latter stages of terrestrial planet formation may have played a vital role in ensuring the disparate volatile budgets of Venus, Earth and Mars (e.g., Lykawka & Ito 2019).

4.5. Planet-mass Objects Beyond Neptune

Ever since Neptune was discovered as a direct result of its perturbations on the orbit of Uranus, the idea that there might be one (or more) planetary body orbiting farther out has repeatedly come into vogue. The fortuitous discovery of Pluto, in 1930, was a direct result of a search for such a body—a search that simply led to the detection of the first trans-Neptunian object. In the early 1980s, studies of the frequency of mass extinctions on the Earth, allied to the discovery of the Chicxulub impact crater in Mexico, believed to be tied to the extinction of the dinosaurs (e.g., Alvarez et al. 1980; Hildebrand et al. 1991; Schulte et al. 2010), led to the idea that such extinctions were occurring with a period of approximately 26 million years (e.g., Raup & Sepkoski 1984, 1986; Schwartz & James 1984). In order to explain the observed periodicity, a number of competing theories sprang up, including the idea that our Sun might have a distant companion, most likely a brown or red dwarf, moving on a highly eccentric orbit that would, at perihelion, bring it through a sufficiently dense region of the inner Oort cloud that it would trigger a severe shower of comets, leading to impacts on the Earth and a mass extinction (e.g., Alvarez & Muller 1984; Davis et al. 1984; Whitmire & Jackson 1984). In the years that followed, however, the theory of "Nemesis" quickly fell by the wayside, with a number of studies casting doubt on the validity of the hypothesis (e.g., Bailey 1984; Clube & Napier 1984a; Hills 1984; Hut 1984; Torbett & Smoluchowski 1984). However, theories of planetary-mass objects beyond the orbit of Neptune continue to crop up in order to explain a variety of features of our solar system, from the population of detached objects beyond Neptune's orbit to observed aphelion asymmetries among dynamically new long period comets. Here, we detail a variety of those theories, showing how observations of the dynamical structure of the outer solar system can allow us to make predictions about what may lie at greater distances.

4.5.1. A Planet in the Oort Cloud—Source of the Long-period Comet Distribution?

While the idea of "Nemesis," a companion star to the Sun responsible for periodic mass extinctions on Earth, has fallen out of favor, there remains beguiling evidence that suggests that the Sun may be accompanied by a large planet, moving in the Oort cloud. Dynamically new long-period comets, making their first pass through the inner solar system having been perturbed from their cold storage in the Oort cloud, should come in from all directions in approximately equal quantities—their aphelia should be isotropically distributed across the sky. In the early 1980s, studies of the distribution of cometary aphelia revealed evidence of the Biermann comet shower (e.g., Biermann et al. 1983)—a clustering of cometary aphelia in a small area of the sky that is accepted as being evidence of the effect of the passage of a star through the Oort cloud a few million years ago. That passage disrupted the orbits of comets in the region affected by the passing star, causing a shower of new comets to fall inward to the inner solar system. The observed clustering represents the tail-end of the shower event.

It is not just close-encounters with low mass stars that can inject significant numbers of comets to the inner solar system. Fouchard et al. (2011) performed simulations of the effect of a wide variety of stellar encounters, and found that the influence of far more distant, massive stars, can be significant. Indeed, they note that single massive stars could cause what they term "comet drizzles"—lengthy periods (of 100 Myr, or more) of enhanced cometary flux from the Oort cloud to the inner solar system. While the impact of close, low-mass stars would be limited to their tracks through the outskirts of the cloud, that of more distant, massive stars can impact the whole of the cloud, moving cometary nuclei onto orbits that are more likely to be injected into the inner solar system by the influence of the galactic tide. It is by enhancing the population of nuclei in this "tidally active zone" that massive stars can generate such prolonged enhancements to the observed long-period comet flux, as those comets proceed to drizzle into the inner solar system for tens of millions of years.

More recent work (Feng & Bailer-Jones 2015) notes that, while the majority of new long-period comets are injected into the inner solar system by the influence of the galactic tide, a non-negligible fraction (>5%) are flung inwards by encounters with passing stars. They also identify past encounters with the stars Gliese 710 and HIP 14473, which they suggest may have contributed up to 8% and 6%, respectively, of the current long-period comet flux. In the coming years, our knowledge of historical stellar close encounters should increase markedly, as a result of the data being returned by the Gaia space telescope (e.g., Gaia Collaboration et al. 2016, 2018; Evans et al. 2018), and it seems almost certain that such work will lead to the identification of the parent of the Biermann shower, and the discovery of additional showers within the ever-growing catalog of long-period comets (e.g., Rickman et al. 2012; Bailer-Jones 2018; de la Fuente Marcos et al. 2018).

In the late 1990s, two groups independently suggested that another signal was becoming visible in the distribution of new cometary aphelia, as the catalog of known dynamically new comets was increasing. Murray (1999) suggested that the observed aphelia of the long-period comets showed an excess at distances between 30,000 and 50,000 au, and that the aphelia within that region were preferentially aligned along a great circle inclined at ∼30° to the plane of the ecliptic. At the same time, Matese et al. (1999) noticed a similar pattern, claiming an excess of aphelia on a different great circle, this time centered around a galactic longitude of ∼135°. Both groups suggested that the alleged asymmetries could be explained by the presence of a planet of around three times the mass of Jupiter, orbiting within the Oort cloud, and perturbing comets therein toward the inner solar system. After an exhaustive study of the biases that affect cometary observations, Horner & Evans (2002) found that the signal proposed by Murray (1999) did not stand up to detailed investigation, being merely the product of the aforementioned biases. However, they found that such biases could not explain the statistically significant feature observed by Matese et al. (1999). In the decade since that work, the number of known comets with well determined orbits has increased significantly, leading Matese & Whitmire (2011) to revisit the earlier work. In that work, they find that the current catalog of known dynamically new comets contains a strong signal showing the perturbations of the galactic tide upon the Oort cloud. On top of this, they find that the population of these objects contains:

"an ≈20% impulsively produced excess. The extent of the enhanced arc is inconsistent with a weak stellar impulse, but is consistent with a Jovian mass solar companion orbiting in the [Outer Oort cloud]."

On this basis, it seemed that the evidence for our Sun having a Jovian companion (an object that would perhaps be classified as an evolved T- or Y-dwarf, if detected floating freely in space, or in orbit around another star—e.g., Liu et al. 2011; Luhman et al. 2011; Burgasser et al. 2012) was growing firmer as more observations were made. However, if such an object exists, then one would expect that it would be discovered during the analysis of data obtained by the Wide-field Infrared Survey Explorer, WISE Wright et al. 2010, since such an object should be relatively bright at the infrared wavelengths studied by that mission. On that basis, Luhman (2014) performed an all-sky search for such an object using images taken by WISE across multiple epochs. Their work found no such object down to a W2 magnitude of 14.5, which they note suggests that no Saturn-mass object exists within 28,000 au of the Sun, nor any Jupiter-mass objects within 82,000 au of our star—though they note that using alternative models for the brightness of Jupiter-mass objects could bring that survey limit down to a range of 26,000 au for an object the size of Jupiter.

It now seems likely that there is no Jupiter-mass companion, lurking in the Oort Cloud—instead, the origin of the observed asymmetries in the long-period comet flux may be the influence of the Galactic Tide itself (e.g., Feng & Bailey-Jones 2014). In the coming years, the number of newly discovered long-period comets should rise considerable, and analysis of their orbits may likely yield a final, definitive answer on the nature and origin of the observed aphelion asymmetries.

4.5.2. A Trans-Neptunian Planet as an Explanation for the Structure of the Trans-Neptunian Population

In the first few years of the 21st Century, observers found growing evidence that the outer edge of the Edgeworth–Kuiper Belt, at about 47 or 48 au from the Sun, was relatively abrupt (e.g., Gladman et al. 1998a; Jewitt et al. 1998; Chiang & Brown 1999; Allen et al. 2001; Trujillo & Brown 2001; Gladman et al. 2001)—a feature that became known as the "Kuiper Cliff" (e.g., Chiang & Brown 1999; Chiang et al. 2003). Such a sharp cut-off was unexpected—indeed, some theories had even suggested that the region beyond 50 au should contain more objects, and more large objects, than the region now recognized as the Edgeworth–Kuiper Belt (e.g., Stern 1996; Stern & Colwell 1997). At the same time, studies of debris discs around nearby stars had revealed them to extend to much greater astrocentric distances around their hosts than the extent of the Sun's Edgeworth–Kuiper Belt (e.g., Smith & Terrile 1984; Backman & Paresce 1993; Dominik et al. 1998; Greaves et al. 1998; Holland et al. 1998; Augereau et al. 1999; Jourdain de Muizon et al. 1999; Schneider et al. 1999; Kenyon & Bromley 2001; see also Matrà et al. 2018).

A number of possible explanations were put forward in an attempt to explain the origin of the Kuiper Cliff. In some scenarios, the proto-planetary disk from which the solar system formed was truncated at ∼30 au, and the current trans-Neptunian objects were swept outward to their current orbits as Neptune migrated to its current orbit (e.g., Ida et al. 2000a; Levison & Morbidelli 2003; Gomes et al. 2004). To explain such truncation, a close encounter (with a pericenter of a hundred or a few hundred au) between the Sun and a nearby star during the solar system's formation in a relatively dense stellar cluster was suggested (e.g., Ida et al. 2000b; Kobayashi & Ida 2001; Levison et al. 2004; Kobayashi et al. 2005), and the discovery of the dwarf planet (90377) Sedna, in 2003 (Brown et al. 2004), appeared to add some weight to that hypothesis (e.g., Kenyon & Bromley 2004; Morbidelli & Levison 2004). It soon became clear, however, that such a solution could not explain all of the observed features of the belt (e.g., Melita et al. 2005).

An alternative explanation for the sculpting of the outer Edgeworth–Kuiper Belt was the presence of an additional planet, beyond the outer edge of the belt. Brunini & Melita (2002) considered the influence that a Mars-mass object at ∼60 au would have on the primordial Edgeworth–Kuiper Belt, which they modeled as extending far beyond the Kuiper Cliff. They found that such a planet would be able to carve a gap in such a disk with an inner edge at ∼50 au—a good match to the location of the Cliff. In a similar manner, Lykawka & Mukai (2008) found that the influence of a planet several times the mass of Mars, ejected by Neptune to an eccentric orbit trapped in an N:1 resonance with the giant planet (where N is an integer >1), could explain the sculpting of the disk's outer edge, the distribution of the excited Classical/Scattered Disk objects, the formation of distant Resonant objects (beyond 50 au), and the presence of the detached objects. They suggested that, if the planet still lurks in the system's outer reaches, it would move on an orbit with inclination between ∼20° and 40°, and a semimajor axis ∼100–200 au, placing it within the reach of detection by contemporary surveys. de La Fuente Marcos & de La Fuente Marcos (2014a) went one step further—following the discovery of the detached, Sedna-like object 2012 VP113 (Trujillo & Sheppard 2014), they suggested that the most likely explanation for the existence of such objects was the existence of at least two trans-Neptunian planets, whose influence was required to shepherd the orbits of the detached objects that had been discovered at that time, building on the single super-Earth proposed by Iorio (2014) to explain the same new discoveries.

The idea of a trans-Neptunian planet has never really gone away, but it has gained further exposure in recent years, following the discovery of 2012 VP113. A series of papers proposed the existence of a distant, Neptune-mass planet (popularly known as "Planet Nine") in order to explain an apparent clustering in the orbits of a number of extremely distant trans-Neptunian objects with a > 250 au, including (90377) Sedna and 2012 VP113 (e.g., Trujillo & Sheppard 2014; Batygin & Brown 2016a, 2016b; Brown & Batygin 2016), a result that gained support by the discovery of additional objects moving on distant, detached orbits within the apparent cluster (Sheppard & Trujillo 2016). Bromley & Kenyon (2016) considered a scenario in which such a planet initially formed in the vicinity of Jupiter and Saturn, and was scattered outwards onto a highly eccentric orbit. They found that, if the protoplanetary disk around the young Sun featured an extended gaseous disk (as is observed for protoplanetary discs around other stars), then the planet could have undergone orbital circularization at a heliocentric distance greater than 100 au as a result of dynamical friction with the gas of that disk. Once the gaseous disk around the Sun was removed, such a planet could sculpt the orbits of objects well beyond Neptune, creating the observed clustering of the extremely distant TNOs (Batygin & Morbidelli 2017). Bailey et al. (2016) studied the manner in which such a distant planet would perturb the orbits of the four giant planets, and found that its influence could be sufficient to explain the observed misalignment (of approximately 6°) between the spin axis of the Sun and the invariable plane of the solar system.

Other explanations have, of course, been advanced to explain the observed clustering in the orbits of these distant objects—including recent work revisiting and reviving the idea that the structure of the solar system's outer reaches might be the result of a stellar fly-by during the system's youth (Pfalzner et al. 2018). Perhaps the most convincing, however, is the suggestion that the observed clustering may be the result of the observational biases that are inherent in the surveys that have discovered those objects (Shankman et al. 2017). In that work, the authors examined the distribution of distant solar system objects discovered by the Outer Solar System Origins Survey, which discovered eight such objects using the Canada–France–Hawaii Telescope between 2013 and 2017. Shankman et al. demonstrated that, while their discoveries might appear to fit the paradigm of a clustered set of orbits, in line with the Planet Nine theory, once the biases that afflicted the survey were properly accounted for, the orbital distribution of their discoveries "is consistent with being detected from a uniform underlying angular distribution." On the other hand, Brown (2017) analyzed the biases involved in observations of Edgeworth–Kuiper Belt objects, finding that such biases were unlikely to explain the alignment of TNOs located beyond a = 230 au.

The "Planet Nine" theory has received widespread attention, and has caught the imagination of astronomers and the wider public alike, even spawning a citizen science search program, through the Zooniverse website48—a search that has yet to bear fruit. Similar searches using data from the WISE and NEOWISE survey (Fortney et al. 2016; Meisner et al. 2018) have failed to find any evidence of the proposed planet—but if it does exist, it seems likely that it will be detected in the coming years.

4.6. The Formation and Evolution of the Oort Cloud

Stretching halfway to the nearest star, the Oort cloud is one of the great mysteries of the solar system. While we know it is there, as a result of the continual flux of dynamically new comets maintaining the long-period comet population (e.g., Öpik 1932; Oort 1950), it is too distant to study directly. Instead, we must learn what we can from the long period comets it sends our way.

As we discussed earlier, those comets are injected from the Oort cloud through the combined effects of the galactic tide, and perturbations from passing stars (e.g., Feng & Bailer-Jones 2015, and references therein), In coming years, it is likely that the combined efforts of the Gaia space telescope (Gaia Collaboration et al. 2016, 2018; Evans et al. 2018), surveying the motions and distances of approximately 2% of all stars in the galaxy, and the next generation of all sky surveys (such as the Large Scale Synoptic Telescope, LSST; LSST Science Collaboration et al. 2009; Ivezić al., 2019), which will discover vast numbers of new comets, will reveal evidence of additional past encounters between the Sun and passing stars, to add to the Biermann comet shower (Biermann et al. 1983). Within a decade, we may even get our first close-up view of a dynamically new comet, passing through the inner solar system for the first time in more than four billion years, as a result of the recently announced Eurpean Space Agency mission "Comet Interceptor,"49 scheduled for launch in 2028.

But what of the Oort Cloud's origins? It is clear that it can not have formed in situ—instead, it is thought to have been created as a side-effect of the accretion of the giant planets, and their subsequent clearing of the outer solar system. As the giant planets accreted, they were embedded in a disk containing icy planetesimals of all shapes and sizes. Close encounters between the planets and the planetesimals would scatter the latter, and as the mass of the planets grew, so too did the efficiency with which such scattering events could eject those planetesimals to highly eccentric, and even unbound, orbits.

The farther an object strays from the Sun, the weaker the Sun's gravitational influence upon it will be. Once a comet is sufficiently distant, the influence of nearby stars and the galactic tide will therefore be strong enough to significantly alter the orbit of that comet. As a result, there are three possible outcomes for a planetesimal ejected by the giant planets to a high eccentricity orbit, that takes them well beyond the orbit of Neptune. The first is that they are either ejected with such great velocity that they move on a hyperbolic orbit, never to return, or are flung sufficiently far (on a nominally bound orbit) that the influence of nearby stars and the galactic tide can decouple them from the Sun's influence. Such objects, clearly, will not become members of the Oort Cloud. At the other extreme, you have those comets whose aphelia are not sufficiently far from the Sun for passing stars and the galactic tide to significantly perturb them. Those objects will return to the regime of the giant planets unperturbed, no doubt to experience further scattering. Between the two extremes, you have a scenario whereby an ejected object obtains sufficient distance from the Sun for its orbit to be strongly perturbed by the influence of the galactic tide and passing stars, without becoming so distant that those perturbations remove it from the solar system entirely. It is these objects that give birth to the Oort Cloud. The perturbations from the galactic tide and nearby stars act to lift the perihelia of those objects out of the region of the solar system dominated by the planets, placing them on orbits where they can remain for billions of years, held in cold storage.

Over the billions of years that follow, the orbits of comets in the outer reaches of the Oort Cloud will slowly become randomized, as a result of their continual stirring by the galactic tide, and periodic nudges from passing stars. From a distribution that would initially resemble the disk from which those comets were sourced (i.e., relatively low inclinations with respect to the plane of the solar system), the Oort Cloud will eventually obtain its current roughly spheroidal shape, with the inclinations of the comets therein wholly randomized.

The outer boundary, beyond which objects would have been effectively removed from the Sun's influence, would have been closer to the Sun if our birth cluster was particularly massive and dense (with more stars forming closer to the solar system), and at a greater distance if the solar system formed in a relatively loose cluster or stellar association. As noted at the start of Section 4, evidence gathered from meteorites suggests that our birth cluster was relatively massive, containing at least 2000 stars (e.g., Pfalzner et al. 2015). Such a tight cluster is invoked in order to explain the abundance of 26Al in the early solar system, which it is argued was provided by the wind of a nearby massive star.

Dones et al. (2004) provide an excellent review of theories describing the formation and evolution of the Oort Cloud. They note that, on the basis of dynamical simulations, Uranus and Neptune likely dominated the emplacement of cometary material to the Oort Cloud—see their section four for a full overview of studies of the Oort Cloud's formation prior to that date. However, more recent work has reopened the question of the primary source of material to the primordial Oort Cloud. Brasser et al. (2012) performed detailed numerical simulations of the formation of the Oort, taking account of the influence of the other stars in the Sun's birth cluster. They found that the inner Oort cloud was primarily populated by comets ejected by Jupiter and Saturn, with the pericentres of those comets then being raised by encounters with passing stars to decouple them from the planetary domain of the solar system. As such, the question of the "planetary parents" of the Oort cloud remains somewhat in doubt, and the true answer might well depend on the precise nature of the cluster in which the Sun formed, as well as the chaotic migration history of the giant planets. For a more detailed discussion of the Oort cloud's formation, we direct the interested reader to the recent review by Dones et al. (2015), which builds on their earlier work to provide a detailed overview of the formation of all of the solar system's reservoirs of cometary body.

As discussed in Section 4.4, it has been proposed that the level of deuteration trapped in the water of the solar system's icy bodies could act as a tracer of their formation locations within that disk (e.g., Drouart et al. 1999; Dauphas et al. 2000; Mousis et al. 2000; Mousis 2004; Horner et al. 2007, 2008; Yang et al. 2013). As such, Horner et al. (2007) suggested that measurements of the D:H ratio in incoming comets could be used to test the various theories of their formation, showing how a variety of scenarios for the formation regions of cometary bodies would manifest as different distributions of deuteration in the nuclei we observe today. Following that logic, Kavelaars et al. (2011) compared the D:H enrichment measured in Saturn's icy satellite Enceladus with the levels of deuteration measured for a number of long-period comets. They found that the comets displayed similar levels of deuteration to the icy moon, suggesting that they had formed at roughly equivalent locations in the protoplanetary nebula. Given that the established wisdom is that the Oort Cloud was probably primarily populated by Uranus and Neptune, those authors considered this to be evidence that the majority of material in the Oort Cloud formed at a similar heliocentric distance to Saturn, which in turn would suggest that Saturn formed close to Uranus and Neptune. Such a scenario is exactly what was proposed by Thommes et al. (1999, 2002), and that formed the cornerstone of the Nice Model (Gomes et al. 2005; Morbidelli et al. 2005; Tsiganis et al. 2005).

Because of the tenuous grasp with which the Sun holds on to comets in the Oort cloud (particularly the outer reaches), the cloud should continually be bleeding objects to interstellar space, stripped from the Sun by the combined effects of passing stars and the galactic tide. It is fair to assume that other stars behave likewise—and a result, interstellar space should contain vast numbers of free-floating comets and asteroids, some of which, by chance, must occasionally pass through the inner solar system. The first such object, 1I/'Oumuamua, was discovered in 2017 (e.g., Bannister et al. 2017; Jewitt et al. 2017; Meech et al. 2017; Fitzsimmons et al. 2018)—and should another such object be discovered moving on a favorable path while the Comet Interceptor mission stands ready at Earth's L2 Lagrange point, that object would likely be considered an ideal target for the mission. In other words, in the relatively near future, that mission means that there is a chance that we may get our first "up-close-and-personal" view of a comet or asteroid from another planetary system.

Just as passing stars perturb our Oort cloud, so too will the Sun perturb the Oort clouds of its stellar neighbors. Particularly while the Sun was still young, and moving within its birth cluster, this process of mutual Oort cloud stirring would have been pronounced, and one could readily envision a cluster awash with comets, torn from one parent star, co-moving within the cluster. For this reason, Levison et al. (2010) performed numerical simulations to investigate the degree to which comets could be transferred from one star to another—with the idea that our Oort cloud could contain a significant number of comets captured from other stars before the dispersal of our birth cluster. The authors conclude that a significant fraction of comets in the Oort cloud were indeed captured from our Sun's birth neighbors—with such comets potentially making up the vast majority (>90%) of nuclei within the cloud.

While the capture of comets to the cloud might have been efficient while the Sun resided in its birth cluster, recent work by Hanse et al. (2018) has shown that the situation became markedly different once the Sun began wandering the galaxy alone, having left that cluster behind. They consider the capture of comets from field stars (and, by extension, the transfer of Solar comets to those stars) as those stars fly past the Sun. They find that capture events are relatively rare, since the typical encounter velocities of passing stars are far greater (by several orders of magnitude) than the orbital velocities of comets in the Oort cloud. They note that the transfer of any "exocomets" to the Oort cloud will typically only happen during unusually slow (≲0.5 km s−1) and close (≲105 au) encounters—and as a result, such exocomets likely make up just a small fraction of the mass of the cloud (between 10−4 and 10−5). In the same work, those authors were able to model how the Oort cloud is stripped of comets by stellar encounters, suggesting that such encounters have likely caused the Oort cloud to lose between 25% and 65% of its initial mass through the age of the solar system.

As we discussed in Section 4.5, comets from the Oort cloud have been invoked as the cause of periodic mass extinctions on Earth since the early 1980s. In order to explain the suspected ∼26 Myr periodicity in mass extinctions reported by Raup & Sepkoski (1984, 1986), several authors discussed the possibility that the Sun might have an undetected binary companion, a dim red dwarf star called "Nemesis," whose motion through the Oort cloud sent periodic showers of comets to imperil life on Earth (e.g., Alvarez & Muller 1984; Davis et al. 1984; Whitmire & Jackson 1984)—an idea that quickly fell into disfavor (e.g., Bailey 1984; Clube & Napier 1984a; Hills 1984; Hut 1984; Torbett & Smoluchowski 1984).

Schwartz & James (1984) proposed a different mechanism by which the Oort cloud could repeatedly send comet swarms toward the Earth, noting that the ∼26 Myr periodicity in extinctions found by Raup & Sepkoski (1984) was remarkably similar to the period with which the solar system oscillates about the plane of the galaxy. While it is typical to imagine the orbital motion of the Sun around the galaxy as being roughly circular (with an orbital period of ∼225 Myr), the true story is somewhat more complicated. As it moves around the galactic center, the Sun also undergoes a periodic vertical oscillation back and forth through the plane of the galaxy. When the Sun is located above that plane, the mass of stars, gas and dust within it exerts a downward force, pulling our star (and therefore the solar system) down toward the plane. As a result, the system accelerates, falling downwards until it passes through the galactic plane and out the other side. Once it is below the plane of the galaxy, the system now feels the same restoring force pulling upwards, causing its vertical motion to slow, stop, and then reverse. The result is that the Sun's motion around the galactic center is somewhat reminiscent of an automated fairground carousel ride, with the Sun oscillating up and down as it moves around the galactic center.

Schwartz & James (1984) proposed that it was this motion that was to blame for the periodic extinctions noted by Raup & Sepkoski (1984). The vertical oscillations of the Sun result in the solar system passing through the plane of the galaxy repeatedly, with those passages occurring with a frequency comparable to the periodicity proposed for Earth's mass extinctions. As the system passes through the galactic midplane, encounters with stars, and with giant molecular clouds become more likely, increasing in turn the likelihood that the Earth will collide with a comet flung inwards by such an encounter. The idea was revisited by Randall & Reece (2014), who invoked the influence of a smooth disk of dark matter, located at the midplane of the galactic disk, in order to explain how the Oort cloud could always be guaranteed to trigger a comet shower—something that had proved problematic when the only influences on the cloud were considered to be encounters with stars and giant molecular clouds.

An alternative theory connecting the Sun's motion through the galaxy was proposed by Leitch & Vasisht (1998), who suggested that several of Earth's largest extinction events coincided with periods at which the Sun was moving through one or other of our Galaxy's spiral arms. The spiral arms of the Milky Way (and of other spiral galaxies) are regions within which there is a greater concentration of dust, gas, and star formation—and as a result, a greater stellar density. When the Sun moves through one of those arms, then, the likelihood of close encounters with stars (which could inject comets to the inner solar system) is higher. At the same time, the chance of an extinction event caused by a nearby supernova (the death of one of the galaxies most massive stars) is also greatly enhanced, since those stars are almost exclusively found within the spiral arms. Gilman & Erenler (2008) and Filipovic et al. (2013) revisited this idea, using updated maps of the Milky Way's structure to attempt to determine whether any correlation between such spiral-arm crossings and mass extinctions could be found.

It is fair to say, however, that the jury is still out on whether mass extinctions on Earth are truly periodic, and even if they are, many of the theories proposed to explain such periodic extinctions are considered by many to be fanciful or unfeasible. Nevertheless, it is clear that comets from the Oort cloud have posed, and will continue to pose, an impact threat to the Earth—and the origin and evolution of the Oort cloud will remain a hot topic for research for many years to come.

4.7. Dust and Debris in the Solar System

When we look out at the solar system, we see dust everywhere. On any clear night, we observe dust ablating in Earth's atmosphere, producing meteors, and their brighter siblings fireballs and bolides (e.g., Jenniskens 2006). With the unaided eye, we can also observe the Zodiacal Light and Gegenschein (e.g., Roosen 1970; Leinert 1975), both the result of sunlight scattering from dust in the plane of the solar system. Whenever an active cometary nucleus is sufficiently close to the Sun, the sublimation of its surface ices will result in the continual ejection of dust grains—which will be pushed away from the Sun to produce those comet's spectacular dust tails. And when those comets fragment (as they often do; e.g., Crovisier et al. 1996; Sekanina 2000; Jenniskens & Lyytinen 2005; Jenniskens & Vaubaillon 2007), or go into outburst (e.g., Sekanina et al. 1992; Dello Russo et al. 2008; Montalto et al. 2008; Ye & Clark 2019), large quantities of dust are released—occasionally even resulting in spectacular meteor showers, as that dust encounters Earth (e.g., Asher 1999; Wiegert et al. 2005; Jenniskens & Vaubaillon 2007; Jenniskens 2006).

Asteroids, too, are a ready source of dust. The near-Earth asteroid 3200 Phaethon (the parent of the Geminid meteor shower; e.g., Gustafson 1989; Williams & Wu 1993) behaves as a "rock comet," shedding dust near perihelion as a result of surface weathering, caused by the extreme temperatures to which it is subject through the course of its orbit (e.g., Jewitt & Li 2010; Li & Jewitt 2013). More often, asteroids shed dust as a result of collisional processes—from the small scale sputtering caused by the continual hypervelocity impacts of dust on their surface, through to the rare but catastrophic collisions that shatter whole objects, creating vast quantities of dust in the process (e.g., Sykes & Greenberg 1986; Sykes 1990; Reach 1992; Durda & Dermott 1997; Nesvorný et al. 2002; Spahn et al. 2019), and even through activity like that of comets (e.g., Hsieh et al. 2004; Hsieh & Jewitt 2006; Jewitt et al. 2009; Jewitt 2012; Agarwal et al. 2017; Chandler et al. 2019).

Farther from the Sun, too, dust is abundant—from the dusty debris orbiting several of the solar system's small bodies (e.g., Stansberry et al. 2004; Braga-Ribas et al. 2014; Ortiz et al. 2015, 2017) to the ring systems around the giant planets (e.g., Elliot et al. 1977; Smith et al. 1979; Goldreich & Tremaine 1982; Smith et al. 1989). In short, dust is a pervasive presence in the solar system.

4.7.1. The Temporal Evolution of Dust in the Solar System—Steady State or Time-Variable?

Given the wide variety of sources for that dust, it is natural to assume that the amount of dust in the system is roughly in steady state—in other words, to assume that we do not observe the system from a privileged, unusual epoch in the system's history, at which dust levels are elevated above the norm. However, that assumption may well be invalid. As we discussed earlier, evidence is growing that Saturn's spectacular ring system is a relatively recent feature of the solar system (Iess et al. 2019), and that those rings may fade and decay over the next few hundred million years (O'Donoghue et al. 2019). Dust in the solar system is relatively short lived (for reasons we will discuss in Section 4.9), and as such, absent any sources to replenish the system's dust budget, one would expect the solar system to become almost dust free on a relatively short timescale. To put this another way—should an event happen to inject large quantities of dust and debris to the solar system (such as the collision of two large asteroids, or the decay and fragmentation of a giant cometary nucleus), the dust created from that event would decay over time.

The solar system today contains at least three features that might be considered evidence of such stochastic events, each of which has contributed to the fact that the current dust and debris budget of the system can be considered to be enhanced over the potential "steady state" background that would be generated by the ongoing collisional sputtering of dust from the asteroid and trans-Neptunian belts, and from the outgassing of comets.

The first of these, as described above, is the existence of Saturn's spectacular ring system. If the work of Iess et al. (2019) and O'Donoghue et al. (2019) holds true, those rings can be considered a transient phenomenon, rather than being considered to be a permanent fixture of the system. While the rings may decay on timescales of hundreds of millions of years, the timescales of their formation and decay could be taken to suggest that the other giant planets, whose ring systems are currently minor in comparison to those of Saturn, might once have hosted spectacular, fully fledged rings. It would be interesting to discover for what fraction of the solar system's history one or other of the giant planets has displayed Saturn-like rings—and it might well be the case that future observations of Jupiter- and Saturn-analog exoplanets might help us to determine whether such rings are the temporal exception, or the norm.

The second unusual feature is the plethora of "sungrazing comets," moving on near-identical orbits that cause them to skim the Sun's surface at perihelion (e.g., Marsden 1967). Thanks to the work of the SOHO spacecraft, in recent years, the number of known sungrazing comets has grown dramatically, with that spacecraft alone discovering several thousand such objects—most of which either disintegrated as they approached the Sun, or never survived the rigours of their perihelion passage (e.g., Battams & Knight 2017).

Of particular interest among the sungrazing comets are the Kreutz family (after Kreutz 1888), which number among them some of the most spectacular comets of the last millenium, visible, at their brightest, in broad daylight. These include the Great Comets of 1106, 1843, 1882, and, more recently, comet C/1965 S1 Ikeya-Seki. Members of the Kreutz family move on near-identical, retrograde, orbits, inclined to the plane of the solar system by ∼140°. As more members have been discovered, it has been possible to group them into several sub-families, each of which features slightly different orbital elements (e.g., Marsden 1967, 1989).

In order to explain the vast number of Kreutz sungrazers, a process of ongoing fragmentation has been proposed, with the idea that all of the observed sungrazers are fragments of a vast parent body, possibly 100 km (or more) in diameter, that first fragmented at least two millennia ago (e.g., Marsden 1967, 1989; Sekanina & Chodas 2004, 2007). This theory is supported by the fact that the great majority of family members fragment as they pass close to the Sun. So, for example, the Great Comet of 1106 fragmented as it passed by the Sun, and the Great Comet of 1882, and comet C/1965 S1 Ikeya-Seki were the first returns of two of the larger fragments created in that event (e.g., Hasegawa & Nakano 2001).

The different sub-groups within the Kreutz family are thought to be objects that all tie to a particular fragmentation event, farther back in time—with each subsequent fragmentation giving birth to a new sub-sub family, and so on. While it is natural to assume that those fragmentation events would happen at, or around, perihelion, when the combination of thermal and tidal stresses on the nucleus are the greatest (e.g., Bailey et al. 1992; Sekanina & Chodas 2012), there is growing evidence that some Kreutz fragmentation events happen far from the Sun, most likely as a result of thermal stress from the penetration of heat from the previous perihelion passage to the comet's interior (e.g., Sekanina 2001; Sekanina & Chodas 2004, 2012).

Based on the current spread of the sub-groups, it has been suggested that the origin of the Kreutz sungrazing family might be tied to a spectacular comet observed in the year −371 (i.e., 371 BC), documented by Aristotle and Ephorus (e.g., Marsden 1989), or the bright comet of −214 (i.e., 214 BC) (e.g., Sekanina & Chodas 2007). In either case, it is suggested that the parent of the Kreutz sungrazer population was a true behemoth of a comet—with a nucleus of at least 100 km in diameter. While such a large nucleus might at first sound unfeasible, there is evidence that at least two of the great comets in the historical record had nuclei of comparable size—these being C/1729 P1 Sarabat (which was visible with the naked eye for several months, despite a perihelion distance of over 4 au) and C/-43 K1 (often known as Caesar's Comet),50 while comet C/1996 O1 Hale–Bopp, which was visible with the unaided eye for a record-breaking 18 months, had a nucleus of diameter ∼60 km in diameter (e.g., Fernández 2002).

Looking forward, it is virtually certain that we will see more spectacular members of the Kreutz sungrazing family, as the ongoing fragmentation and decay process will likely continue for several thousand years, or more. Indeed, some studies predict the arrival of another cluster of bright Kreutz members in the coming decades (e.g., Sekanina & Chodas 2007, 2012; Sekanina & Kracht 2013), of which the comet C/2011 W3 Lovejoy may have been the first substantial member (Sekanina & Chodas 2012).

In addition to the famous Kreutz sungrazing group, there are at least three other families of sungrazing comets, identified as a result of the plethora of discoveries made by the SOHO spacecraft. These families—the Marsden, Meyer and Kracht groups (e.g., Jones et al. 2018)—are also thought to be relics of the fragmentation of other large sungrazing objects. Indeed, it seems likely that both the Marsden and Kracht groups are part of a broad swath of debris throughout the inner solar system that includes the highly eccentric short-period comet 96P/Macholz, and several meteor streams (including the delta Aquariids, the daytime Aqauriids, and the Quadrantids; e.g., McIntosh 1990; Gonczi et al. 1992; Ohtsuka et al. 2003; Sekanina & Chodas 2005; Jenniskens et al. 2012).

All this suggests that such families may well be a relatively common occurrence through the solar system's history—a finding supported by dynamical studies of the long term evolution of cometary orbits, which suggests a significant fraction might eventually evolve to sungrazing orbits (e.g., Bailey et al. 1992, 1996). The sungrazing comet population is therefore expected to be continually in flux, with new families being born as a natural consequence of cometary evolution. As the number of known sungrazers continues to climb in the coming years, it seems likely that other families may become apparent, relics of more ancient cometary fragmentation episodes. For more information on sungrazing comets, we direct the interested reader to the excellent review by Jones et al. (2018).

The third of the solar system's unusual dust features is known as the "Taurid Stream," a swath of debris in the inner solar system that is so broad that it encounters all four terrestrial planets. Through the northern hemisphere late summer and fall, the stream produces the Northern and Southern Taurid meteor showers on Earth,51 and is also responsible for the active daylight Beta Taurid meteor shower, in June and July each year. In other words, the Taurid stream is so broad that the Earth spends fully one third of each year traversing it. And it is not just the Earth that encounters the Taurid stream—so broad is the swath of debris that all four terrestrial planets encounter it. On Mercury, impacts from Taurid meteoroids have been linked to an annually repeated excess of Ca in the planet's exosphere (e.g., Christou et al. 2015), while both Venus and Mars spend a significant fraction of their orbits moving through the broad stream, at both its ascending and descending nodes (e.g., Christou 2010).

The Taurid stream contains (at least) one active comet—2P/Encke, which has long been considered to be the parent of the meteor showers produced by the stream (e.g., Whipple 1951; Kresák 1978; Babadzhanov et al. 1990). Whipple (1951) notes that comet Encke must "originally have been massive to have persisted so long in a short-period orbit"—since the breadth of the stream suggests a grand old age. In the early 1990s, several studies examined the Taurid stream, and their relationship to comet Encke, suggesting that the stream is at least 20,000 yr old (e.g., Babadzhanov et al. 1990; Steel et al. 1991). Steel et al. (1991) note that the stream's initial progenitor must have been a giant comet that was captured into a particularly short period orbit52 before undergoing a process of ongoing fragmentation. That process generated a significant fraction of the dust we observe in the Zodiacal cloud, with Comet Encke being the largest, and most active, remaining fragment.

The idea that the Taurid stream is the result of an ongoing process of cometary disintegration found support with the discovery of a number of near-Earth asteroids moving within the Taurid complex (e.g., Asher et al. 1993; Babadzhanov 2001; Porubčan et al. 2006; Babadzhanov 2008). To generate such a broad stream of debris, the diameter of the parent object has been estimated as being at least ∼100 km (e.g., Asher & Clube 1993; Clube et al. 1996), with a number of authors suggesting it was most likely a former Centaur, captured to a short-period orbit (e.g., Clube & Napier 1984b; Horner et al. 2003; Napier 2015) after a close encounter with Jupiter. Asher & Clube (1997) suggest that the parent of the stream may have been captured to the inner solar system as much as 50,000 yr ago, becoming trapped in a near-sungrazing orbit in 7:2 mean-motion resonance with Jupiter. Over the millennia that followed, the nucleus of that giant comet would gradually fragment as a result of repeated tidal stress, before experiencing a final, cataclysmic disruption as a result of a very close encounter with one of the solar system's planets (potentially Mercury), approximately 5000 yr ago.

Given the large amounts of material contained within the Taurid stream, it has long been considered of interest as the source of potential impactors at the Earth. On 1908 June 30th, a large meteoroid exploded over Siberia, leveling almost 2200 square kilometers of forest—an impact thought to be the largest on Earth for at least a thousand years. Based on calculations of the radiant point for that impactor performed by Zotkin & Tsikulin (1966), Kresák (1978) suggested that the impactor was a member of the Beta Taurid meteor shower, and that it might therefore have been a fragment of Comet Encke, a finding supported by updated radiant calculations presented in Zotkin & Chigorin (1991). Asher & Steel (1998) showed that a genetic link between the two objects was entirely feasible—suggesting that the two bodies could have separated as recently as 10 kyr ago—a timescale well within the longer lifetime of the Taurid stream as a whole. Hartung (1976) suggested that the impact crater Giordano Bruno, on the Moon, was formed in an impact on June 18th53 1178, based on observations reported to Gervase of Canterbury by "five or more men who were sitting there facing the moon." This, in turn, led to the suggestion that the cratering event was linked to the Beta Taurid shower, and comet Encke (e.g., Hartung 1993), though more recent work (Morota et al. 2009) has found that the crater is far too old for this to be the case—putting its age in the range 1–10 Myr.

In the early 1980s, there was wide discussion of the idea that mass extinctions on Earth occurred periodically, as a result of cometary impacts sourced from the Oort cloud (e.g., Raup & Sepkoski 1984, 1986; Schwartz & James 1984). Building on that narrative, Clube & Napier (1984b) suggested that, for the last 20,000 yr, the Earth has been experiencing a "spike" in impact flux, related to the disintegration from a Chiron-sized (i.e., D ∼ 200 km) comet—the progenitor of Encke and the Taurid stream. They suggested that the debris released in that disintegration had a marked impact on the environment of the Earth during that time—driving climatic variations that helped to cause the last of Earth's glaciations. Asher & Clube (1993) built on that work, suggesting that the influence of the Taurid stream could have contributed to intermittent periods of cooling at roughly ∼200 yr intervals over the past millennium, at times when Earth would have encountered particularly dense patches of debris, trapped within the 7:2 mean-motion resonance with Jupiter. Clube et al. (1996) linked the historical evolution of the Taurid swarm with a variety of ancient catastrophes on Earth—"One could regard otherwise enigmatic events in history such as the sudden collapse of the Indus Valley Civilization of Mohendojaro and of the Old Kingdom in Egypt..., both occurring at ∼2500 BC, as fitting well with the precession of the primary orbital nodes ... and the (Taurid) cometary collision picture in general." They again tied the cooling that triggered the last ice age to the influence of impacts from the Taurid stream—along with tying the stream to events in the Old Testament of the Bible, and the end of the Roman Empire.

This theory, which became known as "Cometary Catastrophism," still occasionally rears its head (e.g., Napier 2010, 2015)—and recent years have seen a growing interest in the study of a relatively dense core to the Taurid stream, as noted in Asher & Clube (1993), trapped in 7:2 mean-motion resonance with Jupiter. That stream undergoes periodic close approaches to the Earth, causing an uptick to Taurid activity, and enhanced numbers of bright fireballs from the stream (as was observed in 1998, 2005, and 2015; e.g., Olech et al. 2017; Spurný et al. 2017). In 2019 June, the Earth was forecast to experience its closest approach to the center of that resonant swarm since 1975—albeit at a distance large enough that any impact risk was expected to be negligible. Clark et al. (2019) proposed to take advantage of this close approach to search for near-Earth objects moving in the swarm, detailing the optimal observation strategy to take advantage of the encounter. At the time of writing, no results have been published detailing the success (or otherwise) of this strategy, suggesting that any such work may have proven unsuccessful. It remains likely, however, that in the coming years, new asteroidal members of the Taurid swarm will be discovered, with such finds likely growing in number as we approach our next encounter with the center of the swarm, in the early 2030s.

It is clear, on the basis of these three distinct features of the solar system (Saturn's rings, the Kreutz sungrazing comets, and the Taurid stream), that the amount of dust and debris in the solar system can vary dramatically, and its seems highly unlikely that we are currently seeing the solar system's steady-state dust background. Instead, we are likely in a period of enhanced dust levels in the inner solar system, with the amount of dust gradually decaying following the disintegration of the Taurid progenitor, tens of thousands of years ago, and the Kreutz progenitor, in the past few thousand years. It might well be the case that for much of the solar system's evolution, the amount of dust present was lower than that we see today—with long periods of quiescence interspersed with short periods of enhanced dust loading, as giant comets fragment, or asteroids collide.

4.7.2. Chondrites: A Window Into the Formation of the Solar System

Primitive meteorites contain a direct record of the environment and the physical processes that took place in the early solar system. Studies of those meteorites complement analytical and numerical models of solar system formation, as well as observational studies of current star formation, but putting this record in its astronomical context is not trivial. Chondrite meteorites, in particular, are regarded as the "building blocks" of the Solar nebula, out of which the solar system formed (Wood 1962). They are the most common type of meteorites (about 90% of observed falls54), and are subdivided into ordinary, carbonaceous, and enstatite classes according to mineralogical, chemical and oxygen isotope systematics (Weisberg et al. 2006; Krot et al. 2007; Scott & Krot 2005b, 2014).

The Hayabusa sample return mission to the asteroid 25143 Itokawa (e.g., Yano et al. 2006) showed that ordinary chondrites are related to the S-class asteroids (Nakamura et al. 2011; Yurimoto et al. 2011), the second most common class of asteroids. Two other missions of this type are underway (Hayabusa II to asteroid 162173 Ryugu; Tsuda et al. 2013, and OSIRIS-REx to asteroid 101955 Bennu; Lauretta et al. 2017). These are C-type asteroids; potentially linked to the carbonaceous chondrites (e.g., Burbine et al. 2002; McSween et al. 2011). These meteorites (particularly the CI sub-type) have elemental abundances close to the Solar photosphere (e.g., Lodders et al. 2009), and are regarded to be a good representation of the composition of the early solar system (e.g., Desch et al. 2018). They also contain large amounts of water (approximately 12–13 wt%; Alexander et al. 2013), which is thought to be consistent with their having accreted an initial water-to-rock mass ratio closer to the solar system primordial value (∼1.2; Krot et al. 2015) than is the case for other well characterized meteoritic material.

Chondrite meteorites are, essentially, cosmic "breccias" aggregated from the ambient material in the early solar system. They provide unique information on the timescales for the formation of the first solids, as well as on the dynamical, thermal and chemical properties of the Solar nebula (e.g., Jones et al. 2000; Wadhwa & Russell 2000; Krot et al. 2009). They are composed of three separate constituents (e.g., Alexander et al. 2007; Davis et al. 2014): Chondrules, Calcium–Aluminum rich Inclusions (CAI) and Matrix. Chondrules are millimeter-sized sub-spherical inclusions of magnesium-silicate minerals, predominantly olivine and pyroxene. They are a major component of chondrites (up to 80% by volume, depending on the chondrule class; Connolly 2005; Scott & Krot 2014). Their properties—texture, shape and mineralogy—indicate that they were molten prior to their incorporation into the meteorite parent bodies, with melting temperatures of order 1800 K (Petaev & Wood 2005; Scott & Krot 2005a). They are thought to have formed rapidly (in a process lasting minutes to hours), with rapid cooling rates of ∼10–1000 K hr−1 (e.g., Jones et al. 2000; Ciesla 2005). Moreover, compound chondrules and the presence of dusty rims seem to indicate that some of them may have experienced multiple processing events.

CAI are regarded as the oldest and most refractory solar system solids, with their formation dated to 4.567 Gyr (Amelin & Ireland 2013). They are thought to have experienced even higher temperatures than chondrules, of order 1700–2000 K (Rubin 2000; Richter et al. 2002; Grossman 2010) over timescales of hours to days. CAI can be up to 13% by volume of chondrites, and exhibit sizes in the range of a few micrometers to centimeters. Their compositions suggest that they condensed out of a solar-composition gas (Grossman 1972). Puzzlingly, CAI and chondrules are embedded in a fine-grained mixture (matrix) of silicates, oxides, organic material and presolar grains, which appears never to have experienced temperatures beyond ∼400 K. Moreover, even though the proportion and composition of chondrule and matrix material can vary among chondrites types, the overall chemical composition of each chondrite group is approximately constant, indicating a level of complementarity between these components (e.g., Bland et al. 2005; Zanda et al. 2006; Hezel & Palme 2008; Krot et al. 2009; Desch et al. 2010). In turn, this suggests a close connection between the high-temperature process leading the formation of chondrules, and the preservation of the (complementary) cold matrix in the nebula.

The prevalence of chondrules and CAI in meteorite samples suggests that they formed via a general process, closely associated with the star and planet formation mechanism (e.g., Wood 1962; Ciesla et al. 2003; Scott & Krot 2005a; Morris & Desch 2010). Their formation timescales are broadly consistent with the inferred protostellar disk lifetimes (106–107 yr; Calvet et al. 2000; Hartmann et al. 2016); but CAI are thought to have formed over a shorter period (<0.1 to ∼0.3 Myr) during the embedded phase of Solar formation (Bizzarro et al. 2004; Scott & Krot 2005b), whereas chondrules may either have formed at that time (Bizzarro et al. 2004) or during a period ranging up to 4 Myr later (Kita et al. 2005; Russell et al. 2005; Scott & Krot 2005a). However, the mechanism(s) responsible for their thermal processing has remained poorly understood for many decades. The presence of presolar grains in meteorite samples (Anders & Zinner 1993; Nittler 2003), geochemical evidence for a poorly mixed nebula (Clayton 2005; Jacobsen & Ranen 2006), and direct evidence from astronomical observations all point to a generally cold protoplanetary disk environment, with insufficient temperatures to melt precursors at the radial distances they sample (∼3 au; e.g., Cassen 2001). Proposed mechanisms have included shock waves (Hood & Horanyi 1993; Connolly & Love 1998; Ciesla et al. 2003; Morris & Desch 2010; Boley et al. 2013; Mann et al. 2016), collisions between asteroids (Lichtenberg et al. 2018) and processing close to the Sun followed by ejection to the outer solar system (e.g., via X-winds; Shu et al. 1996).

Processing via shocks appears to be broadly consistent with some chondrule thermal histories inferred from laboratory experiments (Desch et al. 2005; Morris & Desch 2010). However, the origin of these shocks is not fully determined. On the other hand, processing in the proximity of the early Sun is problematic, as dust may be absent at such distances (Muzerolle et al. 2003) and it may be difficult to explain the chondrule-matrix complementary composition.

Another mechanism has been proposed (Salmeron & Ireland 2012a, 2012b), which invokes formation in a laterally extended, magnetocentrifugal wind accelerated from the disk surfaces (Blandford & Payne 1982; Pudritz et al. 2007; Königl & Salmeron 2011; Frank et al. 2014). Young stars often exhibit powerful outflows of material that are accelerated away along the polar axis of the star–disk system, and current observations suggest that they may extend to radial distances ∼ a few au from the protostar (e.g., Anderson et al. 2003; Coffey et al. 2004, 2007; Bjerkeli et al. 2016). This mechanism is attractive because thermal processing occurs where meteorite parent bodies were formed (the disk) and it naturally explains basic chondrule/chondrites properties (e.g., their tight size range, efficient processing, peak temperatures, compound chondrules, and complementary composition). However, many more properties would need to be explored in the context of this formation scenario, and further modeling is warranted. This is a rapidly evolving field, and harmonizing astronomical observations, evidence from remote sensing missions, and studies of meteorites and sample returns is an important objective of early solar system research, which will drive further theoretical, experimental and numerical work into the future.

4.8. Small Bodies With Rings—Chariklo, Chiron and Haumea

Over the past decade, narrow ring systems and other orbiting material have been discovered orbiting several of the solar system's small bodies (as detailed below, in Table 3). Such gravitationally bound material is thought to be able to persist for long periods of time, even in the absence of cometary activity from the host object, and may take the form of orbiting fragments (essentially moonlets), arcs or rings.

Table 3. The Properties of the Rings Around 10199 Chariklo, 2060 Chiron and 136108 Haumea

ObjectOrbital Radius (km)Widths (km)References
Chariklo390.6 ± 3.3 (inner)7.17 ± 0.14 (inner)[1]
 404.8 ± 3.3 (outer)${3.4}_{-1.4}^{+1.1}$ (outer) 
Chiron32410[2]
Haumea2,28770[3]

References. [1] Braga-Ribas et al. (2014), [2] Ortiz et al. (2015), [3] Ortiz et al. (2017).

Download table as:  ASCIITypeset image

The first definite evidence for material orbiting distant solar system small bodies came with the discovery of bona-fide satellites, such as Charon, Nix, Hydra, Styx and Kerboros around Pluto. Given the suggested collisional origin of those satellite families, it seems likely that their formation was preceded by the existence of temporary ring systems orbiting their parent bodies, as the material ejected from those bodies accreted to form its new moons.

In the last decade, orbital fragments (moonlets) have been detected around the Centaurs 29P/Schwassmann–Wachmann 1 (Stansberry et al. 2004; Gunnarsson et al. 2008; Womack et al. 2017) and (60558) 174P/Echeclus (Rousselot 2008; Fernández 2009). More excitingly, however, occultation observations of the dwarf planet Haumea and the Centaurs 10199 Chariklo and 2060 Chiron have resulted in the detection of narrow ring systems in orbit around them—just as, in 1977, the rings of Uranus were discovered during observations of a stellar occultation by the planet from the Kuiper Airborne Observatory (e.g., Elliot et al. 1977).

The first set of rings to be discovered orbiting an object other than one of the giant planets were confirmed around the Centaur 10199 Chariklo (Braga-Ribas et al. 2014). That work made use of a network of telescopes distributed across South America to observe the occultation of a faint (R = 12.4 mag) star by the Centaur. Such occultation observations are a great boon to researchers, as they allow the true size and shape of an object such as Chariklo to be determined on the basis of the durations of the occultation as observed from a variety of locations, in addition to information on any locations from which the occultation does not occur. By piecing together the various occultation "chords" observed in this way, it is possible to reconstruct the projected size and shape of the object in question. In the case of Chariklo, the sites that observed the occultation recorded the occulted star dimming and brightening on two occasions before the true occultation began, then repeating that behavior after the conclusion of that main occultation event. On the basis of their observations, Braga-Ribas et al. found that Chariklo hosts two narrow, dense rings, with orbital radii 391 and 405 km and widths ∼7 km and ∼3 km, respectively.

Once the rings around Chariklo had been announced to the world, Ortiz et al. (2015) returned to earlier observations of the Centaur 2060 Chiron, which had exhibited a similar behavior during its own occultation of a faint background star. The observed pre- and post-occultation dimming of Chiron had originally been explained as being the result of jet-like features around the Centaur's nucleus (Ruprecht et al. 2013), which has long been known to exhibit cometary activity throughout the entirety of its orbit (e.g., Luu & Jewitt 1990; Meech & Belton 1990; Bus et al. 2001). Instead of being the result of such jetting, Ortiz et al. (2015) proposed that Chiron, too, hosts a dense, narrow ring, some 10 km wide, with an orbital radius of approximately 324 km.

More recently, observations of a stellar occultation by the dwarf planet 136108 Haumea, on 2017 January 21 revealed that, in addition to its satellites, Hi'iaka and Namaka, that dwarf planet, too, hosts a dense, narrow ring (Ortiz et al. 2017; Winter et al. 2019). The orbital radius of that ring is far wider than those around Chariklo and Chiron (which is not unexpected—indeed, the diameter of Haumea is so great that the rings of the two Centaurs would lie within its solid body), at ∼2287 km. Its width is also significantly larger than those around the two Centaurs, estimated at ∼70 km. Interestingly, the ring appears to be coplanar with the orbit of Haumea's outer moon, Hi'iaka, and the equatorial plane of the dwarf planet—suggesting a potential common origin for the moons and rings that is entirely in keeping with the idea that they were created in a significant impact event. Additional evidence for such an event can be found in the extended Haumea collisional family (e.g., Brown et al. 2007; Ragozzine & Brown 2007; Schlichting & Sari 2009; Leinhardt et al. 2010; Lykawka et al. 2012; Volk & Malhotra 2012; Vilenius et al. 2018), as discussed in Section 4.1.2.

The exact origin of the rings around the two Centaurs remains unclear, but theories include debris from cometary activity (Pan & Wu 2016; Wierzchos et al. 2017), a collision between the small body and another body, or between an orbiting satellite and another body (Melita et al. 2017), the tidal disruption of an orbiting satellite (El Moutamid et al. 2014), and the tidal disruption of the Centaur itself due to a close encounter with a planet (Hyodo et al. 2016). At this time, no single origin theory can be discounted, and observations of the two Centaurs only hint at possible origin scenarios. 2060 Chiron has been known to display cometary activity at heliocentric distances beyond the water-ice sublimation distance of approximately 3 au (Bus et al. 1991; Womack & Stern 1999; Jewitt 2009; Womack et al. 2017), which might well help to maintain its ring system. To date, 10199 Chariklo has not been observed to display such activity, but it is certainly possible that it may have done so in the relatively recent past (Wood et al. 2017). So, ring formation by cometary activity remains a possibility.

Given the frequency with which Centaurs are expected to experience close encounters with the giant planets, Wood et al. (2017, 2018) performed detailed n-body simulations to determine whether those encounters could be close enough to either disrupt the rings, or to help generate them (through tidal disruption of their parent bodies). Those simulations revealed that encounters that are close enough to create or disrupt rings around Centaurs should be very rare—sufficiently rare, in fact, that it seems unlikely that either Chariklo or Chiron would have experienced such an encounter during their time in the Centaur region. In other words, the results of those simulations argue against the idea that the rings of either body were created by a close encounter with a giant planet. Furthermore, the lack of disruptive encounters during the lifetimes of those Centaurs means that a primordial origin for the rings (or, at least, an origin that predates the transfer of their hosts to the Centaur region) cannot be ruled out.

In the coming years, it seems highly likely that further occultation observation programs will reveal additional rings orbiting more of the solar system's small bodies—indeed, given the scarcity of such observations for those small bodies in the past, the fact that three ring systems have already been discovered may argue that such rings are in fact relatively commonplace—revealing once again that our knowledge of the minutiae of the solar system's small body populations remains incomplete, and serving as a reminder that there is always something new to discover!

4.9. The Removal of Material From Small Body Reservoirs

As was discussed earlier, the solar system is littered with debris moving on dynamically unstable orbits. In the absence of replenishment, these small body populations would quickly become depleted, on timescales of just a few million years, as a result of collisions with the solar system's planets (e.g., Zahnle & Mac Low 1994; Hammel et al. 1995; Sánchez-Lavega et al. 2010; Hueso et al. 2018) or the Sun (e.g., Michels et al. 1982; Farinella et al. 1994; Jones et al. 2018), fragmentation and disintegration (e.g., Crovisier et al. 1996; Sekanina 2000; Jenniskens & Lyytinen 2005; Jenniskens & Vaubaillon 2007), and ejection from the system as a result of close encounters with one or other of the system's giant planets (e.g., Levison & Duncan 1994, 1997; Bottke et al. 2002; Horner et al. 2004b).

Given the number of objects we observe in the various unstable populations, it is obvious that they must be continually replenished with fresh material, injected from those populations in the solar system that are nominally dynamically stable. In this manner, the near-Earth asteroids are held to be primarily sourced from the asteroid belt (e.g., Wetherill 1988; Morbidelli et al. 2002a; Morbidelli & Vokrouhlický 2003); the short-period comets from the trans-Neptunian region (e.g., Duncan & Levison 1997; Levison & Duncan 1997; Di Sisto & Brunini 2007; Lowry et al. 2008; Volk & Malhotra 2008, 2013), with contributions from the inner Oort cloud (e.g., Emel'yanenko et al. 2005; Brasser et al. 2012; de la Fuente Marcos & de la Fuente Marcos 2014b) and the Jovian and Neptunian Trojans (e.g., Horner & Lykawka 2010a; Horner et al. 2012c; Di Sisto et al. 2019; Holt et al. 2020); and the long-period comets from the Oort cloud (e.g., Öpik 1932; Oort 1950).

At the same time, vast quantities of dust are released into the solar system by cometary activity, and by collisional processes throughout the system—evidence of which can be seen on any clear night, in the form of meteors, the Zodiacal Light (e.g., Leinert 1975; Leinert et al. 1981; Reach et al. 2003), and the Gegenschein (e.g., Brandt & Hodge 1961; Wolstencroft 1967; Roosen 1970; Buffington et al. 2009; Ishiguru et al. 2013). Since dust is continually produced, the fact that the solar system is not substantially dustier is clearly the result of the efficient clearing of dust, by a variety of processes.

In this section, we describe the non-gravitational mechanisms by which planetary systems are kept clear of dust, and those that can help to facilitate the liberation of objects from the dynamically stable regions of the system, to join the various populations of dynamically unstable object.

4.9.1. Effects that Remove Dust

Three major processes dominate the lifetimes of dust grains in a planetary system; Poynting–Robertson drag, radiation pressure, and collisional destruction (e.g., Wyatt & Whipple 1950; Burns et al. 1979; Backman & Paresce 1993; Lagrange et al. 1995; Moro-Martín & Malhotra 2003; Krivov 2010; Wyatt et al. 2011). All of these processes act on timescales much shorter than the stellar lifetime, implying that the dust that permeates our solar system, and the debris discs observed around other stars, are not primordial (e.g., Backman & Paresce 1993; Wyatt 2008; Moro-Martín 2013). An unseen population of larger asteroidal and cometary bodies (planetesimals) are invoked as the mass reservoir for the ongoing production of dust grains we observe around other stars (Artymowicz & Clampin 1997; Dominik & Decin 2003). Analogously, the sources of dust in the solar system include the collisional grinding of asteroidal bodies and material ejected from active comets (e.g., Nesvorný et al. 2010; Koschny et al. 2019).

Radiation Pressure: The dynamics of small dust grains in the solar system are dictated by a combination of the incident stellar radiation field and the gravitational potential of the Sun. The strength of influence that radiation pressure has on a given dust grain can be described by the ratio of the radiation pressure force Frad to the stellar gravitational force Fgrav acting on it, defined as the β parameter (Burns et al. 1979; Krivov 2010). The β ratio is calculated as follows:

Radiation pressure is proportional to the stellar luminosity, L*, the grain cross-section, and the efficiency Qpr with which momentum is transferred from photons to the dust, averaged over the stellar spectrum. The gravitational force, on the other hand, is proportional to the stellar gravitation parameter, GM*, as well as the grain mass, and hence, to its volume and its density, ρ. Since both gravitation and radiation intensity are r−2 laws, the value of β is dimensionless, and is fixed for a dust grain of given radius, s, and density, ρ. Dust grains effectively see a reduced gravitational potential from the central star due to this incident radiation. As a result, for dust grains released by objects55 moving on circular orbits, or around the perihelion passage of objects moving on eccentric orbits, the influence of radiation pressure drives dust grains onto more eccentric orbits than those occupied by their parent bodies.

For sufficiently large and opaque grains, the radiation pressure efficiency, Qpr, is constant near unity. As a result, the β ratio becomes inversely proportional to the grain radius, s. Radiation pressure is less and less important as grain size increases, becoming insufficient to perturb the orbital motion for particles larger than ∼100 μm (e.g., Pawellek et al. 2019). Equally, very small dust grains are inefficient absorbers and scatterers of incident stellar radiation. Hence, in the Rayleigh regime, where grains are much smaller than the dominant stellar photons, Qpr is reduced, and becomes proportional to the grain radius, s. As a result, the β ratio levels off to a lower, constant value, at these small sizes. The exact size range over which radiation pressure is important depends on both the properties of the host star and the composition of the dust.

The grain size at which β = 0.5 is often called the blow-out limit. For dust originating from a larger parent body on a circular orbit moving at Keplerian velocity around the host star, grains with β values <0.5 will remain bound to the star. In such a scenario, grains for which β > 0.5 will become unbound from the host star upon release from their parent body. They are blown out of the system on hyperbolic orbits, leaving on orbital timescales, vacating the terrestrial region on a timescale of years, and leaving the Edgeworth–Kuiper Belt on a timescale of centuries. Around the Sun, the blow-out size of compact silicate dust is ≈1 μm, whereas around an A-type star the blow-out size of identical dust is ≈10 μm (Backman & Paresce 1993).

For dust originating from larger bodies moving on eccentric orbits, the limit between bound and unbound depends on the point in their parent body's orbit at which the dust grains are released (Kresák 1976):

where r is the distance at which the grain is released from its parent body, and a denotes the semimajor axis of that object. For grains released closer to perihelion, the excess kinetic energy results in a blow-out limit that falls in the range 0 < βlim < 0.5 (dependent on the exact motion at the time). By contrast, for grains released closer to an object's aphelion, the limiting value for dust grain blow-out falls in the range 0.5 < βlim < 1, and initially, for low β values, emission near aphelion will actually cause dust grains to move on orbits more circular than that of their parent (as seen in Figure 15, below). Indeed, for particles release at aphelion, a β value that is identical to the orbital eccentricity of the parent body will result in a dust grain moving on a circular orbit.

Figure 15.

Figure 15. Orbits for dust grains emitted from a parent body moving on an orbit with a semimajor axis of 15 au. The left panel shows a scenario where the parent body moves on a circular orbit (i.e., e = 0), while the central and right-hand panels show the behavior when the parent body's orbit has an eccentricity of 0.4. The middle panel shows the behavior of dust emitted at perihelion, while the right hand shows emission at aphelion. In each panel, we plot, in color, the resulting orbits of the dust for a variety of values of β (specifically , , , , , , , and ), using the formulation outlined in Kresák (1976). The solid lines show the orbits of bound grains (those moving on orbits with e < 1). The dashed lines show dust grains moving on hyperbolic orbits (i.e., those with e > 1). The dotted lines show those grains for which the β value is so great that the outward force due to radiation equals or exceeds the inward force due to gravity. The vertical gray dotted–dashed line marks the dividing line between cases where the acceleration due to radiation pressure is less than that due to gravity, and those where radiation pressure exceeds the effect of gravity. In the case of emission from an object moving on an eccentric orbit, the limiting β that determines whether a grain will be bound or unbound depends on the point of release in the parent body's orbit around the Sun. The location of the parent body of the dust at the time of emission is denoted by the "o" symbol, and the Sun by the "*."

Standard image High-resolution image

For cometary parent bodies, where orbital eccentricities, e, are often very high, and dust production occurs primarily in the months around their perihelion passage, remission ≈ q = a (1 − e), and the limiting β ratio can be approximated with βlim ≈ (1 − e)/2. For orbital eccentricities in excess of 0.99, which is typically the case for long-period comets, the lower size limit for bound grains can thus exceed the circular blowout limit by a factor ≳100. This distinction is critical when it comes to the discussion of dust behavior in other planetary systems. Studies of debris discs often discuss the "blowout limit" for the dust we observe in those systems—which brings with it an implicit assumption that the parent bodies are moving on circular, or near circular orbits. In reality, however, this blowout limit is actually an upper limit, since grains ejected by objects moving on eccentric orbits would be blown out of a system when identical grains emitted from a circular orbit would not be removed. This is particularly relevant when one considers observations of systems containing exo-comets (e.g., β Pictoris; Kiefer et al. 2014; Greaves et al. 2016). The stability and distribution of dust grains released from parent bodies on circular and eccentric orbits are illustrated in Figure 15.

It is challenging in the extreme to get an overview of the dust halo of the solar system because we are buried deep within it. As a result, while significant work has studied the evolution of dust in the context of meteor showers and the zodiacal light (as described in Section 4.7.1), the discovery of debris orbiting other stars has proved a great boon to researchers, since it allows the study of the entirety of the debris in a given planetary system at a single time. The presence of extended haloes of sub-blowout sized grains (the radii of which may extend up to several microns) has been observed in the optical and near-infrared scattered light images of debris disk-host stars (e.g., Smith & Terrile 1984; Schneider et al. 2014; Ren et al. 2019). For example, multi-wavelength observations of HD 181327 revealed a difference in debris ring location between optical and millimeter wavelengths that is consistent with radiation pressure induced dust grain size segregation. The millimeter wavelength observations also revealed a broad halo component (or unresolved second belt) to the disk (Marino et al. 2016).

It would be expected that any debris disk with a host star luminous enough to drive dust out of its system56 should exhibit such a halo structure, and it has been suggested that up to half of the emission from debris discs at infrared wavelengths could be produced by sub-micron dust grains, for those stars luminous enough to drive the ejection of dust in this way (Thébault & Kral 2019).

We note here that not all dust haloes observed around other stars are the result of radiation pressure. For example, the young M-dwarf star AU Mic has a bright debris disk surrounded by a significant extended halo. However, as an M-dwarf, the star is clearly insufficiently luminous for that halo to be the result of radiation pressure driven escape. Instead, the presence of a halo is attributable to the strong stellar wind from the young M-dwarf star (Schüppler et al. 2015; Boccaletti et al. 2018). Similar to the pressure induced by stellar radiation, the pressure exerted by stellar wind particles is mostly directed radially away from the star, with the intensity dropping as r−2. Stellar wind pressure is proportional to, and varies with, the stellar mass loss rate and the wind velocity. In the solar system, typical wind pressures are by more than three orders of magnitude lower than radiation pressure (e.g., Burns et al. 1979).

In combination with the influence of Poynting–Robertson drag (as described below) and stellar wind forces, radiation pressure induces a radial distribution of dust grains within a debris disk that is dependent on both the size of the particles and their (initial) orbits (e.g., Burns et al. 1979; Strubbe & Chiang 2006; Krivov 2010). While in the solar system, we can see individual objects as they contribute to the system's dust budget, determining the source of dust in extrasolar debris discs is more challenging. To accurately trace the location of the dust-producing planetesimals within those discs it is therefore necessary to observe larger dust grains which are less strongly influenced by these radiation forces. Modeling of debris discs from mid-infrared to millimeter wavelengths suggests that dust seen at wavelengths in the far-infrared and beyond traces the location of the underlying planetesimal belt (Pawellek et al. 2019).

Poynting–Robertson Drag:  The Poynting–Robertson effect (Poynting 1904; Robertson 1937) induces the in-spiralling of a dust grain orbiting the Sun through that grain's interaction with a "head wind" of solar photons. This head wind is the result of the orbital motion of the dust grains, relative to the mostly radial stellar radiation field. While a dust grain at rest would just be affected by the radiation pressure described above, any orbital motion leads to an additional force proportional to −v/c, where v denotes the grain's orbital velocity, and c is the speed of light. The momentum transfer that causes both radiation pressure and the Poynting–Robertson effect happens when the solar photons are absorbed or scattered by the grain. Subsequent re-emission, if anisotropic, can then lead to additional forces on macroscopic objects such as the Yarkovsky effect (which we describe in more detail in Section 4.9.2, below).

This perceived "head wind" causes a loss of orbital energy, resulting in the gradual decay of the dust grain's orbit. The Poynting–Robertson effect acts to reduce both the semimajor axis and the eccentricity of the dust grain's orbit. Grains therefore migrate toward the Sun from the initial orbits into which they were released until they eventually reach distances where the material sublimates. This happens at a few Solar radii for silicate dust and at several astronomical units for volatiles (e.g., Kobayashi et al. 2011). For grains moving on roughly circular orbits, the timescale for loss via Poynting–Robertson drag is given by

Here, tPR is a function of the grain radius, s, density, ρ, and the radiation pressure efficiency Qpr, combined with the orbital radius of the grain, a, around a star with luminosity L* (Wyatt & Whipple 1950; Burns et al. 1979). The timescale is proportional to grain size and makes Poynting–Robertson drag most effective on μm-sized dust grains, for which it causes grains to spiral inward from 1 au to a few Solar radii in roughly one thousand years. Smaller grains are quickly blown out of the system by direct radiation pressure. For dust grains initially moving in the region of the Edgeworth–Kuiper Belt, the timescale increases by a factor of approximately one thousand (to a million years, or so). Similarly, increasing the grain size to millimeters would again yield an increase of the timescale by a factor of a thousand.

While direct stellar wind pressure has little influence on dust dynamics and lifetimes in most systems, wind drag is more effective. The radial velocity of the wind particles (∼100 to 1000 km s−1) is roughly three orders of magnitude smaller than that of the photons. The dust's azimuthal motion and the momentum transfer from the corresponding "head wind" resulting from the stellar wind particles becomes more significant; the additional factor c/vwind makes wind drag comparable in efficiency to radiation-induced Poynting–Robertson drag. Gustafson (1994) estimates that wind drag contributes about one quarter of the total drag, at times of average Solar wind strength. Around stars with stronger stellar winds, wind drag can dominate (e.g., Reidemeister et al. 2011; Schüppler et al. 2015).

Collisional Destruction: The impact craters that cover most solar system objects bear witness to the importance and frequency of mutual collisions (as described in depth earlier in this manuscript). Relative speeds on the order of kilometers per second mean that such collisions often end the lives of dust grains. At the same time, collisions among bigger objects produce fresh dust. A cascade of collisions transfers mass from planetesimals down to observable dust. Once the material is ground down to sufficiently small sizes, radiation pressure and Poynting–Robertson drag move the dust grains out of the system or toward the Sun, respectively. Objects in regions and of sizes for which the loss timescale due to mutual collisions is shorter than the direct removal timescale due to radiation pressure and drag forces are called collisionally dominated bodies.

For a dust grain moving on a Keplerian orbit, its collisional lifetime, tcoll, will be broadly defined by its orbital period, torb, and the cross-section surface density, σ (cross section per disk surface area; also known as normal geometrical optical depth, τ), of potentially harmful colliders in the disk through which it moves (e.g., Burns et al. 1979; Backman & Paresce 1993; Wyatt & Dent 2002), such that:

where i is the orbital inclination, vrel the typical relative velocity, and vK = (GM*/a)0.5 the Keplerian speed. The fraction containing the inclination and the velocities is of the order of unity, and through substitution of the Keplerian speed and the circumference of the orbit (=2πa) for torb, we obtain the relation in terms of the stellar mass, M*, orbital radius, a, and surface density, σ.

A collision is commonly considered catastrophic if the largest remaining fragment has at most half the mass of the colliders. The critical energy required to disrupt and disperse an object increases with increasing size. The critical specific energy (the energy per unit mass) decreases up to object radii ∼100 m, and then increases for larger objects which are strengthened by self-gravity (e.g., Benz & Asphaug 1999; Leinhardt & Stewart 2012). The critical impactor size required for a catastrophic collision also increases with object size, and as a result, the rate at which such collisions happen decreases. The lifetime of an object against collision increases with increasing grain size, albeit not as strongly as the Poynting–Robertson timescale. Below a critical grain size, Poynting–Robertson drag acts on shorter timescales than collisional disruption, while above that critical size, collisions dominate. As a result of the different dependencies in tcoll and tPR, the critical size at which tcoll = tPR is smaller in denser or more distant regions (e.g., Kuchner & Stark 2010). Observations of the interplanetary dust bands and counts of microcraters suggest that the critical size is around ∼100 μm (corresponding to ∼10−5 g) in the inner solar system (e.g., Fechtig & Grün 1975; Le Sergeant d'Hendecourt & Lamy 1980; Grün et al. 1985; Grogan et al. 2001).

While in the solar system, there is a clear distinction between dust grains large enough to be collisionally dominated and those small enough to be driven by radiation pressure and Poynting–Robertson drag, the same is not true of the debris we have observed beyond the solar system. With increasing disk surface density (or mass), the critical size at which tcoll = tPR decreases until it becomes comparable to or smaller than the blowout limit. At that point, all dust is either bound and dominated by collisions or moving on unbound orbits. At the current epoch, our instrumentation is simply not sensitive enough to detect true analogues of the solar system's dust environment. As a result, all of the currently known debris discs are collisionally dominated, as a direct result of observational biases that favor the detection of bright, dense and cool dust discs (see e.g., Matthews et al. 2010, 2014; Hughes et al. 2018).

Detections of near-Edgeworth–Kuiper Belt analogues (∼10 × FEKB) around Sun-like stars have been made at far-infrared wavelengths in continuum emission by Herschel (Eiroa et al. 2013; Sibthorpe et al. 2018), and have allowed us to begin to place our solar system's dust environment in the context of its peers. These detections approach the predicted limits on Edgeworth–Kuiper Belt flux of 1.2 mJy at 70 microns (Vitense et al. 2012) set by IRAS and COBE at 60 μm, favoring an upper limit <1 MJy sr−1 (Aumann & Good 1990; Backman et al. 1995), and in situ dust measurements from New Horizons (Han et al. 2011). Those observations suggest that the Edgeworth–Kuiper Belt lies around the mid-point of dust disk brightness based on an extrapolation from the observed systems (Moro-Martín et al. 2015), but still lies within the collisionally dominated regime.

In the typical situation where a collisional cascade constantly replenishes and removes material, the size-frequency distribution of dust grains can be approximated by a power law, dN ∝ sPds, with index p ≈ 3.5 (Dohnanyi 1969; O'Brien & Greenberg 2003), where dN/ds is the number of particles per unit grain radius. For the smaller grains that are produced in collisions but whose lifetimes are shortened by Poynting–Robertson drag, the size distribution flattens to an index in the range 2.5 < p < 3.0 (Reidemeister et al. 2011; Wyatt et al. 2011). Such flattening is observed for the dust distribution in the inner solar system (e.g., Fechtig & Grün 1975).

The combination of a collisional cascade and a sharp lower size cutoff set by radiation pressure creates a distinctive wavy pattern in the size distribution of dust grains, deviating from the single power law. The small grains that are unbound by radiation pressure are significantly underabundant with respect to those that are still small, but big enough to stay bound. The latter, barely bound grains, therefore lack a reservoir of potentially destructive colliders; their lifetimes and abundance are increased. In turn, yet bigger grains suffer more strongly from collisions with these barely bound grains; they are again underabundant, and so on (Campo Bagatín et al. 1994). The wavelength, amplitude, and grain size to which this pattern extends depend on the details of the collision physics, including material properties and impact velocities (e.g., Thébault et al. 2003; Krivov et al. 2006). The wavy size distribution can translate to observable waves in the spectral energy distribution (Thébault & Augereau 2007; Kim et al. 2018). When orbital eccentricities induced by radiation pressure exceed those inherited from the belt of parent bodies, collision velocities and rates are increased, the abundance decreased. For parent belts with very low dynamical excitation (e ≲ 0.01), this can cause an additional depletion of the barely bound grains up to grain radii s corresponding to β(s) ≳ e, resulting in a more tenuous halo (Thébault & Wu 2008).

In a collisional cascade, the destruction of two asteroidal sized bodies in a system produces large numbers of small dust grains, causing a temporary brightening of the observed debris disk. Such events occur stochastically, and the brightness increase is short lived (tcoll ∼ 104 yr, Rieke et al. 2005; Jackson et al. 2014). The dust grains produced in such a collision are swiftly removed from the system by the subsequent collisional cascade and dynamical evolution through radiation forces as described above. The presence of very bright asteroid belt analogues orbiting other stars (with Ldust/L* > 10−3, Tdust > 200 K; approximately ten thousand times more luminous than our own asteroid belt) is believed to be evidence of ongoing collisional processes, possibly analogous to the giant impacts seen in the solar system's youth (as described in Section 4.1; see e.g., Song et al. 2005; Fujiwara et al. 2009, 2010; Lisse et al. 2009; Melis et al. 2010; Kenyon & Bromley 2016). The rapid evolution of such bright asteroid belt analog debris discs, including both stochastic variability on periods of months to years and monotonic dissipation over timescales of years to decades, has been observed in a handful of systems (Melis et al. 2012; Meng et al. 2012, 2014, 2015, Su et al. 2019), and suggests that the violent processes that shaped our solar system are far from unique.

4.9.2. Effects that Remove Large Objects

The great majority of near-Earth asteroids are thought to be sourced from the asteroid belt. The primary routes by which these asteroids are transferred from the belt to the inner solar system feature interaction with mean-motion and secular resonances with the giant planets (e.g., Wetherill 1988; Morbidelli et al. 2002a). However, those mechanisms are so effective, on astronomical timescales, that they rapidly clear any debris from their regions of influence in the belt. To maintain a steady flux of new material to the inner solar system through those mechanisms, then, fresh material must first be transferred into those regions from elsewhere in the belt. To address this problem, Morbidelli & Vokrouhlický (2003) investigate the influence of a non-gravitational force known as the Yarkovsky effect on the main belt asteroids, finding that that effect could cause the migration of asteroids through the belt to the regions of instability that direct those asteroids into the inner solar system.

The diurnal Yarkovsky effect, explained in detail in Bottke et al. (2006), and the references therein, involves the absorption and re-emission of photons on an orbital body. Photons of light from the Sun hit an object and are absorbed. As the object is rotating, and its surface has a significant, non-zero, thermal inertia, the infrared radiation of the photon happens at a different angle to the absorption. This produces a small net force. In a prograde rotator, the force has a component in the orbital direction that creates a net increase in orbital rate (e.g., Bottke et al. 2006). The consequence of this small force is an increase in semimajor axis over time. For a retrograde rotator, the result is a decrease in orbital momentum, resulting in a decrease in semimajor axis. The magnitude of the force depends upon how close a body is to the Sun, the obliquity of the body's spin axis with respect to the orbital plane, and the body's thermophysical characteristics (e.g., Bottke et al. 2006). Mathematically, this is represented in equation:

The radiative pressure coefficient (Φ) is dependent upon the radius of the object (R) and the Solar radiation flux (F).

The Stefan–Boltzmann constant (α = 1 − A) comprises the bond albedo of the object (A). The thermal component, (W (', Θω)) is dependent on the radius scaled for penetrative depth $({\text{}}R{\omega }^{{\prime} }={\text{}}R/\updownarrow v)$ and the thermal parameter (Θω). The thermal parameter (Θω) is to account for the thermal properties of the body.

The area to mass ratio of the object has a major effect on how the Yarkovsky effect influences an object. A large object has a large mass to area ratio, reducing the effect of such a small force. If an object is too small, the thermal gradient over the object lessens and the radiative difference becomes minimal. It is generally accepted that the size range where the diurnal Yarkovsky effect has the most influence is between one meter and approximately 10 km (e.g., Bottke et al. 2006).

In addition to the diurnal Yarkovsky effect, a seasonal effect has been described (Bottke et al. 2006). Unlike the diurnal Yarkovsky effect, the seasonal effect can only slow the object, reducing the semimajor axis.

From the above equation, due to the radiative flux (F), it can be seen that the Yarkovsky effect has the most effect in the inner solar system. Simulations have shown that the Yarkovsky effect has an effect on the evolution of the near-Earth Asteroids (e.g., Morbidelli & Vokrouhlický 2003; Farnocchia et al. 2013), Martian Trojans (e.g., Ćuk et al. 2015) and the main asteroid belt (e.g., Nesvorný & Bottke 2004). Further out in the solar system, the impact of the Yarkovsky effect on the Jovian Trojans has been found to be negligible on objects larger than 100 m (Wang & Hou 2017), with objects more distant than Jupiter likely not experiencing the effect to any significant degree.

The Near-Earth asteroid population is of great interest as a result of the potential hazard it poses to the Earth (see Section 4.2.2), and the Yarkovsky effect will play an important role in altering the orbits of these objects, causing the degree to which they can be considered to be hazardous objects to change over time. As a result of the effect's influence on the orbital evolution of NEAs, there has recently been a concerted effort to detect the Yarkovsky effect on individual members of the NEA population, with over 40 confirmed detections to date (e.g., Vokrouhlicky et al. 2015; Ďurech et al. 2018), including the OSIRIS-REx target, (101955) Bennu (Deo & Kushvah 2017; Hergenrother et al. 2019).

Over the past two decades, the influence of the Yarkovsky effect on the migration of asteroids across the asteroid belt has been widely studied, and asteroidal drift under the influence of the Yarkovsky effect has become a key metric in the identification of asteroid collisional families, and the determination of their ages (e.g., Bottke et al. 2001; Bolin et al. 2017, 2018; Spoto et al. 2015; Delbó et al. 2019; Paolicchi et al. 2019).

5. Application to Known Exoplanetary Systems

The thought that the solar system might not be unique, and that there might be planets orbiting other stars, is one that has been debated for at least several hundred years. Famously, the Dominican Friar Giordano Bruno argued, in the third dialog of his 1584 work "De l'Infinito, Universo e Mondi," that the stars we see in the night sky are suns, accompanied by their own planets, and that the cosmos contains an infinite number of suns and an infinite number of planets57 (Bruno 1584). He even suggested that the planets can not be seen because they are too small and too faint, compared to their host stars58—an issue that has proven painfully true over the centuries since!

The first steps toward the detection of planets orbiting other stars came in the 18th and 19th Centuries, with discoveries that would presage the most successful exoplanet detection techniques. In 1783, two years after the discovery of Uranus, the young British astronomer John Goodricke presented an explanation for the peculiar periodic dimmings of the bright star Algol (Goodricke 1783). Goodricke realized that an unseen companion orbited Algol,59 and that the unseen secondary eclipsed the primary every 2.86 days. While Algol's variability had been known for thousands of years (as it is easily seen with the unaided eye; e.g., Jetsu et al. 2013), Goodricke was the first (that we know of) to correctly identify the cause of the periodic dimming—a discovery that would lay the groundwork for the modern exoplanet transit technique.

In the 1840s, two additional discoveries were made that set the scene for the eventual dawn of the Exoplanet Era. In 1844, Friedrich Wilhelm Bessel announced the discovery of unseen companions to the bright stars Sirius and Procyon, on the basis of periodic variations in the proper motions of those stars (Bessel 1844).60 Just two years later, in 1846, the planet Neptune was discovered as a direct result of its gravitational perturbation of the motion of Uranus (e.g., Adams 1846; Galle 1846; Le Verrier 1846).

In all of these cases, observations of one object (Algol, Sirius, Procyon and Uranus) enabled the discovery of another, unseen body, as a result of the unseen object's influence on the target of the observations. In other words, all of the new objects were first discovered indirectly, as a result of their causing otherwise inexplicable changes in the behavior of another body. It was obvious that the same techniques could be used to detect planets orbiting other stars—but the problem, as Bruno had realized, was that planets are far smaller than their host stars—and so their influence on those stars would be too small to detect.

In the middle of the 20th Century, the discovery of a number of planets orbiting nearby red dwarf stars, including Barnard's Star, Lallande 21185, and 61 Cygni, were announced as a result of a long-term astrometric survey carried out using the Sproul Observatory refracting telescope (e.g., Strand 1943; van de Kamp & Lippincott 1951; van de Kamp 1963, 1969a, 1969b). The discoveries were announced based on observed periodic astrometric wobbles from the target stars, which were explained on the basis that those stars had planet-mass unseen companions (in much the same way that Bessel, a century beforehand, had inferred the presence of Sirius B and Procyon B). No other observers were able to replicate his results, however—and it seems that the observed "wobbles" were actually the result of changes to the optical properties of the Sproul refractor caused by the periodic cleaning of its main objective lens (e.g., Gatewood & Eichhorn 1973; Hershey 1973).

The dawn of the Exoplanet Era finally arrived in the late 1980s, with the discovery of planets orbiting the stars Gamma Cephei A and HD 114672 (Campbell et al. 1988; Latham et al. 1989). At the time, however, the planets were thought to be brown dwarfs, too massive to count as true planetary detections. Those discoveries were followed, however, by the detection of three planets orbiting the pulsar PSR 1257+12 (the first two being announced in Wolszczan & Frail 1992; the third in Wolszczan 1994), a single planet orbiting the pulsar PSR B1620-26 (Thorsett et al. 1993), and then the announcement, in 1995, of the "first planet orbiting a Sun-like star"—51 Pegasi b (sometimes known as Bellerophon; Mayor & Queloz 1995).

Prior to the detection of those first planets, it was widely expected that planetary systems around other stars (if they existed) would strongly resemble our own solar system, having formed in a disk of material around their young suns (e.g., Swedenborg 1734; Laplace 1796; Edgeworth 1949; Lissauer 1993). However, those first planets were nothing like those we might have expected, and forced a radical overhaul of our theories of planetary formation and evolution.

In the first three decades of the Exoplanet Era, we have learned that planets are ubiquitous in the cosmos—that, just as Bruno said, the myriad stars in the night sky all host planets of their own. We have also learned that planetary systems are far more diverse than we could have imagined. We have found planets moving on highly eccentric, comet-like orbits (e.g., Naef et al. 2001; Tamuz et al. 2008; Wittenmyer et al. 2017b), and others that are denser than osmium (e.g., Siverd et al. 2012; Johns et al. 2018) or more tenuous than fairy floss (e.g., Masuda 2014).61 Some planetary systems are incredibly tightly packed (e.g., MacDonald et al. 2016; Gillon et al. 2017), and others feature "hot Jupiters" on highly inclined or even retrograde orbits (e.g., Bayliss et al. 2010; Addison et al. 2013; Esposito et al. 2014).

Despite the remarkable diversity observed in exoplanet systems, the solar system remains our key touchstone for the understanding of these alien worlds. Of the several thousand planetary systems that have been discovered to date, the only one that we can study in intricate detail is our own.62 For that reason, in this section, we describe exoplanets in the context of solar system science, showing how our knowledge of our planetary system has helped to shape our understanding of those that orbit other stars.

5.1. Demographics of Exoplanets

Since the dawn of the Exoplanet Era, over 4000 exoplanets have been identified in approximately 3000 systems,63 with many more candidate planets and systems discovered by NASA's TESS and Kepler missions awaiting confirmation using ground based facilities. The vast majority of these systems were identified through transit photometry, which looks for the almost imperceptible "wink" as an unseen companion passes across the face of the star as viewed from the solar system. The Kepler spacecraft (e.g., Borucki et al. 2010; Howell et al. 2014) proved to be the most productive planet search project to date, performing what was essentially the first census of the Exoplanet Era. Between its primary and K2 mission phases, Kepler led to the discovery of 2748 planets, with a further 3307 still awaiting confirmation.64

Despite the apparent wealth of exoplanets discovered in the last thirty years, our current understanding of the exoplanetary systems we have discovered remains far less than our knowledge of the solar system's planets before the launch of the first interplanetary missions in the 1960s. Our knowledge, in the main, is limited to knowing the planet's minimum mass (if discovered using the radial velocity technique) or its size (using the transit method). A small fraction of the planets discovered to date have been observed directly—but even then, we can usually gain only limited understanding of the planet's nature (with the mass of such planets estimated based on an assumed age and their observed brightness; e.g., Marois et al. 2008).65 Beyond these basic characteristics, little more is known for the vast majority of known exoplanets. In some cases, a bulk density can be calculated (for those planets that have been observed using both the transit and radial velocity techniques; e.g., Siverd et al. 2012; Masuda 2014; Johns et al. 2018, and many others). In systems containing multiple planets, some of which transit their hosts, it is possible to obtain improved mass and orbit measurements, and even to uncover the presence of additional planets, through the analysis of transit timing variations (TTVs)—which result from the effect of one planet's gravity perturbing the orbit of another (e.g., Agol et al. 2005; Holman & Murray 2005; Lithwick et al. 2012; Becker et al. 2015; Gajdoš et al. 2019; Hamann et al. 2019; Lam et al. 2020). Specific molecular species have been identified in the atmospheres of a small number of planets through multi-wavelength transit photometry (e.g., Charbonneau et al. 2002; Knutson et al. 2014; Khalafinejad et al. 2017) or direct measurement of the planet's spectrum (e.g., in the case of HR 8799 c; Oppenheimer et al. 2013; Wang et al. 2018a). But, in the main, our knowledge of the planet's physical nature is limited to either a minimum mass, or a size—nothing more.66

Prior to the panoply of exoplanet discoveries beginning in the late 1980s, speculation on the types and abundance of planets within the galaxy was mostly inspired by planetary formation theory, which was rooted in the solar system. The Sun, an average star, hosts a set of planets that seemed to be loosely arranged in their orbits by size and mass, with rocky (terrestrial) inner planets, and more distant gaseous giants. It seemed reasonable to conclude that other Sun-like stars would like have similar groups of planets, arranged in similar ways, with rocky worlds interior to gaseous ones.

The first exoplanets to be discovered around Sun-like stars were far different to what was expected following this line of thought. 51 Pegasi b (Mayor & Queloz 1995) is comparable in mass to Jupiter, but orbits its host at a distance of just over 0.05 au, with a period of 4.23 days. In the years that followed, such "hot Jupiters" dominated exoplanet discoveries—a finding that led to significant innovation in our models of planet formation (e.g., Dawson & Johnson 2018, and references therein). The variety of exoplanets that have been found has revealed an unexpected diversity—in other words, not all planetary systems resemble our own.

With that said, as the sample size of known exoplanets has increased, this conclusions we had made on planet formation based on the sample of one system (the solar system) have somewhat stood the test of time. Our broad understanding (planet formation in a disk around a young star) has proven accurate, and we now know that planets are ubiquitous—most, if not all stars host planets. However, the sheer diversity of these planets—in size, mass, and orbital architecture—has proven to be far beyond the scope of solar system-based predictions for the demographics of exoplanets.

Figure 16 shows the dramatic rise in the number of known exoplanets, organized by detection technique. The two most productive methods are transit and radial velocity (RV) techniques. Here, we will summarise the current state of knowledge in exoplanet demographics as determined through transit and RV surveys. Figure 17 shows the distribution of known exoplanets as a function of their discovery method, their mass, and their orbital period. It is immediately apparent that the transit and RV methods of exoplanet detection are sensitive to different subsets of the population of exoplanets. Both techniques fall victim to significant observational biases, and are limited by the available observational temporal baseline, which effectively determines the maximum orbital period at which a given survey they can detect planets. The transit method is heavily biased toward detecting planets that move on short period orbits, and most effectively finds planets with larger radii (as they produce a significantly more obvious transit signature in the light curves of their host stars). Similarly, the radial velocity method is biased toward finding the most massive planets, with an efficiency that is limited toward short orbital periods by the available observational cadence, and toward long periods by the temporal baseline over which observations have been obtained.

Figure 16.

Figure 16. Exoplanet detections as a function of year, colored by the detection technique. Steep increases in the number of transiting exoplanets in 2014 and 2016 are due to observations from the Kepler mission. Image courtesy of the NASA Exoplanet Archive (https://exoplanetarchive.ipac.caltech.edu/), accessed on 2019 June 13th.

Standard image High-resolution image
Figure 17.

Figure 17. Distribution of known exoplanets in mass-period space colored by detection technique. The different sensitivities of the transit and radial velocity methods are demonstrated by the largely separate samples of green and red points. The masses used are those published in the literature—and as a result, the data for radial velocity planets shows the minimum mass compatible with the observations (i.e., m sin i), while the only transiting planets included are those for which a direct mass measurement has been made. Interpretations of the exoplanet demographics from figures such as this must be made with caution, as the observational biases from each detection technique are present, and the mass of any given planet is subject to the degree of uncertainty with which its host star has been parameterized. Image courtesy of the NASA Exoplanet Archive (https://exoplanetarchive.ipac.caltech.edu/), accessed on 2019 June 13th.

Standard image High-resolution image

When interpreting the sample of known exoplanets, especially as displayed in plots such as Figure 17, it is critical to remember that what is known (and not yet known) about exoplanet demographics is shaped by the biases of the techniques used to detect exoplanets. These biases must be carefully accounted for to extract robust demographic information from the sample of detected exoplanets. To give just one example—when one examines Figure 17, it would be natural to think that hot Jupiters were abundant, and that most planetary systems feature at least one such scorching giant. This is, however, the result of the aforementioned observational biases. Simply put—both the transit and radial velocity techniques are strongly biased toward the detection of hot Jupiters—they are the easiest of all planets to find, and so one would expect them to dominate exoplanet demographics, particularly in the early days of the field. Indeed, the true occurrence rate of hot Jupiters is low—with studies based on data taken by Kepler and TESS consistently returning rates of less than 1% (e.g., Howard et al. 2012; Fressin et al. 2013; Wang et al. 2015; Zhou et al. 2019).

Numerous ground- and space-based transit surveys are responsible for having found most of the presently known exoplanets. One of the most notable efforts is the Kepler mission (Borucki et al. 2010), which acquired a nearly continuous four-year observational baseline for over 150,000 stars. Kepler identified over 4000 exoplanet candidates (Akeson et al. 2013; Thompson et al. 2018), of which more than 2300 have been confirmed or statistically validated (Rowe et al. 2014; Morton et al. 2016). Thanks to the observational baseline and photometric precision of Kepler, the orbital periods of these exoplanets range from less than a day to several hundred days and their radii are as small as ∼1 Earth radius (Twicken et al. 2016).

Hsu et al. (2019) provided a thorough analysis of the vetting and detection efficiency of the final data release for the primary Kepler mission (DR25) using updated stellar parameters from the Gaia DR2 release. They found that a typical FGK star hosts ${1.07}_{-0.08}^{+0.10}$ planets with radii in the range 1.0–16 REarth and orbital periods spanning 4–128 days. Hsu et al. (2019) also estimated that planets with sizes 0.75–1.5 REarth and orbital periods of 237–500 days have an occurrence rate for FGK stars of <27%. This value remains solely an upper limit, as a result of the low detection efficiency and high false alarm probability for planets in that region of parameter space. At even longer periods, an automated planet search through the Kepler data determined that, on average, Sun-like stars have 2.0 ± 0.7 planets between 0.1 and 1 Jupiter radii moving on orbits with periods between 2 and 25 yr (Foreman-Mackey et al. 2016). Interestingly, the exquisite precision of the Kepler photometry has enabled the investigation of specific sub-groups of the exoplanet population for which there are no analogues in the solar system. The radius distribution of short-period (<100 days) exoplanets contains a statistically significant decrease in occurrence between 1.5 and 2 Earth radii (Fulton et al. 2017). This feature is currently the subject of intense investigation, but is generally thought to be related to the erosion of planetary atmospheres by stellar radiation.

At longer orbital periods (hundreds of days), the reliability of candidates from transit surveys decreases and RV confirmation is typically employed to ensure the robustness of planet occurrence rate statistics. Santerne et al. (2016) combined Kepler photometry with long-baseline RV observations from the SOPHIE spectrograph to find that the total occurrence rate of giant planets (with transit depth between 0.4% and 3%) within an orbital period of 400 days is 4.6 ± 0.6%. At wider orbital separations, the efficiency of transit detections decreases sharply and long-baseline RV surveys provide much of the exoplanet demographic information. Between 0.3 and 7 au, giant exoplanets with mass in the range 0.3–13 Jupiter masses have an occurrence rate of ${6.2}_{-1.6}^{+2.8} \% $ (Wittenmyer et al. 2016b).

A primary goal of many studies has been to relate stellar properties to exoplanet occurrence rates. Since stars and planets form from the same protostellar material, it seems reasonable to expect that the properties of the host star would influence those of any planets. Such investigations were restricted to the realm of theory prior to the discovery of exoplanets, but now that planets have been discovered around a wide variety of stars, such studies are becoming ever more important.

In the early days of the exoplanet era, RV surveys of F-, G-, and K-type stars identified a positive correlation between the occurrence rate of giant planets within several au and stellar metallicity (Santos et al. 2003; Fischer & Valenti 2005). This is now commonly known as the "planet metallicity relation" for exoplanets, and is interpreted as providing strong support for the core accretion theory of giant planet formation (Pollack et al. 1996). Subsequent work has found that this relation likely extends to later-type (lower mass) stars (Johnson et al. 2010; Neves et al. 2013).

Planet detection efforts aimed specifically at M dwarf stars have also uncovered interesting relationships between exoplanet demographics and stellar properties. In general, small planets are found to be much more prevalent around M dwarfs. According to Dressing & Charbonneau (2015) and Gaidos et al. (2016), the average M dwarf has roughly 2.2–2.5 exoplanets with radii between Earth and Neptune moving on orbits with periods of ∼200 days or less. M dwarfs also favor the formation of "compact multiple" systems of exoplanets, featuring several Earth-size planets on tightly packed short-period orbits, with the scale of the entire planetary systems comparable to that of the Galilean moons orbiting Jupiter (e.g., Muirhead et al. 2015; Ballard & Johnson 2016). While low mass planets orbiting late-type stars seem to be abundant, the prevalence of giant planets orbiting such stars is notably smaller than that observed around more Sun-like FGK stars (e.g., Cumming et al. 2008). In other words, it seems that the smaller the star, the smaller will be its largest planets.

5.2. The Solar System as an Exoplanet System

As noted in Section 5.1, the architecture of the solar system dominated theories regarding formation processes and orbital evolution until the discovery of other systems. Over time, it has become clear that the range of possible architectures, although not a continuum, is incredibly vast (Winn & Fabrycky 2015). One way to place the solar system within the broader context is to consider the appearance and planetary signatures of our solar system if it were to be observed as an exoplanet system.

In Table 4, we show the semimajor axes and orbital periods for the planets in the solar system. Also included are the radial velocity semi-amplitudes (m s−1) for the planets, under the assumption that the solar system is viewed close to edge-on. The typical precision of current RV surveys is ∼1 m s−1, but next generation instruments are poised to be deployed (Plavchan et al. 2015). The "NN-explore Exoplanet Investigations with Doppler spectroscopy" (NEID) instrument aims to have total per measurement error budget of ∼30 cm s−1 (Halverson et al. 2016). The "Echelle SPectrograph for Rocky Exoplanets and Stable Spectroscopic Observations" (ESPRESSO) instrument is anticipated to achieve a per measurement precision of ∼10 cm s−1 (Pepe et al. 2014). Thus, although current RV surveys are only sensitive to Jupiter and Saturn analogues (subject to their having a sufficiently long observational baseline), Earth/Venus analogues may be detectable in the coming years. Note that although the semi-amplitude of Uranus and Neptune are large compared to the terrestrial planets, their exceptionally long orbital periods mean that the detection of their analogues continues to pose a significant challenge.

Table 4. The Semimajor Axes and Orbital Periods for the Solar System's Planets, and the Radial Velocity Semi-Amplitude (K), Transit Depth, and Transit Probability for Those Planets, As They Would Be Seen By An External Observer

PlanetSemimajor Axis (au)Orbital Period (days)K (m s−1)Transit Depth (ppm)Transit Probability (%)
Mercury0.38787.970.008121.20
Venus0.723224.700.086760.64
Earth1.000365.250.089840.47
Mars1.524686.970.008240.31
Jupiter5.204332.612.48105510.09
Saturn9.54107592.7674980.05
Uranus19.19306890.3013480.02
Neptune30.1601820.2812660.02

Note. We remind the reader that the value of K given above is the maximum amplitude that would be observed, in the case of an orbit that was oriented along the observer's line of sight. In cases where the orbit was tilted to that line of sight, only the line-of-sight component of the motion would be observed, resulting in a smaller signal.

Download table as:  ASCIITypeset image

Also shown in the above table are the predicted transit depths (in parts per million, ppm) and transit probabilities for the eight solar system planets. The transit depth of Mercury lies at the very threshold of the photometric precision of the Kepler mission (Borucki et al. 2010), and while Kepler did detect planets with diameters smaller than or comparable to Mercury (e.g., Kepler-37 b Barclay et al. 2010; Kepler-444 b Campante et al. 2015), those planets moved on orbits with periods far shorter than that of Mercury (∼13.4 and ∼3.6 days, respectively). Such short orbital periods allowed large numbers of transits to be observed during Kepler's primary mission, which greatly increased the ease with which those planets could be detected.

However, the major challenge in the detection of solar system analogues lies in the transit probability, which follows an inverse relationship with the semimajor axis (Kane & von Braun 2008). Simply put, the more distant the planet from its host, the lower the likelihood of it being observed to transit from any given external location. Depending on the duration and window function of the survey (von Braun et al. 2009), most of the solar system's planets would, at best, be detected as single transit events by current and past surveys. Such single transit events are far from optimal, and pose significant challenges with regards to their follow-up and validation (Osborn et al. 2016; Villanueva et al. 2019; Yao et al. 2019). As a result, the bias of survey completeness toward short orbital periods means that the full context of true solar system analogues is an on-going process that will rely on combinations of exoplanet detection techniques to fully realize.

A further critical aspect of our solar system as an analog for exoplanetary systems is the realization that in situ data for exoplanets is unlikely to be achieved for several generations (and possibly significantly longer). Hence, all inferences regarding the connection between planetary atmospheric data and surface processes relies upon the in situ data obtained within the solar system from which models applicable to exoplanets may be constructed. The primary terrestrial sources of such atmospheric data and geological profiles are Venus, Earth, Mars and several of the moons of the giant planets, most particularly Titan.

Furthermore, it is important to study the evolution of the solar system's planets through time, since they occupy a specific evolutionary state at the present epoch (Del Genio et al. 2019). The most relevant solar system bodies to the study of terrestrial planets are Venus and the Earth since they represent the primary target demographic of exoplanet searches in the context of habitability (e.g., Kane et al. 2019b). To illustrate the importance of the study of our own planets, we note that the abundance of oxygen in the Earth's atmosphere was significantly lower than the present day for most of our planet's history (Lyons & Reinhard 2009), while recent studies have revealed that Venus could have retained its liquid surface water up until ∼1 Gya (Way et al. 2016). Searches for potential Venus analogues will play an important role in understanding the bifurcation of atmospheric evolution between Venus and Earth in our solar system (Kane et al. 2013, 2014). More detailed studies of the Venusian atmosphere are required in order to fully characterize runaway greenhouse environments for terrestrial exoplanets (Schaefer & Fegley 2011; Ehrenreich et al. 2012; Kane et al. 2018). Likewise, the solar system's giant planets are critically important laboratories for deriving atmospheric and interior models of giant exoplanets as a function of composition, age, and insolation flux (Mayorga et al. 2016; Withers & Vogt 2017; López-Puertas et al. 2018).

As mentioned previously, analogues of the solar system's ice giants are particularly challenging for us to detect at present. Despite this, data from studies of the solar system's ice giants are being utilized for exoplanet models (Arridge et al. 2014), and scaled down versions of the solar system may make the detection of ice giant planets beyond the snow line more accessible (Kane 2011). Overall, the expanding exploitation of the synergies between planetary science and exoplanetary science is an essential step in reliably inferring the derived properties of exoplanets, most particularly inferences regarding the surface conditions of terrestrial exoplanets.

Over the course of its history, the solar system has exhibited large variability both in the structures within the system and the visible emission from the attendant planets and debris belts that would be detectable by an outside observer. As a result, how the system would have looked to an external observer has changed dramatically over the past 4.5 billion years.

At the earliest epoch, around 3–10 Myr after the Sun arrived on the main sequence, the gaseous proto-planetary disk would still be in the final stages of dissipation, or have already dispersed. At that time, the outer solar system would be dominated by the giant planets, which would still be warm from accreting their gaseous envelopes, and would therefore be radiating brightly in the near infra-red (much as is the case with the four giant planets orbiting HR8799 e.g., Marois et al. 2008, 2010). At this stage the primordial debris belts would most likely still be dynamically cold, producing little dust through collisions, and therefore be faint. In the inner solar system, the terrestrial planets would not yet be fully formed, and the inner regions of the solar system would therefore be full of small, short-lived dust particles, as a chaotic pinball of proto-planetary embryos collided and aggregated to one day form the four terrestrial planets. At infrared wavelengths, the warm dust produced in this manner would be detectable as a bright infrared excess at near and mid-infrared wavelengths, perhaps with silica emission features as a result of the colliding embryos producing high temperature, gaseous SiO, as seen around HD 15407 (Fujiwara et al. 2012) and HD 172555 (Lisse et al. 2009; Johnson et al. 2012).

In the Nice Model for the evolution of the solar system, it was suggested that the migration of Saturn and Jupiter resulted in the two planets crossing a mutual mean motion resonance approximately 700 Myr after the solar system formed. The result of that resonance crossing was a massive dynamical instability, which disrupted the orbits of the giant planets and the solar system's debris belts. The orbits of the outer four planets become chaotic, and the quiescent primordial debris belts were shattered, scattering cometary and asteroidal material far beyond their point of formation by the rearrangement of the giant planets onto their current orbital configuration. The model suggests that this rearrangement resulted in the Late Heavy Bombardment, delivering a drenching of water rich cometary material to the terrestrial planets. Some of the scattered small bodies were captured into the Jovian and Neptunian Trojan populations (e.g., Morbidelli et al. 2005, Lykawka & Horner 2010), while the migration of Jupiter sculpted the Asteroid belt, gutting the inner system and removing ∼90% of its original mass. If the solar system did indeed experience a period of dynamical instability such as that proposed in the Nice Model, then the collisions driven by this proposed period of extreme instability would have resulted in the production of a large density of small dust grains, which would in turn produce an appreciable, although short lived (<104 yr), brightening of the Sun's debris disk by several orders of magnitude (Wyatt et al. 2007; Booth et al. 2009; Raymond et al. 2011). Dust-producing collisions occur stochastically over the lifetime of a planetary system, such that the levels of dust (and hence observed infra-red excess) in the system will peak and decay many times.

Currently, at an age of 4.5 Gyr, the Sun's two principal debris belts are very faint, with fractional luminosities, Ldust/Lstar ∼ 10−7 for the inner solar system (Backman et al. 1995—the definition of "Zodi") and ∼1.2 × 10−7 for the Edgeworth–Kuiper Belt (Vitense et al. 2012).67 The peak of the continuum emission from the Edgeworth–Kuiper Belt (the solar system's cold disk) is 0.5 mJy at a wavelength of approximately 40–50 μm, as seen from a distance of 10 pc—a flux that is less than 1% of the contribution from the Solar photosphere at those wavelengths (Vitense et al. 2012). By contrast, the peak of the continuum emission from the zodiacal light falls at approximately 20 μm (Reach et al. 2003), at a flux level of 0.1 mJy, as seen from a distance of 10 pc (compared to a stellar flux at 2 Jy at the same wavelength).68

These values are lower than those observed for the debris discs detected around nearby Solar type stars (Montesinos et al. 2016; Sibthorpe et al. 2018), which typically exhibit fractional luminosities for cold debris discs of the order 10−4–10−5 and warm discs (e.g., Fujiwara et al. 2013; Kennedy & Wyatt 2013) up to levels of a few percent of the stellar flux (e.g., BD+20 307, Zuckerman et al. 2008; HD 15407, Fujiwara et al. 2012). The known menagerie of debris discs around other stars therefore represent brighter, and more massive (and/or more collisionally active), analogues to the solar system's Edgeworth–Kuiper or Asteroid belts.

Current detection of circumstellar debris dust is limited by instrument sensitivity, and many fainter discs remain to be detected—the Edgeworth–Kuiper Belt is believed to have a disk brightness about average for its age (Moro-Martín et al. 2015), but is as yet undetectable. With this caveat, far-infrared observations by Herschel identified cool debris discs around ∼15% of nearby, Sun-like stars (Trilling et al. 2008; Eiroa et al. 2013; Marshall et al. 2014b; Montesinos et al. 2016; Sibthorpe et al. 2018), with a higher incidence of ∼30% around more massive A-type stars (Chen et al. 2006; Thureau et al. 2014).

The combination of volume-limited far-infrared surveys with exoplanet survey results for the same sample of stars makes a direct statistical comparison of the properties of stars with planets (down to a few Earth masses) and/or debris discs (with dust brightness levels ∼10× the Edgeworth–Kuiper Belt) possible for the first time. This led to the determination that stars with detected planets have an enhanced incidence of circumstellar debris (Matthews et al. 2014). Searching for more refined correlations, such as with planetary mass or stellar metallicity, has led to the identification of weak trends (Wyatt et al. 2012; Marshall et al. 2014a; Meshkat et al. 2017), but such analyses are hampered by the heterogeneity of the underlying disk and planet data sets, making direct determination of the impact of such properties on the occurrence of planets/debris difficult (Moro-Martín et al. 2015; Wittenmyer & Marshall 2015).

Spatially resolved imaging of the scattered light and/or the continuum emission from circumstellar debris has revealed the architectures of these planetary systems in greater detail. In this regard the HST, Gemini/GPI, and VLT/SPHERE at optical/near-infrared wavelengths, and ALMA operating at radio wavelengths have been revolutionary. Most obviously, the complex dynamical structure exhibited by β Pictoris' debris disk, believed to be induced by its directly imaged Jovian-mass companion, clearly exhibits the interplay between the planetary and planetesimal components in a system (e.g., Smith & Terrile 1984; Golimowski et al. 2006; Dent et al. 2014; Matrà et al. 2017, 2019).

At millimeter wavelengths, other systems have exhibited an equally intriguing variety of structures thought to be the result of disk-planet interaction, such as eccentric debris rings (MacGregor et al. 2017; Faramaz et al. 2019), gaps in broad planetesimal belts (Marino et al. 2018, 2019), multiple planetesimal rings (MacGregor et al. 2019), haloes of millimeter-sized dust grains from dust filtering (Marino et al. 2017; MacGregor et al. 2018) and scattered planetesimal populations (Marino et al. 2017; Geiler et al. 2019). At optical and near-infrared wavelengths, high angular resolution coronagraphic imaging reveals similar structures to those seen at millimeter wavelengths, including the presence of haloes of small dust, asymmetries, and eccentric rings (Schneider et al. 2014).

Initial successful efforts toward scattered light imaging of debris discs (building on the detection of the disk around β Pictoris; Smith & Terrile 1984) were dominated by the HST (Schneider et al. 2014; Choquet et al. 2018; Ren et al. 2019) and the reanalysis of archival observations (Choquet et al. 2016). More recently, advances in high contrast adaptive optics have led to a renaissance in ground-based disk imaging, with high profile results including the polarimetric imaging of HR 4796A (Perrin et al. 2015), the resolution of many young discs (e.g., Hung et al. 2015; Millar-Blanchaer et al. 2016; Chauvin et al. 2018; Hom et al. 2020) including new detections (Sissa et al. 2018), the identification of asymmetries (Draper et al. 2016) and complex substructures (Olofsson et al. 2018). The information gleaned from the dust distribution and properties in these two wavelength regimes is complementary, providing constraints on the dust grain scattering albedo and size (e.g., Choquet et al. 2018; Marshall et al. 2018).

Determining the distribution and intensity of the scattered light levels around other stars has been of particular interest due to its implications for future exoplanet direct imaging instruments (e.g., Roberge et al. 2012). This exo-zodiacal light would constitute a confusing background to any attempts at the direct imaging of exoplanets in the habitable zone of their host stars. The direct measurement (by photometry or spectroscopy) of exo-zodiacal light would be stymied by the brightness of the star. Interferometric methods have therefore been employed at optical and infrared wavelengths to infer exo-zodiacal dust levels through the measurement of the visibility deficit revealing the presence of an extended component (i.e., circumstellar dust) to the total emission from a stellar system.

Using the Keck interferometer, limits at the level of 100× to 1000× Zodi were obtained for tens of stars, with a few detections for the brightest discs (Mennesson et al. 2014). Measurements with VLTI surveyed around 90 stars at near-infrared wavelengths spanning a broad range of stellar spectral types, finding a hot dust incidence comparable to that of cool debris discs detected by Spitzer and Herschel but uncorrelated with the presence of a known outer debris belt (Absil et al. 2013; Ertel et al. 2014). Most recently, the Hunt for Observable Signatures of Terrestrial Systems on the Large Binocular Telescope Interferometer searched for dust close to the habitable zone of 38 stars, finding that the presence of habitable zone dust was correlated with cool dust, and that most Solar-type stars were not very dusty, with a median of 3 Zodis and a 95% upper limit of 27 Zodis based on their N-band measurements (Kennedy et al. 2015; Ertel et al. 2018, 2020).

5.3. Debris Dust in Exoplanetary Systems

Almost all stars begin their existence with a circumstellar disk most commonly identified through excess emission at infrared wavelengths (Williams & Cieza 2011; Ribas et al. 2014; Wyatt et al. 2017). The effects of grain growth (at the earliest stages) and collisional evolution lead to a predictable monotonic decay in disk brightness over time, assuming a steady-state evolution (e.g., Wyatt et al. 2007; Kains et al. 2011; Marshall et al. 2011; Löhne et al. 2012; Sierchio et al. 2014). The dust masses in circumstellar discs can be traced from the sub-millimeter emission, revealing a drop in the observed mass at around 1–10 Myr in age (e.g., Panić et al. 2013; Holland et al. 2017), comparable to the approximate lifetime of protoplanetary discs as seen through observation of accretion and gas tracers (Ribas et al. 2014; Richert et al. 2018). More massive and compact (smaller semimajor axis) discs evolve more quickly due to the shorter collisional timescales for dust-producing planetesimals, leading to the same overall brightness decay over time, which follows an approximate 1/t relation. A bright disk is not necessarily a massive disk,69 except potentially at the earliest phase of its evolution.

While the evolutionary trend with time holds for the vast majority of debris discs, there exist several well-known anomalies, such as BD+20 307 (Song et al. 2005) and q1 Eri (Liseau et al. 2008, 2010), that are brighter (higher fractional luminosity) than expected given the stellar age (see Figure 18). The interpretation of these abnormally bright discs is that the observed emission is, wholly or partially, the result of stochastic event(s) that have caused an enhanced rate of dust production at the present epoch, rather than being the result of those discs being unusually massive.

Figure 18.

Figure 18. Here we show the evolution of infrared excess for warm (Asteroid belt) and cold (Edgeworth–Kuiper Belt) analogues as a function of time for A stars (blue, top row) and FGKM stars (red, bottom row). Excess emission is characterized by the ratio of total flux to stellar photospheric flux as measured by the Spitzer MIPS instrument at 24 and 70 μm. Photometric measurements were taken from the Spitzer IRS debris disk catalog (Chen et al. 2014). The horizontal solid line denotes Rwav = Fwav/F* = 1 (i.e., no excess emission). The dashed lines denote the boundary above which "high" excess systems exist R = 2/20 for A-stars at 24/70 μm, R = 1.25/15 for G-stars at 24/70 μm; (Wyatt et al. 2007; Kains et al. 2011). The dotted–dashed curved line is the envelope of 24 μm excesses (150tage−1) as determined by Rieke et al. (2005).

Standard image High-resolution image

The mostly likely mechanism behind this enhanced dust production is thought to be either self-stirring by the largest planetesimals within the disk (e.g., Kenyon & Bromley 2008; Krivov & Booth 2018), or the migration of a planet through the planetesimal belt (e.g., Mustill & Wyatt 2009). The process by which self-stirring could result in excess dust production is subject to many assumptions (such as the size and strength of the planetesimals in the disk), leading to orders-of-magnitude uncertainty in the timescale for the mechanism to act. At the most extreme end of systems with enhanced collision rates are bright (Ld/L* > 0.01) exo-Asteroid belts that display evidence of rapid time variability or decay at near- or mid-infrared wavelengths (e.g., Melis et al. 2012; Meng et al. 2015), ruling out an in situ, steady-state origin for the observed dust. At far-infrared wavelengths, the detection of spatially resolved emission from overly bright debris discs around young stars (e.g., Moór et al. 2015, 2017a, 2017b, Vican et al. 2016); Vican et al. 2016) places a constraint on the disk evolution, revealing several systems that are consistent with either self-stirring or planet migration influencing the observed disk architecture. Given that these discs are anomalously bright, the stirring event should have only been a (relatively) recent occurrence in these systems.

For example, a recent detection of CO gas in the debris disk around HD 32297 reveals that the observed debris disk could be coeval with the gas disk, which in turn suggests that it might therefore be only a few thousand years old (Cataldi et al. 2020). This bright debris disk system is young, but otherwise unremarkable in its extent or architecture. As a result, this discovery suggests that the tacit assumption that the discs we observe around other stars are the same age as the host star and present as a result of (broadly) steady state evolution is not always robust.

A handful of debris discs have mid-infrared spectra with features that reveal the composition of their constituent dust. As a result, the origin of that dust, and the nature of its production, can be directly determined. The debris disk orbiting HD 172555 has SiO gas in its spectrum, consistent with high temperature, shocked material produced in a hypervelocity impact (Johnson et al. 2012), whereas that around HD 109085 reveals a mixture of primordial (icy, amorphous) and processed (crystalline) material consistent with an origin of the parent bodies in the outer belt of that system (Lisse et al. 2013).

As we become better able to study the structure and distribution of dust in exoplanetary systems, it will become possible to model the processes that cause that structure to occur. For example, the debris dust orbiting HR 8799 has been found to lie in two distinct reservoirs, one interior to, and the other exterior to, the star's four known giant planets. Much as is the case in the solar system, the structure of debris belts in the HR 8799 system is clearly driven by the gravitational sculpting resulting from the influence of those four planets (e.g., Su et al. 2009; Contro et al. 2016; Goździewski & Migaszewski 2018; Geiler et al. 2019). By contrast, the sharp inner edges to the dust distributions of two young stars (HR 4796A and HD 141569) have been proposed as being the result of the photophoretic driving of dust through a particularly tenuous dispersing gas disk (e.g., Krauss & Wurm 2005, Krauss et al. 2007; Hermann & Krivov 2007)—a process that has also been invoked to explain the origin of high-temperature inclusions in the solar system's asteroidal and cometary bodies (e.g., Mousis et al. 2007a, 2007b, Moudens et al. 2011).

Among known debris discs, Poynting–Robertson drag (as discussed in Section 4.9.1) has a minimal effect on the radial distribution of dust grains (Wyatt 2005, 2006). These systems are dense enough (∼10 to 1000 times the mass of the Edgeworth–Kuiper Belt) that the dust lifetime is dominated by mutual collisions rather than by migration. For fainter discs, Poynting–Robertson drag can be an important dust removal mechanism, but such systems lie beyond our present capacity to detect. The proposed Spica infrared/sub-millimeter mission, scheduled for launch and operation in the mid-2030's, would provide observations of many exact Edgeworth–Kuiper Belt analogues (Roelfsema et al. 2018).

In the coming years, new facilities will enable us to identify the origin, composition, and dynamics of the dust-producing bodies that comprise debris discs with greater precision than has been achieved to date. Such data will not only help us to better understand and characterize the debris discs in question—it will also play an important role in our efforts to address the more esoteric questions of habitability in these planetary systems in the longer term, since it will offer the potential for us to quantify the impact regimes that would be experienced by potential rocky habitable zone planets, following methods initially developed to study the evolution and history of our own planetary system (e.g., Horner & Jones 2008a, 2009, 2010b, 2010a).

5.4. Comparative Analysis of Planetary Systems

A primary scientific aim of exoplanet searches is to determine the degree to which our solar system represents a typical outcome of planetary system formation and evolution. To this end, much attention and effort is lavished on the quest to detect Earth-like planets in Earth-like orbits. Indeed, this was the chief aim of the Kepler spacecraft mission (Borucki et al. 2010). Space-based transit surveys such as Kepler, CoRoT, and TESS (Barge et al. 2008; Ricker et al. 2015) are particularly adept at detecting such small planets. The photometric sensitivity of these missions reaches about 30 parts per million (ppm) for suitably quiet stars, while the flux decrement due to an Earth-size planet transiting a Solar-type star is 80 ppm. In the four years of the Kepler prime mission, a handful of potential Earth- analog candidates were identified moving on orbits within their host stars' habitable zones (Kane et al. 2016). While the radial velocity (RV) method has been a successful technique for detecting exoplanets for decades (Campbell et al. 1988; Cochran & Hatzes 1994; Butler et al. 1996), the RV signal of an Earth-mass planet in a 1 au orbit is a dauntingly diminutive 8 cm s−1, which remains below the current best precision of a few tens of cm s−1 for instruments such as HARPS and ESPRESSO (Fischer et al. 2016).

Of equal importance, but perhaps capturing less public interest, is the presence of outer giant planets analogous to Jupiter and Saturn, moving on orbits with periods of more than 10 yr. For these planets, the transit method is of minimal use, both because the probability of a planet transiting its host scales inversely to the planet's semimajor axis, and because the transit of such a distant planet would be a singular ∼1 day event in any given 10+ year period. Fortunately, however, the RV method is eminently capable of detecting Jupiter analogues: our Jupiter imposes an RV signal of 12 m s−1 in a 12 yr period on the Sun, and RV surveys have achieved precisions of, or better than, 3 m s−1 for at least the last 25 yr (Fischer et al. 2014; Endl et al. 2016). From the long-running RV surveys, there is a growing consensus that about 3%–6% of Solar-type stars host a "Jupiter analog," typically defined as being a giant planet orbiting beyond the ∼3 au snow line (Wittenmyer et al. 2006, 2011b, 2016b; Zechmeister et al. 2013; Rowan et al. 2016). As the temporal baseline of legacy RV surveys lengthens, true Saturn analogues may be detectable; the RV signal of Saturn is 3 m s−1 in an almost 30 yr orbit. Given that the orbital periods of the ice giants Uranus and Neptune are 84 and 165 yr, respectively, and that their maximum RV amplitudes would be just 0.3 and 0.28 m s−1, the RV method is not, currently, likely to deliver true analogues for those planets any time soon, and so to find such planets, we must turn to other detection techniques.

Direct imaging probes a different but complementary region of parameter space to the RV and transit methods, being most sensitive to giant planets moving on orbits at tens of au from their host stars. To date, the favorable contrast ratios of young planetary systems have biased direct imaging detections toward systems younger than ∼100 Myr. The HR 8799 system (Marois et al. 2008, 2010; Wang et al. 2018a) has been considered a benchmark "scaled up solar system analog," with four giant planets of mass 5–7 Jupiter masses orbiting between 14 and 68 au, as well as two distinct debris belts (potentially analogues of the Asteroid and Edgeworth–Kuiper belts; e.g., Su et al. 2009; Contro et al. 2016; Goździewski & Migaszewski 2018). As mentioned in Section 5.2, scaled down versions of the solar system may also assist in the detection of ice giant analogues via RVs and transits (Kane 2011).

As we noted earlier, the combination of transit and RV observations enables the density of certain exoplanets to be determined. In the main, these results have been used to draw conclusions on the potential composition of the planets in question, but it has also recently been used to highlight the significant disparity between the densities of two almost identically sized planets in the Kepler-107 system. Both Kepler-107 b and c have radii around 1.5–1.6 times that of the Earth, and so it would be natural to assume that RV observations would show them to have similar masses. However, such observations instead reveal that the bulk density of Kepler-107 c is more than twice that of Kepler-107 b (12.6 g cm−3 versus 5.3 g cm−3). As discussed by Bonomo et al. (2019), the best explanation for this disparity is that Kepler-107 c was once the victim of a giant, mantle-stripping collision—as was the case for Mercury in the solar system (e.g., Benz et al. 2007; see also Section 4.1.1, above). These results demonstrate that we are now entering an era where mass and size measurements of exoplanets can begin to give information not only on the composition of those planets, but also on their collisional history. Given the widespread evidence of giant impacts in the solar system, this result is perhaps not particularly surprising—but it is worth stressing that, without the evidence from our own backyard, planets such as Kepler-107 c would probably be hugely confusing additions to the exoplanet catalog. Given the uncertainties inherent in calculating the densities of transiting exoplanets (particularly the smallest), it seems unlikely that we would be able to definitively identify the Earth and Mercury as having relatively high densities compared to the other terrestrial worlds, were we observing the solar system from the outside. However, in the coming years, the precision with which we can make such measurements will improve as new facilities come on-line. In the future, we may well therefore reach a point where such forensic studies can be carried out for planetary systems similar to our own.

Currently there are in excess of two hundred known satellites orbiting the eight planets within the solar system (as detailed in Section 2, in Table 1), most of which orbit the two largest planets in our system, with Jupiter hosting 79 known moons and Saturn hosting 82 known moons. While none of those moons are large enough that we would be able to confidently detect them were they placed in orbit about the known exoplanets, the sheer number of moons in the solar system, particularly the large number orbiting the Jovian planets, indicate a high probability that moons will also exist orbiting exoplanets. As a result, there is a general consensus in the scientific community that exomoons do exist (e.g., Williams et al. 1997; Kipping et al. 2009; Heller 2012; Heller & Pudritz 2015; Zollinger et al. 2017). However, while recent work suggested the first preliminary detections of exomoon signatures (Teachey et al. 2018; Teachey & Kipping 2018), those results remain heavily debated, and as a result, no exomoons have been confirmed to date (Heller et al. 2019; Kreidberg et al. 2019).

The lack of confirmed detections should not, however, be taken as evidence that such moons do not exist. Instead, it is, at least in part, the result of such moons remaining beyond the limits of our detection ability, and it is likely that future surveys with more powerful tools will confirm that, like planets, exomoons are ubiquitous. It may, however, be that we have been looking in the wrong places. Astronomers speculate that exomoons are likely ubiquitous based primarily on our knowledge of the solar system's outer planets, yet, to date, we have not looked for moons around planets that are true analogues to the solar system's gas and ice giants. Despite extensive efforts to detect moons around exoplanets with orbital distances between 0.1 and 1 au (e.g., Kipping et al. 2012), the wait for a confirmed detection goes on. This dearth can potentially be explained by the instability of exomoons in systems that experience intense stellar tides (e.g., Barnes & O'Brien 2002) or inward migration (Spalding et al. 2016). Long-period transiting exoplanets may indeed be better candidates for studies of exomoons than their short-period counterparts.

One of the key challenges inherent to the detection of exomoons is the impact of the orbital motion of the moon around the planet as the planet orbits its star. It has been suggested that a potential method by which exomoons could be detected is through their effect on the transits of their host planet across its star. There are two ways in which such variations could manifest and be detected—through transit duration variations and/or TTVs (Brown et al. 2001; Kipping et al. 2012, 2013). In addition, as the planet transits its host star, the moon may also be seen to transit. However, as the planet orbits the star, the moon also orbits the planet, and so the position of the moon in its orbit about the planet will change each time the planet transits the star. The signature of the moon will therefore vary its position within the transit of the planet depending on its orbital phase, as shown in Figure 19, below. The problems caused by this variability in the timing of the moon's transit relative to that of its host planet are further exacerbated by the possibility that there may be occasions when the moon is lined up perfectly with its host planet, and therefore transiting or passing behind the planet as the planet is in transit. Such a scenario will mean that the transit signature of the moon would disappear for that particular transit, leaving a signal that would be indistinguishable from a solitary, moonless planet.

Figure 19.

Figure 19. The transit of a moon may be detected as a planet–moon system passes in front of its host star. As the planet orbits the star, the moon also orbits the planet, and so will produce a different signature depending on its orbital position as it transits the star. The panels above show the expected planet–moon transit signatures for both a moon trailing behind the planet as it transits (green) and a moon leading the planet as it transits (red). Note that if the moon is transiting or eclipsed by the planet when the planet passes in front of the star, then the signature of the moon will disappear, and the planet will appear to be moonless.

Standard image High-resolution image

The time at which the planet transits can also be affected by the existence of an exomoon. The moon–planet system orbits about the center of mass of the two objects (their barycenter). In the absence of any perturbations from other planets in the system, the barycenter of the planet–moon system will follow a Keplerian orbit around the star. At those times when the moon trails the planet, the planet must therefore be leading the barycenter in its orbit around the star (since the planet and moon must always be directly opposite one another across the barycenter). As a result, when the moon trails the planet, the planet will be seen to transit its host star a little earlier than would otherwise be expected. Equally, at times when the moon is leading the planet in their orbit around their host star, the planet must therefore lag behind, trailing the barycenter. As a result, the planet will be observed to transit the star later than would otherwise be expected. Thus the variance of the transit timing of the planet can be an indication of the existence of an exomoon. However this method is also used to determine the existence of additional planets in orbit around the star causing a similar gravitational effect on the transiting planet. As a result, caution is needed when using this method for exomoons. Other potential methods that have been suggested for exomoon detection include microlensing (Han & Han 2002; Bennett et al. 2014), pulsar timing variation (Lewis et al. 2008), radio emission modulations (Noyola et al. 2014) and direct imaging (Cabrera & Schneider 2007). Hill et al. (2018) found that instruments would need the capacity to resolve a separation of 10–4400 μ arcseconds in order to directly image and resolve a planet–moon system. Direct imaging could also reveal the existence of a moon through the transit of a moon or of its shadow across a bright planet (Heller 2016).

The current focus of exomoon searches on planets at relatively small orbit radii is the direct result of the fact that the majority of the most promising techniques for the detection of exomoons rely on the detection of transits of the moon's host planet. Such transit observations are strongly biased toward short period planets, with transits growing more unlikely as the orbital radius of the planet increases. Two giant exoplanets moving on orbits well beyond 1 au that are possible candidates for in-transit exomoon detection are Kepler-421b (Kipping et al. 2014) and Kepler-167e (Kipping et al. 2016). The ephemerides of these planets have been found to be free of TTVs (Dalba & Muirhead 2016; Dalba & Tamburo 2019), enabling the accurate prediction of future transits that may be searched for photometric signals of exomoons.

Over the past few years, the study of exomoons has become a topic of great interest, with particular focus on the possibility that such moons might increase our chances of finding potentially habitable worlds beyond the solar system. Studies on the occurrence rates of giant planets have found that they are less likely to be found in the habitable zone of their star than smaller terrestrial planets. However, if each giant planet hosts more than one moon, then habitable zone exomoons may well be more numerous than in the habitable zone terrestrial planets (Hill et al. 2018).

One reason that exomoons are considered particularly interesting in the context of the search for life beyond Earth is that they would likely offer a variety of sources of energy to a potential biosphere, rather than relying on the flux received from the host star—an idea fueled by our observations of Io and Europa in the Jovian system. The reflected light and emitted heat from the host planet could help to ensure a clement climate on such a moon, while tidal heating could drive volcanic activity, providing both additional energy and nutrients to any life that developed there (Heller & Barnes 2013; Hinkel & Kane 2013). These combined heating effects act to increase the size of the habitable zone around the host star of such moons, creating a wider temperate area in which they could maintain conditions that are amenable to liquid surface water (Scharf 2006). At the same time, occultations of the host star by the giant planet could help to reduce the incoming energy flux for satellites moving close to the inner edge of the habitable zone—preventing such a moon from overheating. This interplay between tidal and radiative heating from the host planet and the diminution of the incoming stellar flux during occultation events is potentially complex, with the inclination of the moon's orbit, and the presence of additional moons in the system likely to play an important role. For an exomoon orbiting an exoplanetary Uranus, for example, occultation events would occur seasonally, with the satellite spending most of its time in full starlight. At the same time, the orbital eccentricity of the planet itself will doubtless impact upon the potential habitability of its moons. For a planet like BD+14 4559 (whose orbit is shown below, in Figure 20) that spends part of its orbit outside the outer edge of the habitable zone, the extra tidal and radiative energy delivered by the host planet could potentially enable any potentially habitable exomoon to maintain its habitability during this time, though it should also be noted that such a moon would likely lack the time to freeze over during these excursions beyond the nominal outer edge of the habitable zone.

Figure 20.

Figure 20. BD+14 4559 is a giant planet that resides mostly within the habitable zone of its star. Due to the extra energy sources provided to an exomoon, any moon orbiting this planet may still be able to maintain habitable conditions during the part of the planet's orbit when it exits the outer edge of the planetary habitable zone. (Image Source: The Habitable Zone Gallery (http://www.hzgallery.org/index.html; accessed 2020 April 23rd); described in Kane & Gelino 2012).

Standard image High-resolution image

In addition, as we have seen in our own planetary system, large exomoons offer the possibility for liquid water to be found far beyond the boundary of the habitable zone, at least as it has traditionally been defined. Moons like Europa, Enceladus, and Titan have become targets of great interest for astrobiological research, despite the fact that they lie many au beyond the outer edge of the solar system's habitable zone. While such moons would clearly be of great interest were they found orbiting Jupiter-analog exoplanets, it is likely that they would be poor targets for the search for life elsewhere. While it is reasonable to assume that Europa is a habitable world, for example, it is not fair to consider that it would be detectably habitable. Europa's ocean is buried beneath a kilometers thick ice sheet—and even in our own backyard, we cannot yet say whether that ocean contains life. For this reason, the focus on exomoon habitability, in a detectable/measurable sense, will remain on those satellites of giant planets that themselves orbit within (or close to) the habitable zone of their host star—such satellites would offer not only the possibility of being considered potentially habitable worlds, but also the possibility that their habitability could be remotely probed, using the next generation of astronomical telescopes and instrumentation.

5.5. Super-Earths and Mini-Neptunes

In the solar system, Earth and Neptune bound a clear division between the terrestrial and gaseous planets. Separated in size by a factor of four, no known solar system object has a radius larger than Earth's but smaller than Neptune's. It was natural, therefore, to expect that exoplanetary systems might follow this same behavior—rocky, terrestrial worlds close to their host stars would be relatively small (as in the solar system), while the more distant giant planets would all be large—Neptune-sized or bigger.

It has become obvious, however, that this is not the case. One of the great surprises to come out of the Kepler mission was that, rather than being rare, planets between the size of Earth and Neptune are actually abundant. Kepler found such planets in droves—as can be seen in Figure 21, below. Now, it is widely known that super-Earths—likely terrestrial worlds with radii slightly larger than the Earth—and sub-Neptunes—likely planets with thick gaseous atmospheres, reminiscent of the solar system's ice giants, with radii slightly smaller than Neptune—are by far the most common of the planets that we can currently detect in the galaxy. Some authors have even suggested that such planets might be the most abundant of all—outnumbering their smaller and larger siblings—but at the same time, logic would dictate that there are likely to be more small planets than large ones in the cosmos (just as, in the solar system, there are more small bodies than larger ones, and just as there are more small, low mass stars than massive stars). The true size distribution of planets smaller than the super-Earths remains to be uncovered in the coming decade—but regardless of the degree to which they are the most common planets in the cosmos, the super-Earths and sub-Neptunes pose a fascinating conundrum, showing where our ability to infer planetary properties based on the solar system breaks down.

Figure 21.

Figure 21. Known transiting planets by size. This histogram demonstrates how readily the primary Kepler mission discovered super-Earth and mini-Neptune exoplanets. Even when corrected for observing biases, exoplanets with radii in between that of the Earth and Neptune seem to be the most prevalent planets in the galaxy. Data from the NASA Exoplanet Archive (https://exoplanetarchive.ipac.caltech.edu) (accessed 2019 July 9th).

Standard image High-resolution image

Prior to the era of Kepler, the surprising abundance of super-Earth and mini-Neptune exoplanets was already becoming clear, with the detection of such planets being achieved by several RV surveys. For planets moving on orbits with periods less than fifty days, it is clear that smaller planets are more common than giants—with the occurrence rate of exoplanets on such orbits rising by approximately a factor of ten as one moves from Jupiter-mass objects to those only a few times more massive than the Earth (Mayor et al. 2009; Howard et al. 2010; Wittenmyer et al. 2011a). The sensitivity of Kepler to exoplanets in this region of parameter space enabled a more detailed investigation of super-Earth and mini-Neptune prevalence. Accounting for both false positives and completeness, Fressin et al. (2013) found the occurrence rates of both super-Earths and mini-Neptunes with periods less than 85 days to be ∼23%, higher than any other size planet in any region of parameter space considered in their study.

Understanding the occurrence rate of super-Earths and mini-Neptunes is of critical importance to theories of planet formation. Early population synthesis models that simulated planet formation via core accretion anticipated an increasing planet occurrence rate with decreasing planet mass for objects orbiting at several au (e.g., Ida & Lin 2004; Alibert et al. 2005; Mordasini et al. 2009). However, the observed prevalence of super-Earth and mini-Neptune exoplanets at orbital distances within 1 au challenged the theoretical models. During formation, planets whose mass grew to several times that of Earth were expected to either spiral into the host star via Type-II migration or rapidly accrete a gaseous envelope to become a gas giant planet (Ida & Lin 2008; Schlaufman et al. 2009; Alibert et al. 2011)—a process of runaway growth that should result in planets rapidly passing through the super-Earth/mini-Neptune mass range. Producing a super-Earth or mini-Neptune planet would therefore require the unlikely dispersal of the gas disk at a particular time, truncating that process of runaway growth. The mere existence of so many super-Earth and mini-Neptune exoplanets in the galaxy demonstrated that this theory was flawed, and has led to numerous suggested mechanisms to explain how these exoplanets may have formed (e.g., Ida & Lin 2010; Rogers et al. 2011; Hansen & Murray 2012; Chiang & Laughlin 2013; Cossou et al. 2014).

Super-Earths and mini-Neptunes occupy a fascinating region of the mass–radius parameter space whereby vastly different interior and atmospheric compositions are a priori plausible for a given planet. These planets can have substantial fractions of silicate rocks, metals, ices, and gases (e.g., Valencia et al. 2007; Rogers & Seager 2010). A subset of super-Earth and mini-Neptune exoplanets that are particularly amenable to transit and RV characterization have had their radii and masses measured precisely (e.g., Marcy et al. 2014; Weiss & Marcy 2014). Substantial effort has been made to explore the atmospheres and interiors of this subset of well characterized exoplanets. Lopez & Fortney (2014) used planetary evolution models to suggest that, for mini-Neptunes, the planet's radius can be used as a proxy for its hydrogen-helium envelope fraction. Wolfgang et al. (2016) employed a probabilistic mass–radius relationship model to explore the intrinsic astrophysical scatter present in the relation between super-Earth and mini-Neptune mass and radius, while Rogers (2015) demonstrated that most 1.6 Earth-radius exoplanets have densities that are too low to be explained by just iron and rock, suggesting that these exoplanets are not terrestrial in nature. Beyond the small group of studies listed here, the atmospheres and interiors of these transitionary worlds are the subject of intense ongoing investigation.

Given the ubiquity of super-Earth and mini-Neptune exoplanets in the galaxy, one must wonder: why is the solar system devoid of this class of planet? In other planetary systems, the presence of cold, long-period giant exoplanets has been found to be strongly correlated to the existence of inner super-Earth-sized companions (e.g., Zhu & Wu 2018; Bryan et al. 2019). One potential explanation for the lack of such a planet in the inner solar system is that Jupiter could have acted as a barrier to Uranus, Neptune, and Saturn, halting their inward migration and preventing them from becoming super-Earth or mini-Neptune class planets (Izidoro et al. 2015a). Equally, given proper disk conditions, super-Earths may have existed in the solar system previously, clearing the material inside of Mercury's orbit before colliding with the Sun (Martin & Livio 2016). Yet another explanation could be that the solar system does contain a super-Earth or mini-Neptune planet that has so far eluded detection (see Batygin et al. 2019 for a review). Additional explorations of the small and large bodies nearby and across the galaxy will likely shed light on this peculiar aspect of the solar system.

5.6. Exoplanetary Atmospheres

Exoplanetary atmospheres are a window to an exoplanet's composition, formation history, evolution, and even habitability. The proximity of the solar system planets has afforded us the luxury of being able to study their atmosphere up-close and personal. Unlike exoplanets, which cover just a single pixel through the largest telescopes, the solar system's planets are easily resolved, allowing the behavior of their atmospheres to be studied in detail. The advent of planetary space exploration, in the 1960s, meant that we could get closer still—sending orbiters to observe the planets continuously, and even launching probes that could dive into their atmospheres, and, for the terrestrial planets, land on their surfaces. This has allowed us to investigate the atmospheres of the planets in our solar system to a level that will most likely not be possible for an exoplanet in the foreseeable future. Almost all of the information we can gather pertaining to an exoplanet's surface, formation, and evolution will be filtered through distant observations of its atmosphere.

Not long after the discovery of the first exoplanet came the first theoretical predictions regarding exoplanet atmospheric characterization. Seager & Sasselov (2000) proposed that measurements of the wavelength dependence of an exoplanet's transit depth would reveal variations in atmospheric opacity. This could, in turn, be used to infer the properties and composition of an exoplanetary atmosphere through a technique known as transmission spectroscopy. This technique was first successfully used to identify sodium in the atmosphere of HD 209458 b based on four transits observed by the HST (Charbonneau et al. 2002).

Since the initial characterization of the atmosphere of HD 209458 b, several other techniques for exoplanet atmospheric characterization have been developed. The observation of the occultation of short-period, highly irradiated exoplanets by their host stars (secondary eclipses70) led to the first detection of photons directly emitted by an exoplanet atmosphere (Charbonneau et al. 2005; Deming et al. 2005). The observation of secondary eclipses enables the calculation of planetary Bond albedo and atmospheric temperature. Similarly, Knutson et al. (2007) conducted infrared observations of half of the orbital phase of the hot Jupiter HD 189733 b, effectively mapping the temperature distribution across planetary longitude. Phase curve maps, such as those presented in that work, have provided valuable information about energy transport and heat redistribution in the atmospheres of the hottest exoplanets.

Exoplanet atmospheres are also being characterized through high dispersion ground-based spectroscopy, whereby spectral lines of an exoplanet are detected separately from stellar lines thanks to the time varying radial motion of the exoplanet (e.g., Snellen et al. 2010). This technique has lead to confident detections of carbon monoxide and water vapor in exoplanet atmospheres (e.g., Brogi et al. 2012; Birkby et al. 2013), has enabled the estimation of day–night wind velocities on exoplanets, and has provided validation for 3D exoplanet atmosphere models (Kempton et al. 2014).

At greater orbital separations, direct imaging has also become a viable technique for the atmospheric characterization of young, self-luminous exoplanets. The HR 8799 system contains three directly imaged exoplanets that have been observed for over a decade, revealing critical information about the atmospheres and orbital dynamics of young giant planets (e.g., Marois et al. 2008, 2010). However, the immense planet–star contrast and the fundamental limits on angular resolution set by diffraction add difficulty to the direct observation of smaller or cooler (i.e., more mature) exoplanets at visible and near-infrared wavelengths.

Among the various methods of exoplanet atmospheric characterization, transmission spectroscopy is presently the most productive means of probing exoplanet atmospheres. Although the entire sample of exoplanets that have been subject to transmission spectroscopy observations are hotter and have shorter-period orbits than any object in the solar system, substantial efforts have been made to simulate the transmission spectra of the solar system's planets. For Venus, Barstow et al. (2016) generated synthetic spectra using a radiative transfer model to determine that the cloudiness of a Venus-analog exoplanet would preclude atmospheric characterization from an observatory such as the James Webb Space Telescope (JWST), even if the hypothetical planet orbited a small star only 10 parsecs away. Pondering the Earth as an exoplanet has become a cottage industry within exoplanetary science. With appropriately high-signal observations, transmission spectroscopy of an Earth-analog exoplanet could reveal the presence of key biosignature gases as well as other species such as ionized calcium (e.g., Pallé et al. 2009). Importantly, mid-transit observations of an Earth-twin would be hindered by atmospheric refraction, which would mask atmospheric information below ∼10 km (e.g., García Muñoz et al. 2012).

Transmission spectra have also been synthesized for Jupiter, Saturn, and Titan. Irwin et al. (2014) used spectroscopic observations and a radiative transfer model to simulate the transmission spectrum of Jupiter between 0.4 and 15 μm. The transmission spectrum of Jupiter as measured in a single transit would yield accurate constraints on the vertical profiles of temperature and hydrocarbon abundance in Jupiter's stratosphere (Irwin et al. 2014). Dalba et al. (2015) demonstrated that occultations of the Sun by Saturn observed by the Cassini Spacecraft in the near-infrared could be used to derive a transmission spectrum between 1 and 5 μm. The spectrum displayed multiple absorption features due to methane, as well as features from the hydrocarbon byproducts of Saturn's photochemistry (i.e., acetylene, ethane, and higher-order aliphatic hydrocarbons). Saturn's transmission spectrum also displayed a flat continuum between 1 and 5 μm caused by atmospheric refraction. Titan's transmission spectrum was also reconstructed from Cassini solar occultation observations (Robinson et al. 2014). It displayed absorption features from methane, acetylene, ethane, higher-order hydrocarbons, and potentially carbon monoxide. Multiple scattering by the thick haze present in Titan's atmosphere gives the spectrum a distinctive slope increasing toward shorter wavelengths, which is comparable in size to the strong absorption features (Robinson et al. 2014).

5.7. Dynamics of Exoplanetary Systems

In order to fully understand the formation and evolution of our solar system, a variety of computational tools have been developed that allow the dynamical evolution of both the small bodies and the planets themselves to be studied on billion year timescales. These n-body dynamics packages, such as SWIFT (Levison & Duncan 1994), Mercury (Chambers 1999), and REBOUND (Rein & Liu 2012) can clearly also be used to study the dynamics of exoplanetary systems. They can be used, for example, to study the impact regimes that might be experienced by Earth-like planets in those systems (e.g., Horner & Jones 2008a, 2009, 2012; Horner et al. 2010), or to assess which of the known exoplanetary systems could host potentially habitable planets, based on the gravitational influence of the known planets in the system (e.g., Jones et al. 2005, 2006; Agnew et al. 2017, 2019, 2018a, 2018b; Horner et al. 2020).

A more immediate and important use for such tools in the study of exoplanetary systems is to use them to assess whether the proposed planets within a given system are actually dynamically feasible. As the number of known exoplanets has increased, the rigor with which new discoveries are investigated, prior to being announced, has fallen. For the first few discoveries, researchers and referees worked to the maxim that "extraordinary claims require extraordinary evidence," and all possible alternatives to the planetary hypothesis were examined in great detail. In recent years, as the discovery of exoplanets has become routine, an ever increasing population of planets moving on unusual orbits, or orbiting peculiar stars, have been announced (e.g., Wolszczan & Frail 1992; Lee et al. 2009; Beuermann et al. 2010, 2013; Potter et al. 2011; Ramm et al. 2016).

Given the importance of new exoplanet discoveries in helping theorists to better understand planetary formation, it is clearly vital to ensure that each newly proposed planetary system is what it seems to be. Since the great majority of such systems are found by indirect means, there is clearly a risk that some other physical process might be producing a variation that could be misinterpreted as the influence of a planet orbiting a given star. Indeed, the detection of exoplanets is hindered by a plethora of phenomena that can cause stars to vary in a fashion similar to that expected to be caused by planetary companions (such as sunspots, magnetic cycles, periodic variability, companion stars, and many others; e.g., Dumusque et al. 2011; Robertson et al. 2014; Meunier et al. 2015; Fischer et al. 2016; Rajpaul et al. 2016; Nicholson et al. 2018). It is therefore useful to have a "sanity check" for each newly proposed planetary system, in order to make sure that the candidate planets are reasonable.

Dynamical studies of newly discovered planetary systems therefore provide an excellent tool by which the veracity of those systems can be examined. If the candidate planets prove to be dynamically stable on timescales comparable to the lifetime of the system, then they can be considered to be dynamically feasible. On the other hand, if dynamical study of a given system shows that the planets therein interact so strongly that they collide, or eject one another from their host system on timescales far shorter than the system's lifetime, this either suggests that we have fortuitously observed the system during its death throes (a statistically unlikely but certainly plausible possibility) or that the proposed planets do not exist, at least on the orbits claimed.

When such simulations are carried out, the results fall into three broad categories. The first are those systems for which no dynamical instability is observed across the full spread of architectures tested (e.g., Wittenmyer et al. 2014b). These systems prove so dynamically stable that it is perfectly reasonable to assume that the planets truly exist on the orbits that are proposed. Beyond this, however, such results show the systems to be so dynamically stable that there is likely to be plenty of space left for further planets in that system—in other words, they indicate systems where further observations may well prove fruitful in coming years.

The second class of exoplanetary system are those for which such simulations reveal only narrow islands of stability across the phase space studied. Examples of the dynamical maps for two such systems are shown in Figure 22, below. In both cases, the candidate planets move on orbits that are close to mutual mean-motion resonances. In the case of HD 200964 (the left-hand plot; Wittenmyer et al. 2012b), our simulations show that the orbits of the two candidate planets are only stable when they are trapped in mutual 4:3 mean-motion resonance (the narrow island of stability visible at the center of the dynamical map). Similarly, for HD 204313 (the right-hand plot; Robertson et al. 2012b), the planets are only dynamically stable when trapped in mutual 3:2 mean-motion resonance. In both cases, as soon as the planets are moved away from a mutually resonant solution, they begin to strongly perturb with one another, resulting in one or other of them being flung into the center star, the two planets colliding with each other, or one being ejected from the system entirely.

Figure 22.

Figure 22. The dynamical stability of the planetary systems proposed orbiting the stars HD 200964 (Wittenmyer et al. 2012b; left) and HD 204313 (Robertson et al. 2012b; right), as a function of the semimajor axis and eccentricity of the orbit of the outermost planet in that system. The color at each point denotes the mean lifetime of a total of more than fifty individual simulations that began with the planet at that particular a–e location. For both systems, the dynamically stable solutions make up only a small fraction of the total number of simulations carried out. In both cases, the stability of the system is only ensured when the two planets within are trapped in mutual mean motion resonance—the 4:3 and 3:2 resonances, respectively.

Standard image High-resolution image

As a result, we can safely conclude that the planetary systems proposed are dynamically feasible provided that the planets therein are trapped in mutual mean-motion resonance. Such results have an interesting additional outcome—not only do they demonstrate that the systems can be dynamically feasible, such simulations have enabled significant additional constraints to be placed on the orbits of the planets in a number of systems (e.g., Robertson et al. 2012a, 2012b; Wittenmyer et al. 2014a, 2016a), above and beyond those that result purely from the observational data.

The third class of system are those for which the proposed planets in a given system move on orbits that appear highly dynamically unfeasible—sometimes even crossing one another's path (e.g., Horner et al. 2011, 2012d, 2013, 2014b, 2019; Wittenmyer et al. 2012a; 2013; Hinse et al. 2014a, 2014b; Marshall et al. 2020). While it is certainly feasible for mutually crossing orbits to be dynamically stable (examples in our own solar system include the Jovian and Neptunian Trojans, and the Plutinos; e.g., Morbidelli et al. 2005; Di Sisto et al. 2010, 2019; Horner & Lykawka 2010a; Pirani et al. 2019), such stability is typically facilitated by mutual mean motion resonance between the orbits of the objects in question. Even for newly discovered objects in our own solar system, it is critically important to confirm whether their orbits are dynamically stable or dynamically unstable (e.g., Lykawka & Mukai 2007; Gladman et al. 2008; Horner & Lykawka 2010a; Horner et al. 2012c; 2011a; Di Sisto et al. 2014, 2019; Wu et al. 2018). For exoplanetary systems, such dynamical studies are even more important—since strong dynamical instability would infer that the planetary system, as proposed, is simply not feasible, and that either the system is dramatically different to that proposed in the literature, or the observed variations that led to the announcement of planets around a given star must have another explanation.

In recent years, a number of multiple planetary systems have been proposed orbiting very tightly bound eclipsing binary stars,including several around what are known as "post-common envelope binaries" (hereafter PCEBs). Those systems consist of a highly evolved primary star (typically, though not always, a white dwarf) with a secondary stellar companion (often a dim red dwarf star, similar to our Sun's nearest stellar neighbor, Proxima Centauri) orbiting at a remarkably small distance. The two stars, which orbit about their common center of gravity with periods of a couple of hours, move on orbits that are aligned such that they eclipse one another each time the secondary passes between the primary star and the Earth, along our line of sight. Multiple massive companions have been claimed orbiting several such binaries (e.g., Lee et al. 2009; Beuermann et al. 2010; Potter et al. 2011; Qian et al. 2011) on the basis of variations in the timing of the eclipses between them, with those timing variations proposed to result from the small wobble back and forth of the binary stars under the gravitational influence of distant, unseen companions. Of those systems, however, only two have stood up to dynamical scrutiny—namely, the planets orbiting NN Serpentis (e.g., Beuermann et al. 2010; Horner et al. 2012d; Beuermann et al. 2013; Marsh et al. 2014) and UZ Fornacis (Potter et al. 2011; Horner et al. 2014a). Even in the case of NN Serpentis, however, Mustill et al. (2013) showed that the planets could not have survived the evolution of the binary from its main-sequence to post-main sequence state, leading to the suggestion that, if the planets truly exist, they most likely represent a second-generation of planet formation, which occurred after the ejection of the star's common envelope, at the end of the white dwarf star's evolution away from the main sequence—a finding supported by Völschow et al. (2014).

In Figure 23, we show the best-fit proposed orbital solutions (top) and resulting dynamical stability plots (bottom) for the planets proposed to orbit the eclipsing binary stars QS Virginis (Lee et al. 2009; Horner et al. 2012a; left) and HW Virginis (Almeida & Jablonski 2011; Horner et al. 2013; right). In both cases, the proposed solutions prove to be catastrophically dynamically unstable across the whole range of possible solutions—with instability occurring on timescales of just a few hundred or few thousand years. Such extreme instability suggests that the origin of the observed variations in the timing of eclipses in these systems can not be planets, at least, not planets moving on the orbits proposed by Lee et al. (2009) and Almeida & Jablonski (2011). Indeed, given the fact that the planets proposed in the majority of such systems have failed to stand up to dynamical scrutiny, it seems likely that a common non-planetary explanation must be sought to explain the periodic variability in their eclipse timings—a possibility that researchers are continuing to explore as more data becomes available on these fascinating systems (e.g., Goździewski et al. 2015; Völschow et al. 2016; Nasiroglu et al. 2017; Navarrete et al. 2018, 2020).

Figure 23.

Figure 23. The best-fit orbital solutions (top) and dynamical stability maps (bottom) for the planetary systems proposed to orbit the stars QS Virginis (Lee et al. 2009; Horner et al. 2012a; left) and HW Virginis (Almeida & Jablonski 2011; Horner et al. 2013; right).

Standard image High-resolution image

Regardless of the true origin of the observed eclipse-timing variations for these fascinating objects, the results described above demonstrate conclusively the importance of applying dynamical models, first developed to study the evolution of the solar system, to the orbital stability of newly discovered exoplanetary systems. In some cases, such simulations will simply reveal that a proposed planetary system is dynamically stable—but in other cases, they might reveal a false positive, or highlight a case where the process of fitting to the observational data has converged on a solution that does not represent the true global minimum of the orbital phase space that the planets could occupy. We therefore contend that such modeling should form a critical component of the planet discovery process, and have incorporated such analysis into the standard procedure of the Anglo-Australian Planet Search (AAPS) (e.g., Wittenmyer et al. 2014a, 2014b, 2017a).

While the orbital solutions of the transiting and radial velocity exoplanets are well constrained due to the requirement of confirming the cyclical nature of the signal before announcement of detection (along with short orbital periods in the case of transiting planets/hot Jupiters), identification of planets through multi-epoch direct imaging of exoplanet systems (e.g., HR 8799; Marois et al. 2008, 2010), looking for co-moving companions to the star, provide only weak constraints on the orbits of the planets as they are, by and large, on wide orbits, with decades-long orbital periods and therefore do not move much in year-to-year imaging. The dynamical stability of HR 8799 has been extensively studied (e.g., Marois et al. 2008; Goździewski & Migaszewski 2009, 2014, 2018; Marshall et al. 2010; Wang et al. 2018c) leading to a variety of more-or-less stable orbital solutions depending on the assumptions made by the authors of the orbital eccentricities of the planets and the inclination of the system. One consistent result is that three of the planets are trapped in a 4:2:1 mean-motion resonance (a scenario often known as a Laplace resonance, and one that mirrors the orbital architecture of Jupiter's moons Io, Europa and Ganymede). Unstable orbital solutions might then be indicative that the system is undergoing a dynamical rearrangement, analogous to the Late Heavy Bombardment in the solar system, rather than suggesting the planetary system is somehow spurious—a result that would be in keeping with the bright debris discs observed in the system (e.g., Su et al. 2009; Hughes et al. 2011; Matthews et al. 2014; Contro et al. 2016; Goździewski & Migaszewski 2018; Wilner et al. 2018; Geiler et al. 2019). In the future, as more multi-planet systems are discovered using techniques such as direct imaging and astrometry, dynamical methods will become even more important in determining which of the many plausible orbital architectures that could fit the observed data are most likely to be a fair representation of the true state of the newly discovered systems.

5.8. Current and Future Exoplanet Surveys

In the first few years of the exoplanet era, the search for alien worlds was dominated by radial velocity surveys that were limited in the number of stars they could observe, and the frequency with which they could carry out their observations (e.g., Latham et al. 1989; Cochran & Hatzes 1994; Walker et al. 1995; Baranne et al. 1996; Butler et al. 1996; Tinney et al. 2002; Naef et al. 2004). While those surveys were able to build up lengthy temporal baselines, and to begin finding ever more distant planets (e.g., Gregory & Fischer 2010; Ségransan et al. 2011; Vogt et al. 2017; Wittenmyer et al. 2017a; Kane et al. 2019a; Rickman et al. 2019a), the fact that they could only target a small number of stars limited the number of planets that they could discover.

In the middle of the first decade of the 21st Century, the first exoplanet transit surveys came online. Unlike the radial velocity surveys, these dedicated programs were able to observe all night, every night. By using wide-field cameras, those transit surveys (such as the Wide-Angle Search for Planets, WASP; e.g., Kane et al. 2004, 2005a, 2005b; Christian et al. 2006; Butters et al. 2010; HAT-Net and HAT-South; e.g., Bakos et al. 2004, 2007, 2013; Hartman et al. 2011; Zhou et al. 2014; and the multi-purpose CoRoT space observatory; e.g., Baglin et al. 2006; Alonso et al. 2008; Barge et al. 2008; Deleuil et al. 2008; Léger et al. 2009) were able to monitor the brightness of thousands of stars at once, continually—playing a numbers game to ensure large numbers of detections. The ultimate expression of that philosophy came with the launch of Kepler, in 2009. The Kepler spacecraft was a dedicated exoplanet finding machine—with a field of view that allowed it to continually monitor more than 150,000 stars, 24 hr per day, for four years, during which time it achieved a duty cycle in excess of 90% for many of the target stars (e.g., Garcia et al. 2014). In that manner, Kepler soon became by far the most successful exoplanet survey to date—finding some 2351 confirmed planets, and a further 2418 candidate planets that still await follow-up71 (e.g., Borucki et al. 2011; Batalha et al. 2013; Mullally et al. 2015; Coughlin et al. 2016; Morton et al. 2016).

While Kepler was a huge success, it also revealed a shortcoming of ground-based exoplanetary science work. Simply put, there were too many candidate planets to follow-up, and too few ground-based facilities to perform that work. Most radial velocity instruments are located on shared-time telescopes, meaning that researchers have to compete for time to make their observations. This, combined with the time taken to perform those observations, and the fact that a given facility can only perform radial velocity observations of a single star at a time, represents a significant bottleneck to the exoplanet detection process—and one that will only get worse as the TESS mission continues to deliver new candidate planets for follow-up work (e.g., Ricker et al. 2015; Sullivan et al. 2015; Barclay et al. 2018; Stassun et al. 2018). To address the overwhelming wealth of exoplanet candidates that will be delivered by TESS in the coming years, numerous radial velocity facilities are being employed to perform follow-up observations (e.g., Nielsen et al. 2019; Quinn et al. 2019; Vanderburg et al. 2019).

The AAPS, operated from 1997 until 2014 using the Anglo-Australia Telescope, built up a 17 yr long database of RV observations for more than two hundred Sunlike stars (e.g., Wittenmyer et al. 2016b; 2017a, Kane et al. 2019a). The Keck telescopes have also played a significant role in radial velocity searches, such as the use of HIRES by such teams as the California Planet Search, including exoplanet occurrence rates (Howard & Fulton 2016). Other facilities, such as the MINiature Exoplanet Radial Velocity Array (Minerva), and Minerva Australis (based in Australia), use off-the-shelf telescopes to reduce the cost of a dedicated facility (e.g., Swift et al. 2015; Addison et al. 2019), and have already delivered some key science results (e.g., Rodrigeuz et al. 2017; Wilson et al. 2019; Addison et al. 2020). Numerous other precision radial velocity instruments operated in conjunction with the European Southern Observatory include CORALIE (Queloz et al. 2000), HARPS (Pepe et al. 2000), and ESPRESSO (Pepe et al. 2014). Other spectrographs, such as the NEID instrument, have been developed to optimize performance in the infra-red in order to provide improved radial velocities for low-mass stars (Halverson et al. 2016). This list of radial velocity instruments is not intended to be exhaustive, and for a more detailed discussion of the future of Radial Velocity facilities, we direct the interested reader to the recent review by Fischer et al. (2016).

While the Exoplanet Era has been dominated, to date, by the radial velocity and transit methods of exoplanet detection, it is likely that, in the future, astrometric and direct imaging observations will begin to yield increasing number of exoplanet discoveries, and play an important role in their characterization. To date, direct imaging studies have been relatively limited, primarily focusing on young (and therefore hot) planets orbiting nearby stars (e.g., HR 8799; Marois et al. 2008, 2010). With recent developments, however, the number of potential targets for such imaging has increased (e.g., Kane et al. 2018). In addition, direct imaging is now finding a growing role in the confirmation of the planetary nature of objects whose existence has been inferred on the basis of observed long-period radial velocity trends (e.g., Kane et al. 2019a). In the future, this complementarity between different observing techniques will become ever more important—and astrometric observations by the Gaia spacecraft are likely to contribute large numbers of long-period exoplanets that would otherwise be particularly hard to detect with traditional methods (e.g., Perryman et al. 2014; Gaia Collaboration et al. 2016).

Another method that has long offered the promise of being able to detect significant numbers of exoplanets, and to sample a population of planets that are hard to find using other techniques, is gravitational microlensing (e.g., Gaudi 2012). Microlensing has numerous advantages over other methods, including the observational requirement of moderate aperture size telescopes and sensitivity to relatively low masses. Disadvantages include the non-repeatability of the observations and degeneracy in modeling the available data (e.g., Dominik 2009). To date, microlensing observations have discovered almost 100 exoplanets (NASA Exoplanet Archive; Akeson et al. 2013), and it is anticipated that the launch of the Wide Field Infrared Survey Telescope (WFIRST) satellite, in 2025, will lead to a new era in microlensing observations, with the potential that the spacecraft could yield hundreds, if not thousands, of new discoveries (e.g., Yee 2013; Penny et al. 2019).

Planned exoplanet missions for the future span a variety of science goals regarding both detection and characterization of exoplanets. The legacy of space-based transit detection will be continued by such missions as PLAnetary Transits and Oscillations of stars (Rauer et al. 2014), with smaller space-based telescopes like CHaracterizing ExoPlanet Satellite performing follow-up observations of known exoplanetary systems (Broeg et al. 2014). Other missions, such as Atmospheric Remote-sensing Infrared Exoplanet Large-survey (Encrenaz et al. 2018; Tinetti et al. 2018) and JWST (Boccaletti et al. 2005; Seager et al. 2009), will focus their observational efforts on detecting and characterizing the atmospheres of transiting planets via the method of transmission spectroscopy (e.g., Seager & Sasselov 2000; Kempton et al. 2018). Beyond these missions lies a pathway toward direct imaging of exoplanets and the potential for direct spectra of exoplanet atmospheres. In the near-term, the WFIRST mission will utilize a coronagraph that will enable direct imaging of giant planets at wide separations from their host stars (Kane et al. 2018; Lacy et al. 2019). Further afield, the design of the Habitable Exoplanet Imaging Mission (HabEx) is underway as a means to achieve the dream of finally being able to directly image and characterize the atmospheres of terrestrial planets, particularly those that lie in the Habitable Zone of their host stars (Kopparapu et al. 2018; Wang et al. 2018b; Kawashima & Rugheimer 2019). As both the scientific and funding landscape is a constantly evolving environment, the precise direction of future mission design and deployment will depend heavily on the discoveries that are made over the intervening years.

6. Conclusions

In the last three decades, we have entered the Exoplanet Era. Where once we knew of only a single planetary system—our own—we now know that planets are ubiquitous. Almost every star has planets, and planet formation has been confirmed as a natural byproduct of star formation. At the time of writing, some 4025 planets have been confirmed orbiting other stars72—a number that is rising on an almost daily basis. There is, however, a problem. While we are finding ever more exoplanets, our ability to characterize and study those planets and their planetary systems in depth is severely limited. While we know the planets are there, we are decades from being able to get a truly in depth overview of their host systems.

With the solar system, we face the opposite problem. The number of known solar system objects is rapidly approaching a million—and since those objects are in our own celestial backyard, we are able to study them in exquisite detail. For that reason, the solar system will remain extremely valuable as our only source of in situ data for a fully characterized planetary system—and it is clearly important to consider our knowledge of exoplanetary systems in the context of our understanding of the solar system. At the same time, we run into the inverse problem—while we can study the solar system in exquisite detail, it is just one planetary system. To fully understand the formation and history of the solar system, it is vital to place it in the context of the myriad other systems we are now discovering.

In this review, we have therefore attempted to describe the scope of current solar system research—with a particular focus on the components that might have detectable analogues in the exoplanetary systems we know today, or will discover in the coming years.

In section two, we presented an overview of the currently known small solar system objects, from the 205 known planetary satellites and the dwarf planets to the smallest of the small bodies. In section three, we discussed the solar system's eight planets as physical objects, revealing the wealth of information that we have gleaned as a result of our exploration during the six decades of the space age. In section four, we focused on what the diverse populations of solar system bodies have told us about our own planetary system's history, from the giant collisions that sculpted the terrestrial worlds early in the system's youth to the formation and evolution of the known solar system small body populations. Then, in section five, we turned our gaze outwards, considering the growing body of work studying other planetary systems, with the aim of drawing parallels between that knowledge and studies of the solar system.

In the coming years, the number of known exoplanets is expected to grow almost exponentially, and it is certain that, in the process, new discoveries will be made that revolutionize our understanding of planet formation, evolution, and our place in the cosmos. As we move from simply finding exoplanets to striving to characterize them, and potentially even search for life upon them, the solar system will remain the critical touchstone on which that work will be based, with research into our own solar system shedding new light on those alien worlds, and our studies of exoplanets helping illuminate our own system's past, present and future.

This research has made use of the NASA Exoplanet Archive, which is operated by the California Institute of Technology, under contract with the National Aeronautics and Space Administration under the Exoplanet Exploration Program. This research has made use of NASA's Astrophysics Data System. The authors would like to express their sincere appreciation for the hard work of the anonymous referee, whose comments and suggestions greatly improved the scope and clarity of this paper.

Appendix: Plots of Individual Solar System Small Body Populations (Grayscale)

In Figures 26, we presented the distribution of small bodies throughout the solar system, with a variety of colors employed to help the reader distinguish between the various populations of small bodies shown in those plots. In this Appendix, we present grayscale plots that show the distribution of objects in those individual population. For each population, we provide four plots—two that show the spatial distribution of objects at epoch 2000 January 1 00:00:00 UT (in the XY and XZ planes), and two that show the orbital element distribution of the objects (in the semimajor axis—inclination and semimajor axis—eccentricity planes). As with the data plotted in Figures 26, these data were taken from the JPL HORIZONS database, which can be found at https://ssd.jpl.nasa.gov/horizons.cgi, on 2019 February 13th.

Figure A1.

Figure A1. The distribution of all known near-Earth objects. The impact of observational biases on the distribution of known objects is readily apparent—particularly in the lower right panel, where an excess of known objects with perihelion distances of ∼1 au is clearly seen, curving upwards and to the right from the dark dot that denotes the orbit of the Earth.

Standard image High-resolution image
Figure A2.

Figure A2. The distribution of all known Atira asteroids—asteroids whose orbits are entirely inside that of the Earth (i.e., whose aphelia lie closer to the Sun than the Earth's perihelion distance of 0.983 au). Very few such objects are currently known—a combination of their intrinsic scarcity (it is estimated that the Atiras make up only 2% of the total NEA population) and the challenges inherent in their discovery.

Standard image High-resolution image
Figure A3.

Figure A3. The distribution of all known Aten asteroids—those near-Earth asteroids whose orbital semimajor axes are less than that of the Earth (1 au). As they move on orbits with semimajor smaller than that of the Earth, their orbital periods are less than one year. The Aten asteroid population includes the Atira objects, which are generally viewed as a subset of the Aten population, rather than a distinct grouping in their own right—although this may change as more objects are discovered in coming years.

Standard image High-resolution image
Figure A4.

Figure A4. The distribution of the Apollo asteroids—near-Earth asteroids whose orbital semimajor axes are greater than that of the Earth (i.e., a > 1 au), but whose orbits cross that of our planet. The two boundaries of the population are clearly seen in the lower right-hand plot, with the curved outer edge of the population in phase space tracing a line of constant perihelion distance (where the perihelion distance for objects on that line is equal to Earth's aphelion distance, approximately 1.0167 au).

Standard image High-resolution image
Figure A5.

Figure A5. The orbital distribution of the Amor asteroids—near-Earth asteroids whose perihelia fall beyond the distance of Earth's aphelion. In other words, while Amor asteroids can be Earth-approaching, they are not Earth-crossing. The outer edge of the Amor population is typically set at a perihelion distance of around 1.3 au—asteroids that pass perihelion at greater distances from the Sun are not considered to be near-Earth asteroids, but instead are typically included within the population of the Asteroid belt.

Standard image High-resolution image
Figure A6.

Figure A6. The distribution of the main belt asteroids. The Kirkwood gaps are clearly visible—specific regions of the belt in semimajor axis space that relatively free of asteroids. Those regions are the locations of mean-motion resonances with the giant planet Jupiter, and objects moving on such orbits would be destabilized and removed from the asteroid belt relatively quickly as a result of the giant planet's influence. In addition, the influence of secular resonances in sculpting the belt can be seen, particularly in the lower-left hand panel (showing the asteroids in semimajor axis vs inclination space). The curved boundary between the dense and lightly population regions extending upwards from ∼2.1 au is the location of one such resonance—the ν6 secular resonance. Objects in the region just sunward of that sharp boundary (or at higher inclinations) will gradually undergo eccentricity excitation as a result of secular perturbations from Saturn, with the eventual result that they will become Mars crossing, and be ejected from the belt entirely.

Standard image High-resolution image
Figure A7.

Figure A7. The distribution of the Hilda asteroids, trapped in 3:2 mean-motion resonance with Jupiter. When plotted in Cartesian coordinates, the Hildas trace an approximately triangular shape, with the vertices located at the locations of the Jovian L4, L3 and L5 Lagrange points (working around in an anti-clockwise direction from the location of Jupiter). As the objects orbit the Sun, their eccentric paths are such that, whenever they are located between Jupiter and the Sun, they are close to perihelion, and hence far from the giant planet. As Jupiter orbits the Sun, the locations of the vertices of the triangle rotate with the planet, keeping station with the Lagrange points. A given Hilda will, through the course of three complete orbits around the Sun (two orbits of Jupiter) pass through each of the three vertices of the triangle. Given the number of objects located away from the main concentration of the Hildas (including those in the top left panel that seem relatively close to Jupiter, and those in the lower panels between ∼3.8 and 3.9 au), it seems likely that a reasonable number of objects classified here as Hildas are actually interlopers, moving on unstable orbits whose "Hilda-like" behavior is a transient phenomenon (e.g., Brož & Vokrouhlický, 2008).

Standard image High-resolution image
Figure A8.

Figure A8. The distribution of the Jovian Trojans, trapped in 1:1 mean-motion resonance with Jupiter. The great majority of these objects move on "tadpole" orbits, librating around the Jovian L4 and L5 Lagrange points. As a result, in the top-down view (at top left), the great bulk of the Trojan population are concentrated in two lengthy clouds, centered on those two Lagrange points. Rarer, and typically less dynamically stable, are the "horseshoe" Trojans—objects whose libration within the 1:1 resonance takes them around from the L4 to the L5 points, and back again, passing through the L3 Lagrange point, opposite Jupiter in its orbit. It should be noted that not all of the objects plotted here are true Trojans—there may well be a number of interlopers—objects temporarily captured into 1:1 resonance with Jupiter, and those experiencing short-term satellite capture by the giant planet (such as those closest to Jupiter in the top-left hand plot).

Standard image High-resolution image
Figure A9.

Figure A9. The solar system's known Centaur population—dynamically unstable objects moving on orbits between those of the giant planets. The majority of these objects move on orbits with relatively low inclinations—less than ∼30°—a fact that reflects their expected origins in the Scattered Disk, Neptunian and Jovian Trojans, and the Edgeworth–Kuiper Belt. The more inclined Centaurs are thought to be objects that originated in the Oort cloud, captured by the giant planets.

Standard image High-resolution image
Figure A10.

Figure A10. All known trans-Neptunian objects—including members of the Scattered Disk, the Edgeworth–Kuiper Belt, the detached population, and the resonant TNOs. The objects clustered together in vertical "spikes" in the lower two figures are the resonant TNOs—the most prominent population of which are the Plutinos, located at around 39.5 au. The Scattered disk objects, with perihelia between ∼30 and ∼39.5 au from the Sun can be clearly seen as the population that curves upwards to the right, while the large population of Edgeworth–Kuiper Belt objects are found between ∼40 and ∼48 au from the Sun.

Standard image High-resolution image
Figure A11.

Figure A11. The distribution of those objects considered to be "Neptune Trojans"—trapped in 1:1 mean motion resonance with Neptune. While this population is still relatively small, and includes some temporarily captured objects, similarities can be seen to the Jovian Trojan population (shown in Figure A8). The Neptune Trojans are found in two clouds, around the L4 and L5 Lagrange points in Neptune's orbit.

Standard image High-resolution image
Figure A12.

Figure A12. The distribution of the known Plutinos—objects trapped in 2:3 mean-motion resonance with Neptune. These objects are thought to have been captured by the giant planet into that resonance during its outward migration through the early solar system—a process that resulted in the gradual orbital excitation of captured members (which explains the broad range of orbital inclinations and eccentricities exhibited by the population). Those Plutinos with orbital eccentricities greater than ∼0.24 move on Neptune-crossing orbits—but their resonant motion prevents them from experiencing close encounters with the giant planet. This is the reason that, in the top-down view (top left), all those objects currently in the vicinity of Neptune's orbit are located either well ahead, or well behind, the planet in its orbit.

Standard image High-resolution image
Figure A13.

Figure A13. The distribution of objects in the classical Edgeworth–Kuiper Belt. The top-down view here (top left) reflects the observational biases implicit in the discovery of these distant objects. The broad gap toward the bottom of the figure, where few TNOs can be found, corresponds to the direction of the center of the Galaxy (the Milky Way), in Scorpius and Sagittarius, and represents the most challenging part of the sky in which to detect faint, star-like points of light. A similar, but less significant, rarefaction can be seen toward the top of that plot—which corresponds to the other location in which the plane of the ecliptic crosses the Galactic plane—between Taurus and Gemini. Thanks to the lower stellar density in that region, compared with that toward the Galactic center, more objects can readily be found. The dynamically "cold" Edgeworth–Kuiper Belt objects can be seen as an excess of objects with inclinations less than ∼5°. The dynamically "hot" population are more widely dispersed, with inclinations ranging up to, and sometimes exceeding, 30°.

Standard image High-resolution image
Figure A14.

Figure A14. The distribution of the Scattered Disk and detached objects. The two red lines in the lower right-hand plot show the boundaries for the Scattered Disk (as per Table 2), defined as perihelia of ∼32 au and ∼40 au. Objects that fall below the lower of these two lines are considered to be "detached." The effect of resonant sticking can be clearly seen, with clusters of objects at specific semimajor axes. While Scattered Disk objects would be expected to random walk in semimajor axis, as a result of perturbations by the giant planets (principally Neptune), it is possible for them to experience temporary (but lengthy) captures in various distant Neptunian mean-motion resonances, during which time their orbital eccentricity can very dramatically, sometimes even transferring those objects to the detached population, lifting their perihelia well beyond Neptune's immediate control. Objects above the upper of the two red lines are considered to be Centaurs (again, as per Table 2), though they appear in many lists of Scattered Disk objects as a result of the blurred nature of the divide between the two linked populations. The concentration of objects at 39 < a < 40 au are the Plutinos, some of which fulfill the criteria of being either Centaurs or Scattered Disk objects. Here, the fact those objects are trapped in Neptune's 2:3 mean motion resonance supersedes their other orbital parameters in determining their classification, but they are included here for completeness.

Standard image High-resolution image
Figure A15.

Figure A15. The distribution of all known comets. The concentration of objects in the top-left hand plot stretching outward slightly to the left of the vertical is the result of vast number of known members of the Kreutz Sungrazing comet population, the majority of which were discovered by the SOHO spacecraft. When plotted in semimajor axis space, the Jupiter-family comets stand out clearly, with semimajor axes smaller than the orbit of Saturn. Because of the truncation of the semimajor axis plots at 100 au, many long period comets do not appear on the plot—the tail of objects to very high (∼1) eccentricities continues out to semimajor axes of over 100,000 au.

Standard image High-resolution image
Figure A16.

Figure A16. As with Figure A15, we show here the orbital distribution of all known comets. The upper panels now show all comets within a cubic space spanning approximately 1500 au on a side (i.e., roughly ± 750 au in X, Y and Z), while the lower panels show the distribution in orbital element space, with a logarithmic semimajor axis scale, to show the extent of the long-period comet population. The isotropic nature of the long-period/Oort cloud comet flux is clearly seen in the lower left-hand panel, with those comets evenly distributed in orbital inclination—a stark contrast to the Jupiter-family/short period comets, whose orbital inclinations are typically less than ∼30°. It is this division that highlights the different origins of these two populations, with the long-period comets sourced from the Oort cloud, which is to first order isotropically distributed around the Sun.

Standard image High-resolution image
Figure A17.

Figure A17. The known Jupiter-Family comets. The great majority of Jupiter-Family comets move on orbits with relatively small inclinations (<30°), reflecting their direct association with the Centaur and trans-Neptunian populations. Those at higher inclinations (particularly those with retrograde orbits—i.e., inclinations > 90°) are likely more accurately considered to be Halley-type comets—but are included here based on their classifications in the data harvested from the JPL Horizons database. The "arc" feature in the top-down plot (top-left), which lies interior to Jupiter's orbit, just above and to the left of center, is a cluster of comet fragments, created when comet 73P/Schwassmann–Wachmann 3 broke apart in 1995. To date, almost 70 separate fragments of that comet have been identified, each moving on a slightly different orbit. In the ∼25 yr since the fragmentation of the comet, each fragment has completed almost five full orbits of the Sun, and as a result, the fragments have dispersed along a lengthy arc of the orbit of their parent comet.

Standard image High-resolution image
Figure A18.

Figure A18. The known Halley-type comets. Where the bulk of the Jupiter-Family comets moved on orbits with low inclinations, the Halley Type comets feature a wide dispersal in orbital inclination, with several notable members moving on retrograde orbits (such as comet 1P/Halley itself, with an orbital inclination of ∼162°).

Standard image High-resolution image

Footnotes

  • 14 

    For an English language translation of Safronov (1969), we direct the interested reader to Safronov (1972).

  • 15 

    We provide an overview of the current state of our knowledge of the planets as physical objects in section three, below.

  • 16 

    We note that the term "terrestrial planet" can sometimes be taken as inferring a degree of similarity with the Earth that is not entirely justified for such a diverse group of objects, and urge the reader to remember that "terrestrial" does not equal "Earth-like."

  • 17 

    Dwarf planets, as defined by the International Astronomical Union, are objects sufficiently massive that their self-gravity can overcome their material strength. Dwarf planets therefore assume a shape determined by hydrostatic equilibrium—usually nearly spherical, with varying degrees of oblateness dependent on the object's spin rate. However, they are not sufficiently massive to have dynamically cleared the area around their orbit, and are not in orbit around a planet. This definition of "dwarf planet" remains somewhat controversial, with a vocal minority continuing to protest the demotion of Pluto from planet to dwarf. In passing, we note that the timescale for an object to clear the region around its orbit increases with the orbital period of that object, such that if the Earth were placed sufficiently far from the Sun, it would be unable to do so within the age of our solar system, and hence would also be classified as a dwarf planet. For the full text of the IAU resolution that defined the nature of planets, we direct the interested reader to http://www.iau.org/public_press/news/detail/iau0603/. Details of an alternative metric, based on the timescale required for objects to clear their orbital zone (through the ejection or accretion of other objects on similar orbits), can be found in Margot (2015).

  • 18 
  • 19 

    We note that, although the number of objects in the Jovian Trojan population greater than 1 km in diameter likely rivals, or exceeds, those contained in the Asteroid belt, the total mass of the Jovian Trojan population is significantly smaller than that contained in the Asteroid belt.

  • 20 

    A mean-motion resonance (hereafter MMR) occurs when the orbital period of one body is approximately in integer ratio with that of another. The Trojans, mentioned earlier, are therefore said to be in 1:1 resonance with their host planet, while Pluto, which completes two orbits of the Sun for every three completed by Neptune, is said to be trapped in the Neptunian 2:3 MMR. By convention, resonances are described by a ratio n:m. When m > n, the resonance is exterior to the major body involved, and when n > m, the resonance is interior to the orbit of that object.

  • 21 

    We note, here, that some of these 700,000 objects have diameters less than 1 km—and that the observed population of objects larger than 1 km remains incomplete.

  • 22 

    As of 2020 May 20th, based on the information at the JPL Solar System Dynamics page, at https://ssd.jpl.nasa.gov/?sb_elem.

  • 23 

    This is particularly true of those objects that display cometary activity, since their numbers are reduced not only as a result of their dynamical instability, but also as a natural outcome of their ongoing activity, which leads to their eventual devolatilisation. That process leaves some cometary nuclei dormant—and essentially indistinguishable from asteroidal bodies. It can also lead to the total disintegration of the comet in question—as witnessed on several occasions in the past few hundred years (such as was the case for comet 3D/Biela, whose disintegration in the early 1840s gave birth to the Andromedid meteor shower; e.g., Jenniskens & Vaubaillon 2007; and comet 73P/Schwassmann–Wachmann 3, which fragmented in 1995; Crovisier et al. 1996). We note that comet 73P is a particularly interesting example of such a disintegration process, as it is one whose evolution is still unfolding. Having initially fragmented in 1995, the single parent comet clearly no longer exists—but a large number of fragments of that comet are still being tracked. These fragments are identified with sub-designations (e.g., 73P-A, 73P-B, and so on). Once those fragments themselves disintegrate, or are lost, the eventual fate for comet 73P will be a change in designation, to become 73D/Schwassmann–Wachmann 3.

  • 24 

    Chiron holds the distinction of being the first object to ever be given both an asteroidal and cometary designation. It is most commonly referred to as 2060 Chiron.

  • 25 

    Mercury's uncompressed mean density is approximately 5.3 g cm−3 (e.g., Cameron et al. 1988), which is markedly higher than the uncompressed mean density of the Earth (at 4.45 g cm−3; Lewis 1972), which is the second densest of the planets, when measured in this manner. It should be noted that the Earth's bulk density (5.514 g cm−3) is higher than that of Mercury (5.427 g cm−3)—the result of the Earth's significantly higher mass and the resultant compression of our planet under its own gravity. The uncompressed mean density reflects the chemical makeup of a planet, ignoring the effect of gravity—and reveals Mercury to be particularly metal-rich compared to the other terrestrial planets.

  • 26 

    Thanks to the retrograde rotation of Venus, the time between consecutive sunrises on the planet (the Solar day) is approximately 117 days. From the surface of Venus (if it were possible to see through the planet's thick clouds), the Sun would appear to rise in the west, and set in the east.

  • 27 

    The current level of CO2 in Earth's atmosphere is slightly over 0.04% (or 400 parts per million) and is climbing as a result of human activity. At the time of writing (2020 March 25th), the latest CO2 level measured at Mauna Loa Observatory in Hawaii was 415.34 ppm (0.041534%; as per https://www.co2.earth/daily-co2). Prior to the industrial revolution, the level of atmospheric CO2 was lower, at around 280 ppm (0.028%).

  • 28 

    It has been suggested that the tilt of Saturn's spin axis could well be the result of the giant planet having accreted, at a late stage in its formation, a planetesimal several times more massive than the Earth, in an oblique collision—a theory we discuss in more detail in Section 4.1.1.

  • 29 

    Jupiter's polar regions rotate with a period of around 9h 55m 44s, while its equator rotates once every 9h 50m 30s. The most fundamental spin period for the planet as a whole is considered to be the System III period of 9h 55m 29fs71, which corresponds to the rotation of the planet's magnetosphere as measured by radio astronomers (Dessler 1983). In the case of Saturn, the difference between polar and equatorial rotation rates is even greater—10h 38m at the poles versus 10h 14m at the equator.

  • 30 

    Jupiter's equatorial and polar radii are 71492 km and 66854 km, respectively, while those of Saturn are 60268 km and 54364 km, respectively. The resulting oblatenesses of the two planets are 6.5% and 9.7%—far greater than Earth's 0.34%.

  • 31 

    It is now widely accepted that the gas giant planets did not form at precisely their current locations, but instead underwent a certain amount of migration as they grew. A number of different scenarios have been proposed describing that migration—from relatively sedate migration over small distances, to chaotic and "jumpy" migration, driven by encounters with massive planetesimals, and even scenarios where Jupiter migrates first inward (to around 2 au from the Sun), then back out to its current location. The bulk of the evidence for the proposed migration is found in the distribution of the solar system's small body populations—and we discuss the various theories proposed to explain that distribution in detail in Section 4.3. It is even considered possible that the giant planets ejected a planet-mass object from the region in which they formed to an orbit beyond that of Neptune—a theory we discuss in Section 4.5.2.

  • 32 

    The extreme tilt of Uranus and its regular satellite system is often considered to be evidence that the giant planet was involved in a giant collision, either during its formation or soon after, with an Earth-sized protoplanet—a theory we discuss in more detail in Section 4.1.1.

  • 33 

    Such irregular satellites are found orbiting each of the giant planets and are thought to have been captured by their host planets through a variety of mechanisms during the latter stages of planet formation—as we discuss in more detail in the opening to section four, and in Section 4.1.3.

  • 34 

    At sub-cm sizes, the collisional accretion of dust grains is only possible for very low collision velocities, as a result of the extremely low strength of such grains. While the precise details of accretion for grains in the micron- to centimeter-scale remain unclear, it is certainly the case that that accretion was greatly aided by the effects of gas-drag within the proto-planetary disk, which would act to rapidly damp the motion of those small grains, ensuring that their typical collision velocities were very low. In laboratory experiments, dust growth has been demonstrated up to millimeter sizes through mutual collisions, but the centimeter size barrier is insurmountable due to mutual velocities and low surface adhesion resulting in bouncing rather than sticking between the largest grains (e.g., Blum 2010; Windmark et al. 2012b; Testi et al. 2014). However, if larger grains are already present in the disk further growth of small dust grains beyond the centimeter size barrier is a rapid process (∼104 yr; Windmark et al. 2012a). This barrier to growth can also be overcome by assuming some "stickier" dust (e.g., some fraction of H2O/CO2 ice, or the presence of an organic mantle to a grain; e.g., Homma et al. 2019) in their compositions, or a distribution of collision speeds and particle sizes (e.g., Windmark et al. 2012a, 2012b).

  • 35 

    It is certain that other planetary systems are also continually ejecting objects to interstellar space—both during their formation, and their long-term evolution. As a result, it was inevitable that we would eventually observe one of those objects passing through the solar system. The first such interstellar vagabond, 1I/'Oumuamua, was discovered in 2017 (e.g., Bannister et al. 2017; Jewitt et al. 2017; Meech et al. 2017; Fitzsimmons et al. 2018), with the second, the cometary 2I/Borisov, being discovered and passing through perihelion in 2019 (e.g., Fitzsimmons et al. 2019; Jewitt & Luu 2019; Opitom et al. 2019; Guzik et al. 2020; Hallatt & Wiegert 2020; McKay et al. 2020). It is likely that the next generation of surveys will discover many more in the years to come.

  • 36 

    For a detailed overview of current theories of terrestrial planet formation, we direct the interested reader to Izidoro & Raymond's excellent chapter in the Handbook of Exoplanets, published in 2018.

  • 37 

    It is worth recollecting, here, that Venus is thought to have experienced a "runaway greenhouse effect" in the past, allowing it to attain and maintain its current superheated surface conditions. The young Venus may well have had a climate superficially similar to that of the Earth. As the Sun grew more luminous, the planet warmed until it reached a crucial tipping point—the evaporation of water from the planet's oceans accelerated. The increased levels of water vapor in Venus' atmosphere acted to strengthen the greenhouse effect—driving the temperature still higher. Eventually, the oceans were gone, which served to further accelerate the greenhouse warming—by removing the oceans, the water-driven weathering that would act to remove other greenhouse gases from the atmosphere ceased. But those gases continued to build up in the atmosphere, as a result of the planet's volcanic activity. The result? Venus eventually warmed to the point that its surface is hot enough to melt lead—far from the "Earth-like planet" it may once have been.

  • 38 

    Indeed, it seems likely that the Earth would have experienced several impacts of a similar scale to this Martian "giant impact," which was several orders of magnitude less energetic than that between the proto-Earth and Theia, which led to the formation of the Moon. Zahnle et al. (2007) present a fascinating review of the impact such collision would have on the early Earth, in the time after the Moon forming impact, while O'Neill et al. (2017) discuss the role that large impacts (of objects with diameters greater than 500 km) could have played in the genesis of Earth's global system of plate tectonics.

  • 39 

    Excerpt from "Galileo's work on Saturn's Rings," Partridge & Whitaker (1896).

  • 40 

    Though it has been suggested that the belt originally formed with much less mass than classical models of solar system formation would suggest (e.g., Hayashi 1981), or may even initially have been completely empty (e.g., Raymond & Izidoro 2017).

  • 41 

    Interestingly, although data from the New Horizons mission supports the collisional formation mechanism for the Pluto system, it has also revealed that Pluto and Charon have fewer small (≲13 km diameter) craters than one would expect based on current estimates of the population of small trans-Neptunian objects, a result that cannot be explained solely on the basis of geological resurfacing (Singer et al. 2019). This, in turn, has been taken to suggest that there may be fewer such objects than previously thought—a result which might also explain the relatively pristine surface of 2014 MU69 (Stern et al. 2019), which was visited by New Horizons on 2019 January 1st.

  • 42 

    This might seem somewhat counter intuitive, since the orbital velocities in the Edgeworth–Kuiper Belt are much slower than those in the Asteroid belt, which should result in lower mean collision velocities. However, as Marcus et al. (2011) note in their introduction, "collisions typically result in a collection of objects with a velocity dispersion comparable to the parent body's escape velocity (e.g., Durda et al. 2007)...the comparable size of the typical ejection velocity and the background velocity dispersion causes Kuiper Belt families to spread across huge swaths of the trans-Neptunian region, significantly diluting the dynamical clumping of families that is so prominent in the asteroid belt."

  • 43 

    It should be noted that, in the 1970s, there were suggestions that an impact was observed on the Moon in 1178, forming the 20 km diameter crater Giordano Bruno (see e.g., Hartung 1976; Calame & Mulholland 1978). However, that idea has since fallen into disfavor, with recent data suggesting the crater is far older, having most likely formed between one and ten million years ago—e.g., Morota et al. 2009; Basilevsky & Head 2012). We discuss this in more detail in Section 4.7.1.

  • 44 

    Were Jupiter smaller, the asteroid belt would likely look very different to how it appears today—the result of the influence of the broad area of instability caused by the ν6 secular resonance. The results of the simulations described in Horner & Jones (2008, 2008a, 2008b) suggested that the end result might well have been a bifurcated belt—with both inner and outer belts separated by a wide gap, in semimajor axis—a scenario very different from the belt we observe in our own solar system.

  • 45 

    Indeed, in work that presaged the development of the Nice Model, Thommes et al. (1999, 2002) proposed a model that suggested that Uranus and Neptune formed between the orbits of Jupiter and Saturn, to be scattered outward to their current orbits as a result of encounters with proto-Jupiter and proto-Saturn.

  • 46 

    At least, within the scope of their simulations. In reality, a fraction of the material flung inwards by Jupiter would be lost—through collision with the terrestrial planets, collision with the Sun, and fragmentation. That said, it seems improbable to suggest that the amount of material lost this way would come close to balancing that which would eventually be ejected from the solar system by the giant planet—and so the end result (that Jupiter would migrate inwards) remains the same.

  • 47 

    It should be noted, here, that other theories exist to explain Mars' small size, suggesting that the protoplanetary disk was depleted in material where Mars formed (e.g., Izidoro et al. 2014; Raymond & Izidoro 2017); one mechanism by which such depletion could occur is that a secular resonance with the planet Jupiter swept through the region in which Mars was to form, prior to its formation. The passage of that resonance would excite debris in the region of Mars' current orbit, causing the available mass for planet formation to be depleted as a result of enhanced particle fragmentation due to increased mutual collision speeds (Bromley & Kenyon 2017). Equally, Levison et al. (2015) suggest that pebble accretion models might offer an alternative explanation for Mars' small mass.

  • 48 
  • 49 
  • 50 

    Descriptions of the apparitions of these two comets can be found in Volume 1 of Kronk's Cometography (Kronk 1999).

  • 51 

    (Active from September 10th–November 20th, and October 20th–December 10th, respectively, according to the 2020 edition of the International Meteor Organisation's Meteor Shower Calendar, which can be found at https://www.imo.net/files/meteor-shower/cal2020.pdf; accessed 2019 June 29.

  • 52 

    Comet 2P/Encke has the shortest orbital period of any of the "traditional" comets, at 3.3 yr, by some distance. The only comets with shorter orbital periods are the Main Belt Comets.

  • 53 

    Or July—in the paper, Hartung gives the date first as July 18th, and later as June 18th.

  • 54 

    Meteorites can be divided into two categories, based on the circumstances of their recovery. "Falls" are those meteorites whose flight through the atmosphere (as a bright fireball) is observed, and which are located on the ground just hours or days after the appearance of their parent fireball. "Finds" are much more common—meteorites located on the surface of the Earth having lain there for years, decades, or even longer. Essentially, we see "falls" fall to Earth, while "finds" are found fortuitously long after they reached our planet. "Falls" are of particular scientific value, since, if recovered quickly enough, they can be considered to be pristine, and unaffected by chemical weathering/alteration on the surface of our planet. For this reason, global camera networks are now being established (Devillepoix et al. 2020), such as the Australian Desert Fireball Network (e.g., Bland et al. 2012; Howie et al. 2017; Devillepoix et al. 2018; Sansom et al. 2019), to maximize our chances of observing "falls," and being able to locate the meteorites they produce in a timely fashion.

  • 55 

    Dust grains can be released through a variety of processes, such as cometary activity, spallation, large scale impacts/collisions, surface expansion and contraction under heating (as is the case for the Geminid meteor shower parent, (3200) Phaethon, e.g., Jewitt & Li 2010; Li & Jewitt 2013), and even as a result of the rapid rotation of their parent bodies.

  • 56 

    Below a certain luminosity, a star simply cannot accelerate dust grains from a circular orbit to escape velocity purely by radiation pressure alone. For stars on the main sequence, this threshold is typically thought to lie at around mid-K spectral type, though the precise location depends on both the dust composition and density.

  • 57 

    "Ivi innumerabili stelle, astri, globi, soli e terre sensibilmente si veggono..."; taken from http://www.ousia.it/content/Sezioni/Testi/BrunoDeInfinitoUniverso.pdf, accessed on 2019 July 14th.

  • 58 

    "La raggione è perché noi veggiamo gli soli, che son gli più grandi, anzi grandissimi corpi, ma non veggiamo le terre, le quali, per esserno corpi molto minori, sono invisibili..."; taken from http://www.ousia.it/content/Sezioni/Testi/BrunoDeInfinitoUniverso.pdf, accessed on 2019 July 14th.

  • 59 

    "If it were not perhaps too early to hazard even a conjecture on the cau[s]e of this variation, I [s]hould imagine it could hardly be accounted for otherwi[s]e than either by the interpro[s]ition of a large body revolving around Algol, or [s]ome kind of motion of its own, whereby part of its body, covered with [s]pots or [s]uch like matter, is periodically turned toward the earth...," from Goodricke (1783).

  • 60 

    These were the first two "white dwarf" stars to be discovered—Sirius B and Procyon B.

  • 61 

    It should be noted that studies of exoplanetary composition are greatly aided by the investigation of debris in white dwarf systems. Such research has the potential to offer direct comparison between the results of Solar system meteoritical research and exoplanetary science. For more information, see e.g. Melis & Dufour, 2017; Zuckerman et al., 2007; Zuckerman & Young, 2018, and references therein.

  • 62 

    In much the same way, the Sun remains the only star we can study from close range—and so has formed the touchstone of our understanding of other stars, and the Space Weather environments around them. While a detailed discussion of the Sun is beyond the scope of this work, we direct the interested reader to the recent review by Airapetian et al. (2019) for more details.

  • 63 

    As of 2020 May 20th, the NASA Exoplanet Archive (https://exoplanetarchive.ipac.caltech.edu/) details 4154 confirmed exoplanets in 3076 systems.

  • 64 

    Values obtained on 2020 May 20th, from the NASA Exoplanet Archive's "Exoplanet and Candidate Statistics" page, at https://exoplanetarchive.ipac.caltech.edu/docs/counts_detail.html.

  • 65 

    Though it should be noted that this may soon change—Gravity Collaboration et al. (2019) report the direct detection of HR 8799 e by optical interferometry, and describe the determination of the mass of that planet (${10}_{-4}^{+7}$ times the mass of Jupiter) on the basis of the application of synthetic spectra to their observational data.

  • 66 

    Those planets detected through Transit Timing Variations are a notable exception to this general rule of thumb. In such cases, the actual mass (rather than the minimum mass) can be directly determined.

  • 67 

    These values relate to the total excess flux emitted by the discs at long-wavelengths, as would be seen from a distant star. Since almost all that excess comes at infrared wavelengths, these values are often called the "fractional infrared luminosity" (e.g., Perryman 2018, p342).

  • 68 

    Due to the faint nature of the zodiacal light, few studies have considered the detection of the solar system's zodi from another stellar system. The values here are taken from page four of a talk given by Chris Koresko at MPIA in 2003, recovered online at http://www.mpia.de/MIDI-RB/Contributions/Koresko.pdf on 2020 April 3rd. The plot on which the values are based credits Kuchner (2012).

  • 69 

    Since the brightness of the disk is linked to the amount of dust, rather than the mass of larger planetesimals—so an excited low-mass disk (with high collision rates) could appear equally bright as a high-mass, dynamically cold disk (with low collision rates). Both scenarios generate equal amounts of dust, and so result in equally bright discs. In other words, in understanding the nature of a disk from its brightness, there remains a degeneracy between the collision rate and disk mass.

  • 70 

    When the planet passes behind its host, as seen from Earth.

  • 71 

    Values correct as of 2020 May 20th; taken from the NASA Exoplanet Archive, at https://exoplanetarchive.ipac.caltech.edu/docs/counts_detail.html.

  • 72 

    As of 2019 July 30th, according to the NASA Exoplanet Archive, at https://exoplanetarchive.ipac.caltech.edu/.

Please wait… references are loading.
10.1088/1538-3873/ab8eb9