Jet substructure at the Tevatron and LHC: new results, new tools, new benchmarks*

A Altheimer; S Arora; L Asquith; G Brooijmans; J Butterworth; M Campanelli; B Chapleau; A E Cholakian; J P Chou; M Dasgupta; A Davison; J Dolen; S D Ellis; R Essig; J J Fan; R Field; A Fregoso; J Gallicchio; Y Gershtein; A Gomes; A Haas; E Halkiadakis; V Halyo; S Hoeche; A Hook; A Hornig; P Huang; E Izaguirre; M Jankowiak; G Kribs; D Krohn; A J Larkoski; A Lath; C Lee; S J Lee; P Loch; P Maksimovic; M Martinez; D W Miller; T Plehn; K Prokofiev; R Rahmat; S Rappoccio; A Safonov; G P Salam; S Schumann; M D Schwartz; A Schwartzman; M Seymour; J Shao; P Sinervo; M Son; D E Soper; M Spannowsky; I W Stewart; M Strassler; E Strauss; M Takeuchi; J Thaler; S Thomas; B Tweedie; R Vasquez Sierra; C K Vermilion; M Villaplana; M Vos; J Wacker; D Walker; J R Walsh; L-T Wang; S Wilbur; W Zhu

doi:10.1088/0954-3899/39/6/063001

1. Introduction

At the time of the first BOOST meeting, at SLAC in July 2009, several groups had begun to argue that jet substructure—the internal characteristics of hadronic jets—could be useful in identifying the decays of heavy particles at the Large Hadron Collider. By the following year, a trickle had become a flood. Some techniques had received detailed attention from experimental groups, and the increasing quantity of data available meant that background studies were beginning to be possible. Another year has passed, and the stream of theoretical advances shows no signs of abating. Many new substructure measurements and techniques have been proposed, and significant progress has been made in developing the theoretical tools to calculate distributions in substructure observables. Meanwhile, sufficient data now exist to study the boosted hadronic decays of heavy standard model particles such as the W boson and top quark.

This paper, an outgrowth of the BOOST 2011 workshop held at Princeton University in May 2011, aims to summarize recent theoretical and experimental progress, outline goals for the near future and provide benchmark comparisons and tools to help achieve these goals.

In section 2, we review recent substructure results at the Tevatron and LHC. In section 3, we survey new proposals for substructure techniques. In section 4, we describe the new software tools for studying jet substructure in FastJet 3. In section 5, we extend the benchmark top-tagging comparisons found in last year's report [1] with new methods, new Monte Carlo samples and detector simulation. The samples, as well as software to implement the techniques compared and our detector model, are publicly available, either as part of the FastJet 3.0.0 package or as FastJet-based tools. Finally, in section 6, we survey the status of substructure predictions and discuss goals for new calculations and measurements in the coming year.

2. New results from the Tevatron and LHC

Several recent experimental results from the Fermilab Tevatron were presented at the BOOST 2011 workshop, exploring different aspects of boosted physics. These could be categorized into two broad classes: studies that elucidate the behaviour of standard model physics processes when subjected to various degrees of boost, and searches for new physics using boosted object signatures. All the analyses were performed on samples of $\sqrt{s} =1.96$ TeV proton–antiproton collisions produced over the last 8 years at the Tevatron. Results from the Tevatron's two detectors are presented here in section 2.1 for CDF and section 2.2 for D0.

The LHC started colliding protons at a centre of mass energy $\sqrt{s} =$ 7 TeV on 30 March 2010. Just over 18 months and 5 fb⁻¹ later, the ATLAS and CMS experiments have collected sufficient data to have a realistic chance of using substructure techniques to uncover massive boosted particles such as W/Z bosons, top quarks and whatever else may lie in wait. The experimental results presented in these proceedings focus on the results made public prior to the BOOST workshop in May 2011. For both ATLAS and CMS, this means two analyses: a paper from each on jet shapes [2, 3] and a conference note on jet substructure [4, 5].

The most important and interesting results of these studies are summarized in section 2.3 for ATLAS and section 2.4 for CMS. Sensitivities to pile-up and detector effects, seen by the experimental community as the most pressing issues in jet and jet substructure studies, feature prominently.

All of these experiments must in varying degrees grapple with the presence of multiple simultaneous proton–(anti-)proton interactions, or pile-up within every bunch crossing. These additional collisions are uncorrelated with the hard-scattering process that triggers the event and create a background of soft diffuse radiation that offsets the energy measurement of jets and impacts jet shape and substructure measurements. It is essential that measurements of jet substructure be able to disentangle or correct for the influence of pile-up.

Observables designed to be sensitive to the internal structure of jets are expected to also be sensitive to pile-up [1]. Large-radius jets, such as those used in the measurements of jet substructure, are naturally more susceptible to pile-up due to their larger catchment area [6]; the invariant mass of these large jets is particularly affected [7]. Techniques for correcting for these effects—as in the CDF analyses—or mitigating their impact—such as the splitting and filtering procedure pioneered in ATLAS—are essential in producing precision measurements for several of the analyses presented by the Tevatron and LHC experiments. A more thorough review of these issues at ATLAS can be found in [8].

2.1. Results from CDF

2.1.1. Pile-up at CDF

In the context of measurements of the mass, angularity and planar flow of high-p_T jets [9], a new method of correcting the jet mass for pile-up was developed. The use of a complementary cone at right angles to the jet in azimuthal angle ϕ and at approximately the same pseudorapidity allows the energy density from both underlying event (UE) and multiple interactions (MI) to be measured. The UE refers to the interactions and hadronization of partons in the colliding protons other than the partons in the hard process. MI refer to both 'in-time' and 'out-of-time' pile-up: the former describing multiple proton–proton collisions in a single bunch crossing, and the latter describing the delayed instrumental effects of previous crossings. The incoherent contributions to the shift in jet mass from MI were isolated from the partially coherent contributions due to UE by examining the mass shift as a function of the number of good vertices in the event. The average shift in the jet mass when adding the towers from the complementary cone into the jet is shown in figure 1 along with the MI+UE corrections measured for angularity and planar flow. The high mass selections made as part of the angularity and planar flow measurements resulted in too few events to separate the UE (single-vertex) and MI (single- and multiple-vertex) components.

**Figure 1.** The correction to the jet mass (a) from additional energy deposition due to MI+UE (Nvtx > 1 events) compared with the jet mass corrections for UE alone (Nvtx = 1 events) for jets with a cone size of R = 0.7. The estimated shift from the combination of UE and MI in planar flow (b) and angularity (c) as measured in data. The average number of collisions per bunch crossing is ∼3 for this data sample.
Download figure:
Standard image

2.1.2. New physics searches in multijet events at CDF

The results of a search for pair production of a supersymmetric particle that decays strongly and violates R-parity were described in [10]. The final state of interest was at least six quarks, most observed as separate jets, and no missing transverse energy in the event. The challenge for this search was to reduce the large backgrounds from QCD multijet production, which was done by placing specific kinematic requirements on three-jet triplets in the final state.

The analysis sought events in a CDF sample that satisfied a trigger requiring at least four jets with p_T > 15 GeV and the sum of the calorimeter transverse energy greater than 175 GeV. Events were furthermore required to have at least six jets with p_T > 15 GeV and |η| < 2.5, and the scalar sum of the six highest-energy jets was required to be greater than 250 GeV. Any jets beyond the six highest-energy jets were not used in this analysis. The missing transverse energy, ${\unicode{x246}}_{T}$ , was required to be less than 50 GeV in order to reduce contributions from W boson final states and mis-measured QCD events.

All 20 combinations of two three-jet triplets that could be produced from six jets were considered and the invariant mass of each combination, M_jjj, and the sum of the magnitudes of the transverse momenta of the three jets, ∑_jjj|p_T|, were formed. Monte Carlo studies have shown that requiring

$\begin{eqnarray} &&\sum _{jjj} |p_{T} | - M_{jjj} > 115 \;{\rm GeV} \end{eqnarray} \tag{ 1 }$

is an efficient way of separating potential signal combinations (where M_jjj would be a constant reflecting the mass of the parent supersymmetric parent) from the QCD and combinatorial backgrounds. This is in effect a boosted three-jet final state. The shape of the backgrounds in M_jjj were estimated by using a five-jet final state, showing that the background is expected to peak around 100 GeV.

The M_jjj distribution was then fit to a combination of signal and background terms, where the signal was defined by a PYTHIA Monte Carlo calculation for R-parity violating (RPV) gluino pair production. Although the acceptance for the gluino final state is quite low (roughly 5 × 10⁻⁵), CDF was able to set significant limits on the RPV gluino cross section, which were then converted into lower limits on the gluino mass. Lower mass limits ranging from 144 to 154 GeV at 95% C.L. were set on the gluino mass, depending on the assumptions about the spectrum of intermediate supersymmetric final states. What is perhaps as interesting is that CDF observed evidence for a boosted top quark signature. In the top quark mass region of M_jjj ∼ 175 GeV, CDF observed 11 ± 5 jet triplets. Although one expects on average only one top quark event in this kinematic region, the shape of the mass distribution is consistent with what one expects from MC simulations.

2.1.3. Boosted top quark search at CDF

CDF presented updated results on the measurements of jet mass, angularity and planar flow for jets with p_T > 400 GeV from a sample of 5.95 fb⁻¹ [11]. The measured distributions were compared with analytical expressions from NLO QCD calculations, as well as PYTHIA 6.1.4 predictions incorporating full detector simulation. The theory predictions for jet mass were in good agreement with the data, whereas the angularity and planar flow predictions by PYTHIA showed disagreement in detail (primarily at low angularity and low planar flow).

CDF used these data to also search for a signal of boosted top quark production. Candidate events were selected in two channels: the fully hadronic decays where both top quarks produce massive high-p_T jets (the '1+1' channel), and the decays where one top quark decays semi-leptonically resulting in one massive high-p_T jet recoiling against a second lower-mass jet and significant missing transverse energy (the 'SL' final state). A signal region in the 1+1 mode was defined by requiring both leading jets to have a mass between 130 and 210 GeV (see figure 2), while the signal region in the SL mode requires the leading jet to have 130 < m^jet1 < 210 GeV and the event to have missing transverse energy significance

$\begin{eqnarray} &&S_{{\rm MET}} \equiv {\unicode{x246}}_{T} / \sqrt{\sum E_T} \in (4,10). \end{eqnarray} \tag{ 2 }$

This cut rejects primarily QCD dijet events. The remaining QCD backgrounds are estimated by looking at the event rates in sideband regions of jet mass and S_MET. The analysis observes 31 and 26 candidate events in the 1+1 and SL channels, respectively, and the backgrounds are estimated to be 14.6 ± 2.7 and 31.3 ± 8.1 events, respectively. The total number of top quark events expected in the two channels is 4.9 ± 2.1 candidates.

**Figure 2.** The m^jet2 versus m^jet1 distribution for all events with at least one jet with *p_T* > 400 GeV and η < 0.7, using R = 1.0 midpoint cones. MI corrections have been performed and all events are required to have S_MET < 4.
Download figure:
Standard image

Although the data are consistent with a boosted top quark signature, they are not statistically strong enough to claim observation. Rather, they are used to set an upper limit of 38 fb on top quarks produced with p_T > 400 GeV. The SM expectation for this cross section is 4.5 fb.

2.1.4. Search for lepton jets at CDF

There are several theories beyond the standard model that predict the production of cascades of particles that appear in the final state as a 'lepton jet'. The CDF collaboration reported on a search for such objects using 5.1 fb⁻¹ of $\sqrt{s} = 1.96$ TeV proton–antiproton collisions at the Tevatron. The search looked for events with a large number of low-energy leptons produced in association with a W or Z boson.

Events were selected by performing the standard selection for W or Z boson candidates, requiring at least one well-identified electron or muon candidate and then either significant ${\unicode{x246}}_{T}$ or a second well-identified charged lepton of the same flavour but opposite charge. A 'soft lepton' algorithm was then employed to identify additional electron or muon candidates down to a transverse momentum of 1 GeV for electrons and 3 GeV for muons. The numbers of events in the zero or one additional lepton bins were used to scale the expected backgrounds for the signal region defined by two or more soft lepton candidates. The analysis found that the dominant backgrounds came from inclusive W+jets production where one or more of the jets were mis-identified or the Drell–Yan production where the additional leptons were mis-identified jets. Other sources of background, such as W+c-quark, W+b-quark, $t\bar{t}$ and di-vector boson production, were also evaluated.

The potential signal was modelled on a benchmark process defined by a neutralino model with a 'hidden' Higgs boson coupled to a dark sector, where the dark sector particles decay to pairs of charged leptons [12]. The channels with the best signal-to-background for this model were those with a W or Z boson with at least three additional muons, either with none or one additional electron; for example, in the channel with a W + 3 additional muons, one expected 1.5 ± 1.2 background events and nine signal events (only two events were observed). There was no signal observed above background, and a 95% C.L. upper limit on the production cross section of a W or Z boson produced in association with a Higgs boson with the expected couplings of 27 fb was set. This allowed the collaboration to rule out the benchmark model.

This analysis showed the effectiveness of using the lepton jet signatures to search for evidence of new physics.

2.2. Results from D0

2.2.1. Colour flow in D0 $t\bar{t}$ events

The D0 experiment presented recent results showing how the colour flow that is expected to arise between two jets can be used as a discriminant to identify W boson hadronic decays in a 5.3 fb⁻¹ sample of $t\bar{t}$ events [13]. Since the W decay products form a colour singlet, they produce an antenna radiation pattern, with most soft particles emitted between the two jet directions. The 'pull' of the jets' radiation towards each other can be used to more effectively identify dijet systems arising from a specific colour state.

Events were selected by requiring the traditional lepton+jets final state, with a charged lepton, missing transverse energy and four or more jets. At least two of the jets had to be tagged as b-quark candidates, resulting in 728 candidate events with an estimated background rate of 82 ± 9 events in the sample not arising from $t\bar{t}$ production. The jet pairs that had an invariant mass within 30 GeV of the W boson mass were then selected and the shape of the energy depositions studied to look for a signal consistent with the colour singlet. The minimum relative pull angle for jet pairs satisfying |m_W − m_j1j2| < 30 GeV, ΔR_{j1, j2} < 2 and |η| < 1 is shown in figure 3(a). The expected colour effect in the relative orientations of the jet pulls was observed when comparing the daughter jets from W boson decays and the b-quark jets, though the statistical power of the measurement was modest and the kinematics of the pairs of jets were not identical. A more direct comparison was made by producing Monte Carlo $t\bar{t}$ decays with a W boson produced as a colour octet compared with the expected colour singlet state, and a series of detailed studies were performed to evaluate the systematic uncertainties on this measurement.

**Figure 3.** The minimum relative pull angle for W jets (a) is measured for jets satisfying |*m_W* − m_j1j2| < 30 GeV, ΔR_{j1, j2} < 2 and |η| < 1. The expected C.L. bands for f_singlet are shown in (b); the dashed line shows the expected value and the dotted line indicates the measured value of f_singlet.
Download figure:
Standard image

The actual quantity measured was the fraction of singlet W boson decays compared with the total number, giving

$\begin{eqnarray} &&f_{\rm singlet} = 0.65^{+0.37}_{-0.38}\, {\rm (stat.)} \pm 0.22\, {\rm (syst.)}. \end{eqnarray} \tag{ 3 }$

This could be turned into a lower limit on the fraction of singlet decays of f_singlet > 0.277 at 95% C.L. Although this is somewhat higher than the limit that might have been expected given the data sample, it is consistent with the expectation within statistical uncertainties. The expected C.L. bands for this limit are shown in figure 3(b).

This means of measuring the colour flow provides an additional discriminant that can be used to search for decays of other boosted objects (such as $H \rightarrow b\bar{b}$ ) and gives us confidence that the modelling of colour flow is being done well in current Monte Carlo calculations.

2.3. Results from ATLAS

2.3.1. ATLAS and pile-up

In the recent ATLAS analysis of 35 pb⁻¹ of data [4], the sensitivity of the individual jet mass to pile-up is directly tested. The increase in the mean jet mass as a function of the number of reconstructed good vertices (good vertices being defined as those with at least 5 tracks with p_T > 150 MeV), N_PV, is measured for jets with a p_T of at least 300 GeV and rapidity |y| < 2. The results are shown in figure 4 for jets of various radii and built using two different jet algorithms. The mean jet mass is observed to increase linearly with N_PV. For events with only a single primary vertex, the mean jet mass increases linearly with R. The slope of the mean jet mass as a function of the number of primary vertices, though, exhibits a dependence on R that is consistent with the ratio of the third power of the jet resolution parameter. This follows the prediction of pQCD calculations [14]: the extra mass from an additional particle k scales like δm² ≈ 2p_T · k ∼ p_Tp^k_TΔR²_k, so the net change from a uniform pile-up p_T density ρ is δm² ∼ p_Tρ∫dAΔR² ∼ p_TρR⁴, and hence δm ∼ R⁴ρp_T/m. Since m ∼ R, the derivative dm/dρ then scales like R³ for fixed p_T.

Figure 4 also shows the impact of pile-up on each of the three jet algorithms used in the ATLAS study. Specifically, figure 4(b) shows the dependence on N_PV for the mean jet mass before and after the splitting and filtering procedure. The filtering procedure significantly reduces the effect of pile-up on jet mass. In fact, the slope of a straight line fit is consistent with zero. This result demonstrates that certain well-designed jet grooming procedures can significantly reduce the impact of pile-up and that the efficacy of the method can be tested in situ. Methods to correct for these effects and to provide more general pile-up mitigation procedures using jet grooming are underway in ATLAS.

2.3.2. Jet shapes with ATLAS

The study of jet shapes in proton–proton collisions provides information about the details of the parton-to-jet fragmentation process [15]. The internal structure of sufficiently energetic jets is primarily dictated by the emission of multiple gluons from the primary parton, calculable in perturbative QCD (pQCD). The shape of the jet depends on the type of parton (quark or gluon) and is also sensitive to non-perturbative fragmentation effects and UE contributions from the interaction between proton remnants. A proper modelling of the soft contributions is crucial for the understanding of jet production in hadron–hadron collisions and for comparison of the jet cross section measurements with pQCD theoretical predictions.

The ATLAS collaboration recently published results on differential and integrated jet shapes for jets constructed with the anti-k_T algorithm [16] with distance parameter R = 0.6 [2]. The jet shapes were calculated using the constituents associated with the jets, which were 3D topological calorimeter clusters [17].

The analysis follows similar measurements undertaken by CDF for $p\bar{p}$ collisions [18–20] by ZEUS for e^±p [21–24] and OPAL for e⁺e⁻ [25, 26] collisions.

The ATLAS analysis uses data corresponding to 3 pb⁻¹ of total integrated luminosity, collected during 2010. This period of data collection can safely be referred to as ATLAS's early days, when pile-up was not the burning issue that it quickly became in 2011, after about 35 pb⁻¹ had been collected. In 2010, the average number of good vertices in a single hard scatter was approximately 2. In 2011 data, this average is closer to 10, and increasing with luminosity.

The measurements are carried out for jets with p_T > 30 GeV and |y| < 2.8.

The measurements of integrated and differential jet shapes, Ψ (equation (5)) and ρ (equation (4)), are corrected to hadron level and compared to several Monte Carlo predictions. Different phenomenological models are employed to describe the fragmentation processes and UE contributions.

The comparisons presented in [2] were complemented with additional results in a second publication [27] including the predictions from new generators and up-to-date MC tunes, and are reported in this contribution to the proceedings.

The differential jet shape ρ(r) as a function of the distance $r=\sqrt{\Delta y^2 + \Delta \phi ^2}$ to the jet axis is defined as the average fraction of the jet p_T that lies inside an annulus of inner radius r − Δr/2 and outer radius r + Δr/2 around the jet axis:

$\begin{equation} \rho (r) = \frac{1}{\Delta r}\frac{1}{N^{\rm jet}}\sum _{\rm jets}\frac{p_{T}(r-\Delta r/2,r+\Delta r/2)}{p_{T} (0,R)},\ \Delta r/2 \le r \le R-\Delta r/2, \end{equation} \tag{ 4 }$

where p_T(r₁, r₂) denotes the summed p_T of the clusters in the annulus between radius r₁ and r₂, N^jet is the number of jets, and R = 0.6 and Δr = 0.1 are used.

The points from the differential jet shape at different r values are correlated since, by definition, ∑^R₀ρ(r) Δr = 1. Alternatively, the integrated jet shape Ψ(r) is defined as the average fraction of the jet p_T that lies inside a cone of radius r concentric with the jet cone:

$\begin{equation} \Psi (r) = \frac{1}{N^{\rm jet}}\sum _{\rm jets}\frac{p_{T}(0,r)}{p_{T}(0,R)},\ 0 \le r \le R, \end{equation} \tag{ 5 }$

where, by definition, Ψ(r = R) = 1, and the points at different r values are correlated.

A summary of the results presented in [2] are given in figure 5 for the differential jet shape in the bin 30 GeV < p_T < 40 GeV and in figure 6 for the integrated jet shape at r = 0.3 in jets with 30 GeV < p_T < 600 GeV.

The measured jets become narrower as the jet transverse momentum and rapidity increase, although with a rather mild rapidity dependence. The results indicate that the predicted jet shapes are mainly dictated by the details of the implementation of the parton showers and the modelling of the UE in the Monte Carlo generators. The presence of contributions from higher-order pQCD matrix elements in the final state does not affect the predicted jet shapes significantly.

The PYTHIA, HERWIG++ and HERWIG/JIMMY Monte Carlo generators, using the latest tunes to describe minimum bias and UE-related observables in data, provide a good description of the measured jet shapes, although the latest version of HERWIG++ tends to produce jets narrower than the data. As expected, PYTHIA 6 AMBT1 tends to produce jets slightly narrower than the data as it underestimates the UE activity in dijet events [28] and lacks a tuned final-state parton shower from the initial-state radiation (ISR). The effect of the wrong setting in 'HERWIG++ 2.4.2 bug'⁴³ is only visible for jets with p_T < 40 GeV .

The different SHERPA predictions are similar and provide a reasonable description of the data. This indicates that the presence of additional partons from higher-order matrix element contributions does not affect the predicted jet shapes, mainly dictated by the soft radiation in the parton shower. The comparison between SHERPA 1.2.3 and SHERPA 1.3.0 shows that the NLL-inspired corrections included in the latter for the parton shower do not impact significantly the predicted jet shapes. ALPGEN interfaced with PYTHIA predicts too-narrow jets and does not describe the data. This was already the case for ALPGEN interfaced with HERWIG/JIMMY [2] and requires further investigation to determine whether the disagreement observed with the data can be completely attributed to the UE modelling in the MC samples or is also related to the prescription followed by ALPGEN in merging the partons from the matrix elements with the parton showers in the final state.

Finally, POWHEG interfaced with HERWIG/JIMMY provides a reasonable description of the data while the interface with PYTHIA predicts too-narrow jets, which is mainly attributed to the details of the UE modelling.

These results confirm the sensitivity jet shape measurements, and their potential to constrain different Monte Carlo models.

2.3.3. Boosted top quarks with ATLAS

First steps have been made this year by ATLAS in the quest to reconstruct the decay of a boosted top quark in a single jet. The first publication has been delivered in the context of the search for massive exotic resonances decaying to $t\bar{t}$ [29]. This analysis was able to isolate a clean sample of boosted top quarks. These events are the first candidates for boosted top quarks reconstructed as single jets at ATLAS.

Although the statistics with the 2010 dataset were too low to provide a testing ground for the many substructure techniques and variables we are interested in testing on such a clean sample of boosted top quarks, the groundwork was laid down to make the prospects for boosted top analyses on 2011 data very exciting.

2.3.4. Jet substructure with ATLAS

One of the most important tasks for the early ATLAS boosted object analyses was to begin subjecting the various grooming methods and variables, initially implemented in Monte Carlo, to the tests that come with the conditions of a real detector. The recent ATLAS analysis on jet mass and substructure has done just that [4]. The analysis was based on 35 pb⁻¹ of data taken in 2010 and presented the mass and splitting scale of high p_T (>300 GeV) jets reconstructed with the anti-k_T (R = 1.0) algorithm and with the Cambridge–Aachen (R = 1.2) algorithm at different stages of grooming. The grooming method employed was a mass-drop filtering method which looks for hard subjets within the very large parent jet, and discards any radiation not falling within the subjets designated to the hard splitting. As already discussed in section 2.3.1, the behaviour of the mass of large-area jets as a function of the number of good vertices in an event was studied and the process of filtering away the soft radiation in such jets was found to effectively remove this dependence. This is perhaps unsurprising, as the radiation present in an event due to pile-up or UE is expected to be a fairly soft continuum. The analysis discarded all events with more than one good vertex, focusing on the 28% of the 2010 dataset that is regarded as being free from pile-up⁴⁴.

The final distributions resulting from the analysis were unfolded to hadron level. The mass distributions of Cambridge–Aachen jets with R = 1.2 are shown in figure 7 before and after employing mass-drop filtering with the requirement that the heavier subjet carry less than μ = 67% of the mass of the parent jet and the splitting between subjets is fairly symmetric, $y_2 \equiv \frac{\min \left(p^2_{T,a}, \,\, p^2_{T,b}\right)}{m^2_{\rm jet}} \Delta R^2_{ab} < y_2^{\rm cut}=0.09$ . Figure 8 shows the mass and splitting scale ( $\sqrt{d_{12}}= \min (p_{T,a}, p_{T,b})\times \Delta R_{ab}$ ) of anti-k_T jets with R = 1.0.

**Figure 7.** The invariant mass distribution of Cambridge–Aachen R = 1.2 jets (left) and the mass distribution after filtering (right). Both distributions are unfolded to stable-particle level and compared to PYTHIA, HERWIG and HERWIG++. The systematic uncertainties are shown by the shaded band.
Download figure:
Standard image

**Figure 8.** The invariant mass distribution of anti-*k_T* R = 1.0 jets (left) and the splitting scale $\sqrt{d_{12}}$ of the same jets (right). Both distributions are unfolded to stable-particle level and compared to PYTHIA, HERWIG and HERWIG++. The systematic uncertainties are shown by the shaded band.
Download figure:
Standard image

**Figure 8.** The invariant mass distribution of anti-*k_T* R = 1.0 jets (left) and the splitting scale $\sqrt{d_{12}}$ of the same jets (right). Both distributions are unfolded to stable-particle level and compared to PYTHIA, HERWIG and HERWIG++. The systematic uncertainties are shown by the shaded band.
Download figure:
Standard image

In all cases, the distributions are found to be well modelled by a range of leading-order parton shower Monte Carlos to within the systematic uncertainties.

2.4. Results from CMS

2.4.1. CMS and pile-up

One advantage of the CMS detector and reconstruction pipeline is the so-called particle flow algorithm for a more holistic approach to jet reconstruction. Because of this approach, it is possible to remove around 60% of the pile-up from jets directly as it corresponds to charged tracks which can be associated with subleading primary vertices. This aids in the reduction of pile-up in jets with substructure at CMS, and studies show that there is only moderate dependence of the substructure variables on pile-up for the luminosities considered [5].

The CMS collaboration plans to retain a data-driven approach to measuring mis-tag rates, ameliorating pile-up issues there. Further studies are ongoing to utilize other advanced jet reconstruction techniques to further reduce pile-up dependence.

2.4.2. Jet shapes with CMS

Several jet shape measurements have been performed at CMS similar to those at ATLAS described above in section 2.3.2. The first is the examination of the classic jet shapes, and the second is the examination of the jet charged component structure, both in [3].

Several differences exist between the ATLAS and CMS results. While both collaborations use the anti-k_T jet algorithm, CMS uses different values of the radius parameter R. For the inclusive shape measurements, two types of jets were used: calorimeter-only jets and jets where the calorimeter response was corrected using associated tracks ('jet-plus-tracks' jets) [30]. To reduce out-of-cone radiation, a large R of 0.7 was used. The jets were required to have (track-corrected) transverse momenta greater than 15 GeV. For the charged multiplicity measurement, only the jet-plus-tracks jets were used, with R = 0.5. The (track-corrected) transverse momenta for the first two jets were required to be larger than 20 and 10 GeV, respectively.

The distribution of Ψ(r) (equation (5)) is shown in figure 9. The Monte Carlo simulation is doing a very reasonable job at predicting the data, and the agreement improves with larger transverse momentum.

2.4.3. Boosted top quarks and W bosons with CMS

CMS has developed algorithms to identify the hadronic decays of boosted top quarks and W bosons. The top-tagging algorithm is based on the Johns Hopkins tagger [31], with small modifications as described in [32, 1]. The W-tagging algorithm is based on jet pruning [33, 34], adding a requirement on the mass drop of the subjet splitting, which was motivated by the BDRS subjet/filter algorithm [35].

The algorithms were characterized as of the BOOST 2011 conference [5], and subsequent studies have advanced this characterization [36]. The strategy for the studies here is to have data-driven mis-tag predictions which can accurately represent non-top and non-W backgrounds. Sidebands in the data are selected that have little to no true top or W contributions, and the tagging rate is derived as a function of the jet transverse momentum. These per-jet parameterizations are then applied to each jet in a signal selection, which is used to form a data-driven background estimate. This has several advantages, primarily that modelling effects are not relevant. Moreover, pile-up effects are handled directly, because the same data sample is used in the sidebands and the signal region, so the pile-up contribution is the same.

The efficiency for the tagging algorithms is derived from Monte Carlo, and corrected for a data-to-Monte-Carlo scale factor using standard model $t\overline{t}$ events in the semi-leptonic channel in [36]. In that public result, limits are set on the possible cross section for narrow resonances decaying to $t\overline{t}$ pairs. Figure 10 shows the mis-tag rates and efficiencies for the top and W tagging algorithms from CMS.

**Figure 10.** Mis-tag rates and efficiencies for the top tagging (top) and W tagging (bottom) algorithms. The mis-tag rates are measured in a signal-depleted control sample and compared to PYTHIA and HERWIG++ predictions. The efficiencies for tagging top quarks and Ws are estimated from PYTHIA simulation.
Download figure:
Standard image

2.4.4. Jet substructure at CMS

The study of the performance of jet substructure techniques in generic QCD samples is critically important to understand these algorithms, and to derive appropriate sideband regions for data-driven background estimates at CMS. As such, CMS has performed detailed comparisons of jet substructure observables in [5].

For example, figure 11 shows the jet mass for the top-tagging and W-tagging algorithms, compared to several different predictions from Monte Carlo. The simulation seems to be predicting the data quite nicely for all of the generators examined, with slight variations based on the shower model and UE tunes. This seems to have a much larger effect on the substructure than the pile-up contributions.

3. New tools and techniques

Jet substructure techniques attempt to extract additional information about a jet from a detailed study of its constituents. These methods have been mainly aimed at distinguishing boosted hadronic objects like top quarks and W/Z/Higgs bosons from the background of jets initiated by light quarks and gluons. Many such techniques exist, and to help make sense of them, the BOOST 2010 report [1] proposed that they be classified according to three broad categories: (1) methods that directly identify subjets within a jet, (2) methods which 'groom' away uncorrelated radiation within a jet and (3) methods based on energy flow within a jet.

Since the BOOST 2010 workshop, a variety of new techniques have been introduced which will be described in more detail below. A recent and thorough review of substructure techniques applied to top tagging can be found in [37]; here, we focus on the developments since BOOST 2010. N-subjettiness [38, 39] and dipolarity [40] are examples of hybrid jet shapes, which describe the energy flow of a jet (as in method (3) above) with respect to candidate subjet axes (determined using e.g. method (1) above). Jet substructure through angular correlation functions (ACFs) [41] is a complementary technique to energy flow observables. The template overlap method [42] and the shower deconstruction method [43] classify jets with the help of approximations to hard matrix elements and the parton shower. Beyond the highly boosted regime, the HEP (Heidelberg–Eugene–Paris) top tagger [44] is appropriate for identifying moderately boosted top quarks. Meanwhile, substructure techniques have been used in a variety of interesting applications, including separating quark jets from gluon jets [45, 46], tagging jets from ISR [47] and identifying boosted decay products of new physics [48, 49].

3.1. N-subjettiness

In [38], Thaler and van Tilburg introduced a new jet shape 'N-subjettiness' (denoted τ_N), designed to identify boosted N-prong hadronic decays. N-subjettiness quantifies the degree to which jet radiation is aligned along specified subjet axes, such that small values of τ_N correspond to N or fewer subjets, while large values of τ_N indicate more than N subjets. This jet shape was adapted from the event shape N-jettiness introduced in [50] to define exclusive jet cross sections, and similar ideas were pursued by Kim in [39].⁴⁵

Given candidate subjet directions determined by an external algorithm (such as the exclusive k_T procedure [51, 52]), τ_N is defined as

$\begin{equation} \tau _N = \frac{\sum _k p_{T,k} (\min \lbrace \Delta R_{1,k},\Delta R_{2,k},\ldots , \Delta R_{N,k} \rbrace )^\beta }{\sum _k p_{T,k} (R_0)^\beta }, \end{equation} \tag{ 6 }$

where the sum runs over the particles in the jet, p_{T, k} is the transverse momentum of particle k, ΔR_{A, k} is the azimuth-rapidity distance between the subjet axis A and particle k, and R₀ is the characteristic jet radius defined such that 0 ⩽ τ_N ⩽ 1. The constant β is an angular weighting exponent closely related to angularities [53], and 1-subjettiness roughly corresponds to jet angularities [54] with a ≡ 2 − β.

To separate boosted hadronic objects from the QCD jet background, one could use the complete set of τ_N values (with different values of β) in a multivariate analysis. However, reference [38] showed that a simple cut on the ratio τ_N/τ_{N − 1} provides excellent discrimination power for N-prong hadronic objects. In particular, τ₃/τ₂ is a successful boosted top discriminator, and τ₂/τ₁ can identify boosted W/Z and Higgs bosons, with the angular weighting exponent β = 1 (corresponding roughly to jet broadening [55]) providing the best discrimination. In a subsequent work [56], Thaler and van Tilburg showed that the initial step of choosing candidate subjet axes is in fact unnecessary. In particular, the quantity in equation (6) can be minimized over the candidate subjet directions using a variant of the k-means clustering algorithm [57], further improving boosted object discrimination.

3.2. Dipolarity

A new colour flow observable, 'dipolarity', was introduced by Hook, Jankowiak and Wacker to discriminate between different colour configurations of a given pair of subjets j₁ and j₂ [40]. Dipolarity is given by a sum in which each constituent of j₁ + j₂ is weighted by its p_T and its squared angular separation ΔR² from the line segment connecting j₁ and j₂ in the η-ϕ plane:

$\begin{equation} \mathcal {D} \equiv \frac{1}{\Delta R_{12}^2 }\sum _{i\in J} \frac{p_{Ti}}{p_{T_J}}\Delta R_i^2 . \end{equation} \tag{ 7 }$

For subjets j₁ and j₂ in a colour singlet configuration, the radiation pattern is of the dipole form with most radiation clustered in the region between the two subjets. Consequently, $\mathcal {D}$ is expected to be small for colour singlet configurations and larger for other colour configurations, in which j₁ and j₂ are colour-connected to other subjets.

By considering the entire radiation pattern of the two subjets at once, dipolarity is designed to be most effective in the semi-boosted regime, where there can be considerable overlap between the two subjets. This is in contrast to jet pull [58], which was introduced with the low boost regime in mind and which can lose discrimination power if there is substantial overlap. As a first application, dipolarity has been incorporated into the HEP top tagger [44], where it was shown to improve background rejection by probing the colour structure of the reconstructed W boson. More work will be needed to determine whether dipolarity can be applied effectively outside of top tagging.

3.3. Jet substructure without trees

Jankowiak and Larkoski developed a method for identifying substructure within jets via angular correlations [41], introducing an ACF ${\mathcal G}(R)$ :

$\begin{equation} \mathcal {G} (R)\equiv \frac{\sum _{i\ne j}p_{T i}p_{T j} \Delta R_{ij}^2\Theta (R-\Delta R_{ij})}{\sum _{i \ne j}p_{T i}p_{T j}\Delta R_{ij}^2}, \end{equation} \tag{ 8 }$

where the sum runs over all pairs of jet constituents. The ACF measures the contribution to a jet's mass from pairs of constituents separated by an angular scale R or less. A high-p_T QCD jet has an ACF that goes approximately like a power of R, since it is nearly scale-invariant. By contrast, a jet initiated by a heavy particle decay has one or more intrinsic scales, which results in an ACF with one or more 'cliffs'⁴⁶.

For a given jet, numerous infrared/collinear-safe observables can be constructed from the ACF, and these can be used to characterize the jet's substructure. For example, one can look at the angular scales at which cliffs are located as well as the corresponding cliff heights. Cliff heights are closely related to mass drops as utilized in BDRS mass-drop/filtering [35] and can be used to extract mass scales that correspond to hard substructure in the jet. As a first application of these ideas, Jankowiak and Larkoski developed a top tagging algorithm whose performance is competitive with others in the literature. Other applications remain to be explored.

3.4. Template overlap

The energy distribution resulting from hard scatterings can be well described by energy correlation functions in momentum space. In QCD, these naturally describe jet cross sections in terms of energy flow observables, which are peaked around the states associated with the hard scattering that subsequently initiate the jets. Therefore, energy flow observables within the jet should be of particular interest to substructure studies. In [42], Almeida, Lee, Perez, Sterman and Sung developed a method based on the quantitative comparison of the energy flow of observed jets at high-p_T with the flow from selected sets (the templates) of partonic states.

The template overlap procedure can be summarized as follows. Let |j〉 denote the set of particles or calorimeter towers that make up a jet, identified by some algorithm, and take |f〉 to represent a set of partonic momenta p₁...p_n that represent a boosted decay, found by the same algorithm. The functional measure ${\mathcal F}(j,f)\equiv \langle f | j \rangle$ quantifies how well the energy flow |j〉 matches the (templates) |f〉. In practice, reference [42] found good results with a simple construction of functional overlap based on a Gaussian in energy differences within angular regions surrounding the template partons. Any region of partonic phase space for the boosted decays, {f}, defines a template. Knowledge of the signal and background can be used to design a custom analysis for each resonance, to make use of differences in energy flow between signal and background. The template overlap of an observed jet j is defined as $Ov(j,f[j])={\rm max}_{\,\lbrace f\rbrace }\, {\mathcal F}(j,f)$ , the maximum functional overlap of j to a state f[j] within the template region, where f[j] stands for the state of maximum overlap, emphasizing that the value of the overlap functional depends not only on the physical state |j〉 but also on the choice for the set of template functions f.

Template overlaps provide a tool to match unequivocally arbitrary final states j to partonic partners f[j] at any given order. Once a 'peak template' f[j] is found, it can be used to characterize the energy flow of the state, which gives additional information on the likelihood that it is a signal or background. In addition, template overlaps can be combined with higher moments of the energy distribution or jet shapes to further discriminate the event.

3.5. Shower deconstruction

Shower deconstruction, proposed by Soper and Spannowsky, is a method to look for new physics in a hadronic environment [43]. First, one picks the part of the event that is likely to be of interest, for instance, the part contained in a large-radius jet that possibly contains the decay products of a boosted heavy particle. This part of the event is divided into small radius jets called the microjets that are ideally the size of topoclusters or calorimeter towers. If there are too many microjets to analyse, one can discard the microjets with the lowest transverse momenta. Shower deconstruction uses the four-momentum and possibly b-tag information for each microjet.

The aim of shower deconstruction is to calculate a single number χ for each event such that events with small χ are likely to be background events and events with large χ are likely to be signal events. The number χ is an approximation to the ratio P(S)/P(B) of the probability P(S) that a parton shower Monte Carlo that represents the sought signal process would generate the given event to the probability P(B) that a parton shower Monte Carlo that represents the background process would generate the event. The function P(S) is calculated as of a sum, over all possible shower histories for the signal hypothesis, of weights that are a product of splitting-kernels and Sudakov factors. P(B) is calculated the same way, but for the background hypothesis.

Although shower deconstruction is not limited to boosted configurations, the computing time increases strongly with the number of microjets. Boosted configurations are known to ameliorate combinatoric problems in reconstructing a resonance that decays hadronically because all decay products can be in one wide angle jet. Thus, reference [43] presents a first application of shower deconstruction using the HZ production channel, where the boosted Higgs decays to a $b\bar{b}$ pair, first discussed in [35]. The statistical significance obtained with the shower deconstruction algorithm is found to be larger than that obtained with the method of [35].

3.6. HEP top tagger

Unlike other taggers, the HEP top tagger, proposed by Plehn, Salam and Spannowsky, is not motivated by searches for resonances decaying to two highly relativistic top quarks [44]. Instead, its first application was the notorious Higgs search channel $pp \rightarrow t\bar{t}H$ with a hadronically decaying $H \rightarrow b\bar{b}$ [35]. In the standard model, one can expect several percent of the events to have transverse momenta in the range p_{T, H} ≳ m_H and p_{T, t} ≳ m_t for the leading hadronically decaying top quark. To extract the $t\bar{t}H$ signal from the continuum QCD background, reference [44] required two fat jets, one from a boosted Higgs and one from a boosted top quark.

Another application of top tagging in a moderately boosted regime is identifying top partners—like a supersymmetric top squark—decaying to a top quark and an invisible dark matter agent [59]. Similar to the $t\bar{t}H$ channel, such searches suffer from combinatoric backgrounds. Using this tagger, one can exploit purely hadronic top decays and extract the stop pair signal out of backgrounds.

Algorithmically, the HEP top tagger is motivated by the BDRS Higgs tagger. In particular, it starts with a large, R = 1.5, Cambridge–Aachen jet. This size immediately translates into a minimum transverse momentum condition of p_{T, t} ≳ 200 GeV. This fat jet is unclustered using an iterative mass-drop criterion, with a general cutoff at m_j > 30 GeV for the subjets. Next, filtering [35] is applied to sets of three hard subjets, using five constituents, and a combination of three subjets is chosen with jet mass closest to the top mass. To reconstruct the W mass, note that for the top decay kinematics, it is surprisingly likely that more than one of the three m_jj combinations lies within 15% of the W mass. Therefore, the HEP top tagger does not aim to distinguish the b jet from the two W decay jets, but instead applies a more democratic subjet mass criterion described in [59]. Finally, a self-consistency condition is applied on the reconstructed transverse momentum p_{T, t} > 200 GeV.

In a recent application, Plehn, Spannowsky and Takeuchi studied semi-leptonic decays of top partners into two top quarks and missing energy [60]. The hadronic top was reconstructed with the usual tagging algorithm. For the leptonic top, the unmeasured neutrino three-momentum was reconstructed based on the assumption of a boosted top decay. Two of the three unknown components can be reconstructed using the top and W mass constraints. For the third component, one can analyse the top decay in a specific rest frame and find that one of the neutrino momentum components is strongly suppressed. This way, one can approximately reconstruct the neutrino momentum and compare it to the measured two-dimensional missing transverse momentum vector. The results for supersymmetric top squarks are promising over a wide range of masses with a similar reach in the hadronic and semi-leptonic channels.

As this paper was in preparation, several further extensions of the HEP top tagger were proposed [61].

3.7. Quark versus gluon separation

Being able to distinguish light-quark jets from gluon jets on an event-by-event basis could significantly enhance the reach of many new physics searches at the LHC. The two prongs of this effort are finding intra-jet observables whose distributions are significantly different between the flavours, and finding relatively pure samples of quark and gluon jets to measure these observables in data. Identifying quark and gluon jets was also studied by ATLAS in the context of reducing jet energy scale uncertainty [62].

In [45], Gallicchio and Schwartz systematically examined many existing and novel jet substructure observables to find the ones whose distributions, for a given jet η and p_T, are the most powerful single- and multi-variable discriminants. It turned out that a combination of the charged track multiplicity and the p_T-weighted linear radial moment (girth) performed almost as well on particle-level Monte Carlo as discriminants with more variables. Over 95% of the gluon jets can be filtered out while keeping more than half of the light-quark jets.

The best single observable was the number of charged particles within the jet (which were required to have p_T > 500 MeV). The discrimination power improved with jet p_T, and the strength relative to other observables was greatest at high signal efficiency, where mild cuts were required.

Another good single variable, and part of the best pair, was the linear radial moment—a measure of the 'width' or 'girth' of the jet—constructed by adding up the p_T deposits within the jet, weighted by distance from the jet axis. It is defined as

$\begin{equation} g = \sum _{i \in \mathrm{jet}} \frac{ p_T^i }{ p_T^{\rm jet} } \, |\Delta R_i|, \end{equation} \tag{ 9 }$

where $\Delta R_i = \sqrt{ \Delta y_i^2 + \Delta \phi _i^2 }$ and where the true boost-invariant rapidity y should be used when measuring with respect to the (massive) jet axis instead of the geometric pseudorapidity η. This is a boost-invariant version of jet broadening to which it reduces in the limit of massless constituents at small angles to the jet axis.

Finding relatively pure samples was discussed by Gallicchio and Schwartz in [46]. Such samples are necessary because all intra-jet observables have distributions with significant overlap between quark and gluon jets. Combined distributions of an evenly mixed sample do not provide verification, independently for quarks and gluons, of the showering and detector simulation.

Kinematic cuts on multijet and jets+X tree-level samples were optimized to purify first quarks, and then gluons. At the 7 TeV LHC, the pp → γ + 2 jets sample can provide 98% pure quark jets with 200 GeV of transverse momentum and a cross section of 5 pb. To get 10 pb of 200 GeV jets with 90% gluon purity, the pp → 3 jets sample can be used. These samples could provide a direct evaluation of the tagging technique at all jet p_T's, verify and help improve the Monte Carlo generators, and provide a test of perturbative QCD.

3.8. ISR tagging

In [47], Krohn, Randall and Wang studied the feasibility of identifying jets from ISR on an event-by-event basis, and considered how these jets can be used in the interpretation of new physics phenomena. As a proof of principle, they investigated the pair production of new physics states which each decay into jets and missing energy, and suggested that ISR can be identified by looking for jets which are distinguished in either their p_T, rapidity or m/p_T ratio. Using these three criteria they report that they can identify ISR in di-squark (di-gluino) events roughly 40% (15%) of the time with a mis-tag rate of around 10% (15%).

The most obvious application of the technique is in reducing the combinatoric difficulties which arise in event reconstruction. However, the production of ISR is governed by the detailed properties of a hard scattering event, e.g. the flavour of the initial partons and the scale of the hard interaction, and so ISR can be used to distinguish between different production mechanisms yielding events with similar visible final states. In [47], the authors provide an example of this, showing how one can, over many samples, observe the recoil of a new-physics system against ISR and thus infer the mass scale of the system even in the presence of significant missing energy.

3.9. Multitagging for new physics

The application of one or more boosted object taggers can also be used effectively in searches for the tagged objects within new physics event samples themselves. This provides the potential to discover the Higgs boson in a way distinct from standard model search strategies, as well as characterizing the interactions of the new physics by understanding the variety of boosted objects that appear in these samples. Kribs, Martin, Roy and Spannowsky demonstrated that a slightly modified BDRS algorithm [35] was highly effective at finding the lightest supersymmetric Higgs boson in superpartner-enriched event samples, where a boosted Higgs boson appeared in the cascade from heavy supersymmetric particles decaying to light supersymmetric particles with a gravitino [48] or neutralino [49] lightest supersymmetric particle. Vector-like fermionic top partners also provide a rich final state amenable to the simultaneous application of multiple boosted object taggers. Top-partners decay t' → (bW, tZ, th) with roughly (50%, 25%, 25%) branching ratios [63]. Kribs, Martin and Roy showed that combining both the HEP top tagger [44] with the BDRS algorithm [35] could identify both boosted t, boosted h, as well as boosted W/Z (though modified filtering/subjet techniques). This was shown to be highly effective at identifying top partners with the Higgs boson in LHC simulations.

4. Jet substructure in FastJet 3

One of the aims of FastJet 3, available since October 2011 from http://fastjet.fr/, is to facilitate the use and development of jet substructure tools. A novelty relative to the 2.X series is that jets are now 'self-aware'. So whereas previously one would access a jet's constituents via its cluster sequence

vector <PseudoJet> constituents = cluster_sequence.constituents(jet);

one may now write

vector <PseudoJet> constituents = jet.constituents();

and similarly for other properties such as parents, subjets and, where relevant, areas. Aside from being more intuitive to write, this also has the advantage that when dealing with multiple cluster sequences, one no longer needs to remember which cluster sequence to use with which jet.

In FastJet 2.X, the only way in which one could create an object with substructure was by creating a cluster sequence for it. Accordingly a number of tools were written as plugins, which was not necessarily the most natural way of formulating them. In FastJet 3, an easy way of creating substructure is via the 'join' function. Suppose one has W1, W2 and b subjets obtained from some declustering procedure. Then, one may simply write

PseudoJet W = join(W1,W2);

PseudoJet top = join(W, b);

Then, not only will the top's momentum be sensible, but top.constituents() will return the concatenation of the constituents of W1, W2 and b. The high-level top substructure can be accessed with

vector〈PseudoJet〉 top_pieces = top.pieces()

where the vector contains the W and the b. The pieces() function also works on normal jets from a cluster sequence.

A number of substructure tools are distributed with FastJet 3. They are implemented as classes deriving from Transformer, as shown in figure 12. Top taggers should derive from TopTaggerBase. As an example consider the Filter class, which can be used as follows:

#include ``fastjet/tools/Filter.hh''

#include ``fastjet/Selector.hh''

double Rfilt = 0.3;

Selector selector_filter = SelectorNHardest(3);

Filter filter(Rfilt, selector_filter);

PseudoJet filtered_jet = filter(jet);

**Figure 12.** Illustration of some of the classes deriving from `Transformer` in FastJet 3. Classes shown in dark grey are abstract base classes.
Download figure:
Standard image

Note the use of the Selector (itself new to FastJet 3), which provides an easy way of specifying the cuts on the subjets and passing them to the Filter class. Simply changing the selector leads to trimming [64]

Selector selector_trimmer = SelectorPtFractionMin(0.05);

Filter trimmer(Rfilt, selector_trimmer);

PseudoJet trimmed_jet = trimmer(jet);

Transformed jets often have more internal structure than a normal jet. In the case of trimmers and filters, for example, one may want to know which subjects did not pass the selection. For this purpose, there is the structure_of<TransformerName>() function, which returns a reference to a transformer-specific class with extra information about the jet, e.g.

vector<PseudoJet> rejected_subjets =

filtered_jet.structure_of<Filter>().rejected();

Other substructure tools that are part of FastJet 3 include a Pruner [34], a MassDropTagger [35], a JHTopTagger [31] and a RestFrameNSubjettinessTagger [39]. Given the rapid evolution of the substructure field, it is not realistic for FastJet to always distribute a complete and up-to-date set of substructure tools. Accordingly, in the near future we envisage setting up a FastJet 'contrib' area to facilitate dissemination of third-party tools.

FastJet 3 also contains changes unrelated to substructure. These include a new interface to pile-up and underlying background estimation, including a new and very fast grid-based median background estimator. Also of use is a new facility for attaching arbitrary user information to jets, beyond just the user_index of FastJet 2.X. Illustrations of its usage together with HepMC and Pythia 8 are available on request from the FastJet authors. More information on these and other developments is to be found in the FastJet manual, as well as the slides and doxygen documentation on the FastJet web site [65].

5. Benchmark samples and comparisons

As proposals proliferate for LHC searches with jet substructure, the space of questions to ask about substructure techniques grows correspondingly larger. Which techniques work best for which signals? Does the answer depend on the p_T of particles in question, or whether the analysis is a search or a measurement? How do detector limitations and uncertainties affect each method? How do things change as the amount of pile-up present increases? If we study these questions with Monte Carlo event generators, do the answers depend on which generator we choose?

In this section, we will address a few of these questions. Beyond the specific results we give, we hope that the set of samples and tools we use will provide a starting point for future work. The results here can then be seen as benchmarks for comparison.

Our starting point is the set of comparisons presented in the BOOST 2010 report [1]. That analysis compared top taggers exclusively for several reasons. A large number of substructure techniques are either designed for finding tops or can be used that way; the two-step decay of a top quark is an interesting example to study; and finding top jets is immediately useful at the LHC. For the same reasons, as well as limited time and space, we will take the same approach and focus on distinguishing top jets from pure QCD background⁴⁷. Extending last year's results, we will consider new techniques, more Monte Carlo generators—including matched samples, and the effects of a simple detector model.

Samples new and old are publicly available at

5.1. New samples

This year we expand the set of benchmark samples with events generated by the SHERPA and HERWIG++ Monte Carlos. The SHERPA events allow us to examine the effects of higher-order matrix elements, which should provide a more accurate description of hard QCD radiation. The HERWIG++ events are a useful cross-check of last year's parton-shower-level results using a current-generation program⁴⁸.

5.1.1. Matched samples with SHERPA

We consider LHC proton–proton collisions at a centre-of-mass energy of 7 TeV. We have simulated the standard model processes $t\bar{t} +X$ , W/Z + X and pure QCD jets. All samples were produced with version 1.3.1 of the SHERPA event generator [67, 68]. For the electroweak gauge bosons as well as the top quarks, we consider fully hadronic decays only. For each process we generated samples of 100 000 unweighted events for p_T slices of 100 GeV between 0, ..., 1.6 TeV, resulting in an almost flat p_T distribution when combining all samples. As a slicing parameter, we used the maximum of the Monte Carlo top/anti-top p_T, the gauge-boson p_T for the W/Z channels and the leading-jet p_T for pure jet production, where parton-level jets were reconstructed according to the anti-k_T algorithm with R = 1.5. For all processes, the slicing cut is applied at the matrix-element level before parton showering is invoked.

To accurately simulate high-p_T events and to account for configurations with more than one hard parton inside a large jet, we employ SHERPA's matrix-element–parton-shower matching algorithm [69, 70]. We consider the complete sets of tree-level matrix elements with up to two (four) additional final-state partons for $t\bar{t}$ and dijet (W/Z) production from SHERPA's matrix-element generator COMIX [71]. The matrix elements of varying multiplicity are consistently combined to form inclusive samples of fully exclusive events through subsequent parton shower evolution [72] before being subjected to hadronization. The renormalization and factorization scales are dynamically determined on an event-by-event basis according to the matching algorithm [69]. For the parton separation parameter of the matching procedure, we use Q_cut = 20 GeV. The default tune of SHERPA 1.3.1 for the hadronization and UE model parameters was used.

5.1.2. HERWIG++ events

For comparison with last year's HERWIG events, we have also produced samples using HERWIG++ 2.5.0. The initial state consists of two beams of protons at a centre-of-mass energy of 7 TeV. The UE has been simulated using the UE7-2 tune, whose parameters can be found at http://projects.hepforge.org/herwig/MB_UE_tunes/input_cards/LHC-UE7-2.in. The samples have been produced using standard model matrix elements for QCD 2 → 2 scattering and $t\bar{t}$ production. Only fully hadronic final states are considered. For each process 10 000 events were produced for each p_T bin of a width of 100 GeV, from 200 GeV to 800 GeV for $t\bar{t}$ production and from 200 GeV to 1.6 TeV for QCD 2 → 2 scattering. The resulting p_T distribution when combining all samples is approximately flat. The p_T cut is applied at the level of the hard process. B hadrons have been set stable through a lifetime cut of 1 ps for consistency with the HERWIG samples used for BOOST 2010⁴⁹. All other settings have been left as default.

5.2. New tools

The BOOST 2010 report compared the Johns Hopkins top tagger [31], its CMS variant [32], an ATLAS tagger based on k_T scales [73], the so-called Thaler and Wang tagger based on a similar kinematic variable z_cell [74], and a tagger using pruned jet and subjet masses [33, 34]. Detailed descriptions of these taggers may be found in the 2010 report [1]. To this list we add a trimming-based tagger [64] as well as the HEP and N-subjettiness taggers described in section 3.

The HEP tagger used in this section is a somewhat different implementation than the one described in section 3, essentially corresponding to the tagger described in [44].⁵⁰ Rather than applying the subjet mass criteria of [59], we simply identify the W as the pair of the three identified subjets with mass closest to m_W, and measure its mass and helicity angle as in the JH and CMS taggers. Both schemes make use of the mass and decay kinematics of the $t\rightarrow Wb \rightarrow q\bar{q}b$ system, so after optimizing parameters and measurement cuts it is not clear how much of an improvement the new symmetric mass cuts give. We leave this question, just as possible optimizations of the tagging cuts [61], to detailed joint experimental and theoretical studies.

For the N-subjettiness tagger we find subjet axes via 'one-pass minimization' of k_T axes. That is, three subjets are identified via the exclusive k_T algorithm, and then a single iteration of the minimization procedure described in [56] is applied. The parameter β is set to 1.0.

5.3. General analysis description

As for the 2010 report, all analyses begin with anti-k_T jets with R = 1.0, p_T > 200 GeV.⁵¹ For the Hopkins, CMS, HEP and trimmed taggers, jets are first reclustered with Cambridge–Aachen, which is also the algorithm used in the pruning stage of the pruned analysis. For the ATLAS and Thaler and Wang taggers, jets are reclustered with k_T before measuring substructure. N-subjettiness is a jet shape, so no reclustering is necessary.

For each tagger, we identify the set of input parameters to be tuned as well as the set of output variables to cut on. These are given in table 1. The Hopkins, CMS and HEP taggers all have a built-in method of identifying the W subjet. For fairness, in the other taggers we include a W-identifying step: the jet (after grooming for pruning/trimming) is unclustered to three subjets and the pair with the minimum pairwise mass is merged to form the W. This mimics the W-finding step in the CMS tagger.

Table 1. Input parameters and output variables for each top tagging analysis. m⁰_J is the jet mass found by the original algorithm; m_J is after any substructure modification. For the HEP tagger, μ is the mass drop threshold and m^max_subjet is the subjet mass where declustering stops.

Analysis	Input parameters	Output variables
ATLAS	N/A	$\big\{ m_J^0, m{_W^{k_{T}}}, z_{\rm cut}^1, z_{\rm cut}^2\big\}$
CMS	{δ_p, δ_r}	{m⁰_J, m_W, cos θ_h}
Johns Hopkins	{δ_p, δ_r}	{m_J, m_W, cos θ_h}
HEP	{μ, m^max_subjet}	{m_J, m_W, cos θ_h}
N-subjettiness	N/A	{m⁰_J, τ₃/τ₂}
Pruning	{z_cut, R_cut}	{m_J, m_W}
Thaler and Wang	N/A	$\left\lbrace m_J^0, m_W^{k_{T}}, z_{\rm cell} \right\rbrace$
Trimming	{R_sub, p^frac_T}	{m_J, m_W}

The optimal output cuts for any given input variables and signal efficiency are found using TMVA [75]; input variables are scanned with a Python script and the lowest mis-tag rate is kept for each bin in signal efficiency. To avoid pathological optimizations where TMVA finds W jets instead of top jets, we require that the jet mass cut have a lower bound of at least 120 GeV. The statistical noise from fluctuations in the testing and training samples is significant, so in each efficiency plot a five-point moving average is applied.

One might not always want to re-optimize the input parameters of a tagger for each p_T range, signal efficiency, etc. To give a sense of how well each tagger performs without scanning its parameters, we repeat the efficiency scans with a single set of input parameters. For each Monte Carlo sample, we take the parameters found at the 35% signal efficiency point, in the 500–600 GeV p_T bin, as this common point. These parameters are given in table 2.

Table 2. Input parameters for each tagger, found by scanning over reasonable values and taking the lowest mis-tag rate for a 35% signal efficiency, in the 500–600 GeV p_T bin. These parameters are then used in all plots where input parameters are not varied. The ATLAS, N-subjettiness, and Thaler and Wang taggers are not shown as they do not have input parameters.

	Input parameters
Analysis	HERWIG 6.5	HERWIG++	SHERPA
CMS	{δ_p, δ_r} = {0.028, 0.13}	{0.028, 0.20}	{0.056, 0.47}
HEP	{μ, m^max_subjet} = {0.93, 54}	{1.0, 46}	{0.87, 10}
Johns Hopkins	{δ_p, δ_r} = {0.083, 0.19}	{0.056, 0.23}	{0.083, 0.27}
Pruning	{z_cut, R_cut} = {0.056, 0.33}	{0.056, 0.33}	{0.056, 0.22}
Trimming	{p^frac_T, R_sub} = {0.050, 0.19}	{0.050, 0.28}	{0.083, 0.28}

The analyses are implemented in SpartyJet [76, 77], using tools based on FastJet [78, 65]. All are available either as internal FastJet tools in version 3.0.0 or as add-ons available on the FastJet tools site at http://fastjet.fr/tools.html. The Python scripts used to define the analyses, perform the efficiency scans and make the plots, along with instructions, can be found on the BOOST 2011 website, http://boost2011.org.

5.4. Results

We begin with a simple extension of the 2010 results (figure 3 in [1]). In figure 13, we show mis-tag versus efficiency curves for the expanded set of top taggers, as tested on last year's HERWIG 6.5 samples. We can see that the groomed taggers (pruning and trimming plus mass cuts), 'hybrid' taggers that incorporate grooming as well as measurement (CMS, HEP, JH) and the N-subjettiness jet shape outperform the ATLAS and Thaler and Wang taggers, which use subjet measurements exclusively. The groomed, hybrid and jet shape taggers are comparable in performance, with the groomed and N-subjettiness taggers performing best at low signal efficiency. In the lower set of plots, where we do not scan over input parameters, N-subjettiness performs noticeably better over most of the efficiency range.

**Figure 13.** Mis-tag versus efficiency for several top tagging methods, as tested on HERWIG 6.5 $t\bar{t}$ and dijet samples. For figures (a) and (b), the input parameters are optimized for each efficiency point. The input parameters for the unoptimized scans are taken from the 35% efficiency point in figure (b).
Download figure:
Standard image

To complement the HERWIG 6.5 samples from the last year, we have generated equivalent samples in HERWIG++ as a cross-check. The results are shown in figure 14. The results are broadly similar to those for HERWIG 6.5, but the spread between different methods is reduced. With or without input optimization, there is no clear 'winner'.

**Figure 14.** Mis-tag versus efficiency for several top tagging methods, as tested on HERWIG++ $t\bar{t}$ and dijet samples. For figures (a) and (b), the input parameters are optimized for each efficiency point. The input parameters for the unoptimized scans are taken from the 35% efficiency point in figure (b).
Download figure:
Standard image

Both Monte Carlo samples used last year were generated via 2 → 2 processes interfaced with a parton shower; this year we add matched samples from SHERPA to explore the effects of adding higher-order matrix elements. We expect that these higher-order effects should lead to a more precise description of the hard, wide-angle radiation that forms the most challenging background to heavy particle jets.

In figure 15, we repeat the analyses of figure 13 on matched samples from SHERPA. The p_T cuts are the same. Comparing to figure 13 we see that the different multijet modelling predicts substantially worse performance for all taggers, presumably due to enhanced hard substructure in the background. The relative performance of different taggers, though, is similar to the unmatched samples. Other than ATLAS and Thaler and Wang, which continue to be less effective, the taggers are extremely similar in performance. When we do not scan input parameters, N-subjettiness narrowly outperforms the hybrid taggers.

The plots thus far represent events with no detector simulation. How do the results change if we add detector resolution effects? In figure 16, we repeat the analyses, but acting on events run through a simple calorimeter simulation provided by Peter Loch, presented at the BOOST workshop. This simulation smears energy according to a radial profile based on the performance of the ATLAS detector, and then groups energy deposits into calorimeter cells. Each calorimeter cell is then treated as a massless particle with the direction and total energy of that cell. The resulting events provide a crude proxy for the real calorimeter output and give us a way to estimate detector effects on substructure performance. This simulation has yet to be extensively studied, so the results presented here should be considered preliminary and interpreted with caution.

The detector simulation code is available at the author's website, http://atlas.physics.arizona.edu/∼loch, and is also provided with SpartyJet as the RadialSmearingTool tool. You can use it in a SpartyJet analysis via:

builder = SJ.JetBuilder()

builder.add_jetTool_input(SJ.RadialSmearingTool())

<add analyses and run...>

Comparing figures 13–15 with 16, we can see that including realistic detector resolution generally degrades the best achievable performance and changes how algorithms compare to each other. Compared to other taggers, N-subjettiness does worse. For each MC sample, the spread between taggers is still small, with the ATLAS and Thaler and Wang taggers still being outperformed. Considering the right-hand plots in figure 16, we can see that in some cases taggers appear to do better after detector simulation, especially at low signal efficiency. This effect is at least partly due to statistical noise in the efficiency scans, but warrants further study.

The results we have presented in this paper, while certainly not providing the last word in boosted object tagging, suggest some conclusions. By any metric, methods that use only subjet kinematic information, like the ATLAS and Thaler and Wang taggers, are outperformed by groomed, hybrid and jet shape taggers. Differences between the rest of the taggers are largely of similar magnitude to differences between Monte Carlo samples and before and after detector simulation: a more careful study would be needed to definitively say that some are better than others. In these studies, the N-subjettiness tagger does particularly well when compared to other taggers with fixed input parameters, but when optimization is included performances are extremely similar. N-subjettiness also appears to be particularly susceptible to degradation due to detector resolution.

Beyond the limited comparisons given in this section, we hope that the software tools we have developed to make them will facilitate further study. Such a study is clearly warranted to seriously answer the questions posed at the beginning of this section. Further phenomenological exploration of the broad space of substructure techniques will help lead to a more holistic understanding of substructure physics. A Monte Carlo study will complement developments in jet calculations, a subject to which we now turn.

6. What is calculable and measurable in 2012?

Since the start of the LHC physics program, the ATLAS and CMS experiments have made substantial progress in measuring jet properties. Interest in using jet properties for new physics searches has placed a large emphasis on understanding jets more thoroughly than at previous experiments. The improved resolution of the detectors has made this possible, and current progress is very promising for the future of jet physics in the age of the LHC. This has naturally excited the interest of the theory community, whose continued work on jet physics will confront data in the coming years. This section is devoted to a summary of theoretical work related to jet substructure, focusing on observables whose properties can be calculated and measured in the near term.

Two complementary tools are used to understand jet properties: analytic calculation and Monte Carlo simulation. Analytic calculations give an inclusive picture of a jet or event, often projecting onto single observables. MC generators offer totally exclusive events, on which any number of observables can be measured. These programs can simulate effects that are challenging to integrate into a calculation, such as the UE, pile-up and hadronization. State-of-the-art calculations have perturbative precision that often exceeds MC generators (either in fixed order or resummed terms), but these calculations are only feasible for a limited set of observables. Importantly, improvements in analytic calculations help further development and validation of MC generators. Improvements in calculations will also play a key role in interpreting jet measurements at the LHC.

Calculations of jet properties typically focus on single jet observables, also known as jet shapes. These variables are inclusive over the jet constituents, measuring a function of the final state momenta. By contrast, most jet substructure methods involve more complicated partitioning of the jet constituents and observables of localized structure in the jet. While many early studies used subjet information to identify the hadronic decays of heavy particles, recent studies have explored the efficacy of jet shapes in boosted heavy object searches (e.g. [38, 41]). Observables derived from the N-subjettiness jet shapes have been shown to be very competitive in top tagging (see also section 5). This gives hope that calculationally simpler observables may be useful in new physics searches.

In this section, we will discuss the calculation of jet observables, focusing on jet mass, which provides an example of a jet observable that can be usefully calculated. We present aspects of the calculation and discuss improvements that can be made with future work. We then discuss other observables that are useful for measurement and point out where calculations may soon be available.

6.1. Jet mass as an example

Jet mass is the simplest and most phenomenologically useful jet shape. An accurate measurement depends on a good understanding of the jet energy scale and splash-in contributions (pile-up, the UE and ISR). The importance of the measurement and the accessibility of the calculation mean that a comparison between the two can be very useful.

The measurement of a single jet mass is still a complex observable. The calculation is performed for an exclusive jet multiplicity, meaning there must be a veto on additional jets. Consider the cross section for dijet events from the anti-k_T algorithm, differential in both jet masses, d²σ/dm₁dm₂. We must veto on additional jets by placing a p_T cut on candidate jets found by the algorithm. Additionally, the perturbative calculation is only valid for high-p_T jets, meaning a restriction of the phase space for the hard scattering is required.

There are two main regimes in the jet mass distribution. The small mass regime, m_J ≪ p_TJ, contains large logs of m_J/p_TJ. Resummation is required to tame these large logs and restore accuracy. For larger masses, a fixed order calculation suffices to describe the distribution. Next-to-leading order is currently state-of-the-art for a wide range of processes, and can be simulated by Monte Carlo for many processes.

The small jet mass limit is the focus of the most effort in the calculation. Resummation must be performed in this regime, which requires proving certain properties of the cross section. In particular, some kind of factorization must be proven that relates the all-orders, large-log terms to the fixed order terms.

The additional factorization required for resummation of jet mass takes two basic forms, depending on what tools are used. Both perturbative QCD and soft-collinear effective theory (SCET) can be used to resum the jet mass distribution, and the essential physics that each uses is the same. However, the language each uses is distinct, and their comparison would be useful. Here we give an outline of factorization in SCET; some features of resummation in SCET and QCD are compared in the next subsection.

In the SCET formulation of the cross section, an effective theory framework is used [79–82]. Hard modes that produce perturbative emissions of energetic partons are integrated out of QCD. The remaining soft and collinear modes have momenta parameterized by a power counting parameter λ ≪ 1. This power counting parameter is typically determined by the kinematics of the final state; for the jet mass example, λ = m_J/p_TJ. The soft and collinear fields decouple at leading power in the SCET Lagrangian [82], decoupling the N-jet operators used to describe the energetic jets. Factorization also requires showing that the measurement (such as the jet mass with the anti-k_T algorithm and a p_T cut on additional jets) factorizes as well. This means the phase space constraints on soft and collinear modes must separate. This can be shown formally by providing an operator formulation of the phase space constraints [83, 84] and can also be seen at a physical level by a power counting analysis of the measurement in terms of the modes in SCET [85]. This leads to a schematic factorization theorem of the form, for N jets:

$\begin{equation} \sigma = H_N \left[ B_a \times B_b \times J_1 \times \cdots \times J_N \right] \otimes S_N . \end{equation} \tag{ 10 }$

The hard function H_N is the matching coefficient onto SCET and arises from the short-distance interaction that produces the well-separated, energetic partons. The jet functions J_i and beam functions B_{a, b} come from the final and initial state collinear evolution of the hard partons [86]. The soft function S_N comes from the global soft gluon radiation across the event. Each of these functions can be separately calculated in perturbation theory, and renormalization group evolution is used to sum the large logarithms that arise in the cross section. This framework formalizes much of the resummation machinery used in perturbative QCD.

6.1.1. Facets of the calculation

We first discuss the perturbative contributions to the cross section, in particular, the resummation. We focus on jets in the anti-k_T algorithm, since jets defined in other algorithms are complicated by the non-trivial role of clustering between soft gluons. Soft gluon clustering in these algorithms means that the soft phase space does not factorize into single particle phase space for each soft gluon, affecting the subleading logarithms in the perturbative series starting at $\mathcal {O}(\alpha _s^2 \ln ^2)$ [87, 88]. In the anti-k_T algorithm, the phase space for a single soft particle is not affected by the presence of other soft particles.

The leading logs in the cross section are given by the exponentiation of single soft gluon emission in QCD, and single-log and running-coupling corrections can be included. Equivalently in SCET, the leading logs are specified by the cusp anomalous dimensions of the one-loop jet and soft functions, with the single-log terms coming from the non-cusp anomalous dimensions. Running-coupling effects arise in renormalization group evolution. In the limit of small jet radius (R ≪ 1), the leading logs have the form

$\begin{equation} \int _0^m \frac{\mathrm{d}\sigma }{\mathrm{d}m} \sim \sigma _0 \exp \left( - \frac{\alpha _s C_F}{2\pi } \ln ^2 \frac{m}{p_T R^2} \right)\!. \end{equation} \tag{ 11 }$

In principle, subleading logarithms can be straightforwardly computed and resummed. The extension of resummation techniques beyond next-to-leading-log (NLL) is natural using the SCET framework. However, general jet shapes suffer from non-global logarithms that contribute at NLL [89]. Nearly every jet measurement made at the LHC will have non-global observables, and their resummation is more challenging and can only be performed in the leading-colour approximation [89–92]. The physics of the leading non-global logs at fixed order (α²_s) has been well understood in QCD, and their resummation calculated numerically in the large-N_C limit. Recent explorations in SCET have provided additional insights into their structure and origin, providing a possible new path to renormalization group-based resummation [93–95]. Non-global logs are reduced by soft gluon clustering, and so are reduced for algorithms other than anti-k_T [90, 95].

The small-R approximation is advantageous because a simple picture of non-global logs emerges. Each jet has its own non-global correction factor, and cross talk between jets is relegated to a power-suppressed contribution in R. The non-global factor for a jet was calculated in the large-N_c limit in [89] and used for jet mass in [92]. The large-N_c approximation will receive corrections from subleading colour terms which will modify the non-global log contribution at around the 10% level. Since non-global logs contribute at NLL and start at $\mathcal {O}(\alpha _s^2 \ln ^2)$ , subleading terms in colour ought not to matter phenomenologically, as was the case for deep inelastic scattering event shapes [96]. Thus, in the small-R approximation, the cross section is described by a resummed factor for each jet dressed by a calculated non-global factor, and one can include jet mass measurements for each jet [92].

Beyond the small-R approximation, one can account for the corrections by including terms that vanish as powers of R. In QCD, one computes soft gluon emission from all of the colour dipoles in the event, including the incoming partons. Expanding the result as a power series in R, one finds that the terms of relative order R² make a numerical impact (at around the 10% level even for R = 0.4), but terms scaling as R⁴ do not matter even for R = 1. Similarly, the non-global factor must be corrected beyond the small-R limit. In SCET, the sum over soft gluon emission from colour dipoles is given in the soft function, where one expands in

$\begin{equation} t_{ij}^{-1} \equiv \frac{\tan ^2 R/2}{\tan ^2 \theta _{ij}/2} \,, \end{equation} \tag{ 12 }$

with θ_ij the angle between jets i and j [97, 54]. The requirement 1/t_ij ≪ 1 forces the jets to be well separated, which is needed to factorize the cross section. When jets become close together, additional modes in SCET are required to separate the scale dependence on the small dijet invariant mass [98]. These modes parameterize soft radiation between the nearby jets, and SCET can be formally extended to SCET₊, which includes these modes.

Therefore, for a wide range of jet mass measurements, it should be possible to obtain a resummed result that accounts for non-global logarithms and goes beyond the small-R approximation to sufficient accuracy so as to be phenomenologically useful.

A resummed calculation describes the dominant shape of the distribution. However, many other effects must be taken into account. The shape of the large-mass tail of the distribution is described well by fixed-order QCD. The resummed and fixed-order calculations can be combined in a simple way:

$\begin{equation} \frac{\mathrm{d}\sigma }{\mathrm{d}m} = \frac{\mathrm{d}\sigma }{\mathrm{d}m}\bigg |_{{\rm resum}} - \frac{\mathrm{d}\sigma }{\mathrm{d}m}\bigg |_{{\rm resum}}^{{\rm FO exp}} + \frac{\mathrm{d}\sigma }{\mathrm{d}m}\bigg |_{{\rm FO}} . \end{equation} \tag{ 13 }$

The resummed and fixed order distributions are added, and the fixed order expansion of the resummed distribution is subtracted. In the small-mass limit, the two fixed order functions cancel up to power corrections in m/p_T, and the resummed distribution correctly describes the distribution. In the large-mass tail, the resummed distributions (all orders and fixed-order expanded) cancel up to the fixed order in α_s of the expansion, so the fixed-order QCD distribution describes the shape up to higher-order corrections.

Hadronization, pile-up and UE contributions must also be included in a calculation of the jet mass. These non-perturbative contributions affect the small-mass regime of the distribution, and are needed for the calculation to be compared to data. While important, these effects are challenging to include, as no framework exists to systemically include them and they require a more detailed understanding of the data.

Basic models have been formulated to quantify contributions from the UE and pile-up [99, 14, 6, 100]. As jet shapes are measured and calculated more extensively, the comparison between the two is a good testing ground to study these models and fold them into calculations. In the case of 2-jet event shapes in e⁺e⁻ collisions, the correction to the first moment of the event shape from non-perturbative effects has been shown to be related to a universal, measurable non-perturbative matrix element for a wide class of event shapes [101, 102]. In the SCET framework, model soft functions that parameterize non-perturbative effects have also been introduced [103]. Further study is warranted to understand how non-perturbative effects in a hadron collider environment can be introduced, and whether similar universality relations that exist in e⁺e⁻ collisions apply in the hadron collider case.

6.2. Measurable and calculable observables

A wide array of observables have been defined to study the structure of jets. While the jet mass is phenomenologically the most important, it is important to prioritize which other observables should be measured in the near term at the LHC. There are several key reasons to measure jet substructure. The comparison to precision QCD calculations is obviously attractive, and understanding QCD backgrounds helps characterize observables relevant for new physics searches. Additionally, jet substructure measurements can further understanding of detector effects and improve unfolding techniques. This can have an impact beyond jet substructure measurements, such as a reduction in the jet energy scale uncertainty.

There are four general classes of observables used to measure jet substructure and jetty events:

jet mass
jet shapes
groomed jet observables
global event shapes.

Although jet mass is a jet shape, it is central enough to jet physics that special consideration should be given to it as a first measurement. Jet grooming methods are generic substructure methods that remove isolated soft radiation in jets. Three grooming methods have been proposed: filtering [35], pruning [33] and trimming [64]. The measurement of properties of groomed jets can provide data about the structure of the jet that is difficult to capture with simpler observables such as jet shapes. Finally, global event shapes can be measured without clustering into jets. Several event shapes are useful in characterizing soft physics that can contribute to the jet substructure.

Below we discuss each class of observables, identifying key observables in each category. These are measurements that theorists find informative, and will help in the study of the jet substructure. Many of these observables may be calculated in the near term, and their measurement can provide motivation for a detailed comparison.

6.2.1. Jet mass measurements

There are several possible channels in which a jet mass distribution can be measured. These include pure QCD events with inclusive and exclusive jet cross sections, jets produced in association with electroweak bosons and light quark jets produced in association with top quarks. Because the calculation of a jet mass distribution depends on the precise event topologies studied, to compare theory and experiment it is important to carefully choose the measurements. Several key topologies will provide important jet mass measurements. The three we focus on are γ/W/Z + jet, dijet and multijet events. In each topology, there are particular jet mass observables of interest, specialized to the final state.

γ/W/Z + jet events. These have only a single jet in the final state (assuming the W or Z decays leptonically) and provide the cleanest jet measurements. This class of events is an important reference sample for jet energy scale calibration, and can be used to measure fake rates for certain substructure tagging algorithms. They are a source of jet + lepton and jet + MET final states, and are a background to many new physics searches. Theoretically, they can be used as precision tests of QCD and provide a more calculable environment due to the single jet multiplicity.The observable of interest is the differential jet mass distribution, dσ/dm_J, in bins of jet p_T. A jet veto must be imposed to obtain an exclusive sample of single jet events. A hard veto on additional jets, such as $p_{T_J} < 0.1\, p_T^{\gamma }$ , can be useful for jet energy scale calibration. The jet mass distribution in this case is of interest because of the restriction on radiation outside the singe jet. For any value of the jet veto, if the jet mass distribution is finely binned in p_T (several bins instead of one or two), the impact of various components of the calculation can be compared across p_T.
Dijet events. Dijet events with high-p_T jets have a large cross section and make up a significant fraction of the pure QCD events containing high-p_T jets at the LHC. These events are backgrounds to many new physics searches, and their large cross section means that they must be precisely understood.Because there are multiple jets in the event, different jet mass measurements can be made. The doubly differential distribution,
$\begin{equation} \frac{\mathrm{d}^2 \sigma }{\mathrm{d}m_1 \mathrm{d}m_2} , \end{equation} \tag{ 14 }$
with p_T1 > p_T2, is interesting because correlations between the jet masses can be studied. This was explored in the CDF analysis of jet mass [104], where the ratio of event numbers in different bins in the double differential distribution was measured. Define two single-jet mass bins b₁ and b₂ (in the CDF study b₁ = (30, 50) GeV and b₂ = (130, 210) GeV). These bins define four regions:
$\begin{eqnarray} A&: m_1 \in b_1 , \quad m_2 \in b_1 , \nonumber \\ B&: m_1 \in b_2 , \quad m_2 \in b_1 , \nonumber \\ C&: m_1 \in b_1 , \quad m_2 \in b_2 , \\ D&: m_1 \in b_2 , \quad m_2 \in b_2 . \nonumber \end{eqnarray} \tag{ 15 }$
Then, define the ratio R_mass to be
$\begin{equation} R_{{\rm mass}} \equiv \frac{N_B N_C}{N_A N_D} . \end{equation} \tag{ 16 }$
If the jet mass distributions were independent, then R_mass = 1. Correlations between these distributions exist because of soft gluon exchange, although the expected value of R is not known. This observable is of interest due to the discrepancy between Monte Carlo simulation and data, and a theory calculation could help resolve the differences.In addition to the doubly differential mass distribution, the projection onto a single mass is theoretically accessible. Common choices are the heavy jet mass and the sum of jet masses, which have been studied at e⁺e⁻ colliders. The hardest jet's mass is harder to predict theoretically because the relative p_T in dijet events is affected primarily by soft physics.
Multijet events. Events with more than two jets provide a more complex background. The colour correlations between jets are enhanced as the detector becomes more crowded, and these events typically have a hierarchy of jet p_T's. Multijet events are more important at the LHC due to the increased phase space for high-p_T parton production, and significant attention has been paid to understanding them better theoretically and experimentally. Multijet events not only present a theoretical challenge to describe but also offer the opportunity to learn about effects such as colour correlations that figure less prominently in simpler events.In multijet events, placing a hard p_T cut on the leading three jets will give a multijet event sample that allows one to study the structure of the hardest jet as the kinematics of the events change. In particular, the jet mass distribution of the hardest jet can be measured in bins of the ΔR separation to the nearest jet. As jets become closer together, inter-jet soft radiation can have a large effect on the mass distribution. This can be predicted theoretically, and the result is compared with the Monte Carlo.

Because the jet mass is sensitive to the environment of the event, it is important to examine the effect of event-wide characteristics on the mass. As pile-up increases, the jet mass can serve as a probe of the size of the pile-up and the effectiveness of subtraction methods (see section 2.3.1).

The rapidities of the hard jets will affect the jet mass distribution for a number of reasons. First, large-rapidity jets imply a large momentum asymmetry between the initial partons, meaning that one carries a large fraction of the proton momentum (x ∼ 0.1). Since the quark parton distribution functions dominate in this regime, the fraction of quark jets will be enhanced relative to central events. Second, the shape of the parton distribution functions implies that there is likely to be more ISR, since they are less steep in the large x region. Coupled with the fact that the large-rapidity jets are closer to the beams, it is interesting to see the dependence of jet mass on the jet rapidity.

Finally, the total transverse momentum in the event, H_T, is a measure of the activity of the event. Soft jets that are vetoed contribute to H_T and will affect the jet mass, but still fit into the same event selection criteria. The contribution of UE and pile-up will similarly affect the jet mass distribution, and H_T is a simple proxy for the size of these observables.

6.2.2. Jet shape measurements

Jet shapes measure the radiation pattern within a jet by calculating a function of the momenta in the jet. The jet mass is a special jet shape, but there are other important jet shapes whose measurement would be very informative. We discuss several. Both N-subjettiness and planar flow are calculable, as they take moments of the momenta in the jet.

N-subjettiness is a useful variable for jet substructure [38, 56]. N-subjettiness is defined with respect to a set of subjet axes q_i:
$\begin{equation} \tau _N^{(\beta )} = \frac{\sum _k p_{T,k} (\min \lbrace \Delta R_{1,k},\Delta R_{2,k},\ldots , \Delta R_{N,k} \rbrace )^{\beta }}{\sum _k p_{T,k} (R_0)^{\beta }}, \end{equation} \tag{ 17 }$
where β > 0 is a real parameter, R₀ is the radius of the jet and ΔR_{i, k} is the angular separation between particle k and subjet axis q_i. The shape τ^(β)_N probes how 'N-subjet-like' a jet is, and provides an effective veto on additional subjets. Because adding more axes only lowers the value of τ^(β)_N, N-subjettiness cannot provide a veto against fewer subjets.The case N = 1, β = 1 is also known as girth, broadening, or width; β = 2 is related to the jet mass for small masses. These are the most common choices for β, and it would be interesting to compare them. Additionally, other choices of β can be informative, as they will probe the structure of the jet as a function of the angle to the 1-subjettiness axis. τ^(β)₁ is similar to jet angularities, a jet shaped defined for e⁺e⁻ events [53, 105, 97, 54]. The distribution in jet angularity τ_a was calculated to next-to-leading log accuracy in τ_a [54], and the theoretical tools exist to extend this calculation to the distribution of τ^(β)₁ at the LHC.2- and 3-subjettiness are also of interest. These can probe jets with multiple subjets, and the substructure of these jets will more closely resemble a boosted heavy object decay. (Using τ₃/τ₂ to identify top jets was studied in section 5.)
Planar flow is an interesting shape variable that was proposed to distinguish top and QCD jets [106, 74]. Planar flow is defined in terms of the eigenvalues of a 2-by-2 matrix, I_w, constructed from the components of momenta perpendicular to the jet axis:
$\begin{equation} I_w^{kl} = \frac{1}{m_J} \sum _{i\in J} \frac{p_i^{\perp k} p_i^{\perp l}}{E_i} \,, \end{equation} \tag{ 18 }$
where p^⊥k is the kth component of p perpendicular to the jet axes (with respect to a pair of basis axes in the perpendicular plane). From I_w, planar flow is defined as
$\begin{equation} Pf = \frac{4\, {\rm det}\, {I_w}}{{\rm tr}(I_w)^2} . \end{equation} \tag{ 19 }$
If the radiation in the jet is clustered along a line in the perpendicular plane going through the jet axis, then Pf will be close to 0. This will be the case for QCD jets and jets with 2-body kinematics, such as Higgs or W jets. For jets with 3-body (or more) kinematics, such as the top, Pf will be $\mathcal {O}(1)$ .
Subjet and track multiplicities have been shown to be effective discriminators of quark and gluon jets [45, 46]. Subjets can be found by reclustering the jet with a small radius, and finer resolution of the subjet size can improve discrimination between jets from different sources. The study of subjet multiplicity offers the chance to study the sensitivity to p_T or energy cutoffs placed on jet constituents or subjets. Does the energy resolution for subjets significantly improve with a higher energy cutoff on jet constituents? Does the multiplicity become sensitive to pile-up and the UE if the cutoff is taken too low? Similarly, the charged track multiplicity is a useful observable whose power depends on the angular resolution for track identification as well as the performance over the range of charged particle momenta in the jet.

6.2.3. Groomed jet measurements

Many new physics searches can utilize jet grooming algorithms. Filtering, pruning and trimming are jet grooming algorithms that modify the substructure of the jet to remove contributions from the UE and pile-up [35, 33, 34, 64]. These algorithms can be applied to a wide range of channels, and comparing the performance of the three algorithms can be instructive (cf section 6 of [1]). Because each algorithm operates on the substructure differently, it is useful to compare all of them. Experimental studies can quantify the performance of each algorithm and determine which is best for a given application.

Each grooming algorithm operates on a single jet and produces a groomed jet. Therefore, any observable can be measured on the original jet and the groomed jet. The distributions before and after grooming are always of interest to understand the effects of grooming, and grooming algorithms can be used to study a variety of jet properties. The most interesting observables and objects to study with jet grooming are as follows.

Jet mass.
Pile-up and the UE are a large motivation for jet grooming methods. The performance with respect to pile-up for some jet grooming algorithms is currently being studied, and is quantified by studying tracks from secondary vertices. Quantifying the performance with respect to the UE is more challenging, but measures exist [100].
If N-subjettiness is measured, then the effect of the grooming algorithms will be interesting to study. Grooming algorithms are designed to remove isolated soft radiation that can come from the UE or pile-up and contribute to poor mass resolution in the jet. This radiation will also have a large effect on N-subjettiness. Studying the N-subjettiness distribution (for N = 1, 2, 3) will give insight to grooming methods using an observable where it is simple to visualize jets with multiple energetic subjets.
Given the good tracking and angular resolution at the LHC, the experiments can study the performance of grooming methods using only charged tracks.

6.2.4. Global event shapes

Finally, event shapes are also of interest. While these are not direct jet substructure observables, they have a strong connection to jet substructure. Initial studies of jet substructure were driven by Monte Carlo studies, but more recently the theoretical foundations of jet substructure have been explored. Event shapes have traditionally been a testing ground for theoretical tools, and jet shapes are a by-product of these studies. N-subjettiness was built in analogy to N-jettiness, an event shape used to veto against additional jets in an event [50, 107, 108]. Like N-subjettiness, N-jettiness measures how ⩽N-jet-like an event is. Measurements of event shapes can be compared directly to calculations, and the comparison is useful in building theoretical tools to calculate jet substructure properties. These observables are especially amenable to calculation, and in fact distributions of these observables have already been calculated in a variety of applications.

Observables of interest include the following.

0-jettiness, also known as beam thrust, is a measure of total hadronic activity away from the beams. In events without energetic jets, this can be used to study the UE and assist in tuning the Monte Carlo. In new physics searches with a veto on any jets (e.g. h → WW), the beam thrust distribution can be used to understand the theory uncertainties on the jet veto.
1-jettiness can be similarly implemented on events with a single jet, and can be used to veto against additional jets for an exclusive 1-jet sample. An ample background of γ/W/Z + jet exists in which we study the 1-jettiness distribution.
A number of event shapes have been proposed for study in dijet-type events in [109] and examined in detail in [110], including observables related to thrust, broadening and jet rates (Y₂₃). Thrust-like observables show enhanced sensitivity to the UE, possibly allowing for powerful constraints on its properties, while Y₂₃ provides a more direct sensitivity to perturbative effects. The distribution of Y₂₃ has also been studied in [73]. The above observables have all been defined with particles as inputs. One can also study event shapes with jets as inputs and in one such study [111], Monte Carlos with matched matrix elements covering a range of jet multiplicities gave a worse description of event shape data than parton shower Monte Carlos (which only include 2 → 2 matrix elements matched directly onto the parton shower). This is troubling, and comparison to more event shapes can provide a way to differentiate different effects in the Monte Carlo simulations that may be contributing to this issue.

7. Concluding remarks

The field of jet substructure has made remarkable progress since its inception more than 20 years ago [112]. In this paper, we have endeavoured to present a contemporary overview of jet substructure theory and practice. The experimental results published so far are encouraging and promising. They suggest that in the coming years, jet substructure will play an important role in both standard model measurements and beyond-the-standard-model searches. New theoretical ideas for using and understanding jets continue to be developed. As substructure techniques proliferate, a unified understanding will be vital. The theoretical progress and goals discussed above are important steps along this path. Meanwhile, software tools like FastJet and SpartyJet provide a complementary method for understanding and exploring jet substructure techniques. We have given some examples of such exploration in this paper, and we hope that the benchmark samples, tools and comparisons we are making available will be useful tools in advancing jet substructure in 2012 and beyond.

Acknowledgments

We thank the Princeton Center for Theoretical Science for hosting and providing financial support for BOOST 2011. SH, CKV and JRW were supported in part by the US National Science Foundation, grants NSF-PHY-0969510 and NSF-PHY-0705682, the LHC Theory Initiative, Jon Bagger, PI. SH's work was supported by the US Department of Energy under contract DE-AC02-76SF00515. JRW was supported in part by the US Department of Energy under contract DE-AC02-05CH11231. IWS was supported in part by the Office of Nuclear Physics of the U.S. Department of Energy under the grant DE-FG02-94ER40818. GPS was supported in part by grants ANR-09-BLAN-0060 from the French Agence Nationale de la Recherche and PITN-GA-2010-264564 from the European Commission. Computing resources were provided by the Universities of Washington and Louisville, supported by US National Science Foundation contract ARRA-NSF-0959141 and US Department of Energy contract DE-FG02-98ER41089.