Abstract
Recent advances in the study of networked systems have highlighted that our interconnected world is composed of networks that are coupled to each other through different ‘layers’ that each represent one of many possible subsystems or types of interactions. Nevertheless, it is traditional to aggregate multilayer networks into a single weighted network in order to take advantage of existing tools. This is admittedly convenient, but it is also extremely problematic, as important information can be lost as a result. It is therefore important to develop multilayer generalizations of network concepts. In this paper, we analyze triadic relations and generalize the idea of transitivity to multiplex networks. By focusing on triadic relations, which yield the simplest type of transitivity, we generalize the concept and computation of clustering coefficients to multiplex networks. We show how the layered structure of such networks introduces a new degree of freedom that has a fundamental effect on transitivity. We compute multiplex clustering coefficients for several real multiplex networks and illustrate why one must take great care when generalizing standard network concepts to multiplex networks. We also derive analytical expressions for our clustering coefficients for ensemble averages of networks in a family of random multiplex networks. Our analysis illustrates that social networks have a strong tendency to promote redundancy by closing triads at every layer and that they thereby have a different type of multiplex transitivity from transportation networks, which do not exhibit such a tendency. These insights are invisible if one only studies aggregated networks.

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
The quantitative study of networks is fundamental for investigations of complex systems throughout the biological, social, information, engineering, and physical sciences [1–3]. The broad applicability of networks, and their success in providing insights into the structure and function of both natural and designed systems, has generated considerable excitement across myriad scientific disciplines. Numerous tools have been developed to study networks, and the realization that several common features arise in a diverse variety of networks has facilitated the development of theoretical tools to study them. For example, many networks constructed from empirical data have heavy-tailed degree distributions, satisfy the small-world property, and/or possess modular structures. Such structural features can have important implications for information dissemination, robustness against component failure, and more.
Traditional studies of networks generally assume that nodes are adjacent to each other by a single type of static edge that encapsulates all connections between them. This assumption is almost always a gross oversimplification, and it can lead to misleading results and even the fundamental inability to address certain problems. Most real systems have multilayer structures [4, 5], as there are almost always multiple types of ties or interactions that can occur between nodes, and it is crucial to take them into account. For example, transportation systems include multiple modes of travel, biological systems include multiple signaling channels that operate in parallel, and social networks include multiple types of relationships and multiple modes of communication. We will represent such systems using the formalism of multiplex networks, which allow one to incorporate multiple types of edges between nodes.
The notion of multiplexity was introduced years ago in fields such as engineering [6, 7] and sociology [1, 8–10], but the discussions included few analytical tools to accompany them. This situation arose for a simple reason: although many aspects of single-layer networks are well understood, it is challenging to properly generalize even the simplest concepts to multiplex networks. Theoretical developments on multilayer networks (including both multiplex networks and interconnected networks) have gained steam only in the last few years [11–20], and even basic notions like centrality and diffusion have barely been studied in multiplex settings [21–30]. New degrees of freedom arise from the multilayer structure of multiplex networks, and this brings both new challenges [4, 31] and new phenomena. The new phenomena include multiplexity-induced correlations [19], new types of dynamical feedbacks [26], and ‘costs’ of inter-layer connections [32]. For reviews about networks with multiple layers, see [4, 5].
In the present article, we focus on one of the most important structural properties of networks: triadic relations, which are used to describe the simplest and most fundamental type of transitivity in networks [1, 3, 33–35]. We develop multiplex generalizations of clustering coefficients, which can be done in myriad ways, and (as we will illustrate) the most appropriate generalization depends on the application under study. Such considerations are crucial when developing multiplex generalizations of any single-layer (i.e., ‘monoplex’) network diagnostic. There have been several attempts to define multiplex clustering coefficients [36–40], but there are significant shortcomings in these definitions. For example, some of them do not reduce to the standard single-layer clustering coefficient or are not properly normalized (see appendix
The fact that existing definitions of multiplex clustering coefficients are mostly ad hoc makes them difficult to interpret. In our definitions, we start from the basic concepts of walks and cycles to obtain a transparent and general definition of transitivity. This approach also guarantees that our clustering coefficients are always properly normalized. It reduces to a weighted clustering coefficient [41] of an aggregated network for particular values of the parameters; this allows comparison with existing single-layer diagnostics. We also address two additional, very important issues: (1) multiplex networks have many types of connections, and our multiplex clustering coefficients are (by construction) decomposable, so that the contribution of each type of connection is explicit; (2) because our notion of multiplex clustering coefficients builds on walks and cycles, we do not require every node to be present in all layers, which removes a major (and very unrealistic) simplification that is used in existing definitions.
Using the example of clustering coefficients, we illustrate how the new degrees of freedom that result from the existence of multiple layers in a multiplex network yield rich new phenomena and subtle differences in how one should define key network diagnostics. As an illustration of such phenomena, we derive analytical expressions for the expected values of clustering coefficients on multiplex networks in which each layer is an independent Erdős–Rényi (ER) graph. We find that the clustering coefficients depend on the intra-layer densities in a nontrivial way if the probabilities for an edge to exist are heterogeneous across the layers. We thereby demonstrate for multiplex networks that it is insufficient to generalize existing diagnostics in a naïve manner and that one must instead construct their generalizations from first principles (e.g., as walks and cycles in this case).
2. Methods
2.1. Mathematical representation
We represent a multiplex network using a finite sequence of graphs
, with
, where
is the set of layers. Without loss of generality, we let
and
. For simplicity, we examine unweighted and undirected multiplex networks. We define the intra-layer supra-graph as
, where the set of nodes is
and the set of edges is
. We also define the coupling supra-graph
using the same sets of nodes and the edge set
and its associated adjacency matrix
. If
, we say that
and
are ‘interconnected’. The nonzero entries of the matrix
indicates the connections between corresponding nodes (i.e., between the same entity) on different layers. We say that a multiplex network is ‘node-aligned’ [4] if all layers share the same set of nodes (i.e., if
for all α and κ). The supra-graph is
, where
. The corresponding adjacency matrix is the supra-adjacency matrix
.
Supra-adjacency matrices satisfy
and
, where
is the adjacency matrix of layer α (i.e., the adjacency matrix associated to
) and ⨁ denotes the direct sum of the matrices. We consider undirected networks, so
. For clarity, we denote nodes in a given layer and in monoplex networks using the symbols
; and we denote indices in a supra-adjacency matrix using the symbols
. We also define
to be the set of supra-adjacency matrix indices that correspond to node
, and we refer to the nodes
of a supra-graph as a node-layer pair. An entity u corresponds to a ‘physical node.’
The local clustering coefficient
of node
in an unweighted monoplex network is the number of triangles (i.e., triads) that include node
divided by the number of connected triples (i.e., either 2-stars or triangles) with node
in the center [3, 34]. The local clustering coefficient is a measure of transitivity [33], and it can be interpreted as the density of a focal node’s neighborhood. For our purposes, it is convenient to define the local clustering coefficient
as the number of three-cycles
that start and end at the focal node
divided by the number of three-cycles
such that the second step of the cycle occurs in a complete graph (i.e., assuming that the neighborhood of the focal node is as dense as possible)7
. In mathematical terms,
and
, where
is the adjacency matrix of the graph and
is the adjacency matrix of a complete graph with no self-edges. (In other words,
, where
is a complete square matrix of 1s and
is the identity matrix.)
The local clustering coefficient for node u is thus given by the formula
. This is equivalent to the usual definition of the local clustering coefficient:
, where
is the degree of node
. (The local clustering coefficient is often set to 0 for nodes of degree 0 and 1, and another option is to state that it is not defined in such cases.) One can calculate a single global clustering coefficient for a monoplex network either by averaging
over all nodes or by computing
. Henceforth, we will use the term global clustering coefficient for the latter quantity.
2.2. Triads on multiplex networks
In addition to three-cycles (i.e., triads) that occur within a single layer, multiplex networks also contain cycles that incorporate more than one layer but still have three intra-layer steps. Such cycles are important for the analysis of transitivity in multiplex networks. In social networks, for example, transitivity involves social ties across multiple social environments [1, 42]. In transportation networks, there typically exist several means of transport to return to one’s starting location, and different combinations of transportation modes are important in different cities [43]. For dynamical processes on multiplex networks, it is important to consider three-cycles that traverse different numbers of layers, so one needs to take them into account when defining a multiplex clustering coefficient. We define a supra-walk as a walk on a multiplex network in which, either before or after each intra-layer step, a walk can either continue on the same layer or change to an adjacent layer. We represent this choice using the following matrix:

where
is the
identity matrix,
is the number of node-layer pairs, the parameter β is a weight that accounts for the walk staying in the current layer, and γ is a weight that accounts for the walk stepping to another layer. In a supra-walk, a supra-step consists either of only a single intra-layer step or of a step that includes both an intra-layer step and an inter-layer step, in which one changes from one layer to another (either before or after the intra-layer step). In the latter type of supra-step, note that we are disallowing two consecutive inter-layer steps. The number of three-cycles for node
is then

where the first term corresponds to cycles in which the inter-layer step is taken after an intra-layer one and the second term corresponds to cycles in which the inter-layer step is taken before an intra-layer one. The subscript M refers to the particular way that we define a supra-walk in a multiplex network through the supra-matrices
and
. However, one can also define other types of supra-walks (see appendices
and
are symmetric. This yields

It is useful to decompose multiplex clustering coefficients that are defined in terms of multilayer cycles into so-called elementary cycles by expanding equation (3) and writing it in terms of the matrices
and
. That is, we write
, where
denotes the set of elementary cycles and
are weights of different elementary cycles. We can use symmetries in our definition of cycles and thereby express all of the elementary cycles in a standard form with terms from the set
. See figure 1 for an illustration of elementary cycles and appendix
Figure 1. Sketch of the elementary cycles
,
,
,
, and
. The orange node is the starting point of the cycle. The intra-layer edges are the solid lines, and the intra-layer edges are the dotted curves. In each case, the yellow line represents the second intra-layer step.
Download figure:
Standard image High-resolution imageTo define multiplex clustering coefficients, we need both the number
of cycles and a normalization
. The symbol * stands for any type of cycle: the three-cycle that we define in the main text, an elementary cycle, or the alternatives definition of three-cycles that we give in appendix
. To determine the normalization, it is natural to follow the same procedure as with monoplex clustering coefficients and use a complete multiplex network
, where
is the adjacency matrix for a complete graph on layer α. We can then proceed from any definition of
to
by replacing the second intra-layer step with a step in the complete multiplex network. For example, we obtain
for
. Similarly, one can use any other definition of a cycle (e.g., any of the elementary cycles or the cycles that we discuss in appendix
The above formulation allows us to define local and global clustering coefficients for multiplex networks analogously to their definition in monoplex networks. We can calculate a natural multiplex analog to the usual monoplex local clustering coefficient for any node
of the supra-graph. Additionally, in a multiplex network, a node
of an intra-layer network allows an intermediate description for clustering that lies between local and the global clustering coefficients. We define



where
is as defined before. Note that we refer to clustering coefficients defined either by equation (4) or by equation (5) as local clustering coefficients. In equations (4)–(6), and in our subsequent formulas for clustering coefficients, we are of course requiring denominators to be nonzero (as in the monoplex case). In situations in which a denominator vanishes, we set the value of the associated clustering coefficient to 0.
We can decompose the expression in equation (6) in terms of the contributions from cycles that traverse exactly one, two, and three layers (where
indicates the number of layers) to give



We can similarly decompose equations (4) and (5). Using the decomposition in equation (7) yields an alternative way to average over contributions from the three types of cycles:

where
is a vector that gives the relative weights of the different contributions. We use the term layer-decomposed clustering coefficients for
,
, and
. There are also analogs of equation (10) for the clustering coefficients defined in equations (4) and (5). Each of the clustering coefficients in equations (4)–(6) depends on the values of the parameters β and γ, but the dependence vanishes if
. Unless we explicitly indicate otherwise, we assume in our calculations that
.
2.3. Clustering coefficients for aggregated networks
A common way to study multiplex networks is to aggregate layers to obtain either multi-graphs or weighted networks, where the number of edges or the weight of an edge is the number of different types of edges between a pair of nodes [4]. One can then use any of the numerous ways to define clustering coefficients for weighted monoplex networks [44, 45] to calculate clustering coefficients for the aggregated network.
One of the weighted clustering coefficients is a special case of our multiplex clustering coefficient (for others, see appendix

where
is an element of the weighted adjacency matrix
. The elements of
are the weights of the edges, the quantity
is the maximum weight in
, and
is the adjacency matrix of the complete unweighted graph. We can define the global version CZ of
by summing over all of the nodes in the numerator and the denominator of equation (11) (analogously to equation (6)).
For node-aligned multiplex networks, the clustering coefficients
and CZ are related to our multiplex clustering coefficients
and CM. Letting
and summing over all layers yields
. That is, in this special case, the weighted clustering coefficients
and CZ are equivalent to the corresponding multiplex clustering coefficients
and CM. In particular,
and
. We need the term
to match the normalizations because aggregation removes the information about the number of layers b, so the normalization must be based on the maximum weight instead of the number of layers. That is, a step in the complete weighted network is described by using
in equation (11) instead of using
.
Note that this relationship between our multiplex clustering coefficient and the weighted clustering coefficient in equation (11) is only true for node-aligned multiplex networks. If some nodes are not shared among all layers, then the normalization of our multiplex clustering coefficient depends on how many nodes are present in the local neighborhood of the focal node. This contrasts with the ‘global’ normalization by
used by the weighted clustering coefficient in equation (11).
2.4. Clustering coefficients in Erdős–Rényi networks
Almost all real networks contain some amount of transitivity, and it is often desirable to know if a network contains more transitivity than would be expected by chance. In order to examine this question, one typically compares clustering-coefficient values of a network to what would be expected from some random network that acts as a null model. The simplest random network to use is an Erdős–Rényi (ER) network. In this section, we give formulas for expected clustering coefficients in node-aligned multiplex networks in which each intra-layer network is an ER network that is created independently of other intra-layer networks and the inter-layer connections are created as described in section 2.1.
The expected value of the local clustering coefficient in an unweighted monoplex ER network is equal to the probability p of an edge to exist. That is, the density of the neighborhood of a node, measured by the local clustering coefficient, has the same expectation as the density of the entire network for an ensemble of ER networks. In multiplex networks with ER intra-layer graphs with connection probabilities
, the same result holds only when all of the layers are statistically identical (i.e.,
for all α). Note that this is true even if the network is not node-aligned. However, heterogeneity among layers complicates the behavior of clustering coefficients. If the layers have different connection probabilities, then the expected value of the mean clustering coefficient is a nontrivial function of the connection probabilities. In particular, it is not always equal to the mean of the connection probabilities. For example, the formulas for the expected global layer-decomposed clustering coefficients are



See appendix
3. Results and discussions
We investigate transitivity in empirical multiplex networks by calculating clustering coefficients. In table 1, we give the values of layer-decomposed global clustering coefficients for multiplex networks (four social networks and two transportation networks) constructed from real data. Note that the two transportation networks have different numbers of nodes in different layers (i.e., they are not node-aligned [4]). To help give context to the values, the table also includes the clustering-coefficient values that we obtain for ER networks with matching edge densities in each layer. See appendix
Table 1.
Clustering coefficients CM,
,
, and
that correspond, respectively, to the global, one-layer, two-layer, and three-layer clustering coefficients for various multiplex networks. ‘Tailor Shop’: Kapferer tailor-shop network (n = 39, b = 4) [48]. ‘Management’: Krackhardt office cognitive social structure (n = 21, b = 21) [49]. ‘Families’: Padgett Florentine families social network (n = 16, b = 2) [50]. ‘Bank’: Roethlisberger and Dickson bank wiring-room social network (n = 14, b = 6) [51]. ‘Tube’: the London Underground (i.e., ‘the Tube’) transportation network (n = 314, b = 14) [52]. ‘Airline’: network of flights between cities, in which each layer corresponds to a single airline (n = 3108, b = 530) [53]. The rows labeled ‘orig.’ give the clustering coefficients for the original networks, and the rows labeled ‘ER’ give the expected value and the standard deviation of the clustering coefficient in an ER random network with exactly as many edges in each layer as in the original network. For the original values, we perform a two-tailed Z-test to examine whether the observed clustering coefficients could have been produced by the ER networks. We designate the p-values as follows: *:
, **:
for Bonferroni-corrected tests with 24 hypothesis; ’:
, ”:
for uncorrected tests. We do not use any symbols for values that are not significant. We symmetrize directed networks by considering two nodes to be adjacent if there is at least one edge between them. The social networks in this table are node-aligned multiplex graphs, but the transportation networks are not node-aligned. We report values that are means over different numbers of realizations:
for Tailor Shop,
for Airline,
for Management,
for Families,
for Tube, and
for Bank.
| Tailor Shop | Management | Families | Bank | Tube | Airline | ||
|---|---|---|---|---|---|---|---|
| CM | orig. | 0.319** | 0.206** | 0.223’ | 0.293** | 0.056 | 0.101** |
| ER | 0.186 ± 0.003 | 0.124 ± 0.001 | 0.138 ± 0.035 | 0.195 ± 0.009 | 0.053 ± 0.011 | 0.038 ± 0.000 | |
|
orig. | 0.406** | 0.436** | 0.289’ | 0.537** | 0.013” | 0.100** |
| ER | 0.244 ± 0.010 | 0.196 ± 0.015 | 0.135 ± 0.066 | 0.227 ± 0.038 | 0.053 ± 0.013 | 0.064 ± 0.001 | |
|
orig. | 0.327** | 0.273** | 0.198 | 0.349** | 0.043* | 0.150** |
| ER | 0.191 ± 0.004 | 0.147 ± 0.002 | 0.138 ± 0.040 | 0.203 ± 0.011 | 0.053 ± 0.020 | 0.041 ± 0.000 | |
|
orig. | 0.288** | 0.192** | — | 0.227** | 0.314** | 0.086** |
| ER | 0.165 ± 0.004 | 0.120 ± 0.001 | — | 0.186 ± 0.010 | 0.051 ± 0.043 | 0.037 ± 0.000 |
As we will now discuss, multiplex clustering coefficients give insights that are impossible to infer by calculating weighted clustering coefficients for aggregated networks or by calculating clustering coefficients separately for each layer of a multiplex network.
For each social network in table 1, note that
and
. Consequently, the primary contribution to the triadic structure of these multiplex networks arises from three-cycles that stay within a given layer. To check that the ordering of the different clustering coefficients is not an artifact of the heterogeneity of densities of the different layers, we also calculate the expected values of the clustering coefficients in ER networks with identical edge densities to the data. We observe that all clustering coefficients exhibit larger inter-layer transitivities than would be expected in corresponding ER networks with identical edge densities, although the same ordering relationship (i.e.
) holds. From our results in table 1, it seems that triadic-closure mechanisms in social networks cannot be considered purely at the aggregated network level; these mechanisms appear to be more effective inside of layers than between layers. For example, if there is a connection between individuals u and v and also a connection between v and w in the same layer, then it is more likely that u and w ‘meet’ in the same layer than in some other layer.
The transportation networks that we examine exhibit the opposite pattern from the social networks. For example, for the London Underground (‘Tube’) network, in which each layer corresponds to a line, we observe that
. This reflects the fact that single lines in the Tube are designed to avoid redundant connections. A single-layer triangle would require a line to make a loop among three stations. Two-layer triangles, which are a bit more frequent than single-layer ones, entail that two lines run in almost parallel directions and that one line jumps over a single station. For three-layer triangles, the geographical constraints do not matter because one can construct a triangle with three straight lines.
We also analyze the local triadic closure of the Kapferer tailor-shop social network by examining the local clustering-coefficient values. In figure 2(a), we show a comparison of the layer-decomposed local clustering coefficients (also see figure 6(a) of [40]). Observe that the condition
holds for most of the nodes. In figure 2(b), we subtract the expected values of the clustering coefficients of nodes in a network generated with the configuration model8
from the corresponding clustering-coefficient values observed in the data to discern whether we should also expect to observe the relative order of the local clustering coefficients in an associated random network (with the same layer densities and degree sequences as the data). Similar to our results for global clustering coefficients, we see that taking a null model into account lessens—but does not remove—the difference between the coefficients that count different numbers of layers.
Figure 2. Comparison of different local clustering coefficients in the Kapferer tailor-shop network. Each point corresponds to a node. (A) The raw values of the clustering coefficients. (B) The value of the clustering coefficients minus the expected value of the clustering coefficient for the corresponding node from a mean over 1000 realizations of a configuration model with the same degree sequence in each layer as in the original network. In a realization of the multiplex configuration model, each intra-layer network is an independent realization of the monoplex configuration model.
Download figure:
Standard image High-resolution imageWe investigate the dependence of local triadic structure on degree for one social network and one transportation network. In figure 3(a), we show how the different multiplex clustering coefficients depend on the unweighted degrees of the nodes in the aggregated network for the Kapferer tailor shop. Note that the relative ordering of the values of the mean clustering coefficient does not depend on degree. In figure 3(b), we illustrate that the aggregated network for the airline transportation network exhibits a non-constant difference between the curves of
and the weighted clustering coefficient
. Using a global normalization (see the discussion in section 2.3) reduces the clustering-coefficient values for the small airports much more than it does for the large airports. This, in turn, introduces a bias.
Figure 3. Local clustering coefficients versus unweighted degree of the aggregated network for (A) the Kapferer tailor-shop network and (B) the airline network. The curves give the mean values of the clustering coefficients for a degree range (i.e., we bin similar degrees). Note that the horizontal axis in panel (B) is on a logarithmic scale.
Download figure:
Standard image High-resolution imageThe airline network is organized differently from the London Tube network. When comparing these networks, note that each layer in the former encompasses flights from a single airline. For the airline network (see figure 3(b)), we observe that the two-layer local clustering coefficient is larger than the single-layer one for hubs (i.e., high-degree nodes), but it is smaller for small airports (i.e., low-degree nodes). However, the global clustering coefficient counts the total number of three-cycles and connected triplets, and it thus gives more weight to high-degree nodes than to low-degree nodes. We thus find that the global clustering coefficients for the airline network satisfy
. The intra-airline clustering coefficients have small values, presumably because it is not in the interest of an airline to introduce new flights between two airports that can already be reached by two flights via the same airline through some major airport. The two-layer cycles correspond to cases in which an airline has a connection from an airport to two other airports and a second airline has a direct connection between those latter two airports. Completing a three-layer cycle requires using three distinct airlines, and this type of congregation of airlines to the same area is not frequent in the data. Three-layer cycles are more common than single-layer cycles only for a few of the largest airports.
4. Conclusions
We derived measurements of transitivity for multiplex networks by developing multiplex generalizations of triadic relationships and clustering coefficients. By using examples from empirical data in diverse settings, we showed that different notions of multiplex transitivity are important in different situations. For example, the balance between intra-layer versus inter-layer clustering is different in social networks versus transportation networks (and even in different types of networks within each category, as we illustrated explicitly for transportation networks), reflecting the fact that multilayer transitivity can arise from different mechanisms. Such differences are rooted in the new degrees of freedom that arise from inter-layer connections and are invisible to calculations of clustering coefficients on single-layer networks obtained via aggregation. In other words, transitivity is inherently a multilayer phenomenon: all of these diverse flavors of transitivity reduce to the same description when one throws away the multilayer information. Generalizing clustering coefficients for multiplex networks makes it possible to explore such phenomena and to gain deeper insights into different types of transitivity in networks. The existence of multiple types of transitivity also has important implications for multiplex network motifs and multiplex community structure. In particular, our work on multiplex clustering coefficients demonstrates that the definition of any clustering notion for multiplex networks needs to be able to consider diverse forms of transitivity.
Acknowledgments
All authors were supported by the European Commission FET-Proactive project PLEXMATH (Grant No. 317614). AA also acknowledges financial support from the ICREA Academia, Generalitat de Catalunya (2009-SGR-838), and the James S.McDonnell Foundation; and SG and AA were supported by FIS2012-38266. YM was also supported by MINECO through Grants FIS2011-25167 and by DGA (Spain). MAP acknowledges a grant (EP/J001759/1) from the EPSRC. We thank David Krackhardt and the anonymous referees for useful comments.
Appendix A.: Weighted clustering coefficients
There are two primary weighted clustering coefficients for monoplex networks that provide alternatives to the one that we discussed in the main text [54, 55]. They are


where
is the unweighted adjacency matrix associated with the weighted adjacency matrix
, the degree of node
is
, the strength of
is
, and the quantity
is the maximum weight in
. When using equations (A.1) and (A.2), one also has
and
for nodes of degree
and
.
Appendix B.: Multiplex clustering coefficients in the literature
Let
denote the intra-layer adjacency matrix for layer α. For a weighted multiplex network, we use
to denote the intra-layer weight matrix (i.e., the weighted intra-layer adjacency matrix) for layer α. We use
to denote the weight matrix of the aggregated network. (See section 2.3 in the main text.) The clustering coefficient that was defined in [36] for node-aligned multiplex networks is

which can be expressed in terms of the aggregated network as

The numerator of equation (B.2) is the same as the numerator of the weighted clustering coefficient
, but the denominator is different. Because of the denominator in equation (B.2), the values of the clustering coefficient
do not have to lie in the interval
. For example,
for a complete multiplex network (where n is the number of nodes in the multiplex network), so
when
.
References [37, 38] defined a family of local clustering coefficients for directed and weighted multiplex networks:

where
, t is a threshold, and we recall that
is the set of layers. The clustering coefficient (B.3) does not yield the ordinary monoplex local clustering coefficient for unweighted (i.e., networks with binary weights) and undirected networks when it is calculated for the special case of a monoplex network (i.e., a multiplex network with b = 1 layer). Furthermore, its values are not normalized to lie between 0 and 1. For example, consider a complete multiplex network with n nodes and an arbitrary number of layers. In this case, the clustering coefficient (B.3) takes the value of
for each node. If a multiplex network is undirected (and unweighted), then
can always be calculated when one is only given an aggregated network and the total number of layers in the multiplex network. As an example, for the threshold value t = 1, one obtains

where
is the binary adjacency matrix corresponding to the weighted adjacency matrix
and
is the degree of node
.
Reference [39] defined a clustering coefficient for multiplex networks that are not necessarily node-aligned as

where
is again the set of layers,
, the quantity
is the set of neighbors of node u in the aggregated network,
is the set of nodes in layer α, and
is the set of edges in the subgraph induced by
in the aggregated network. For a node-aligned multiplex network,
and
, so one can write

which is a local clustering coefficient for the aggregated network.
Battiston et al [40] defined two versions of clustering coefficients for node-aligned multiplex networks:


The first definition,
, counts the number of
-type elementary cycles; and the second definition,
, counts the three-layer elementary cycles
. In both of these definitions, the sums in the denominators allow terms in which v = w, so a complete multiplex network has a local clustering coefficient of
for every node.
Reference [31] proposed definitions for global clustering coefficients using a tensorial formalism for multilayer networks; when representing a multiplex network as a third-order tensor, the formulas in [31] reduce to the clustering coefficients that we propose in the present article. (See equation (6) of the main text.)
Parshani et al [56] defined an ‘inter-clustering coefficient’ for two-layer interdependent networks that can be interpreted as multiplex networks [4, 57–59]. Their definition is similar to edge ‘overlap’ [40]; in our framework, it corresponds to counting two-cycles of type
. A few other scholars [60, 61] have also defined generalizations of clustering coefficients for multilayer networks that cannot be interpreted as multiplex networks [4].
In table B1
, we show a summary of the properties satisfied by several different (local and global) multiplex clustering coefficients. In particular, we check the following properties. (1) The value of the clustering coefficient reduces to the values of the associated monoplex clustering coefficient for a single-layer network. (2) The value of the clustering coefficient is normalized so that it takes values that are less than or equal to 1. (All of the clustering coefficients are non-negative.) (3) The clustering coefficient has a value of p in a large (i.e., when the number of nodes
) node-aligned multiplex network in which each layer is an independent ER network with an edge probability of p in each layer. (4) Suppose that we construct a multiplex network by replicating the same given monoplex network in each layer. We indicate whether the clustering coefficient for the multiplex network has the same value as for the monoplex network. (5) There exists a version of the clustering coefficient that is defined for each node-layer pair separately. (6) The clustering coefficient is defined for multiplex networks that are not node-aligned.
Table B1.
Summary of the properties of the different multiplex clustering coefficients. The notation
means that the property holds for both the global version and the local version of the associated clustering coefficient (C.C.).
| Property |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| (1) Reduces to monoplex C.C. | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
(2)
|
✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
(3) in multiplex ER graph |
✓ | ✓ | ||||||
| (4) Monoplex C.C. for copied layers | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| (5) Defined for node-layer pairs | ✓ | |||||||
| (6) Defined for non-node-aligned | ✓ | ✓ |
Appendix C.: Other possible definitions of cycles
There are many possible ways to define cycles in multiplex networks. If we relax the condition of disallowing two consecutive inter-layer steps, then we can write


where
. (Our discussion in appendix D is helpful for understanding the factor of 1/2.) Unlike the matrix
in equation (3) in the main text, the matrices
and
are symmetric. We can thus interpret them as weighted adjacency matrices of symmetric supra-graphs, and we can then calculate cycles and clustering coefficients in these supra-graphs (see appendix D).
It is sometimes desirable to forbid the option of staying inside of a layer in the first step of the second term of equation (C.2). In this case, one can write

With this restriction, cycles that traverse two edges of the focal node i are only calculated two times instead of four times. We simplify equation (C.3) to obtain

which is similar to equation (3) in the main text. In table C1 , we show the values of the clustering coefficients that we calculate using this last definition of cycle for the empirical networks that we studied in the main text.
Table C1.
Clustering coefficients (rows) for the same empirical networks (columns) from table 1 in the main text. For the Tube and the Airline networks, we only calculate clustering coefficients for non-node-aligned networks. For local clustering coefficients, we average over all nodes to obtain
.
| CC | Families | Bank | Tailor Shop | Management | Tube | Airline |
|---|---|---|---|---|---|---|
|
0.218 | 0.289 | 0.320 | 0.206 | 0.070 | 0.102 |
|
0.289 | 0.537 | 0.406 | 0.436 | 0.013 | 0.100 |
|
0.202 | 0.368 | 0.338 | 0.297 | 0.041 | 0.173 |
|
— | 0.227 | 0.288 | 0.192 | 0.314 | 0.086 |
|
0.164 | 0.377 | 0.344 | 0.309 | 0.123 | 0.120 |
|
0.342 | 0.254 | 0.308 | 0.150 | 0.038 | 0.329 |
|
0.195 | 0.811 | 0.612 | 2.019 | — | — |
|
0.674 | 1.761 | 4.289 | 1.636 | — | — |
|
0.303 | 0.268 | 0.260 | 0.133 | — | — |
|
0.486 | 0.775 | 0.629 | 0.715 | — | — |
|
0.159 | 0.199 | 0.271 | 0.169 | — | — |
|
— | 0.190 | 0.282 | 0.179 | — | — |
Appendix D.: Defining multiplex clustering coefficients using auxiliary networks
An elegant way to generalize clustering coefficients for multiplex networks is to define a new (possibly weighted) auxiliary supra-graph GM so that one can define cycles of interest as weighted three-cycles in GM. Once we have a function that produces the auxiliary supra-adjacency matrix
, we can define the auxiliary complete supra-adjacency matrix
. One can then define a local clustering coefficient for node-layer pair i with the formula

As with a monoplex network, the denominator written in terms of the complete matrix
is equivalent to the usual one written in terms of connectivity. We thereby consider the connectivity of a node in the supra-graph induced by the matrix
. We refer to the matrix
as the multiplex walk matrix because it encodes the permissible steps in a multiplex network. When
is equal to
or to
, the induced supra-graph is directed, so one needs to distinguish between in-degrees and out-degrees.
A key advantage of defining clustering coefficients using an auxiliary supra-graph is that one can then use it to calculate other diagnostics (e.g., degree or strength) for nodes. One can thereby investigate correlations between clustering-coefficient values and the size of the multiplex neighborhood of a node. (The size of the neighborhood is the number of nodes that are reachable in a single step via connections defined by the matrix
.)
We can write the symmetric multiplex walk matrices in equations (C.1) and (C.2) as


To avoid double-counting intra-layer steps in the definition of
, we need to rescale either the intra-layer weight parameter β (i.e., we can write
) or the inter-layer weight parameter γ (i.e., we can write
and also define
).
Consider a supra-graph induced by a multiplex walk matrix. The distinction between the matrices
and
is that
also includes terms of the form
that take into account walks that have an inter-layer step (
) followed by an intra-layer step (
) and then another inter-layer step (
). Therefore, in the supra-graph induced by
, two nodes in the same layer that are not adjacent in that layer are nevertheless adjacent if the same physical nodes are adjacent in another layer.
The matrix
sums the contributions of all node-layer pairs that correspond to the same physical node when
. In other words, if we associate a vector of the canonical basis ei to each node-layer pair i and let
denote all node-layer pairs that correspond to the same physical node, then

produces a vector whose entries are equal to 1 for nodes that belong to the basis vector and which are equal to 0 for nodes that do not belong to that vector. Consequently,
is related to the weighted adjacency matrix of the aggregated graph for
. To be precise, we obtain the following relation:

One can also write the multiplex clustering coefficient induced by equation (2) in terms of the auxiliary supra-adjacency matrix by considering equation (3), which is a simplified version of the equation that counts cycles only in one direction. This yields

The matrix
is not symmetric, which implies that the associated graph is a directed supra-graph. Nevertheless, the clustering coefficient induced by
is the same as that induced by its transpose
if
is symmetric.
Appendix E.: Expressing clustering coefficients using elementary three-cycles
We now give a detailed explanation of the process of decomposing any of our walk-based clustering coefficients into elementary cycles. An elementary cycle is a term that consists of products of the matrices
and
(i.e., there are no sums) after one expands the expression for a cycle (which is a weighted sum of such terms). Because we are only interested in the diagonal elements of the terms and we consider only undirected intra-layer supra-graphs and coupling supra-graphs, we can transpose the terms and still write them in terms of the matrices
and
rather than also using their transposes. There are also multiple ways of writing non-symmetric elementary cycles (e.g.,
).
We adopt a convention in which we transpose all elementary cycles so that we select the one in which the first element is
rather than
when comparing the two versions of the term from left to right. That is, for two equivalent terms, we choose the one that comes first in alphabetical order. To calculate the clustering coefficients that we defined in the appendix (see appendices C and D), we also need to include elementary cycles that start and end in an inter-layer step. The set of elementary three-cycles is thus
,
,
,
,
,
,
,
.
We now write our clustering coefficients using elementary three-cycles. We obtain the normalization formulas by using the elementary three-cycles and then replacing the second
term with
. This yields a standard form for any of our multiplex clustering coefficients. For example,

where


where i is a node-layer pair and the
coefficients are scalars that correspond to the weights for each type of elementary cycle. These weights are different for different types of clustering coefficients; one can choose whatever is appropriate for a given problem. Note that we have absorbed the parameters β and γ into these coefficients (see below and table E1). We illustrate the possible elementary cycles in figure 1 of the main text and in figure E1
.
Figure E1. Sketches of elementary cycles for which both the first and the last step are allowed to be an inter-layer step. These elementary cycles are
,
, and
. The orange node is the starting point of the cycle. The intra-layer edges are the solid lines, and the intra-layer edges are the dotted curves. In each case, the yellow line represents the second intra-layer step. Note that the elementary cycle
includes three ‘degenerate’ versions in which the three-cycle returns to a previously-visited layer. The subscripts in the names of the degenerate cycles indicate the number of layers that are used in each cycle.
Download figure:
Standard image High-resolution imageTable E1.
Coefficients of elementary multiplex three-cycle terms
(see equations (E.2) and (E.3)) for different multiplex clustering coefficients. For example,
for type-M clustering coefficients (i.e., CM,
, and
) is equal to
. For type-SM′ and type-SM clustering coefficients, we calculate the expansions only for node-aligned multiplex networks.
| C.C. |
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
|
2 | ||||||||
| M |
|
2 | 2 | 2 | |||||
|
2 | ||||||||
|
1 | ||||||||
|
|
2 | 2 | 1 | |||||
|
2 | ||||||||
|
1 | ||||||||
| SM′ |
|
|
2 | 4 | 3 | 1 | |||
|
|
|
2 | 2 | |||||
| SM |
|
1 | |||||||
|
|
4 | 4 | 4 | 1 | ||||
|
|
|
|
8 | 4 | ||||
|
|
|
|
|
|
|
|
4 | |
|
|
|
|
|
|
||||
|
|
|
|
One can even express the cycles that include two consecutive inter-layer steps in the standard form of equations (E.2)–(E.3) for node-aligned multiplex networks, because
in this case. Without the assumption that
, the expansion for the coefficient cSM,i is cumbersome because it includes coefficients
with all possible combinations of k and h such that
and
. Furthermore, in the general case, it is also not possible to infer the number of layers in which a walk traverses an intra-layer edge based on the exponents of β and γ for cSM,i and
. For example, in
, the intra-layer elementary triangle
includes a contribution from both
(i.e., the walk stays in the original layer) and
(i.e., the walk visits some other layer but then comes back to the original layer without traversing any intra-layer edges while it was gone). Moreover, all of the terms with b arise from a walk moving to a new layer and then coming right back to the original layer in the next step. Because there are
other layers from which to choose, the influence of cycles with such transient layer visits is amplified by the total number of layers in a network. That is, adding more layers (even ones that do not contain any edges) changes the relative importance of different types of elementary cycles.
In table E1, we show the values of the coefficients
for the different ways that we define three-cycles in multiplex networks. In table E2
, we show their corresponding expansions in terms of elementary cycles for the case
. These cycle decompositions illuminate the difference between
,
,
, and
. The clustering coefficient
gives equal weight to each elementary cycle, whereas
gives half of the weight to
and
cycles (i.e., the elementary cycles that include an implicit double-counting) as compared to the other cycles. The matrices that correspond to elementary cycles with such double-counting are symmetric, and the same cycle is thus counted in two different directions.
Table E2.
Coefficients of the elementary multiplex three-cycle terms
(see eqations (E.2) and (E.3)) for different multiplex clustering coefficients when
. For type-SM′ and type-SM clustering coefficients, we calculate the expansions only for node-aligned multiplex networks.
| C. C. |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| M | 2 | 2 | 2 | 2 | 2 | 0 | 0 | 0 |
|
1 | 2 | 2 | 1 | 2 | 0 | 0 | 0 |
| SM | 1b2 | 2b2 | 2b2 | b2 | 2b2 | b2 | 2b2 | b2 |
| SM′ |
|
2 |
|
|
2 | 1 | 2 | 0 |
Appendix F.: A simple example
We now use a simple example (see figure F1) to illustrate the differences between the various notions of a multiplex clustering coefficient. Consider a two-layer multiplex network with three nodes in layer a1 and two nodes in layer a2. The three node-layer pairs in layer a1 form a 2-star, the two node-layer pairs that are not connected directly to each other on layer a1 are each adjacent via an inter-layer edge to a counterpart node-layer pair in layer a2, and the two node-layer pairs on layer a2 are adjacent to each other.
Figure F1. A simple, illustrative example of a multiplex network.
Download figure:
Standard image High-resolution imageThe adjacency matrix
for the intra-layer supra-graph is

and the adjacency matrix
of the coupling supra-graph is

Therefore, the supra-adjacency matrix is

The multiplex walk matrix
is

and we note that it is not symmetric. For example, node-layer pair
is reachable from
, but node-layer pair
is not reachable from
. The edge
in this supra-graph represents the walk
in the multiplex network. The symmetric walk matrix
is

The matrix
is the sum of
and
with rescaled diagonal blocks in order to not double-count the edges
and
. Additionally,

which differs from
in the fact that node-layer pairs
and
are connected through the multiplex walk
.
The adjacency matrix of the aggregated graph is

That is, it is a complete graph without self-edges.
We now calculate
using different definitions of a multiplex clustering coefficient. To calculate
, we need to compute the auxiliary complete supra-adjacency matrix
according to equation (D.6). We obtain

The clustering coefficient of node-layer pair
, which is part of two triangles that are reachable along the directions of the edges, is

For node-layer pair
, we get

which is the same as the clustering-coefficient values of the remaining node-layer pairs.
To calculate
, we need to compute
, which we obtain using equation (D.3). We thus obtain

In the supra-graph associated with the supra-adjacency matrix
, all node-layer pairs are adjacent to all other node-layer pairs except those that correspond to the same physical nodes. The clustering coefficient of node-layer pair
, which is part of six triangles, is

The clustering coefficient of node-layer pair
, which is part of one triangle, is

To calculate
, we compute
using equation (D.2). We thus obtain

The only difference between the graphs associated with the matrices
and
is the weight of the edges in
that take into account the fact that intra-layer edges might be repeated in the two layers.
The clustering coefficient of node-layer pair
, which is part of eight triangles, is

The clustering coefficient of node-layer pair
, which is part of four triangles, is

Because we are weighting edges based on the number of times an edge between two nodes is repeated in different layers among a given pair of physical nodes in the normalization, none of the node-layer pairs has a clustering coefficient equal to 1. By contrast, all nodes have clustering coefficients with the same value in the aggregated network, for which the layer information has been lost. In particular, they each have a clustering-coefficient value of 1, independent of the definition of the multiplex clustering coefficient.
Appendix G.: Further discussion of clustering coefficients in multiplex ER networks
The expected values of the local clustering coefficients in node-aligned multiplex ER networks are





Note that
and
, but the two-layer clustering coefficient
arises from a weighted sum of contributions from three different elementary cycles.
In figure G1, we illustrate the behavior of the global and local clustering coefficients in multiplex networks in which the layers consist of ER networks with varying amounts of heterogeneity in the intra-layer edge densities. Although the global and mean local clustering coefficients are equal to each other when averaged over ensembles of monoplex ER networks, we do not obtain a similar result for multiplex networks with ER layers unless the layers have the same value of the parameter p. The global clustering coefficients give more weight than the mean local clustering coefficients to denser layers. This is evident for the intra-layer clustering coefficients
and
, for which the ensemble average of the mean of the local clustering coefficient
is always equal to the mean edge density, whereas the ensemble average of the global clustering coefficient
has values that are greater than or equal to the mean edge density. This situation is a good example of a case in which transivity in multiplex networks differs from the results and intuition from monoplex networks.
Figure G1. (A), (B), (C) Global and (D), (E), (F) local multiplex clustering coefficients in multiplex networks that consist of ER layers. The markers give the results of simulations of 100-node node-aligned multiplex ER networks that we average over 10 realizations. The solid curves are theoretical approximations (see equations (12)–(14) of the main text). Panels (A), (C), (D), (F) show results for three-layer networks, and panels (B), (E) show results for six-layer networks. The ER edge probabilities of the layers are (A), (D)
, (B), (E)
, and (C), (F)
.
Download figure:
Standard image High-resolution imageIn particular, failing to take into account the heterogeneity of edge densities in multiplex networks can lead to incorrect or misleading results when trying to distinguish among values of a clustering coefficient that are what one would expect from an ER random network versus those that are a signature of a triadic-closure process (see figure G1).
Appendix H.: Null model for shuffling inter-layer connections
In table H1, we compare empirical values of layer-decomposed global clustering coefficients with clustering-coefficient values for a null model in which we preserve the topology of each intra-layer network but independently shuffle the labels of the nodes inside of each layer. That is, for each intra-layer network
, we choose a permutation
uniformly at random and construct a new multiplex network starting from
, where
and
. In this way, we effectively randomize inter-layer edges but preserve both the structure of intra-layer networks and the number of inter-layer edges between each pair of layers. For our comparisons using this null model, most of the clustering coefficients take values that are significant for our data sets (see table H1). Because of the way that we construct the null model, the global single-layer clustering coefficients are exactly the same for the original data and the null model.
Table H1.
Clustering coefficients CM,
,
,
that correspond, respectively, to the global, one-layer, two-layer, and three-layer clustering coefficients for various multiplex networks. ‘Tailor Shop’: Kapferer tailor-shop network (n = 39, b = 4) [48]. ‘Management’: Krackhardt office cognitive social structure (n = 21, b = 21) [49]. ‘Families’: Padgett Florentine families social network (n = 16, b = 2) [50]. ‘Bank’: Roethlisberger and Dickson bank wiring-room social network (n = 14, b = 6) [51]. ‘Tube’: the London Underground (i.e., ‘the Tube’) transportation network (n = 314, b = 14) [52]. ‘Airline’: network of flights between cities, in which each layer corresponds to a single airline (n = 3108, b = 530) [53]. The rows labeled ‘orig.’ give the clustering coefficients for the original networks, and the rows labeled ‘NM’ give the expected value and the standard deviation of the clustering coefficient in a null-model network in which we preserve the topology of the intra-layer networks but separately (and independently) shuffle the node labels for each intra-layer network. For the original values, we perform a two-tailed Z-test to examine whether the observed clustering coefficients could have been produced by the null model. We designate the p-values as follows: *:
, **:
for Bonferroni-corrected tests with 18 hypothesis; ’:
, ”:
for uncorrected tests. We do not use any symbols for values that are not significant. We symmetrize directed networks by considering two nodes to be adjacent if there is at least one edge between them. The social networks in this table are node-aligned multiplex graphs, but the transportation networks are not node-aligned. We report values that are means over different numbers of realizations: 105 for Tailor Shop, 103 for Airline, 104 for Management, 105 for Families, 104 for Tube, and 105 for Bank.
| Tailor Shop | Management | Families | Bank | Tube | Airline | ||
|---|---|---|---|---|---|---|---|
| CM | orig. | 0.319** | 0.206** | 0.223 | 0.293** | 0.056** | 0.101** |
| NM | 0.218 ± 0.007 | 0.131 ± 0.003 | 0.194 ± 0.029 | 0.223 ± 0.014 | 0.025 ± 0.008 | 0.054 ± 0.001 | |
|
orig. | 0.406 | 0.436 | 0.289 | 0.537 | 0.013 | 0.100 |
| NM | 0.406 | 0.436 | 0.289 | 0.537 | 0.013 | 0.100 | |
|
orig. | 0.327** | 0.273** | 0.198 | 0.349** | 0.043 | 0.150** |
| NM | 0.220 ± 0.009 | 0.176 ± 0.003 | 0.158 ± 0.041 | 0.240 ± 0.018 | 0.035 ± 0.018 | 0.082 ± 0.002 | |
|
orig. | 0.288** | 0.192** | — | 0.227’ | 0.314** | 0.086** |
| NM | 0.165 ± 0.010 | 0.120 ± 0.003 | — | 0.186 ± 0.016 | 0.053 ± 0.043 | 0.037 ± 0.002 |
Footnotes
- 7
Note that we use the term ‘cycle’ to refer to a walk that starts and ends at the same physical node
. As we will discuss later, in a multiplex network, it is permissible (and relevant) to return to the same node via a different layer from the one that was used originally to leave the node. - 8
We use the configuration model instead of an ER network as a null model because the local clustering-coefficient values are typically correlated with node degree in monoplex networks [3], and an ER-network null model does not preserve degree sequence.

































































































