Generalized Deam–Edwards approach to the statistical mechanics of randomly crosslinked systems

Xiangjun Xing; Bing-Sui Lu; Fangfu Ye; Paul M Goldbart

doi:10.1088/1367-2630/15/8/085017

1. Introduction

Randomly crosslinked systems—including rubbers, gels, liquid crystalline elastomers, cytoskeleton networks and many other related systems—arise widely in nature and are also extensively manufactured by scientists and technologists. Their physical properties are dominated by heterogeneous, system-spanning random networks. A proper specification of this random network structure therefore constitutes the first step toward obtaining a more complete understanding of these complex materials.

In a typical protocol for making randomly crosslinked materials, one starts from a liquid system under particular physical conditions (e.g. of temperature, pressure, solvents, etc), and randomly introduces crosslinkers that link together nearby particles or polymers. The macroscopic liquid state prior to crosslinking is defined as the preparation state. A realistic crosslinking process necessarily takes some finite interval of time, and is therefore inevitably accompanied by some irreversible responses of the system. Inclusion of these irreversible aspects would make the theoretical modeling of the random crosslinking process extremely complicated. To simplify the analysis and nevertheless capture the essential physics of random crosslinking, we shall therefore follow the original idea of Deam and Edwards [1] by considering an idealized, instantaneous crosslinking scheme in which all crosslinkers are introduced into the system at one particular instant in time. If a sufficient number of crosslinkers is introduced, a system-spanning random network emerges, and endows the material with a non-vanishing shear modulus. A continuous transition such as this, from a liquid to a random solid, is usually called gelation—or vulcanization in settings of pre-existing polymers. Gelation and vulcanization are quite different from usual thermodynamic phase transitions, because they are irreversible and the resulting systems acquire intrinsic spatial heterogeneity. The statistical theory for these transitions is therefore necessarily more complicated.

It is well known that the physical properties of randomly crosslinked materials depend both on the state of measurement and on the state of preparation (i.e. the method of preparation). This is, of course, a feature common to all heterogeneous materials, and has been widely appreciated by experimentalists. It was also explicitly pointed out by Deam and Edwards [1] and by de Gennes [2] as early as the 1970s. It therefore follows that the statistical physics of randomly crosslinked materials involves two statistical ensembles: the preparation ensemble and the measurement ensemble. This necessitates a substantial generalization of the standard Gibbs ensemble theory for equilibrium statistical mechanics.

To substantiate this seemingly extreme statement, let us discuss the remarkably distinct properties exhibited by nematic elastomers that have been crosslinked under distinct conditions. Nematic elastomers [3] are crosslinked nematic polymer melts, and their liquid crystalline ordering is strongly coupled to their network elasticity. Two crosslinking protocols have been studied recently by Urayama et al. In one, the system is first crosslinked in the high-temperature, isotropical phase of liquid crystallinity, and is then brought to a lower-temperature, nematic phase. The resulting system (called an isotropic-genesis polydomain (PD) nematic elastomer, or I-PNE) exhibits a PD nematic state, with a typical nematic domain size of a few microns. Upon stretching, the nematic domains gradually rotate and deform, and eventually, beyond certain strain value, they align along the external stress so as to form a monodomain (MD) state. The stress response during the PD–MD transition window is extremely soft, being orders of magnitude lower than that of typical elastomeric materials.

In the other protocol, the system is first quenched into the nematic phase, so that the nematic order is well developed locally, but is still frustrated by nematic defects at larger scales (typically beyond 100 μm). Crosslinking carried out in this state freezes in the patterns of nematic defects. The resulting system (known as a nematic-genesis PD nematic elastomer, or N-PNE) also exhibits a stress-driven PD–MD transition. The observed stress is, however, within the range of typical elastomer materials, and much higher than that of I-PNEs. The striking differences between I-PNEs and N-PNEs are entirely due to their distinct preparation states. Evidently, a theoretical understanding of these differences demands a statistical theory that involves both the preparation ensemble and the measurement ensemble.

A generic protocol for preparing and measuring a randomly crosslinked system has three steps, each statistical in nature. Firstly, there is a preparation state. We shall denote averaging over the thermal fluctuations in this state by square brackets [A]_p , where the subscript p stands for 'preparation'. Secondly, there is an instantaneous crosslinking stage, at which crosslinkers are randomly introduced into the system. Averages over realizations of crosslinkers are denoted by an overbar: $\overline {A}$ . Finally, there is a measurement state. Averages over fluctuations in this state are denoted by angle brackets 〈A〉_m, where the subscript m stands for 'measurement'. Note that these three types of average have a causal ordering in time. Any quantity averaged in a measurement state must also be averaged over crosslinker realizations and over the preparation state, as the latter two ensembles constitute the method of preparation. By contrast, quantities averaged in the preparation ensemble are not to be averaged over crosslinking, because the network does not exist in the preparation state.

From the above example of nematic elastomers, it can be inferred that the structure of a polymer network in a randomly crosslinked system depends not only on the randomness inherent in the crosslinking process but also on the thermal fluctuations in the state of preparation. Two macroscopically identical crosslinking processes using the identical protocol yield two distinct networks which are only statistically similar. More importantly, two such crosslinking processes carried out under different preparation states yield statistically different network structures, and lead to different properties of the systems, even under the same conditions in the measurement state. It therefore takes both the crosslinking methods and the preparation state to have a full statistical description of the structure of the random network in these materials.

In view of the three steps just mentioned, as well as their statistical nature, there are four types of physical quantities that can be measured in a randomly crosslinked system.

1.
Thermodynamic averages in the preparation state: [A]_p, [A B]_p, etc, where the subscripts p stands for 'preparation'.
2.
Thermodynamic averages in the measurement state: $[\overline {\langle A\rangle _{\rm{m}}}]_{\rm{p}}$ , $[\overline {\langle A B\rangle _{\rm{m}}}]_{\rm{p}}$ , etc. Note that these are to be averaged over both their crosslinking randomness and the thermal fluctuations in the preparation state.
3.
Glassy correlations, which characterize sample-to-sample fluctuations in the measurement state: $[\overline {\langle A\rangle _{\rm{m}}\langle B\rangle _{\rm{m}}}]_{\rm{p}}$ . These quantities are important because the systems are random.
4.
Memory correlations: $[A\overline {\langle B\rangle _{\rm{m}}}]_{\rm{p}}$ , which characterize cross-correlations between the preparation and measurement states. Memory correlations therefore characterize the dependence of system properties on their histories of formation.

The purpose of this paper is to discuss various basic aspects of vulcanization theory, which provides a general methodology for the systematic calculation of all four of these types of diagnostic quantity. The remainder of the paper is organized as follows. In section 2 we introduce a toy model of randomly linked anisotropic particles, as a starting point for vulcanization theory. In section 3 we discuss a generalized Deam–Edwards distribution of network structure, and average the free energy of the network over this distribution. At this stage, we shall see a key theme, viz., that both the preparation and the measurement ensemble appear naturally, coupled to one another, in the effective replicated Hamiltonian. In section 4 we discuss in detail various physical quantities that can be calculated using the replicated partition function. We also discuss the significance of the zeroth replica and the connection between replica limit and causality. We also address the distinction between the original Deam–Edwards distribution and our generalization of it. In section 5 we discuss the physical meaning of the vulcanization order parameter. In section 6 we study the mean-field approximation, and from it we derive the classical theory of rubber elasticity, as well as the neoclassical theory elasticity, appropriate for nematic elastomers. Finally in section 7, we draw the concluding remarks by briefly reviewing three basic results about the heterogeneous nature of randomly crosslinked materials, and by indicating some possible future directions.

2. Model of linked particles

To make our discussion concrete, we consider a liquid of particles, each of which may or may not be anisotropic. The particles may, e.g., be entire polymers that we regard in a coarse-grained sense. Under appropriate conditions, the particles may participate in some kind of collective ordering, such as liquid crystalline alignment. Each particle has certain number of functional sites at which the crosslinkers can act. Before crosslinking, the spatial locations of all the functional sites of all the particles, which we index via i, are denoted by c⁰_i. We assume that the positions of these sites c⁰ ≡ {c⁰_i,i = 1,...,N} completely specify the microscopic state of the liquid (i.e. there are no relevant degrees of freedom beyond these). The liquid Hamiltonian (normalized by temperature) is denoted by H⁰_liq(c⁰). The corresponding partition function for the liquid system (in the preparation state) is then given (with the shorthand notation $D {\boldsymbol {c}}^0 = \prod _{i=1}^N d {\boldsymbol {c}}^0_i$ ) by

$\begin{equation} Z^0_{\rm{liq}} = \int D {\boldsymbol{c}}^0 \, \mathrm{e}^{- H^0_{\rm{liq}}({\boldsymbol{c}}^0)}. \end{equation} \tag{ 1 }$

The chemical crosslinkers are modeled as Gaussian springs⁴, each connecting a pair of functional sites (i,j) (for i ⩽ j to avoid double counting). The network structure is uniquely specified by the list of N(N + 1)/2 'link numbers' k_(i,j) that specify the number of springs linking the pair of sites (i,j): χ = {k_(i,j)}.⁵ Then, the summation over network structures corresponds to summations over all (non-negative) integer values for k_(i,j):

$\begin{equation} \sum_{\chi} = \sum_{k_{(1,1)} = 0}^{\infty}\sum_{k_{(1,2)} = 0}^{\infty} \cdots \sum_{k_{(N,N)} = 0}^{\infty} \equiv \sum_{\{(i,j)\}}. \end{equation} \tag{ 2 }$

Another, equivalent, specification of the network structure is given as follows. Firstly, we specify the total number of springs M. Secondly, we treat these M springs as if they are distinguishable, and number them using an integer index e = 1,...,M. The eth spring can be denoted by an integer pair (i_e,j_e) (with i_e ⩽ j_e) which specifies the pair of functional sites it links together. The network structure is then fully but non-uniquely characterized by an integer M and a list of M integer pairs χ' = {(i_e,j_e)}^M_e=1. Because all springs are identical, any arbitrary permutation of the M pairs within the list χ' does not change the network structure. We easily find the number of distinct χ' that correspond to the same network structure (which is uniquely determined by χ), i.e. the 'degeneracy factor':

$\begin{equation} \frac{M!}{\prod_{(i,j)} k_{(i,j)}!}. \end{equation} \tag{ 3 }$

The summation over all network structures can then also be represented as a summation over the crosslinker number M, combined with a sum over the list of 2M integers {i_e,j_e}

$\begin{eqnarray*} &&\sum_{\chi} = \sum_{M=0}^\infty \sum_{i_1, j_1} \cdots \sum_{i_M, j_M}. \end{eqnarray*}$

Note, however, that the probability of a given network structure P_χ' must be divided by the degeneracy factor (3) in order to correct for the over-counting. We shall discuss this issue in detail below. For an illustration of a particular network structure, see figure 1. For the two equivalent specification of this network structure, see table 1.

Table 1. Two equivalent specifications of the network structure shown in figure 1.

χ	k_(1,9)=k_(2,8)=k_(3,4)=k_(4,5)=k_(5,9)=k_(6,7)=1, all other k_(i,j)=0
(M,χ')	M=6, χ'={(i_e,j_e), e=1,...,6}={(1,9),(2,8),(3,4),(4,5),(5,9),(6,7)}

Let us assert that after crosslinking the system is brought to the measurement state, in which the microscopic locations of the sites are now labeled by c_i (rather than c⁰_i of the preparation ensemble). The interaction between pairs of sites linked by springs is modeled via quadratic terms in the separations between sites:

$\begin{equation} \Delta H_{\chi} ({\boldsymbol{c}}) = \frac{1}{2 b^2} \sum_{e=1}^M \left\vert{{\boldsymbol{c}}}_{i_e} - {{\boldsymbol{c}}}_{j_e} \right\vert^2 = \frac{1}{2 b^2} \sum_{(i,j)} k_{(i,j)} \left\vert{{\boldsymbol{c}}}_{i} - {{\boldsymbol{c}}}_{j}\right\vert^2. \end{equation} \tag{ 4 }$

This potential energy increases without bound as any two linked sites are separated, and hence it keeps them close to each other at all times. For later purposes, we also define the liquid–state partition function in the measurement state

$\begin{equation} Z_{\rm{liq}} = \int D {\boldsymbol{c}} \, \mathrm{e}^{- H_{\rm{liq}}({\boldsymbol{c}}) }. \end{equation} \tag{ 5 }$

The partition function and free energy of the crosslinked system are then given, respectively, by

$\begin{equation} Z_{\chi} = \int D{\boldsymbol{c}} \, \mathrm{e}^{-H_{\rm{liq}} ({\boldsymbol{c}}) - \Delta H_{\chi} ({\boldsymbol{c}}) }, \quad F_{\chi} = - T \ln Z_{\chi} \end{equation} \tag{ 6 }$

both of which depend on the network structure χ. Throughout this paper, we shall set the Boltzmann constant to be unity. Note that, generically, H_liq(c) (i.e. the liquid Hamiltonian in the measurement state) differs from its preparation-state counterpart H⁰_liq(c). We then average the free energy over realizations of the network structure χ (with probability P_χ) to obtain

$\begin{equation} \overline{F} = \sum\nolimits_\chi P_{\chi} F_\chi = - T \sum\nolimits_{\chi} P_{\chi} \ln Z_\chi. \end{equation} \tag{ 7 }$

Assuming that the self-averaging property holds, this formula also gives the free energy of a typical sample. The calculation (or modeling) of the probability P_χ for the network structure χ is the sine qua non for obtaining a proper statistical treatment of randomly crosslinked systems. This we now discuss in detail.

3. Generalized Deam–Edwards distribution of network structure

3.1. Statistics of connectivity between a pair of sites

The crosslinking process that we shall consider is an instantaneous crosslinking process, as already mentioned previously. Many springs (crosslinkers) are introduced into the liquid system (in its preparation state), completely randomly and independently of one another. We further assume that the extension of every spring d is distributed according to a Gaussian equilibrium Gibbs–Boltzmann factor with characteristic lengthscale b (cf equation (4)):

$\begin{equation} P( \boldsymbol{d} ) \propto \mathrm{e}^{ -| \boldsymbol{d} |^2/{2 b^2} }. \end{equation} \tag{ 8 }$

In addition, we assume that each functional site c⁰_i has an 'effective region' of radius , centered around c⁰_i. If an end of a spring is introduced at some position x that lies within the 'effective region' around c⁰_i, it is successfully linked to this functional site. We assume that is smaller than half the minimal distance between any two functional sites, so that any end of a spring cannot be linked to two sites simultaneously. This assumption does not limit the generality of our approach (figure 2).

**Figure 2.** Two functional sites, c_i and c_j, each with an 'effective region' of radius and volume v. Any end of a spring that is introduced into one such region is successfully linked to the site at the center of that region.
Download figure:
Standard image High-resolution image

epsilon — **Figure 2.** Two functional sites, c_i and c_j, each with an 'effective region' of radius and volume v. Any end of a spring that is introduced into one such region is successfully linked to the site at the center of that region.
Download figure:
Standard image High-resolution image

Next, let p_k(c⁰_i,c⁰_j) be the probability that a given pair of sites {c⁰_i,c⁰_j} is linked by some integer number k of springs. If we assume that the crosslinkers are introduced into the system at random, independently and homogeneously, then k would be Poisson-distributed

$\begin{equation} p_{k}({\boldsymbol{c}}^0_i,{\boldsymbol{c}}^0_j) = C \, \frac{\hat{\mu}^k}{k!}\, \mathrm{e}^{- k\, \left| {\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j \right| ^2/2 b^2}, \end{equation} \tag{ 9 }$

where we have introduced the dimensionless control parameter $\hat {\mu }$ for the crosslinking density⁶, and the constant C is determined via overall normalization, see below. Note that p_k(c⁰_i,c⁰_j) contains a factor proportional to the k^th power of Gibbs–Boltzmann factor (8), which characterizes the equilibrium distribution of each spring. It is convenient to define a (three-dimensional) soft delta function δ_b(x) in terms of a Gaussian spatial profile having variance parameter b²

$\begin{equation} \delta_b({\boldsymbol{x}}) \equiv ( \sqrt{2\pi}/b )^{3} \,\mathrm{e}^{- {\boldsymbol{x}}^2/2 b^2}. \end{equation} \tag{ 10 }$

The function δ_b(x) tends to the Dirac delta function as b → 0. The linking probability (9) can then also be expressed in the following concise form:

$\begin{equation} p_{k}({\boldsymbol{c}}^0_i,{\boldsymbol{c}}^0_j) = \mathrm{e}^{- \tilde{\mu} \, \delta_b({\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j)} \frac{1}{k!} \left(\, \tilde{\mu} \, \delta_b({\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j) \,\right)^k, \end{equation} \tag{ 11 }$

where we have exchanged the crosslink density control parameter $\hat {\mu }$ for the parameter $\tilde {\mu }$ , defined via

$\begin{equation} \tilde{\mu} \equiv (b/ \sqrt{2\pi} )^{3} \, {\hat{\mu}}. \end{equation} \tag{ 12 }$

It is easy to check that equation (11) is properly normalized: $\sum _{k = 0}^{\infty } p_k({\boldsymbol {c}}^0_i,{\boldsymbol {c}}^0_j) =1$ .

3.2. Statistics of the connectivity of the network

We assume that the liquid, prior to crosslinking, is in thermal equilibrium with respect to Hamiltonian H⁰_liq(c⁰), where c⁰ ≡ {c⁰_i} denotes the locations of all N functional sites in the preparation state⁷. The joint probability density function for all N sites (prior to crosslinking) to be at the locations {c⁰₁,...,c⁰_N} is then given by

$\begin{equation} P({\boldsymbol{c}}^0) = \frac{1}{Z^0_{\rm{liq}} }\, \mathrm{e}^{- H^0_{\rm{liq}}({\boldsymbol{c}}^0)}, \end{equation} \tag{ 13 }$

where the normalizing partition function is given by

$\begin{equation} Z^0_{\rm{liq}} = \int D {\boldsymbol{c}}^0 \, \mathrm{e}^{- H^0_{\rm{liq}}({\boldsymbol{c}}^0)}. \end{equation} \tag{ 14 }$

Now recall that the network structure is characterized by a set of N(N + 1)/2 stochastic variables⁸, or link numbers, {k_(i,j)}, each of them characterizing the number of springs linking a particular pair (i,j). Given a liquid configuration c⁰ prior to crosslinking, these link numbers are fully uncorrelated, and governed by the following conditional probability distribution:

$\begin{eqnarray} P({\chi}|{\boldsymbol{c}}^0) &=& \prod_{(i,j)}p_{k_{(i,j)}}({\boldsymbol{c}}^0_i, {\boldsymbol{c}}^0_j)\nonumber\\ &=& \prod_{(i,j)}\mathrm{e}^{- \tilde{\mu} \, \delta_b({\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j)} \frac{1}{k_{(i,j)}!} \left( \tilde{\mu} \, \delta_b({\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j) \right)^{k_{(i,j)}}, \end{eqnarray} \tag{ 15 }$

where we have used the linking probability given by equation (11). Making the further definition

$\begin{equation} H_{\rm{norm}} ({\boldsymbol{c}}) \equiv - \tilde{\mu} \, \sum_{i, j } \delta_b({{\boldsymbol{c}}}_i - {{\boldsymbol{c}}}_j), \end{equation} \tag{ 16 }$

we can write this conditional probability as

$\begin{eqnarray} P({\chi}|{\boldsymbol{c}}^0) &=& \mathrm{e}^{H_{\rm{norm}}({\boldsymbol{c}}^0)} \prod_{(i,j)} \frac{1}{k_{(i,j)}!} \left( \tilde{\mu} \, \delta_b({\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j) \right)^{k_{(i,j)}}\nonumber\\ &= & \mathrm{e}^{H_{\rm{norm}}({\boldsymbol{c}}^0)} \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!} \mathrm{e}^{- k_{(i,j)}\,|{\boldsymbol{c}}^0_i - {\boldsymbol{c}}^0_j|^2/2 b^2}\nonumber\\ &=& \mathrm{e}^{H_{\rm{norm}}({\boldsymbol{c}}^0)-\Delta H_{\chi}({\boldsymbol{c}}^0)} \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!}, \end{eqnarray} \tag{ 17 }$

where we have used equations (10), (12) and (4) in the above three equalities, respectively. If we were to specify the network structure using the set χ' = {(i_e,j_e),e = 1,...,M} instead of the set χ, the corresponding conditional probability would be

$\begin{equation} P(\chi'|{\boldsymbol{c}}^0) = \mathrm{e}^{H_{\rm{norm}}({\boldsymbol{c}}^0)} \frac{1}{M!} \prod_{e = 1}^M \tilde{\mu} \, \delta_b({\boldsymbol{c}}^0_{i_e} - {\boldsymbol{c}}^0_{j_e}), \end{equation} \tag{ 18 }$

where we have divided the probability by the proper degeneracy factor (3), so as to cancel the over-counting.

The probability of obtaining the network structure χ can be arrived at by using the law of total probability as

$\begin{eqnarray} P_{\chi} &=&\sum_{{\boldsymbol{c}}^0} P(\chi | {\boldsymbol{c}}^0) P({\boldsymbol{c}}^0)\nonumber\\ &=& \frac{1}{Z^0_{\rm{liq}}} \int D{\boldsymbol{c}}^0 \, \mathrm{e}^{- H_{\rm{liq}}^0({\boldsymbol{c}}^0) + H_{\rm{norm}}({\boldsymbol{c}}^0) -\Delta H_{\chi}({\boldsymbol{c}}^0)}\, \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!}\,. \end{eqnarray} \tag{ 19 }$

Furthermore, by making the definition

$\begin{equation} \tilde{Z}_{0,\chi} \equiv \int D {\boldsymbol{c}}^0 \, \mathrm{e}^{-H_{\rm{liq}}^0({\boldsymbol{c}}^0) + H_{\rm{norm}}({\boldsymbol{c}}^0) -\Delta H_{\chi}({\boldsymbol{c}}^0)}\,, \end{equation} \tag{ 20 }$

we obtain

$\begin{equation} P_{\chi} = \frac{\tilde{Z}_{0,\chi}}{Z^0_{\rm{liq}}} \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!}. \end{equation} \tag{ 21 }$

Hence we see that, apart from a dimensionless factor, P_χ is the ratio of two partition functions. Even though the variables c⁰_i (associated with preparation state) appear only as dummy variables, the integrations over them in equation (20) cannot be carried out explicitly. Below, when we calculate the disorder average of free energy, we shall find that these variables c⁰_i interact with the corresponding variables c_i in the measurement ensemble. It is this interaction that generates the ubiquitous memory effects in randomly crosslinked materials.

3.3. Normalization of probabilities

The conditional probability that there are precisely M linking springs in the system can be obtained by summing equation (18) over all possible values taken by the indices (i_e,j_e):

$\begin{eqnarray} P(M|{\boldsymbol{c}}^0) &=&\sum_{i_1, j_1 = 1}^N \sum_{i_2, j_1 = 2}^N\cdots\sum_{i_M, j_M = 1}^NP(\chi' | {\boldsymbol{c}}^0)\nonumber\\ &=& \mathrm{e}^{H_{\rm{norm}}({\boldsymbol{c}}^0)}\frac{1}{M!}\left(- H_{\rm{norm}}({\boldsymbol{c}}^0)\right)^{M}. \end{eqnarray} \tag{ 22 }$

It is elementary to check that the conditional probability P(M|c₀) is properly normalized for any fixed c₀:

$\begin{equation} \sum_{M = 0}^{\infty} P(M | {\boldsymbol{c}}^0) = 1. \end{equation} \tag{ 23 }$

To check the normalization of the probability P_χ, we sum equation (19) over all possible network structures. By equation (2), this amounts to summing over all possible linking numbers {k_(i,j) }. Further using equations (4) we arrive at

$\begin{eqnarray} \sum_{\chi} P_{\chi} &=&\frac{1}{Z^0_{\rm{liq}}} \int D{\boldsymbol{c}}^0 \,\mathrm{e}^{- H_{\rm{liq}}^0({\boldsymbol{c}}^0) + H_{\rm{norm}}({\boldsymbol{c}}^0)}\sum_{\chi} \mathrm{e}^{-\Delta H_{\chi}({\boldsymbol{c}}^0)} \,\prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!}\nonumber\\ &= &\frac{1}{Z^0_{\rm{liq}}} \int D{\boldsymbol{c}}^0 \, \mathrm{e}^{- H_{\rm{liq}}^0({\boldsymbol{c}}^0) + H_{\rm{norm}}({\boldsymbol{c}}^0)} \prod_{(i,j)} \sum_{k = 1}^{\infty}\frac{\hat{\mu}^k}{k!} \mathrm{e}^{ - k \left| {\boldsymbol{c}}_i - {\boldsymbol{c}}_j \right|^2/2 b^2}. \end{eqnarray} \tag{ 24 }$

The summation and product can readily be calculated, leading to the factor $\exp - H_X({\boldsymbol {c}}^0)$ , which precisely cancels the corresponding exponential factor. Hence, as expected, we have the normalization condition

$\begin{equation} \sum_{\chi} P_{\chi} = 1. \end{equation} \tag{ 25 }$

3.4. Averaging over network structures

We are now ready to compute the disorder-averaged free energy, equation (7). For this purpose, we use the following identity, which is commonly called the replica trick⁹:

$\begin{equation} \overline {\ln X} = \lim_{n \rightarrow 0} \frac{1}{n} \ln \overline{X^n}. \end{equation} \tag{ 26 }$

We use this to rewrite the average free energy as

$\begin{eqnarray} \overline{ F} &=& - T \lim_{n \rightarrow 0} \frac{1}{n} \ln \overline{Z_{\chi}^n}= - T \lim_{n \rightarrow 0} \frac{1}{n} \ln\sum_{\chi} P_{\chi} Z_{\chi}^n \nonumber\\ &=& - T \lim_{n \rightarrow 0} \frac{1}{n} \ln\sum_{\chi} \left( \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!} \right) \frac{1}{Z_{\rm{liq}}^0}\,\, \tilde{Z}_{0,\chi}\,(Z_{\chi})^n. \end{eqnarray} \tag{ 27 }$

We calculate the right-hand side of equation (27) for positive integers n, and subsequently analytically continue n to real values. More specifically, we calculate Zⁿ_χ by replicating the variables {c_i} a number n times, to obtain an n-tuple (c¹_i,c²_i,...,cⁿ_i). Together with the corresponding variable c⁰_i of the preparation ensemble, they form a (1 + n)-tuple $\hat {{\boldsymbol {c}}}_i \equiv ({\boldsymbol {c}}^{0}_i, {\boldsymbol {c}}^1_i,\ldots, {\boldsymbol {c}}^n_i )$ . The shorthand $\hat {{\boldsymbol {c}}}$ has been extensively used for this (1 + n)-tuple in the vulcanization theory literature. Using equations (6) and (20), the product of partition functions Z_0,χ(Z_χ)ⁿ in equation (27) can be written in terms of functional integrals (with the shorthand notation $D\hat {{\boldsymbol {c}}} \equiv \prod _{\alpha = 0}^n D {\boldsymbol {c}}^{\alpha }$ ) as follows:

$\begin{equation} Z_{0,\chi}\,( Z_{\chi})^n = \int D \hat{{\boldsymbol{c}}} \, \prod_{\alpha = 0}^n \, \mathrm{e}^{- \tilde{H}^0_{\rm{liq}} ({\boldsymbol{c}}^0) - \sum_{\alpha = 1}^n H_{\rm{liq}} ({\boldsymbol{c}}^{\alpha}) - \sum_{\alpha = 0}^n \Delta H_{\chi}({\boldsymbol{c}}^{\alpha}) }. \end{equation} \tag{ 28 }$

Next, by using the definition equation (4) we easily see that

$\begin{eqnarray} \sum_{\alpha = 0}^n \Delta H_{\chi}({\boldsymbol{c}}^{\alpha}) &=& \sum_{\alpha = 0}^n \frac{1}{2 b^2} \sum_{(i,j)} k_{(i,j)} \left| {{\boldsymbol{c}}}_{i} - {{\boldsymbol{c}}}_{j} \right| ^2 \nonumber\\ &=&\frac{1}{2 b^2} \sum_{(i,j)} k_{(i,j)} \left| {\hat{{\boldsymbol{c}}}}_{i} - \hat{{\boldsymbol{c}}}_{j} \right|^2 \nonumber\\ &\equiv& \Delta H_{\chi} (\hat{{\boldsymbol{c}}}), \end{eqnarray} \tag{ 29 }$

where we have used the notation $|\hat {{\boldsymbol {c}}}^{\alpha }|^2 \equiv \sum _{\alpha = 0}^n |{\boldsymbol {c}}_i^{\alpha }|^2$ . We can now substitute equation (28) back into equation (27), exchange the order in which $\sum _{\chi }$ and $\int D \hat {{\boldsymbol {c}}}$ are performed in equation (27), and sum over χ for a fixed configuration of $\hat {{\boldsymbol {c}}}$ . The summation over the realizations of network structure can be calculated using the following result:

$\begin{eqnarray} &&\fl \sum_{\chi} \mathrm{e}^{- \Delta H_{\chi}(\hat{{\boldsymbol{c}}})} \left( \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!} \right)= \sum_{\{k_{(i,j)} \}} \left( \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!} \right) \mathrm{e}^{- \frac{1}{2 b^2} \sum_{(i,j)} k_{(i,j)} \left| {\hat{{\boldsymbol{c}}}}_{i} - \hat{{\boldsymbol{c}}}_{j} \right|^2} \nonumber\\ &&= \prod_{(i,j)} \sum_{k} \left( \frac{\hat{\mu}^{k}}{k!} \right) \mathrm{e}^{- \frac{1}{2 b^2} k \left| {\hat{{\boldsymbol{c}}}}_{i} - \hat{{\boldsymbol{c}}}_{j} \right|^2} \end{eqnarray} \tag{ 30 }$

$\begin{eqnarray} &=&\prod_{(i,j)} \exp \left( \hat{\mu} \, \mathrm{e}^{- \frac{1}{2 b^2} k \left| {\hat{{\boldsymbol{c}}}}_{i} - \hat{{\boldsymbol{c}}}_{j} \right|^2} \right)\nonumber\\ &=& \exp \sum_{(i,j)} {\mu} \, \delta_b(\hat{{\boldsymbol{c}}}_i - \hat{{\boldsymbol{c}}}_j), \end{eqnarray} \tag{ 31 }$

where in the last equality, we have used the definitions

$\begin{equation} {\mu} \equiv \hat{\mu} (\sqrt{2\pi}/b)^{3(1+n)}, \end{equation} \tag{ 32 }$

$\begin{equation} \delta(\hat{{\boldsymbol{x}}}) \equiv \prod_{\alpha = 0}^n \delta_b({\boldsymbol{x}}^{\alpha}) = \delta_b({\boldsymbol{x}}^0) \delta_b({\boldsymbol{x}}^1) \cdots \delta_b({\boldsymbol{x}}^n). \end{equation} \tag{ 33 }$

As we shall eventually take the replica limit, n → 0, the difference between μ and $\tilde {\mu }$ (see equation (12)) can be ignored.

Combining all these results, we find that the disorder-averaged free energy is given as follows:

$\begin{equation} \overline{F} = - T\, \lim_{n \rightarrow 0} \frac{1}{n} \, \ln\left( \frac{1}{Z^0_{\rm{liq}}} \, Z_{1+n}\right), \end{equation} \tag{ 34a }$

$\begin{equation} Z_{1+n} = \int D\hat{{\boldsymbol{c}}} \, \mathrm{e}^{-H_{1+n}(\hat{{\boldsymbol{c}}}) }, \end{equation} \tag{ 34b }$

$\begin{equation} H_{1+n}(\hat{{\boldsymbol{c}}}) = H_{\rm{liq}}^0({\boldsymbol{c}}^0) - H_{\rm{norm}}({\boldsymbol{c}}^0) + \sum_{\alpha = 1}^n H_{\rm{liq}} ({\boldsymbol{c}}^{\alpha}) + H_{\rm{norm}} (\hat{\boldsymbol{c}}), \end{equation} \tag{ 34c }$

$\begin{equation} H_{\rm{norm}} (\hat{\boldsymbol{c}}) = - {\mu} \sum_{(i, j)}^N \delta_b \left( \hat{{\boldsymbol{c}}}_i - \hat{{\boldsymbol{c}}}_j\right). \end{equation} \tag{ 34d }$

The Hamiltonian $H_{\rm{norm}}[\hat {\boldsymbol {c}}]$ $H_{\rm{norm}}[\hat {\boldsymbol {c}}]$ represents an effective short-range attraction between all pairs of functional sites in the replicated coordinate space $\hat {\boldsymbol {c}} = \{ {\boldsymbol {c}}^{\alpha } \}_{n = 0}^n$ $\hat {\boldsymbol {c}} = \{ {\boldsymbol {c}}^{\alpha } \}_{n = 0}^n$ , resulting from cross-linking. This short-range interaction reduces to a delta function in the limit b → 0.

4. Physical quantities and correlation functions

We shall use the notation $\langle \Psi (\hat {{\boldsymbol {c}}}) \rangle _{1+n}$ to indicate averages of physical quantities with respect to the weight in the replicated partition function equation (34b):

$\begin{equation} \langle \Psi(\hat{{\boldsymbol{c}}}) \rangle_{1+n} = \frac{\int D \hat{{\boldsymbol{c}}}\Psi({\boldsymbol{c}}^0,{\boldsymbol{c}}^1,\ldots, {\boldsymbol{c}}^n)\mathrm{e}^{- H_{1+n}}}{\int D \hat{{\boldsymbol{c}}} \, \mathrm{e}^{- H_{1+n}} }. \end{equation} \tag{ 35 }$

When there is no risk of confusion, we shall neglect the subscript 1 + n on the average and simply write $\langle \Psi (\hat {{\boldsymbol {c}}}) \rangle$ . It is understood that at the end of the calculation of such quantities the replica limit n → 0 should always be taken.

In the replicated Hamiltonian H_1+n, equation (34c), only the last term, $H_{\rm{norm}}(\hat {{\boldsymbol {c}}})$ , couples together various replicas. It can be understood as a short-range attraction between all pairs of the 1 + n replicated positions $\hat {{\boldsymbol {c}}} = ({\boldsymbol {c}}^0,{\boldsymbol {c}}^1,\ldots,{\boldsymbol {c}}^n)$ . The strength of this attraction is controlled by the density of crosslinking μ. If μ = 0, i.e. the system is not crosslinked, H_norm = 0, and H_1+n reduces to a sum of liquid Hamiltonians, one for each replica:

$\begin{eqnarray*} &&H_{1+n}\rightarrow {H}^0_{\rm{liq}} ({\boldsymbol{c}}^0) + \sum_{\alpha = 1}^n H_{\rm{liq}} ({\boldsymbol{c}}^{\alpha}),\nonumber\\ &&Z_{1+n} \rightarrow Z^0_{\rm{liq}} \cdot \left(Z_{\rm{liq}}\right)^n, \nonumber\\ &&\overline{F}\rightarrow - T \ln Z_{\rm{liq}}. \end{eqnarray*}$

Variables in distinct replicas then become completely decoupled from one another, and the replicated partition function Z_1+n describes the equilibrium liquid in the preparation state together with n copies of equilibrium liquid in the measurement state, with no interactions amongst these liquids.

For systems with crosslinks, μ ≠ 0 and so there are non-vanishing correlations between the variables c⁰ and the other c^α. These correlations characterize the dependence of the properties of these systems on the method of preparation. Let us look systematically at the various physical quantities that can be constructed within the framework of an (1 + n)-replica theory, and relate them to the suite of correlators involving the preparation and measurement ensembles discussed in the introduction. Below, we use the shorthand notation A^α ≡ A(c^α) for physical quantities, and we frequently omit the subscript 1 + n on various expectation values, all to ease the notation.

1.
One-replica quantities: 〈A⁰〉 and 〈A^α〉 (α ≠ 0) are average physical quantities in the preparation state and in the measurement state, respectively. In the notation used in section 1 they are written as [A]_p and $[\overline {\langle A \rangle _{\rm{m}}}]_{\rm{p}}$ , respectively.There are also correlation functions in a single replica: 〈A⁰ B⁰〉 = [AB]_p and $\langle A^{\alpha }\,B^{\alpha } \rangle =[\overline {\langle A\,B \rangle _{\rm{m}}}]_{\rm{p}}$ (α ≠ 0), which quantify correlations in the preparation state and in the measurement state, respectively.
2.
Two-replica quantities: $\langle A^{\alpha }\,B^{\beta } \rangle =\overline { \langle A \rangle _{\rm{m}} \langle B \rangle _{\rm{m}}}$ (for α ≠ β;α,β ≠ 0) are non-vanishing in the presence of the quenched disorder arising from random crosslinking, and thus give the glassy correlations in the measurement state. The order parameter for the random solid state is an especially important example of such a multi-replica quantity.Memory correlation functions involving zeroth replica only once, $\langle A^0 B^{\alpha } \rangle _{\mathrm {c}} =[A \overline { \langle B \rangle _{\rm{m}}}]_{\rm{p}}$ (for α ≠ 0) measure the cross correlation between the preparation state and the measurement state. These memory effects are a hallmark property of randomly crosslinked materials. Vulcanization theory provide the unique framework via which they can be systematically calculated.

In the setting of liquid crystalline elastomers, the memory correlators are the quantities that can best distinguish systems prepared under distinct physical conditions, e.g. isotropic genesis nematic elastomers and nematic genesis nematic elastomers. Various correlation functions, expressed in both replica notation and physical notation, are summarized in table 2.

Table 2. Various correlation functions in vulcanization theory, showing the correspondence between physical notation and replica notation. (Note that in the table we have 0 ≠ α ≠ β ≠ 0.) In the replica notation, all averages are defined by equation (35).

	Preparation correlation	Measurement correlation	Glassy correlation	Memory correlation
Physical notation	[AB]_p	$[ \overline {\langle A B \rangle _{\rm{m}}}]_{\rm{p}}$	$[\overline {\langle A \rangle _{\rm{m}} \langle B \rangle _{\rm{m}}}]_{\rm{p}}$	$[A \overline {\langle B \rangle _{\rm{m}}}]_{\rm{p}}$
Replica notation	〈A⁰B⁰〉	〈A^αB^α〉	〈A^αB^β〉	〈A⁰B^α〉

4.1. Significance of the zeroth replica

We can use Z_1+n with appropriate source terms to calculate the average of a physical quantity A(c⁰) (≡ A⁰) that only depends on the degrees of freedom c⁰ of the preparation ensemble. As replicas 1,...,n play no role in such a calculation, we can take the replica limit n → 0 before the calculation of the expectation value. This is equivalent to deleting all variables c^α for α = 1,...,n from the replicated Hamiltonian (34c). In particular, we have

$\begin{eqnarray*} H_{\rm{norm}}(\hat{{\boldsymbol{c}}}) &\rightarrow& H_{\rm{norm}}({\boldsymbol{c}}^0), \nonumber\\ H_{1+n}(\hat{{\boldsymbol{c}}}) &\rightarrow& H_{\rm{liq}}^0({\boldsymbol{c}}^0). \end{eqnarray*}$

We also have to delete the integrals over c^α (for α = 1,...,n) in the functional integrals (34b) and (35). Therefore, we have

$\begin{equation} \langle A^0 \rangle = \frac{\int D \hat{{\boldsymbol{c}}} A^0\, \mathrm{e}^{- H_{1+n}}}{\int D \hat{{\boldsymbol{c}}} \, \mathrm{e}^{- H_{1+n}} } = \frac{\int D {\boldsymbol{c}}^0 \,A^0 \, \mathrm{e}^{- H^0_{\rm{liq}} ({\boldsymbol{c}}^0) } }{\int D {\boldsymbol{c}}^0 \, \mathrm{e}^{- H^0_{\rm{liq}} ({\boldsymbol{c}}^0) } } = \langle A \rangle_{\rm{liq}}^0 \equiv [ A ]_{\rm{p}}. \end{equation} \tag{ 36 }$

As a result, 〈A⁰〉 is indeed the average in the preparation state, as expected.

4.2. Replica limit and causality

In the replicated partition function, equation (34b), there is one copy of the preparation ensemble (the zeroth replica), and n copies of the measurement ensemble. In figure 3 we show a cartoon that shows schematically the relationship between various replicas. All 1 + n replicas interact with one another through the last term in equation (34c), which is linear in the crosslinking density. Note that there are n copies of the measurement ensemble (i.e. replicas 1,...,n) but only one copy of the preparation ensemble (i.e. replica 0). In general, owing to the interactions, the preparation and measurement ensembles can mutually influence each other. For example, if we tune some parameter J⁰ in fluid Hamiltonian H⁰_liq of the preparation state (see equation (34c)), there will be changes in the physical quantities in the measurement state

$\begin{equation} \frac{\partial}{\partial J^0} \left[\overline{\langle A \rangle_{\rm{m}}}\right]_{\rm{p}} \neq 0. \end{equation} \tag{ 37 }$

This is, of course, entirely reasonable, as the properties of polymer networks do depend on their method of preparation. Reciprocally, however, if we change a parameter J in the measurement state, might there, at least in principle, also be responses of physical quantities in the preparation state, e.g.

$\begin{equation} \frac{\partial}{\partial J} [A ]_{\rm{p}} \neq 0 \quad (?). \end{equation} \tag{ 38 }$

Such a response would be entirely unphysical, as it would violate the principle of causality. In the framework of vulcanization theory, the causality principle is rescued via the replica trick: as n → 0, the weight of the measurement ensemble relative to the preparation ensemble tends to zero, and therefore the former can have no quantitative influence on the latter.

**Figure 3.** Cartoon showing the logical relations between different replicas: the preparation ensemble and n copies of the measurement ensemble are coupled to one another via the crosslinking Hamiltonian $H_{\rm{norm}}(\hat {{\boldsymbol {c}}})$ . The relative weight of measurement ensemble vanishes in the replica limit, thus protecting the causality principle.
Download figure:
Standard image High-resolution image

**Figure 3.** Cartoon showing the logical relations between different replicas: the preparation ensemble and n copies of the measurement ensemble are coupled to one another via the crosslinking Hamiltonian $H_{\rm{norm}}(\hat {{\boldsymbol {c}}})$ . The relative weight of measurement ensemble vanishes in the replica limit, thus protecting the causality principle.
Download figure:
Standard image High-resolution image

An explicit proof of this result is simple. Let the parameter J couple linearly to some quantity Φ(c) in the (1 + n)-replica Hamiltonian H_1+n; this leads to the change

$\begin{eqnarray*} &&H_{1+n} \rightarrow H_{1+n} - J \sum_{\alpha = 1}^n \Phi({\boldsymbol{c}}^{\alpha}). \end{eqnarray*}$

Substituting this back into the expectation value equation (36), taking the derivative with respect to J, and finally setting J = 0, we obtain

$\begin{equation} \frac{\partial [ A ]_{\rm{p}} }{\partial J} = \sum_{\alpha = 1}^n \langle A^0 \Phi({\boldsymbol{c}}^{\alpha}) \rangle = n \langle A^0 \Phi({\boldsymbol{c}}^{1}) \rangle, \end{equation} \tag{ 39 }$

where in the last equality we have assumed that permutation symmetry amongst replicas 1 to n remains intact. The right-hand side of equation (39) therefore vanishes in the replica limit, i.e. n → 0. Even in the presence of the spontaneous breaking of this replica permutation symmetry, the right-hand side of equation (39) would even be proportional to n, and hence would vanish in the replica limit.

4.3. Deam–Edwards revisited

In the replicated Hamiltonian (34c), the parameters of the preparation state only appear in H⁰_liq, whereas the parameters of the measurement state only appear in H_liq. These two types of parameters can be separately controlled in experiments, but they jointly determine the statistical physics of randomly crosslinked materials. Let us consider a rather special case in which H⁰_liq = H_liq, i.e. a system that is prepared and measured under identical conditions. What appears inside the (1 + n)-replica Hamiltonian (34c) is, however, H⁰_liq(c⁰) − H_norm(c⁰) instead of just H⁰_liq(c⁰):

$\begin{equation} H_{1+n}(\hat{{\boldsymbol{c}}}) = H_{\rm{liq}}({\boldsymbol{c}}^0) - H_{\rm{norm}}({\boldsymbol{c}}^0) + \sum_{\alpha = 1}^n H_{\rm{liq}} ({\boldsymbol{c}}^{\alpha}) + H_{\rm{norm}} (\hat{\boldsymbol{c}}). \end{equation} \tag{ 40 }$

Consequently, the replica Hamiltonian H_1+n is not invariant under permutations of the replicas that mix the zeroth and the other replicas. Hence, the average of a quantity A in the preparation state is generically different from its average in the measurement state:

$\begin{eqnarray*} &&[ A ]_{\rm{p}} = \langle A^0 \rangle \neq \langle A^{\alpha} \rangle = \overline{ \langle A \rangle_{\rm{m}}}. \end{eqnarray*}$

This is completely reasonable, as the random network structure does affect physical quantities in the measurement state but cannot influence the preparation state. One simple example involves the nematic correlations in liquid crystalline elastomers that have been crosslinked in isotropic state. The random polymer network tends to disorder the nematic degrees of freedom, and therefore reduces the nematic correlations.

Historically, Deam and Edwards [1] chose a particular probability distribution P_χ for the network structure given by

$\begin{equation} P_{\chi}^{\rm{DE}} = \frac{\tilde{Z}_{0,\chi}^{\rm{DE}}}{Z^0_{\rm{liq}}} \prod_{(i,j)} \frac{\hat{\mu}^{k_{(i,j)}}}{k_{(i,j)}!}, \end{equation} \tag{ 41 }$

where

$\begin{equation} \tilde{Z}_{0,\chi}^{\rm{DE}} = \int D {\boldsymbol{c}}^0 \, \mathrm{e}^{-H_{\rm{liq}}^0({\boldsymbol{c}}^0) -\Delta H_{\chi}({\boldsymbol{c}}^0)}.\end{equation} \tag{ 42 }$

The corresponding (1 + n)-replica Hamiltonian is then given by

$\begin{eqnarray} H_{1+n}^{\rm{DE}}(\hat{{\boldsymbol{c}}}) &=&H_{\rm{liq}}({\boldsymbol{c}}^0)+ \sum_{\alpha = 1}^n H_{\rm{liq}} ({\boldsymbol{c}}^{\alpha})+ H_{\rm{norm}} (\hat{\boldsymbol{c}}), \nonumber\\ &= & \sum_{\alpha = 0}^n H_{\rm{liq}} ({\boldsymbol{c}}^{\alpha})+ H_{\rm{norm}} (\hat{\boldsymbol{c}}),\end{eqnarray} \tag{ 43 }$

in which all 1 + n replicas appear symmetrically. The resulting 1 + n replica theory then has an enlarged S_1+n permutation symmetry, rather than the S_n permutation symmetry that is mandatory for replica theories. The Deam–Edwards choice of disorder distribution is appealing, in that it simplifies the analysis considerably. However, it also leads to the unnatural consequence that the averages of arbitrary physical quantities in the preparation state are equal to their counterparts in the measurement state. The S_1+n permutation in the Deam–Edwards theory is therefore not mandatory, as first pointed out by Broderix et al [4]. The explicit breaking of the S_1+n permutation symmetry down to S_n permutation symmetry is physically natural, and opens the way to a systematic analysis of the interrelationship between randomly crosslinked materials and their histories of formation. This is a valuable and important element of the physics contained in vulcanization theory.

4.4. Comparison with spin glasses and other disordered systems

We end this section with a brief discussion of the difference between the vulcanization model and certain well-known spin glass models. Consider, e.g. the Edwards–Anderson model [5]:

$\begin{eqnarray*} &&H = - \sum_{\langle i,j\rangle} J_{ij} S_i S_j. \end{eqnarray*}$

The quenched random variables {J_ij} are typically assumed to have independent Gaussian statistics with vanishing means, and therefore averaging over them (after application of the replica method) is elementary. The final, replicated theory reads

$\begin{eqnarray*} &&H_{\mathrm{R}} = - \frac{1}{2} \sigma_J^2 \sum_{\langle i,j\rangle} \sum_{\alpha, \beta} S^{\alpha}_i S^{\alpha}_j S^{\beta}_i S^{\beta}_j \end{eqnarray*}$

and contains n replicas of the annealed degrees of freedom (i.e. the spins). The variance of the quenched variables σ_J appears only as a parameter in the replicated Hamiltonian. The quenched variables {J_ij} have been integrated out, and therefore do not show up in the replica theory. By contrast, in vulcanization it is not just expedient but indeed manifestly physical to employ an equilibrium preparation state to generate the statistics for the network structure. (For vulcanization, the quenched random variables are the network structure.) Furthermore, correlations between the preparation state variables and the measurement state variables encode the permanent memory effects, and these are experimentally measurable. Finally, we note that the Edwards–Anderson model represents a considerable idealization of disordered systems in real world. In general, the physics of most disordered systems does depend on the method of preparation. Hence, the proper statistical modeling of such systems requires a separate statistical ensemble.

5. Order parameter for vulcanization

The crosslinking-induced interaction (34d) couples distinct functional sites. It can be decoupled using the standard Hubbard–Stratonovich transformation. To proceed, we first notice that Gaussians are closed under convolution. Therefore, the following identity about the (1 + n)d-dimensional soft delta function $\delta _b({\hat {\boldsymbol {x}}})$ (defined in equations (10) and (33)) can readily be established:

$\begin{eqnarray*} &&\delta_b({\hat{\boldsymbol{x}}} -{\hat{\boldsymbol{y}}}) = \int \mathrm{d} {\hat{\boldsymbol{z}}} \, \delta_{b'} ({\hat{\boldsymbol{x}}} - {\hat{\boldsymbol{z}}}) \, \delta_{b'} ({\hat{\boldsymbol{z}}} - {\hat{\boldsymbol{y}}}), \quad\quad b' = b/\sqrt{2}. \end{eqnarray*}$

Equation (34d) can then be written as

$\begin{eqnarray*} H_{\rm{norm}} (\hat{\boldsymbol{c}}) &=&- {\mu} \int \mathrm{d} \hat{\boldsymbol{z}}\sum_{i=1}^N \delta_{b'}\left( \hat{{\boldsymbol{c}}}_i - \hat{\boldsymbol{z}}\right)\sum_{j=1}^N \delta_{b'}\left( \hat{{\boldsymbol{c}}}_j - \hat{\boldsymbol{z}}\right)\\ &=& - {\mu} \int \mathrm{d} \hat{\boldsymbol{z}} \,\omega({\hat{\boldsymbol{z}}},\hat{\boldsymbol{c}})^2, \end{eqnarray*}$

where

$\begin{equation} \omega({\hat{\boldsymbol{z}}},\hat{\boldsymbol{c}}) \equiv \sum_{i=1}^N \delta_{b'} \left( \hat{{\boldsymbol{c}}}_i - \hat{\boldsymbol{z}}\right) \end{equation} \tag{ 44 }$

is the microscopic collective density of functional sites, coarse grained by the Gaussian wave packet $\delta _{b'}({\hat {\boldsymbol {x}}})$ . It depends explicitly on the microscopic configuration of the replicated system. Our definition of the collective field differs from those in the other literature on vulcanization theory [6] via a multiplicative factor of N. We can now introduce an auxiliary field $\Omega ({\hat {\boldsymbol {x}}}) \equiv \Omega ({\boldsymbol {x}}^0,\ldots, {\boldsymbol {x}}^n)$ via the following Hubbard–Stratonovich transformation:

$\begin{eqnarray*} \mathrm{e}^{-H_{\rm{norm}} (\hat{\boldsymbol{c}})} &=& \mathrm{e}^{\, {\mu} \int \mathrm{d} \hat{\boldsymbol{z}} \, \omega({\hat{\boldsymbol{z}}},\hat{\boldsymbol{c}})^2} \\ &=& C \int D \Omega({\hat{\boldsymbol{x}}}) \, \mathrm{e}^{- {\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \left( \Omega({\hat{\boldsymbol{x}}})^2 - 2 \, \omega({\hat{\boldsymbol{x}}}) \Omega({\hat{\boldsymbol{x}}})\right)}, \end{eqnarray*}$

where C is a trivial constant that will be neglected henceforth. Insertion of this result into equation (34b) leads to

$\begin{eqnarray} \fl Z_{1+n} &=& \int D \hat{{\boldsymbol{c}}} \int D \Omega({\hat{\boldsymbol{x}}}) \, \mathrm{e}^{ - H^0_{\rm{lid}}[{\boldsymbol{c}}^0] + H_X[{\boldsymbol{c}}^0] - \sum_{\alpha = 1}^n H_{\rm{liq}}^{\alpha}[{\boldsymbol{c}}^{\alpha}] - {\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \, \Omega({\hat{\boldsymbol{x}}})^2 + 2 \,{\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \, \omega({\hat{\boldsymbol{x}}})\, \Omega({\hat{\boldsymbol{x}}}) } \nonumber\\ \fl &=& \int D \Omega({\hat{\boldsymbol{x}}}) \, \mathrm{e}^{ - {\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \, \Omega({\hat{\boldsymbol{x}}})^2 } \int D \hat{{\boldsymbol{c}}} \, \mathrm{e}^{ - H^0_{\rm{lid}}[{\boldsymbol{c}}^0] + H_X[{\boldsymbol{c}}^0] - \sum_{\alpha = 1}^n H_{\rm{liq}}^{\alpha}[{\boldsymbol{c}}^{\alpha}] + 2\,{\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \, \omega({\hat{\boldsymbol{x}}})\, \Omega({\hat{\boldsymbol{x}}})}. \end{eqnarray} \tag{ 45 }$

Let us further define a modified liquid state partition function via

$\begin{equation} \tilde{Z}_{\rm{liq}}^0 \equiv \int D{\boldsymbol{c}}^0 \, \mathrm{e}^{- H^0_{\rm{liq}}[{\boldsymbol{c}}^0] + H_X[{\boldsymbol{c}}^0]} \end{equation} \tag{ 46 }$

and the average 〈 ··· 〉⁰_1+n as

$\begin{equation} \langle \Psi(\hat{{\boldsymbol{c}}}) \rangle^0_{1+n} \equiv \frac{1}{\tilde{Z}_{\rm{liq}}^0 \left( Z_{\rm{liq}}\right)^n} \int D \hat{{\boldsymbol{c}}} \, \Psi(\hat{{\boldsymbol{c}}})\, \mathrm{e}^{ - H^0_{\rm{lid}}[{\boldsymbol{c}}^0] + H_X[{\boldsymbol{c}}^0] - \sum_{\alpha = 1}^n H_{\rm{liq}}^{\alpha}[{\boldsymbol{c}}^{\alpha}] }. \end{equation} \tag{ 47 }$

Then we can rewrite equation (45) as

$\begin{equation} Z_{1+n} = \tilde{Z}_{\rm{liq}}^0 \left( Z_{\rm{liq}}\right)^n \int D \Omega({\hat{\boldsymbol{x}}}) \, \mathrm{e}^{- H_{\rm{eff}}(\Omega)}, \end{equation} \tag{ 48 }$

where the effective Hamiltonian H_eff(Ω) is defined as

$\begin{equation} H_{\rm{eff}}(\Omega) = {\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \, \Omega({\hat{\boldsymbol{x}}})^2 - \ln \left\langle \mathrm{e}^{ 2\,{\mu} \int \mathrm{d}{\hat{\boldsymbol{x}}} \, \omega({\hat{\boldsymbol{x}}}) \Omega({\hat{\boldsymbol{x}}}) } \right\rangle_{1+n}^0. \end{equation} \tag{ 49 }$

We can now define the average of functions of Ω using the equilibrium partition function associated with H_eff(Ω):

$\begin{equation} \langle \Upsilon(\Omega)\rangle_{\rm{eff}} \equiv \frac{\int D \Omega \, \Upsilon(\Omega) \,\mathrm{e}^{- H_{\rm{eff}}(\Omega)}}{ \int D \Omega \, \mathrm{e}^{- H_{\rm{eff}}(\Omega)}}. \end{equation} \tag{ 50 }$

The following identity is a well-known property of the Hubbard–Stratonovich transformation:

$\begin{equation} \left \langle \Omega({\hat{\boldsymbol{x}}}) \right \rangle_{\rm{eff}} =\left \langle \omega({\hat{\boldsymbol{x}}},\hat{\boldsymbol{c}}) \right \rangle_{1+n} = \left\langle \sum_{i=1}^N \delta_{b^{\prime}}({\boldsymbol{x}}^0 - {\boldsymbol{c}}_i^0) \cdots \delta_{b^{\prime}}({\boldsymbol{x}}^n - {\boldsymbol{c}}_i^n) \right\rangle_{1+n} \end{equation} \tag{ 51 }$

which shows the physical significance of the collective field $\Omega ({\hat {\boldsymbol {x}}})$ , namely that its average is the joint probability density function of the following 1 + n events: in the preparation state there is a particle at some point x⁰, whereas in n independent measurements on the measurement state the same particle is found at positions x¹,...,xⁿ, respectively. Using the physical notation discussed in the introduction, this probability density can be expressed as

$\begin{equation} \left \langle \Omega({\hat{\boldsymbol{x}}}) \right \rangle_{\rm{eff}} = \sum_{i=1}^{N} \left[ \delta_{b^{\prime}}({\boldsymbol{x}}^0- {\boldsymbol{c}}_i) \overline{ \langle \delta_{b^{\prime}}({\boldsymbol{x}}^1- {\boldsymbol{c}}_i) \rangle_{\rm{m}} \cdots \langle \delta_{b^{\prime}}({\boldsymbol{x}}^n- {\boldsymbol{c}}_i) \rangle_{\rm{m}} } \right]_{\rm{p}}, \end{equation} \tag{ 52 }$

where, as usual, the subscripts p and m respectively denote the preparation and measurement states, and the overline indicates an average over realizations of crosslinkers. $\Omega ({\hat {\boldsymbol {x}}})$ encodes information about the localization of particles in the gel phase, and consequently is called the vulcanization order parameter. The definition of $\Omega ({\hat {\boldsymbol {x}}})$ in this work differs from that in some of the other vulcanization literature via a trivial additive constant and a multiplicative constant N.

5.1. One replica sector and marginal distributions

We can integrate out the n variables x¹,...,xⁿ in the order parameter (51), leaving only one variable x⁰, and thereby obtain a one-replica distribution

$\begin{equation} \langle \Omega^{0}({\boldsymbol{x}}^{0}) \rangle_{\rm{eff}} \equiv \int \prod_{\alpha = 1}^n \mathrm{d} {\boldsymbol{x}}^{\alpha} \langle \Omega(\hat{{\boldsymbol{x}}}) \rangle = \left \langle \sum_{i=1}^{N} \delta_{b^{\prime}}({\boldsymbol{x}}^0 - {\boldsymbol{c}}^0_i) \right \rangle_{1+n} = \left[ \rho({\boldsymbol{x}}^0) \right]_{\rm{p}}. \end{equation} \tag{ 53 }$

The resulting object is the average number-density of functional sites in the preparation state, which is manifestly an intensive quantity. At the coarse-grained level, it can also be understood as the average particle density of the liquid prior to crosslinking. The fluctuation δΩ⁰(x⁰) = Ω⁰(x⁰) − 〈Ω⁰(x⁰)〉_eff is also linearly related to the fluctuation of particle density in the preparation state. Similarly, we can obtain the αth replica distribution for α ≠ 0:

$\begin{equation} \langle \Omega^{\alpha}({\boldsymbol{x}}^{\alpha}) \rangle_{\rm{eff}} \equiv \int \prod_{\beta \neq \alpha}^n \mathrm{d} {\boldsymbol{x}}^{\beta} \langle \Omega(\hat{{\boldsymbol{x}}}) \rangle = \left \langle \sum_{i=1}^{N} \delta_{b^{\prime}}({\boldsymbol{x}}^{\alpha} - {\boldsymbol{c}}^0_i) \right \rangle_{1+n} = \left[ \overline{\langle \rho({\boldsymbol{x}}^{\alpha}) \rangle_{\rm{m}} }\right]_{\rm{p}}, \end{equation} \tag{ 54 }$

which describes average particle density in the measurement state. This relationship between the vulcanization order parameter and the particle densities in the preparation and measurement states is the primary reason that we choose the particular normalization of the order parameter field in equation (44).

For swollen gels and elastomers there can be large density fluctuations in the system (either quenched or thermal). These fluctuations, characterized by correlations of the one-replica distributions Ω^α(x^α), may strongly influence the elasticity of system. For unswollen gels and elastomers, the mean particle density at long enough lengthscales is usually uniform and its fluctuations are negligibly small. The one-replica part of free energy then contains no interesting physics, besides the stabilizing of the density fluctuations.

Finally, by integrating out n − 1 variables, we obtain the two-replica correlator

$\begin{equation} \langle \Omega^{\alpha \beta}({\boldsymbol{x}}^{\alpha} - {\boldsymbol{x}}^{\beta}) \rangle_{\rm{eff}} = \int \prod_{\gamma \neq \alpha,\beta}^n \mathrm{d} {\boldsymbol{x}}^{\gamma} \langle \Omega(\hat{{\boldsymbol{x}}}) \rangle \end{equation} \tag{ 55 }$

which encodes the system average of the correlation of the position fluctuations of a site in two independent measurements. Macroscopic translational symmetry dictates that this function depend only on the difference between two coordinates, x^α and x^β. Generically, it contains less information than the full order parameter $\Omega ({\hat {\boldsymbol {x}}})$ .

6. Landau theory and saddle-point approximation

Equations (48) and (50) provide a formal approach to calculate all relevant physical quantities. Our problem is therefore reduced to the calculation of a functional integral over $\Omega ({\hat {\boldsymbol {x}}})$ . At the saddle-point level we approximate the disorder-averaged free energy (34a) using the minimum of the effective Hamiltonian H_eff[Ω], equation (49). Even after making this approximation, the problem remains difficult, because we do not know the effective Hamiltonian (49) exactly, let alone its minimum. We therefore proceed to expand the exponential in equation (49) in a Taylor series in Ω. At this stage, it is convenient to shift the order parameter (51) by a constant N/V¹⁺ⁿ (where N is the total number of functional sites), so that it satisfies the condition

$\begin{equation} \int \mathrm{d} {\hat{\boldsymbol{x}}} \, \Omega({\hat{\boldsymbol{x}}}) = 0. \end{equation} \tag{ 56 }$

Both in the liquid phase and in the gel phase, the system is uniform in density. Therefore the saddle-point order parameter satisfies

$\begin{equation} \int \prod_{\beta ( \neq \alpha )} \mathrm{d} {\boldsymbol{x}}^{\beta} \Omega({\hat{\boldsymbol{x}}}) = \left[ \overline{\langle \rho({\boldsymbol{x}}^{\alpha}) \rangle_{\rm{m}} }\right]_{\rm{p}} - \frac{N}{V} = 0. \end{equation} \tag{ 57 }$

We therefore only have to consider the subspace of $\Omega ({\hat {\boldsymbol {x}}})$ in which equation (57) is obeyed. This subspace is usually called the higher replica sector.

The form of Landau free energy given in equation (49) is generic for all isotropic networks:

$\begin{equation} H_{\rm{HR}} =\int \mathrm{d}{\hat{\boldsymbol{x}}} \left\{ \frac{1}{2} \sum_{\alpha = 0}^n \left( \nabla^{\alpha} \Omega\right)^2 + \frac{1}{2} r \, \Omega^2 - \frac{1}{3} w \, \Omega^3 + \cdots \right\}. \end{equation} \tag{ 58 }$

The coefficients in the Taylor expansion do of course depend on the short-scale structure of the constituent particles. It may appear surprising that the cubic term in equation (58) has a negative coefficient, which seems to suggest that the Landau theory is unstable, at lest for uniform values of the order parameter. By equations (53) and (54), however, a large and uniform $\Omega ({\hat {\boldsymbol {x}}})$ also implies the large deviation of the particle densities from their equilibrium values in each replica (and therefore would violate equation (57)), and this would incur a large free-energy penalty. The Landau theory of vulcanization is therefore locally stable. Minimization of equation (58) leads to the following saddle-point equation:

$\begin{equation} - \sum_{\alpha = 0}^n\left(\nabla^{\alpha}\right)^{2} \Omega({\boldsymbol{x}}) + r \, \Omega({\hat{\boldsymbol{x}}}) - w \, \Omega^2({\hat{\boldsymbol{x}}}) = 0. \end{equation} \tag{ 59 }$

This equation should be solved subject to the constraint (57), and is therefore necessarily non-uniform in replicated space.

6.1. Order parameter at the saddle-point

By solving the saddle-point equation (59), subject to the constraint (57), we obtain the saddle-point value of the order parameter, which we denote by $\overline {\Omega }({\hat {\boldsymbol {x}}})$ . For r > 0, the saddle-point value of $\overline {\Omega }$ is trivially zero, corresponding to the liquid state. For r < 0, the trivial saddle-point becomes unstable, and a non-trivial (i.e. replica-space dependent) saddle-point value emerges, corresponding to the amorphous solid state. It has the following form of superposed Gaussians¹⁰:

$\begin{equation} \overline{\Omega}({\hat{\boldsymbol{x}}}) = q\, \int \mathrm{d}{\boldsymbol{z}} \int \mathrm{d} \tau \, p(\tau) \, \left({2 \pi}{\tau}\right)^{(1+n)d/2} \mathrm{e}^{- \frac{\tau}{2}\sum_{\alpha = 0}^n({\boldsymbol{x}}^{\alpha} - {\boldsymbol{z}})^2} - \frac{\,\,q }{\quad V^{n}}. \end{equation} \tag{ 60 }$

The coefficient q is identified with the number density of the infinite cluster (i.e. the gel fraction). The dummy parameter τ is the inverse variance of the Gaussian fluctuations from their equilibrium positions of the localized particles (belonging to the infinite cluster), and is called the inverse square localization length. The fact that there is a distribution of τ signifies the heterogeneous nature of randomly crosslinked systems: the infinite cluster is more rigid in some places and looser in others. Both q and p(τ) are determined by solving the saddle-point equation. For details, see [6, 7]. It is important to note that they are independent of strain deformation and all other parameters that are tunable in the measurement state. The physics of the saddle-point order parameter can be better understood by focusing on the integrand inside equation (60). The vector z can be interpreted as the mean position of a particle in the gel fraction, and is localized around x⁰, the position of the particle at the moment of crosslinking. After crosslinking, the particle in the gel fraction (x^α,α = 1,...,n) is localized around z, its mean location in the measurement state. The uniform integral over z implies that the gel is statistically homogeneous, after averaging over the quenched disorder.

If the crosslinker density is not high enough, the system is in the sol phase in the measurement state, no infinite cluster emerges, and all particles are delocalized, so that there is no correlation between their positions in different measurements, performed in the preparation and measurement states. Hence, the associated joint pdf is a trivial constant, and the average order parameter is vanishes identically. By contrast, in the gel phase, there is an infinite cluster which constitutes a finite percentage of the overall mass. For any particle in this infinite cluster, different measurements of its position are necessarily correlated: knowledge of its location in the preparation state tells us information of its whereabout in the measurement state, and hence the average order parameter in equation (51) is non-zero. The order parameter $\Omega ({\hat {\boldsymbol {x}}})$ thus distinguishes the gel phase from the sol phase.

We can actually deduce the functional form of the saddle-point order parameter (60) using the definition of the order parameter (52) (after subtracting a trivial constant, so that the constraint (56) is satisfied) together with some simple, intuitive reasoning. We have already noted that particles belonging to the liquid fraction do not contribute to the saddle-point order parameter. For a particle i belonging to the infinite cluster, we expected that its position c_i in the measurement state is Gaussian-distributed around its mean value, which we denote by z_i, with variance 1/τ_i. The thermal averages inside equation (52) combine to give

$\begin{equation} \langle \delta_{b^{\prime}}({\boldsymbol{x}}^1- {\boldsymbol{c}}_i) \rangle_{\rm{m}} \cdots \langle \delta_{b^{\prime}}({\boldsymbol{x}}^n- {\boldsymbol{c}}_i) \rangle_{\rm{m}} = (2 \pi \tau_i)^{nd/2}\, \exp \left\{- \frac{\tau_i}{2} \sum_{\alpha = 1}^n |{\boldsymbol{x}}^{\alpha} - {\boldsymbol{z}}_i |^2\right\}. \end{equation} \tag{ 61 }$

The parameters z_i and τ_i of course depend on the microscopic configuration of the preparation state at the instant of crosslinking, as well as on the crosslinks that are realized. It is important to note that the mean position z_i is distinct from the position of the particle at the instant of crosslinking, which we denote by c⁰_i. Although, for a given configuration of preparation state and crosslinker realization, z_i is fully determined, it becomes a statistical variable after we include the various crosslinker realizations. In fact, we expect that z_i is Gaussian-distributed around the particle position c⁰_i in the preparation state, with the same variance 1/τ_i. Also, for different realizations of crosslinkers, the variance 1/τ_i itself should also be different, obeying some distribution p(τ). An ergodic hypothesis suggests that this distribution is reflected in the microscopic spatial heterogeneity that individual samples of elastomer exhibit. All these considerations suggest that

$\begin{eqnarray} &&\fl \overline{ \langle \delta_{b^{\prime}}({\boldsymbol{x}}^1- {\boldsymbol{c}}_i) \rangle_{\rm{m}} \cdots \langle \delta_{b^{\prime}}({\boldsymbol{x}}^n- {\boldsymbol{c}}_i) \rangle_{\rm{m}} }= \int \mathrm{d} {\boldsymbol{z}}\, \int \mathrm{d}\tau\, (2 \pi \tau)^{(1+n)\,d/2}\,p(\tau)\nonumber\\ &&\times \exp \left\{ - \frac{\tau}{2} \sum_{\alpha = 1}^n | {\boldsymbol{x}}^{\alpha} - {\boldsymbol{z}}|^2 - \frac{\tau}{2} |{\boldsymbol{c}}_i^0- {\boldsymbol{z}}|^2 \right\}. \end{eqnarray} \tag{ 62 }$

Finally, we note that in the preparation state the system is a homogeneous liquid and thus translation invariant. Hence, the position of a given particle c₀ is uniformly distributed over space, and therefore for an arbitrary function f(c⁰_i) we have

$\begin{equation} \left[ f({\boldsymbol{c}}_i^0) \right]_{\rm{p}} = \int \frac{\mathrm{d}{\boldsymbol{c}}}{V}\,f({\boldsymbol{c}}). \end{equation} \tag{ 63 }$

We now substitute equation (62) back into equation (52) and carry out the average over the preparation state and also sum over all of the particles. Thus, we finally arrive at the precise form of the saddle-point order parameter given in equation (60).

6.2. Translational symmetry and order parameter

The 1 + n replicated Hamiltonian H_1+n, equation (34c), possesses the symmetry of independent translations of the replicas, i.e. it is invariant under the α-dependent translations:

$\begin{eqnarray*} &&{\boldsymbol{c}}_i^{\alpha} \rightarrow {\boldsymbol{c}}_i^{\alpha} + {\bf u}^{\alpha}, \quad i = 1, \ldots, N. \end{eqnarray*}$

This symmetry is also possessed by the average order parameter $\overline {\Omega }({\hat {\boldsymbol {x}}})$ in the liquid phase (which has the value zero). In the gel phase, however, this translational symmetry is explicitly broken by the saddle-point order parameter, equation (60). Translations of all particles in one replica leads to a distinct but energetically degenerate order parameter

$\begin{eqnarray*} &&\overline{\Omega}({\boldsymbol{x}}^0, \ldots, {\boldsymbol{x}}^{\alpha} + {\boldsymbol{u}}, \ldots {\boldsymbol{x}}^n) \neq \overline{\Omega}({\boldsymbol{x}}^0, \ldots, {\boldsymbol{x}}^{\alpha}, \ldots {\boldsymbol{x}}^n). \end{eqnarray*}$

This reduction in symmetry is a consequence of the localization of the particles associated with the emergence of an infinite cluster. Nevertheless, translations of all replicas by a common vector u remains a symmetry for equation (60):

$\begin{eqnarray*} &&\overline{\Omega}({\boldsymbol{x}}^0,{\boldsymbol{x}}^1, \ldots, {\boldsymbol{x}}^n) = \overline{\Omega}({\boldsymbol{x}}^0 + \boldsymbol{u},{\boldsymbol{x}}^1 + \boldsymbol{u}, \ldots, {\boldsymbol{c}}^n + \boldsymbol{u}). \end{eqnarray*}$

This macroscopic translational invariance [6] reflects the fact that gels and elastomers are statistically homogeneous. This symmetry can be broken by translational ordering, such as smectic ordering in liquid crystalline elastomers.

6.3. Classical theory of the elasticity of rubber

Because rubbery materials can typically sustain large deformations, their elasticity theory is necessarily nonlinear. This is already apparent in the classical theory of rubber elasticity, which gives the elastic free energy

$\begin{equation} H(\boldsymbol{\Lambda}) = \frac{\mu}{2} {\rm{Tr}\,} \boldsymbol{\Lambda}^{\rm{T}} \boldsymbol{\Lambda} \end{equation} \tag{ 64 }$

in terms of the (spatially uniform) deformation gradient matrix Λ. The matrix Λ relates the undeformed positions of particles belonging to the infinite cluster to their deformed positions z' ≡ Λ·z. As most rubbery materials are nearly incompressible, det Λ can be taken to be unity. This is largely responsible for the nonlinear nature of rubber elasticity. In the framework of statistical physics, the positions of particles fluctuate, and this deformation relation must be understood in the sense of relocation of the mean positions. It is reasonable to suppose that the saddle-point order parameter of the deformed state is given by

$\begin{equation} \overline{\Omega}_{\boldsymbol{\Lambda}}({\hat{\boldsymbol{x}}}) = q\, \int_z \int_{\tau} \mathrm{e}^{- \frac{\tau}{2} ({\boldsymbol{x}}^0 - {\boldsymbol{z}})^2 - \frac{\tau}{2}\sum_{\alpha = 1}^n |{\boldsymbol{x}}^{\alpha} - \boldsymbol{\Lambda} {\boldsymbol{z}}|^2} - \frac{\,\,q}{\quad V^{n}}. \end{equation} \tag{ 65 }$

Note that the mean position z is affinely deformed in the measurement state (replicas 1,...,n), but not in the preparation state (replica 0), and that q and p(τ) are not deformed (i.e. do not depend on Λ). Deformation of zeroth replica can always be 'gauged away' via a coordinate transform of the dummy integration variable z, and hence has no physical significance. It is the relative deformation between the preparation and measurement ensembles that has physical significance.

While it is rather obvious that the gel fraction q should not change with deformation, it is a priori not clear why the distribution of localization length p(τ) should stay the same in the course of deformation. Nevertheless, it has been shown explicitly [7] that equation (65) indeed satisfies the saddle-point equation (59) and the boundary conditions for a solid having a uniform deformation Λ. Furthermore, by inserting equation (65) into the Landau free energy (58)¹¹ it is straightforward to obtain the elastic free energy of the deformed system, as given in equation (64), i.e. the classical theory of rubber elasticity [8]. Moreover, one finds that μ∝|r|³ for the shear modulus¹². It is rather interesting to see that classical rubber elasticity theory emerges at the saddle-point level of vulcanization theory.

6.4. Nematic elastomers and neo-classical elasticity theory

In nematic elastomers, the polymer chains carry liquid crystalline mesogens that are prone to undergoing nematic ordering. The mesogens can either be parts of the polymer backbone (i.e. main-chain nematic polymers) or attached to the backbone side-on (i.e. side-chain nematic polymer). A Landau theory of nematic elastomers has be derived via vulcanization theory elsewhere [7]; here we simply quote the results.

Let the (spatially uniform) nematic order parameter be Q⁰ in the preparation state and Q in the measurement state. The free energy (58) needs to be modified to include possible couplings between the nematic and vulcanization order parameters. To lowest order in these order parameters, the couplings are

$\begin{equation} \int \mathrm{d} {\hat{\boldsymbol{x}}} \left\{ {\boldsymbol{Q}}^0_{ij} \,\, \nabla_i^0\Omega \,\, \nabla_j^0\Omega +\sum_{\alpha=1}^{n} {\boldsymbol{Q}}_{ij}\,\, \nabla_i^{\alpha}\Omega \,\, \nabla_j^{\alpha}\Omega \right\}. \end{equation} \tag{ 66 }$

These couplings can be incorporated into the isotropic term in equation (58) to yield

$\begin{equation} \fl H_{\mathrm HR} =\int \mathrm{d}{\hat{\boldsymbol{x}}} \left\{ \frac{1}{2} \boldsymbol{l}^0_{ij}\, \nabla^{0}_i \Omega\, \nabla^{0}_j \Omega + \frac{1}{2} \sum_{\alpha = 1}^n \boldsymbol{l}_{ij}\, \nabla^{\alpha}_i \Omega\, \nabla^{\alpha}_j \Omega + \frac{1}{2} r \, \Omega^2 - \frac{1}{3} w \, \Omega^3 + \cdots \right\}, \end{equation} \tag{ 67 }$

where

$\begin{equation} \boldsymbol{l}^0 \equiv \boldsymbol{I} + {\boldsymbol{Q}}^0\quad\mathrm{and} \quad \boldsymbol{l} \equiv \boldsymbol{I} + {\boldsymbol{Q}} \end{equation} \tag{ 68 }$

are the polymer step-length tensors in the preparation and measurement states, respectively.

The general saddle-point in the presence of a macroscopic deformation gradient Λ, which we denote by $\overline {\Omega }_{\boldsymbol {\Lambda }}({\hat {\boldsymbol {x}}})$ , can also be obtained, by minimizing the free energy (67):

$\begin{equation} \overline{\Omega}_{\boldsymbol{\Lambda}}({\hat{\boldsymbol{x}}}) = q\, \int_z \int_{\tau} \exp\left\{- \frac{\tau}{2} {\boldsymbol{y}}^0 \cdot \boldsymbol{l}^0 \cdot {\boldsymbol{y}}^0 - \frac{\tau}{2}\sum_{\alpha = 1}^n {\boldsymbol{y}}_{\boldsymbol{\Lambda}}^{\alpha} \cdot \boldsymbol{l} \cdot {\boldsymbol{y}}_{\boldsymbol{\Lambda}}^{\alpha}\right\} - \frac{\,\,q}{\quad V^{n}}, \end{equation} \tag{ 69 }$

where

$\begin{equation} {\boldsymbol{y}}^0 \equiv {\boldsymbol{x}}^0 - {\boldsymbol{z}} \quad{\rm and}\quad {\boldsymbol{y}}^{\alpha}_{\boldsymbol{\Lambda}} \equiv {\boldsymbol{x}}^{\alpha} - \boldsymbol{\Lambda} \cdot {\boldsymbol{z}}. \end{equation} \tag{ 70 }$

Note that the Gaussian fluctuations of the particle positions have variance matrices l⁰ in the preparation state and l in the measurement state. The effect of nematic order is therefore to render the thermal position fluctuations of each molecule anisotropic.

Substituting this deformed saddle-point (69) back into equation (67), and carrying out the prescription specified in equation (34a), we find the following elastic free energy for nematic elastomers:

$\begin{equation} H(\boldsymbol{\Lambda}) = \frac{\mu}{2}\,{\rm{Tr}\,}\,\boldsymbol{l}^0 \boldsymbol{\Lambda}^{\rm{T}} \boldsymbol{l}^{-1} \boldsymbol{\Lambda} \end{equation} \tag{ 71 }$

which is precisely the neo-classical elasticity theory for nematic elastomers, derived by Warner and Terentjev [3, 9] as a generalization of the classical theory of rubber elasticity.

7. Concluding remarks: why vulcanization theory?

It is quite satisfactory that the classical theory of rubber elasticity and the neo-classical elasticity theory of nematic elastomers can be derived from the vulcanization theory as saddle-point approximations. These theories were originally established by studying the statistics of single polymers inside a network, with two key assumptions [3, 8]: (i) that strain deformations are affine at all scales; and (ii) that the polymer statistics are Gaussian. These theories have attained classical status because of their remarkable successes in explaining the basic properties of isotropic and nematic elastomers. Nevertheless, the starting point of single polymer statistics substantially restricts the generalizability of these theories. For example, it is hard to see how one might systematically explore the nature of heterogeneities in random polymer networks using these molecular-level approaches.

Vulcanization theory was developed to understand the heterogeneities of randomly crosslinked systems. In the face of its (perhaps heavy!) machinery—such as field theory, the replica technique and multiple statistical ensembles—if all that could be obtained from it were re-derivations of classical results then it would be hard to argue that vulcanization theory has been worth investing in. Fortunately, however, several fundamental new results have already been obtained using vulcanization theory, concerning the heterogeneity of rubbery materials: (i) the distribution of localization lengths p(τ) in isotropic networks [10]; (ii) the distribution of internal random stresses in isotropic networks [11, 12]; and (iii) the statistics of random fields and memory effects in isotropic-genesis nematic elastomers [13]. We refer the reader to the literature for further details about these topics. We think it is fair to say that we are still in an early stage in the unraveling of the statistical physics of network heterogeneities and memory effects in randomly crosslinked materials. We hope that fresh and interesting results will be available to report in the near future.

Acknowledgments

We thank our numerous co-workers and several constructive anonymous referees for valuable collaborations and guidance. This work was supported by NSFC (China) via grant numbers 11174196 and 91130012, by the US NSF via DMR 09-06780 and DMR 12 07026, and by the Institute for Complex Adaptive Matter.

Generalized Deam–Edwards approach to the statistical mechanics of randomly crosslinked systems

Article metrics

Author e-mails

Author affiliations

Author notes

Dates

Abstract

1. Introduction

2. Model of linked particles

3. Generalized Deam–Edwards distribution of network structure

3.1. Statistics of connectivity between a pair of sites

3.2. Statistics of the connectivity of the network

3.3. Normalization of probabilities

3.4. Averaging over network structures

4. Physical quantities and correlation functions

4.1. Significance of the zeroth replica

4.2. Replica limit and causality

4.3. Deam–Edwards revisited

4.4. Comparison with spin glasses and other disordered systems

5. Order parameter for vulcanization

5.1. One replica sector and marginal distributions

6. Landau theory and saddle-point approximation

6.1. Order parameter at the saddle-point

6.2. Translational symmetry and order parameter

6.3. Classical theory of the elasticity of rubber

6.4. Nematic elastomers and neo-classical elasticity theory

7. Concluding remarks: why vulcanization theory?

Acknowledgments

Footnotes

Generalized Deam–Edwards approach to the statistical mechanics of randomly crosslinked systems

Article metrics

Share this article

Author e-mails

Author affiliations

Author notes

Dates

Abstract

1. Introduction

2. Model of linked particles

3. Generalized Deam–Edwards distribution of network structure

3.1. Statistics of connectivity between a pair of sites

3.2. Statistics of the connectivity of the network

3.3. Normalization of probabilities

3.4. Averaging over network structures

4. Physical quantities and correlation functions

4.1. Significance of the zeroth replica

4.2. Replica limit and causality

4.3. Deam–Edwards revisited

4.4. Comparison with spin glasses and other disordered systems

5. Order parameter for vulcanization

5.1. One replica sector and marginal distributions

6. Landau theory and saddle-point approximation

6.1. Order parameter at the saddle-point

6.2. Translational symmetry and order parameter

6.3. Classical theory of the elasticity of rubber

6.4. Nematic elastomers and neo-classical elasticity theory

7. Concluding remarks: why vulcanization theory?

Acknowledgments

Footnotes