Incorporating age and delay into models for biophysical systems

Wasiur R KhudaBukhsh; Hye-Won Kang; Eben Kenah; Grzegorz A Rempała

doi:10.1088/1478-3975/abc2ab

1. Introduction

We consider biophysical systems described by a set of chemical reactions. The chemically identical molecular entities in the system are called (chemical) species. A chemical reaction refers to the event of creation, annihilation, or conversion of a number of molecules of one or more species. Here, we assume the system is well mixed spatially in that a randomly chosen molecule of a species has an equal chance to chemically interact with any other molecule of any species in the system. A continuous time Markov chain (CTMC) is a natural choice to model the species copy numbers of such systems.

When modeling chemical reaction networks (CRNs) stochastically using CTMCs, one assumes that every reaction occurs instantaneously after an exponentially distributed amount of time. Whenever a reaction takes place, we update the system state. A random time-change representation of the Poisson process is often used to write the trajectory equations and to analyze the system dynamics [1–4]. The sample paths of the CTMC are simulated exactly using the Doob–Gillespie's stochastic simulation algorithm (SSA) [5–7] or the next reaction method by Gibson and Bruck [8].

1.1. Delays are inherent and a useful model reduction tool

It has been reported that some biological processes do not take place instantaneously. In other words, there is a time lag between the initiation and the completion of the process. Time delays are observed inherently in many biological systems, such as gene transcription [9–11] and translation [12], cell cycle in cancer treatment [13], intracellular viral dynamics [14, 15], control of infectious diseases [16], population growth [17, 18], RNA and protein folding [19, 20], and enzyme catalyzed reactions [21, 22]. Sometimes time delays are introduced purposefully as a useful means to reduce model complexity and to compensate for the lack of experimental observation in both deterministic and stochastic models of biological processes.

Intermediate, ancillary processes or unobserved reactions can be replaced by time delays. For example, production of hes1 mRNA from hes1 gene has been modeled using delay differential equations where detailed mRNA synthesis and processing steps are replaced by a time delayed reaction [23]. While modeling the mammalian circadian clock, intermediate protein dynamics can be simplified as transcriptional feedback loops with time delayed variables in delay differential equations [24]. In enzyme catalyzed reactions with multiple intermediates, the production of the final product can be expressed as a distributed delay equation, which is a useful tool when measurements on multiple intermediates in the experiment are not available [25].

Introduction of time delays as a model reduction technique has also been applied in discrete stochastic models for CRNs. For instance, model complexity of unimolecular reaction networks is reduced by generating delay distributions with key model features that are derived by computing first passage times of target species [26]. In [27], the production of yellow fluorescent protein has been described using a time-delayed birth and death process where a randomly distributed time delay was generated to simplify a sequence of steps in gene activation.

1.2. Our contribution

In most previous works in this area, the focus was on investigating stochastic models for CRNs with constant or random time delays. In those models, the probability that a reaction occurs within the next short interval of time is commonly described by a propensity (also known as intensity) function of the reaction. The waiting time for non-delayed reactions is exponentially distributed [28]. In practice, the occurrence of some reactions is not only determined by the molecular counts of the reactants but also affected by the age distributions or lifetimes of the reactant molecules. For example, mRNA decay rates vary depending on the age of each mRNA. Moreover, the age of the mRNAs determines polysome size distributions and protein synthesis rates in translation ([29, 30], chapters 3 and 5 in [31]). It was also reported that an mRNA tail length distribution depends on the average age of mRNA population and that the tail-length distribution plays a significant role in deadenylation and decay dynamics of mRNA populations [32, 33].

When time delays are used to aggregate out ancillary or unobserved processes and reduce model complexity, it makes more sense that the length of time delay depends on the age of each reactant molecule (e.g., mRNA, protein, and enzymes). Therefore, it is worthwhile to consider an individual-based age-structured stochastic model for CRNs.

In this work, we develop a way to describe CRNs with random time delays and non-delayed reaction rates incorporating the age of each reactant and making use of hazard functions in survival analysis [34, 35]. See appendix A for some preliminaries on relevant mathematical and statistical concepts. In our approach, the hazard functions are set as constant, time-dependent, or age-dependent functions generalizing the notion of reaction rate constants in propensity functions. Our model keeps track of the age of each reactant molecule and provides a new way to express time delays in non-Markovian models. Moreover, the method also allows us to describe discrete stochastic CRNs with constant or random time delays without age dependence, as considered in previous works. We study the large-volume limit of the proposed non-Markov CRN and provide a mean-field PDE limit for the age densities by virtue of the law of large numbers (LLN), as opposed to an ODE limit in the classical theory. The PDE limit is based on existing results in the literature [36, 37] and follows from the standard limit theory for measure-valued Markov processes. However, novel usage of the PDE limit can provide further approximations and pave the way for efficient simulation algorithms. For the sake of illustration, we show how the PDE limit can be used to approximate mean first passage times (MFPTs) in the context of CRNs. As another by-product of the LLN, we show how an efficient (fast) hybrid simulation algorithm can be devised when a subset of the CRN is abundantly available, giving a flavor of multiscale approximation. Finally, as simple applications of our approach, we briefly discuss a prokaryotic auto-regulation and the quasi-steady state approximation (QSSA) in the context of the Michaelis–Menten enzyme kinetic reactions. Numerical examples have been provided wherever deemed necessary. For the sake of ready usage of our methods, the Julia scripts used in the numerical examples have been made available via a GitHub repository [38].

The following notational conventions are adhered to throughout the paper. We use 1 _{A}(x) to denote the indicator (or characteristic) function of a set A, i.e., 1 _{A}(x) = 1 if and only if x ∈ A. Given a suitable space E, let D([0, ∞), E) (or D([0, T], E)) denote the space of E-valued càdlàg functions defined on [0, ∞) (or [0, T], for some T > 0). The set of Borel subsets of a set A will be denoted by $\mathcal{B}\left(A\right)$ . The set of natural numbers are denoted by $\mathbb{N}$ . The set of real numbers is denoted by $\mathbb{R}$ . Other notations will be introduced as and when needed.

2. The simplest model with a delay

Let us consider a simple CRN with two chemical species A and B. First, we shall describe the standard Markovian approach and then introduce an age structure to allow non-exponential holding times. The following network describes the production and the degradation of A along with a conversion from A to B

$\begin{equation}\begin{aligned}\hfill \varnothing \enspace {\rightarrow }^{b}\enspace A& {\rightarrow }^{\tau }\enspace B,\hfill \\ \hfill A\enspace {\rightarrow }^{d}\enspace \varnothing & ,\hfill \end{aligned}\end{equation} \tag{ 2.1 }$

where b, τ, and d, depending on whether we are in the Markovian or non-Markovian setup, will be either reaction rate constants or hazard functions for the production of A, the conversion from A to B, and the degradation of A, respectively.

An example similar to the CRN in equation (2.1) was investigated in some previous works with time delays [39, 40]. It is worth noting that the simplistic CRN described in equation (2.1) can be thought of as a model reduction of a more complex CRN. For instance, a series of conversion type reactions

$\begin{equation}A\enspace {\rightarrow }^{{k}_{1}}\enspace {A}_{1}\enspace {\rightarrow }^{{k}_{2}}\enspace {A}_{2}\enspace {\rightarrow }^{{k}_{3}}\enspace \cdots \enspace {\rightarrow }^{{k}_{n}}\enspace B\end{equation} \tag{ 2.2 }$

can be described by a single conversion reaction A ⟶^τ B with an appropriate hazard function τ. For the sake of illustration, let us assume we are in the Markovian setup so that k₁, k₂, ..., k_n are positive constants. We can interpret the CRN in equation (2.2) as follows: one molecule of A gets transformed into a molecule of A₁ after an exponentially distributed (with rate k₁) amount of time. Then, the molecule of A₁ gets transformed into a molecule of A₂ after an exponentially distributed (with rate k₂ this time) amount of time. This process goes on until the molecule finally gets transformed into a molecule of B from a molecule of A_n−1. Therefore, from the perspective of a single A molecule, the amount of time required for the molecule to finally get transformed into a molecule of B is the sum of those exponentially distributed amounts of times (with rates k₁, k₂, ..., k_n). Under independence, the probability distribution of the total amount of time required for a single A molecule to get transformed into a B molecule can be described by a convolution of the individual exponential distributions. Denoting the corresponding hazard function by τ, one can describe the CRN in equation (2.2) by a single conversion reaction A ⟶^τ B. Similarly, a series of birth–death-conversion type reactions

$\begin{equation*}\begin{aligned}\hfill \varnothing \enspace {\rightarrow }^{b}\enspace A\enspace {\rightarrow }^{{k}_{1}}\enspace {A}_{1}\enspace {\rightarrow }^{{k}_{2}}\enspace {A}_{2}\enspace {\rightarrow }^{{k}_{3}}\enspace \cdots \enspace {\rightarrow }^{{k}_{n}}\enspace B,& \hfill \\ \hfill A\enspace {\rightarrow }^{d}\enspace \varnothing ,{A}_{1}\enspace {\rightarrow }^{{d}_{1}}\enspace \varnothing ,{A}_{2}\enspace {\rightarrow }^{{d}_{2}}\enspace \varnothing ,\dots ,{A}_{n}\enspace {\rightarrow }^{{d}_{n}}\enspace \varnothing \end{aligned}\end{equation*}$

can be approximated by a single birth type reaction ∅ ⟶^τ B with an appropriate hazard function τ. Therefore, even a simplistic model such as the CRN in equation (2.1) covers a nontrivial class of CRNs and builds the foundation for studying more complex CRNs.

2.1. Standard Markov approach

The standard way to model the CRN in equation (2.1) is to use a CTMC to describe the counts of molecules of the species A and B over time. In such a model, whenever each reaction fires, the consumption and the production of molecules are instantaneous. Let ${\tilde {X}}_{A},\;{\tilde {X}}_{B}$ denote the stochastic processes counting the copy numbers of the species A and B respectively. Here, the quantities b, τ, and d are reaction rate constants. The propensity functions corresponding to the three chemical reactions are defined as

$\begin{align*}\hfill & {\lambda }_{b}\left(t\right)=b,\hspace{25.0pt}{\lambda }_{\tau }\left(t\right)=\tau {\times}{x}_{A}\left(t\right),\hspace{25.0pt}\hfill \\ \hfill & {\lambda }_{d}\left(t\right)=d{\times}{x}_{A}\left(t\right),\hfill \end{align*}$

where x_i(t) denotes the number of molecules of the chemical species i at time t, for i = A, B. Define T_k to be the waiting time until the next reaction of type birth (k = b), conversion (k = τ), and death (k = d). Then, T_k is exponentially distributed with rate λ_k(t) for k = b, τ, d. The probability of each reaction's occurrence is expressed in terms of the corresponding propensity function as follow:

$\begin{align*}\hfill & \mathsf{P}\left(t{\leqslant}{T}_{k}{< }t+{\Delta}t\vert {\tilde {X}}_{A}\left(t\right)={x}_{A},{\tilde {X}}_{B}\left(t\right)={x}_{B}\right)\hfill \\ \hfill & \hspace{25.0pt}\approx {\lambda }_{k}\left(t\right){\Delta}t+o\left({\Delta}t\right),\hfill \end{align*}$

for k = b, τ, d when Δt is small enough. Then, the trajectory equations can be written in a straightforward fashion following the random time changed representation of Poisson processes as

$\begin{equation*}\begin{aligned}\hfill {\tilde {X}}_{A}\left(t\right)& ={\tilde {X}}_{A}\left(0\right)+{R}_{1}\left(bt\right)-{R}_{2}\left({\int }_{0}^{t}\tau {\tilde {X}}_{A}\left(s\right)\enspace \mathrm{d}s\right)\hfill \\ \hfill & \quad -{R}_{3}\left({\int }_{0}^{t}d\hspace{2.0pt}{\tilde {X}}_{A}\left(s\right)\enspace \mathrm{d}s\right),\hfill \\ \hfill {\tilde {X}}_{B}\left(t\right)& ={R}_{2}\left({\int }_{0}^{t}\tau {\tilde {X}}_{A}\left(s\right)\enspace \mathrm{d}s\right),\hfill \end{aligned}\end{equation*}$

where R₁, R₂, and R₃ are unit rate Poisson processes [2]. We assume we do not have any B molecules in the system initially, i.e., ${\tilde {X}}_{b}\left(0\right)=0$ . Now, if we scale the stochastic processes by a scaling parameter n, e.g., volume of the system, it follows directly from the LLN for Poisson processes [41, 42] that the scaled process $\left({n}^{-1}{\tilde {X}}_{A},{n}^{-1}{\tilde {X}}_{B}\right)$ can be approximated by the solution to the following system of ODEs:

$\begin{align*}\hfill \begin{aligned}\hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{x}_{A}\left(t\right)& =-\left(\tau +d\right){x}_{A}\left(t\right),\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{x}_{B}\left(t\right)& =\tau {x}_{A}\left(t\right).\hfill \end{aligned}\end{align*}$

Notice that the birth rate b vanishes in the limit because we did not assume any scaling of b with respect to n. In general, one would assume that the overall birth rate scales linearly with n so that it is retained in the limit.

2.2. Age-structured model

Now, let us introduce age and delay into the CRN described by equation (2.1). We assume that the production rate of B and the degradation rate of A depend on the age of the reactant molecule of species A. We use 'age' as an umbrella term to refer to the amount of elapsed time since a specific event. Thus, 'age' could mean different things depending on the application area. The most straightforward way is the biological or the physical age, which we take as the time duration since the molecule was born or created. In systems where a certain reaction can fire only when a gene is activated, one could define age as the time duration since activation of the gene. In some cases, it may be desirable to define delays in terms of time duration since the initiation of a reaction. The notion of age is sufficiently general to account for those cases as well. For example, a reaction A → B in which the delay is defined purely in terms of time difference between initiation and completion of the reaction, can be replaced by the reaction system A → F → B where F is a fictitious species. The physical age of this fictitious species F is precisely the time since the initiation of the reaction A → B. Now, putting an appropriate hazard function on the reaction F → B, we can introduce a random or a deterministic delay in the reaction A → B. Therefore, for the CRN in equation (2.1), it seems sufficient to define the age to be the physical age of the molecules of A.

When we have an age-structured model, the counts (copy numbers of the species A and B) are inherently non-Markovian unless the holding times are exponentially distributed. However, if we keep track of the ages of the molecules in addition to the counts, we can get a Markov system, albeit on a more abstract state space. A neat way to do so is to use measure-valued processes that keep track of the age distribution of the molecules over time. Moreover, the measure-valued processes are also Markovian, which allows us to make use of the already existing limit theory for Banach space-valued Markov processes. This approach to age-structured modeling in biology is not new. Our work builds on the existing literature [36, 37, 43, 44]. In the next section, we describe how the measure-valued processes can be utilized in the context of the CRN in equation (2.1).

2.3. The measure-valued process and the limiting system

Let us denote by N_A(t) and N_B(t) the numbers of molecules of the chemical species A and B at time t. Then, individual molecules of A are labeled 1, 2, ..., N_A(t). We denote the age of the ith molecule of the species A by a_i(t) for i = 1, 2, ..., N_A(t). Similarly, we denote by b_j(t) the age of the jth molecule of the species B at time t. Now, we define a measure-valued process ${X}_{t}=\left({X}_{t}^{A},{X}_{t}^{B}\right)$ where ${X}_{t}^{A}$ and ${X}_{t}^{B}$ describe the age distributions of chemical species A and B at time t. To be more precise, we define

$\begin{equation}{X}_{t}^{A}{:=}\sum _{i=1}^{{N}_{A}\left(t\right)}{\delta }_{{a}_{i}\left(t\right)},\hspace{25.0pt}{X}_{t}^{B}{:=}\sum _{i=1}^{{N}_{B}\left(t\right)}{\delta }_{{b}_{i}\left(t\right)},\end{equation} \tag{ 2.3 }$

where δ_x is the Dirac measure, a function that takes value 1 if the argument to the function (a measurable set) contains x and zero otherwise. The components ${X}_{t}^{A}$ and ${X}_{t}^{B}$ of X_t are finite point measures with atoms placed on the individual ages of the molecules. For example, ${X}_{t}^{A}\left(\left(0.5,11.25\right]\right)={\sum }_{i=1}^{{N}_{A}\left(t\right)}{\delta }_{{a}_{i}\left(t\right)}\left(\left(0.5,11.25\right]\right)$ gives us the count of species A molecules with ages in the set (0.5, 11.25] at time t. In general, ${X}_{t}^{A}\left(F\right)$ gives us the count of species A molecules whose ages lie in the set F at time t.

For any point measure $\mu ={\sum }_{i=1}^{n}{\delta }_{{x}_{i}}$ and a measurable function f, we denote the integration of the function f with respect to the measure μ by

$\begin{equation*}\langle \mu ,f\rangle {:=}\int f\enspace d\mu =\sum _{i=1}^{n}f\left({x}_{i}\right).\end{equation*}$

If μ := (μ₁, μ₂, ..., μ_L), for some positive integer L, is a vector of point measures and f is a measurable function, we use the notation ⟨⟨μ, f⟩⟩ to denote

$\begin{equation*}\langle \langle \mu ,f\rangle \rangle {:=}\sum _{i=1}^{L}\langle {\mu }_{i},f\rangle .\end{equation*}$

Therefore, we have ${N}_{A}\left(t\right)=\langle {X}_{t}^{A},1\rangle ={X}_{t}^{A}\left({\mathbb{R}}_{+}\right)$ and ${N}_{B}\left(t\right)=\langle {X}_{t}^{B},1\rangle ={X}_{t}^{B}\left({\mathbb{R}}_{+}\right)$ where 1 stands for the identity function. The set of non-negative real numbers is denoted by ${\mathbb{R}}_{+}$ . The total population size is given by

$\begin{equation*}N\left(t\right){:=}\langle \langle {X}_{t},1\rangle \rangle ={N}_{A}\left(t\right)+{N}_{B}\left(t\right).\end{equation*}$

The process X_t is a Markov process on the space $D\left(\left[0,T\right],{\mathcal{M}}_{P}\left({\mathbb{R}}_{+}\right){\times}{\mathcal{M}}_{P}\left({\mathbb{R}}_{+}\right)\right)$ where T > 0 is a finite time horizon and ${\mathcal{M}}_{P}\left({\mathbb{R}}_{+}\right)$ is the space of finite, point measures on ${\mathbb{R}}_{+}$ .

In order to simplify notations, we introduce maps ${\sigma }_{i}:{\mathcal{M}}_{P}\left({\mathbb{R}}_{+}\right)\to {\mathbb{R}}_{+}$ , for i = 1, 2, 3, ..., the purpose of which is to extract the ith atom (the age of the ith molecule) from a point measure following some partial order (e.g., 'greater or equal to' relation). Therefore, ${\sigma }_{i}\left({X}_{t}^{A}\right)$ gives us the age of the ith molecule of the species A at time t. We can now write down the trajectory equations:

$\begin{align}\hfill {X}_{t}^{A}& =\sum _{k=1}^{{N}_{A}\left(0\right)}{\delta }_{t+{\sigma }_{k}\left({X}_{0}^{A}\right)}+{\int }_{0}^{t}{\int }_{0}^{\infty }{\delta }_{t-s}\enspace {\mathsf{1}}_{\left\{\theta {\leqslant}b\right\}}\enspace \hfill \\ \hfill & \quad {\times}{Q}_{1}\left(\mathrm{d}s,\mathrm{d}\theta \right)-{\int }_{0}^{t}{\int }_{\mathbb{N}}{\int }_{0}^{\infty }{\delta }_{t-s+{\sigma }_{i}\left({X}_{s-}^{A}\right)}\hfill \\ \hfill & \quad {\times}\enspace {\mathsf{1}}_{\left\{i{\leqslant}{N}_{A}\left(s-\right)\right\}}\enspace {\mathsf{1}}_{\left\{\theta {\leqslant}\tau \left({\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\right\}}\enspace {Q}_{2}\left(\mathrm{d}s,di,\mathrm{d}\theta \right)\hfill \\ \hfill & \quad -{\int }_{0}^{t}{\int }_{\mathbb{N}}{\int }_{0}^{\infty }{\delta }_{t-s+{\sigma }_{i}\left({X}_{s-}^{A}\right)}\enspace {\mathsf{1}}_{\left\{i{\leqslant}{N}_{A}\left(s-\right)\right\}}\enspace \hfill \\ \hfill & \quad {\times}{\mathsf{1}}_{\left\{\theta {\leqslant}d\left({\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\right\}}\enspace {Q}_{3}\left(\mathrm{d}s,di,\mathrm{d}\theta \right),\hfill \end{align} \tag{ 2.4 }$

$\begin{align}\hfill {X}_{t}^{B}& ={\int }_{0}^{t}{\int }_{\mathbb{N}}{\int }_{0}^{\infty }{\delta }_{t-s}\enspace {\mathsf{1}}_{\left\{i{\leqslant}{N}_{A}\left(s-\right)\right\}}\enspace {\mathsf{1}}_{\left\{\theta {\leqslant}\tau \left({\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\right\}}\enspace \hfill \\ \hfill & \quad {\times}{Q}_{2}\left(\mathrm{d}s,di,\mathrm{d}\theta \right),\hfill \end{align} \tag{ 2.5 }$

where Q₁, Q₂, Q₃ are independent Poisson point measures (PPMs) with intensity measures ds × dθ, ds × di × dθ, and ds × di × dθ respectively, where di is a counting measure on $\mathbb{N}$ , and ds and dθ are Lebesgue measures on ${\mathbb{R}}_{+}$ . Provided the global jump rates are upper bounded by a finite quantity and the initial population size does not explode ${\mathrm{sup}}_{n}\mathsf{E}\left[{n}^{-1}{N}_{A}\left(0\right)\right]{< }\infty$ , the trajectory equations admit a unique pathwise solution $\left({X}_{t}^{A},{X}_{t}^{B}\right)$ (see [37, theorem 2.5] for a similar derivation).

Under some assumptions on the hazard functions and the initial age distribution of the A molecules, the scaled process n⁻¹ X_t converges to a deterministic, continuous function ${x}_{t}{:=}\left({x}_{t}^{A},{x}_{t}^{B}\right)$ whose components ${x}_{t}^{A}$ and ${x}_{t}^{B}$ are themselves measure-valued functions satisfying

$\begin{align*}\hfill \langle {x}_{t}^{A},{f}_{t}\rangle & =\langle {x}_{0}^{A},{f}_{0}\rangle +{\int }_{0}^{t}{\int }_{0}^{\infty }\left(\frac{\partial }{\partial a}{f}_{s}\left(a\right)+\frac{\partial }{\partial s}{f}_{s}\left(a\right)\right.\hfill \\ \hfill & \quad \left.-{f}_{s}\left(a\right)\left(\tau \left(a\right)+d\left(a\right)\right)\right){x}_{s}^{A}\left(\enspace \mathrm{d}a\right)\enspace \mathrm{d}s\hfill \\ \hfill \langle {x}_{t}^{B},{f}_{t}\rangle & ={\int }_{0}^{t}{\int }_{0}^{\infty }\left(\frac{\partial }{\partial a}{f}_{s}\left(a\right)+\frac{\partial }{\partial s}{f}_{s}\left(a\right)\right.\hfill \\ \hfill & \quad \left.+{f}_{s}\left(0\right)\tau \left(a\right)\right){x}_{s}^{A}\left(\enspace \mathrm{d}a\right)\enspace \mathrm{d}s,\hfill \end{align*}$

for a sufficiently large class of test functions f : (a, s) → f_s(a). The convergence of the scaled stochastic process n⁻¹ X_t to the deterministic function x_t can be proved using techniques similar to those in [36, 37, 43–45]. However, for the sake of completeness, a brief, intuitive argument is presented in appendix B.

Since the measure-valued function ${x}_{t}^{B}$ is determined entirely by ${x}_{t}^{A}$ , it suffices to study ${x}_{t}^{A}$ . The densities y_A(t, a) of the measure ${x}_{t}^{A}$ , when they exist, are an important quantity describing the distribution of age of the species A molecules in the large-volume mean-field limit. The density function y_A should satisfy

$\begin{equation}\left({\partial }_{t}+{\partial }_{s}\right){y}_{A}\left(t,s\right)=-\left(\tau \left(s\right)+d\left(s\right)\right){y}_{A}\left(t,s\right),\end{equation} \tag{ 2.6 }$

with the initial and the boundary conditions

$\begin{equation*}{y}_{A}\left(0,s\right)={f}_{A}\left(s\right),\hspace{25.0pt}{y}_{A}\left(t,0\right)=0,\end{equation*}$

where f_A(s) specifies the age distribution of A molecules at time t = 0. To be more precise, it is the density of the limiting measure ${x}_{0}^{A}$ , which we assume exists, with respect to the Lebesgue measure. Notice that the birth rate b vanishes in the limit, as in case of CTMC model, because we did not assume any scaling of the birth rate with respect to n.

Let y_B denote the limiting proportion of B molecules in the system. Then, y_B can be described entirely in terms of the density y_A as a solution to the ODE:

$\begin{equation}\frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{B}\left(t\right)={\int }_{0}^{\infty }\tau \left(s\right){y}_{A}\left(t,s\right)\enspace \mathrm{d}s,\end{equation} \tag{ 2.7 }$

with the initial condition y_B(0) = 0. Luckily, the limiting system equation (2.6) can be solved explicitly using standard analysis techniques:

$\begin{equation*}{y}_{A}\left(t,s\right)={f}_{A}\left(s-t\right){S}_{\tau }\left(s\right){S}_{d}\left(s\right)/\left({S}_{\tau }\left(s-t\right){S}_{d}\left(s-t\right)\right),\end{equation*}$

where S_τ and S_d are the survival functions of the probability distributions characterized by the hazard functions τ and d respectively (see appendix A for the definition of a survival function). Therefore, the limiting concentration of B molecules can be described by

$\begin{equation*}{y}_{B}\left(t\right)={\int }_{0}^{t}{\int }_{0}^{\infty }\tau \left(v\right){y}_{A}\left(u,v\right)\enspace \mathrm{d}v\enspace \mathrm{d}u.\end{equation*}$

In figure 1, we numerically show the agreement between the theoretical limits in equations (2.6) and (2.7) and the stochastic simulation. More specifically, we compare ${\int }_{0}^{\infty }{y}_{A}\left(t,s\right)\enspace \mathrm{d}s$ with stochastic simulations of $\langle {n}^{-1}{X}_{t}^{A},1\rangle$ and y_B(t), with $\langle {n}^{-1}{X}_{t}^{B},1\rangle$ . As it can be verified, the approximation error vanishes in the limit. Because X_t is a Markov process, the simulation of the stochastic CRN in equation (2.1) can be carried out by adapting the Doob–Gillespie's SSA, which involves simulating two quantities at each step: (1) simulating the next reaction time; and (2) determining the reaction type. Note that, for the CRN in equation (2.1), there are (2N_A(t) + 1) different reactions possible at time t, even though there are only three types of reactions. The next reaction time can be simulated by drawing an exponential random variable with rate equal to the total hazard (the sum of the hazards corresponding to those (2N_A(t) + 1) possible reactions). The total hazard is given by $b+\langle {X}_{t}^{A},\tau \rangle +\langle {X}_{t}^{A},d\rangle$ . The type of reaction is then decided by drawing a categorical random variable whose probability masses are the ratios of the individual hazards and the total hazard. This discrete event simulation algorithm is a straightforward adaptation of Doob–Gillespie's SSA for CTMCs. However, it must be noted that the simulation of a non-Markovian CRN is computationally more expensive than the CTMCs. For the sake of completeness, a pseudocode describing the above procedure is given in algorithm 2.1. An implementation in the Julia programming language [46] is also made available in [38].

Algorithm 2.1. Pseudocode for the exact simulation of the CRN in equation (2.1).

Require n, X₀, K	⊳ K: Maximum number of iterations
Ensure $\left({t}_{1},{X}_{{t}_{1}}\right),\left({t}_{2},{X}_{{t}_{2}}\right),\dots$	⊳ Timings of the reactions along with the measures
1: Set t = 0
2: for i = 1, 2, ..., K do	⊳ Compute the next reaction time
3: Calculate ${\Lambda}={\left(b+\langle {X}_{{t}_{i-1}}^{A},\tau \rangle +\langle {X}_{{t}_{i-1}}^{A},d\rangle \right)}^{-1}$	⊳ Λ⁻¹: Total hazard
4: if 0 < Λ < ∞ then
5: Draw an exponential random variable T with mean Λ, i.e., T ∼ Exponential(Λ)	⊳ Advance time to the next reaction time
6: Set t_i = t_i−1 + T	⊳ Determine the reaction type
7: Define π₁ = Λb	⊳ Probability for the birth reaction
8: Define ${\pi }_{j}={\Lambda}\tau \left({\sigma }_{j-1}\left({X}_{{t}_{i-1}}^{A}\right)\right)$ for j = 2, 3, ..., (N_A(t_i−1) + 1)	⊳ Probabilities for the transformation reaction
9: Define ${\pi }_{j}={\Lambda}d\left({\sigma }_{j-{N}_{A}\left({t}_{i-1}\right)-1}\left({X}_{{t}_{i-1}}^{A}\right)\right)$ for j = (N_A(t_i−1) + 2), (N_A(t_i−1) + 3), ..., (2N_A(t_i−1) + 1)	⊳ Probabilities for the death reaction
10: Set $\pi {:=}\left({\pi }_{1},{\pi }_{2},\dots ,{\pi }_{2{N}_{A}\left({t}_{i-1}\right)+1}\right)$
11: Draw a categorical random variable L with probability distribution π
12: if L = 1 then	⊳ Birth reaction
13: ${X}_{{t}_{i}}^{A}={\delta }_{0}+{\sum }_{k=1}^{{N}_{A}\left({t}_{i-1}\right)}{\delta }_{{\sigma }_{k}\left({X}_{{t}_{i-1}}^{A}\right)+T}$	⊳ Advance ages of all A molecules by T and add an atom {0}
14: ${X}_{{t}_{i}}^{B}={\sum }_{k=1}^{{N}_{B}\left({t}_{i-1}\right)}{\delta }_{{\sigma }_{k}\left({X}_{{t}_{i-1}}^{B}\right)+T}$	⊳ Advance ages of all B molecules by T
15: else if L ⩽ (N_A(t_i−1) + 1) then	⊳ Transformation reaction
16: ${X}_{{t}_{i}}^{A}={\sum }_{k=1}^{{N}_{A}\left({t}_{i-1}\right)}{\delta }_{{\sigma }_{k}\left({X}_{{t}_{i-1}}^{A}\right)+T}-{\delta }_{{\sigma }_{L-1}\left({X}_{{t}_{i-1}}^{A}\right)+T}$	⊳ Remove the atom $\left\{{\sigma }_{L-1}\left({X}_{{t}_{i-1}}^{A}\right)\right\}$ from the measure ${X}_{{t}_{i-1}}^{A}$ and advance ages of all other A molecules by T
17: ${X}_{{t}_{i}}^{B}={\delta }_{0}+{\sum }_{k=1}^{{N}_{B}\left({t}_{i-1}\right)}{\delta }_{{\sigma }_{k}\left({X}_{{t}_{i-1}}^{B}\right)+T}$	⊳ Advance ages of all B molecules by T and add an atom {0}
18: else	⊳ Death reaction
19: ${X}_{{t}_{i}}^{A}={\sum }_{k=1}^{{N}_{A}\left({t}_{i-1}\right)}{\delta }_{{\sigma }_{k}\left({X}_{{t}_{i-1}}^{A}\right)+T}-{\delta }_{{\sigma }_{L-{N}_{A}\left({t}_{i-1}\right)-1}\left({X}_{{t}_{i-1}}^{A}\right)+T}$	⊳ Remove the atom $\left\{{\sigma }_{L-{N}_{A}\left({t}_{i-1}\right)-1}\left({X}_{{t}_{i-1}}^{A}\right)\right\}$ from the measure ${X}_{{t}_{i-1}}^{A}$ and advance ages of all other A molecules by T
20: ${X}_{{t}_{i}}^{B}={\sum }_{k=1}^{{N}_{B}\left({t}_{i-1}\right)}{\delta }_{{\sigma }_{k}\left({X}_{{t}_{i-1}}^{B}\right)+T}$	⊳ Advance ages of all B molecules by T
21: end if
22: else
23: Stop and break loop
24: end if
25: Set i = i + 1.
26: end for

In section 1, we mentioned that introduction of delay into a CRN could also serve the purpose of model reduction. Indeed, the LLN limit y := (y_A, y_B) provides a model reduction of the original non-Markovian CRN in equation (2.1). In the following, we discuss two other examples of usefulness of the LLN limit in the form of a PDE system. The first one approximates mean first passage times while the second one describes a faster simulation algorithm.

2.4. Mean first passage times

Mean first passage times are important quantities in the study of stochastic processes and dynamical systems. In the context of CRNs, they could arise in several ways [47, 48]. For instance, natural questions that could arise for the CRN in equation (2.6) are how long it takes to deplete all molecules of species A or to produce the first molecule of B. One of the benefits of the LLN limit is that it can be used to approximate FPTs when the scaling parameter n is sufficiently large. The following illustrates this point.

Suppose we are interested in the time required to produce the first molecule of B. Following the exact simulation algorithm 2.1 adapted from Doob–Gillespie's SSA, the total hazard for the production of a B molecule is $\langle {X}_{0}^{A},\tau \rangle$ . In the large-volume limit, we can approximate this hazard by ${\int }_{0}^{\infty }n\tau \left(s\right){y}_{A}\left(0,s\right)\enspace \mathrm{d}s$ . Therefore, for a sufficiently large n, the MFPT can be approximated by

$\begin{equation}m={\left({\int }_{0}^{\infty }n\tau \left(s\right){y}_{A}\left(0,s\right)\enspace \mathrm{d}s\right)}^{-1},\end{equation} \tag{ 2.8 }$

which, of course, vanishes in the limit of n → ∞. Moreover, the FPTs can be approximated by a random variable following an exponential distribution with mean m, whenever n is sufficiently large. It follows that we can use a simple likelihood function (based on the exponential distribution) for the purpose of statistical inference of the underlying parameters, provided we have observations on the FPTs. This method, called dynamic survival analysis, of estimating parameters based on timings rather than counts was recently explored in the context of epidemiology in [34]. Dynamic survival analysis of general CRNs will be discussed elsewhere.

In figure 2, we show the accuracy of this approximation when n = 100. The approximation appears to be reasonably accurate. More importantly, this suggests we might be able to devise an efficient simulation algorithm using such approximate results. We explore this idea next.

2.5. Fast hybrid simulation

Consider a situation when the species A is abundantly available at the beginning of the reaction. Naturally, we expect the PDE approximation to the age density of the species A to be quite accurate, even though there will be considerable stochastic fluctuations in the copy numbers of B, at least initially. However, if we approximate the age density of A by the limiting PDE, we can also approximate the initial growth of the B molecules by a Poisson process whose time-varying intensity is driven by the PDE. We use this idea to devise a hybrid simulation algorithm, which is, again, essentially an adaptation of the Doob–Gillespie's SSA in the sense that it only draws next reaction times from an exponential distribution whose mean depends on the solution to the PDE. For the sake of completeness, a pseudocode describing the idea is provided in algorithm 2.2.

Algorithm 2.2. Pseudocode for the hybrid simulation algorithm.

Require n, y_A, K	⊳ K: Maximum number of reactions
Ensure t₁, t₂, ...	⊳ Timings of creation of B molecules
1: Set t₀ = 0
2: for i = 1, 2, ..., K do
3: Calculate ${\Lambda}={\left({\int }_{0}^{\infty }n\tau \left(s\right){y}_{A}\left({t}_{i-1},s\right)\enspace \mathrm{d}s\right)}^{-1}$ .
4: if 0 < Λ < ∞ then
5: Draw an exponential random variable T with mean Λ, i.e., T ∼ Exponential(Λ)
6: Set t_i = t_i−1 + T
7: else
8: Stop and break loop
9: end if
10: Set i = i + 1.
11: end for

In figure 3, we show the accuracy of the hybrid simulation algorithm. Expectedly, the hybrid simulation is considerably faster than the full stochastic simulation of the CRN in equation (2.1). A more elaborate comparison of performance is shown in figure 4. However, it is worth noting that the hybrid simulation algorithm, by design, will underestimate the variance in the counting process for the species B. Therefore, one should use the hybrid simulation when it suffices to get the mean trajectory accurately. Alternatively, one can borrow ideas to estimate the variance correctly in other simulation algorithms [49–51]. Similar ideas to expedite simulations have been proposed previously. For instance, Ganguly et al [52] propose a jump-diffusion approximation to the stochastic CRNs and provide error analysis while others [28, 53] introduce hybrid simulation methods using a piecewise deterministic Markov process.

**Figure 4.** Efficiency of the hybrid simulation algorithm. The figure shows the empirical density of the ratios of execution times and memory usage of the full stochastic simulation and those of the hybrid simulation algorithm described in algorithm 2.2. It is evident that the hybrid simulation algorithm is at least five times faster in terms of execution times and at least four times more efficient in terms of memory usage. The simulation set-up is the same as Figure 3. The performance evaluation of the hybrid simulation is done using the *BenchmarkTools.jl* package [54] in Julia language [46].
Download figure:
Standard image High-resolution image

3. Michaelis–Menten enzyme-kinetic reactions

Michaelis–Menten enzyme-catalyzed chemical reactions form an important class of CRNs particularly because of their vast applications in the industry [55, 56]. Several descriptions of this class of reactions are available in the literature. For the sake of simplicity, in what follows we shall adopt the simplest form of the Michaelis–Menten enzyme-catalyzed reactions. In this form, the CRN comprises a reversible binding of a molecule of a substrate (S) and a molecule of an enzyme (E) into a molecule of a substrate-enzyme complex, and an irreversible conversion of a molecule of the complex into a molecule of a product (P) leaving the molecule of the enzyme free. That is, the system consists of the following reactions:

$\begin{equation}\begin{aligned}\hfill E+S& \;{\rightarrow }^{{k}_{1}}\enspace C,\hfill \\ \hfill C& \;{\rightarrow }^{{k}_{-1}}\enspace E+S,\hfill \\ \hfill C& \;{\rightarrow }^{{k}_{2}}\enspace P+E.\hfill \end{aligned}\end{equation} \tag{ 3.9 }$

In traditional models of enzyme kinetics, the quantities k₁, k₋₁, and k₂ are reaction rate constants. When modeled stochastically using a CTMC, the mean-field limit of the scaled concentrations is described by the following set of ODEs (see [57] for more details):

$\begin{equation}\begin{aligned}\hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}\left[E\right]& =-{k}_{1}\left[E\right]\left[S\right]+\left({k}_{-1}+{k}_{2}\right)\left[C\right],\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}\left[S\right]& =-{k}_{1}\left[E\right]\left[S\right]+{k}_{-1}\left[C\right],\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}\left[C\right]& ={k}_{1}\left[E\right]\left[S\right]-\left({k}_{-1}+{k}_{2}\right)\left[C\right],\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}\left[P\right]& ={k}_{2}\left[C\right].\hfill \end{aligned}\end{equation} \tag{ 3.10 }$

The [⋅] notation is used to denote the concentrations. The ODE system in equation (3.10) has been studied extensively in the literature. We will adopt our measure-valued representation to incorporate potential age structure in the Michaelis–Menten CRN.

3.1. Enzyme kinetics with age structure

We assume the binding reaction depends on the age of the participating molecule of the enzyme. That is, only k₁ depends on the age of the E molecules (and not on the age of the S molecules); k₋₁ and k₂ are constants. For the species E, S, C, and P, define the measure-valued stochastic processes

$\begin{align*}\hfill & {X}_{t}^{E}{:=}\sum _{i=1}^{{N}_{E}\left(t\right)}{\delta }_{{e}_{i}\left(t\right)},\quad {X}_{t}^{S}{:=}\sum _{i=1}^{{N}_{S}\left(t\right)}{\delta }_{{s}_{i}\left(t\right)},\quad \hfill \\ \hfill & {X}_{t}^{C}{:=}\sum _{i=1}^{{N}_{C}\left(t\right)}{\delta }_{{c}_{i}\left(t\right)},\quad {X}_{t}^{P}{:=}\sum _{i=1}^{{N}_{P}\left(t\right)}{\delta }_{{p}_{i}\left(t\right)},\hfill \end{align*}$

where N_E, N_S, N_C, N_P denote the counts of molecules of E, S, C, and P respectively. Similarly, e_i, s_i, c_i, p_i denote the age of the ith molecule of E, S, C, and P respectively. The process X := (X^E, X^S, X^C, X^P) is a Markov process on the space $D\left(\left[0,T\right],{\mathcal{M}}_{P}{\left({\mathbb{R}}_{+}\right)}^{4}\right)$ . Please note that we need to scale the hazard function k₁ corresponding to the bimolecular reaction by n⁻¹ following the stochastic law of mass actions [1].

As before, we are interested in the large-volume limit of the scaled process n⁻¹ X_t. The scaled stochastic process n⁻¹ X_t converges to a deterministic function ${x}_{t}{:=}\left({x}_{t}^{E},{x}_{t}^{S},{x}_{t}^{C},{x}_{t}^{P}\right)$ whose components ${x}_{t}^{E},{x}_{t}^{S},{x}_{t}^{C},{x}_{t}^{P}$ are finite measures on ${\mathbb{R}}_{+}$ by virtue of the LLN.

Let y_E denote the density of the measure ${x}_{t}^{E}$ with respect to the Lebesgue measure. Also, let y_S, y_C, y_P denote the concentrations of the S, C, and P molecules. Then, we get the following limiting system:

$\begin{equation}\begin{aligned}\hfill \left({\partial }_{t}+{\partial }_{s}\right){y}_{E}\left(t,s\right)& =-{k}_{1}\left(s\right){y}_{E}\left(t,s\right){y}_{S}\left(t\right),\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{S}\left(t\right)& =-{y}_{S}\left(t\right){\int }_{0}^{\infty }{k}_{1}\left(s\right){y}_{E}\left(t,s\right)\enspace \mathrm{d}s\hfill \\ \hfill & \quad +{k}_{-1}{y}_{C}\left(t\right),\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{C}\left(t\right)& ={y}_{S}\left(t\right){\int }_{0}^{\infty }{k}_{1}\left(s\right){y}_{E}\left(t,s\right)\enspace \mathrm{d}s\hfill \\ \hfill & \quad -\left({k}_{-1}+{k}_{2}\right){y}_{C}\left(t\right),\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{P}\left(t\right)& ={k}_{2}{y}_{C}\left(t\right),\hfill \end{aligned}\end{equation} \tag{ 3.11 }$

with the boundary condition

$\begin{equation*}{y}_{E}\left(t,0\right)=\left({k}_{-1}+{k}_{2}\right){y}_{C}\left(t\right)\end{equation*}$

and the initial condition y_E(0, s) = f_E(s) such that ${\int }_{0}^{\infty }{f}_{E}\left(s\right)\enspace \mathrm{d}s=\left[{E}_{0}\right]$ . Appropriate initial conditions for S, C, and P are also assumed. This limiting system can now be used to study the effects of delay in the binding reaction. One interesting approximation that has been widely applied in the context of Michaelis–Menten enzyme kinetic reactions is what is known as a quasi-steady state approximation [58]. There are many forms of QSSAs, namely, standard QSSA (sQSSA), total QSSA (tQSSA), and reversible QSSA (rQSSA). Detailed analysis of any of the QSSAs is beyond the scope of the present work. For the purpose of illustration, we informally describe an analogue of the sQSSA here.

3.2. The standard QSSA

The QSSAs are a multiscale approximation of the Michaelis–Menten enzyme-kinetic reactions. The basic assumption behind the standard QSSA is that the substrate-enzyme complex C reaches its steady-state much quicker than the other species. In the deterministic set-up, the approximation is achieved by setting $\frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{C}\left(t\right)=0$ in equation (3.11), which allows one to work with a smaller system of ODEs. Several conditions for the validity of the sQSSA have been proposed in the literature. See [57] for a detailed discussion.

Following the deterministic approach in our case, we set $\frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{C}\left(t\right)=0$ in equation (3.11) to get a reduced PDE system that is analogous to the sQSSA. To be more precise, $\frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{C}\left(t\right)=0$ yields

$\begin{equation*}{y}_{C}\left(t\right)=\frac{{y}_{S}\left(t\right){\int }_{0}^{\infty }{k}_{1}\left(s\right){y}_{E}\left(t,s\right)\enspace \mathrm{d}s}{{k}_{-1}+{k}_{2}},\end{equation*}$

which further yields an approximate system

$\begin{equation}\begin{aligned}\hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{S}\left(t\right)& =-\frac{{k}_{2}}{{k}_{-1}+{k}_{2}}{y}_{S}\left(t\right){\int }_{0}^{\infty }{k}_{1}\left(s\right){y}_{E}\left(t,s\right)\enspace \mathrm{d}s,\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{P}\left(t\right)& =\frac{{k}_{2}}{{k}_{-1}+{k}_{2}}{y}_{S}\left(t\right){\int }_{0}^{\infty }{k}_{1}\left(s\right){y}_{E}\left(t,s\right)\enspace \mathrm{d}s.\hfill \end{aligned}\end{equation} \tag{ 3.12 }$

Recall that y_E solves $\left({\partial }_{t}+{\partial }_{s}\right){y}_{E}\left(t,s\right)=-{k}_{1}\left(s\right){y}_{E}\left(t,s\right){y}_{S}\left(t\right)$ with boundary condition y_E(t, 0) = (k₋₁ + k₂)y_C(t) and initial condition y_E(0, s) = f_E(s). As a consequence, y_E is determined by y_S and y_C, and can be partially solved in terms of y_S and y_C. Therefore, the reduced system of ODEs in equation (3.12) is indeed autonomous and therefore, serves as an sQSSA of the CRN in equation (3.9).

In the stochastic set-up, the QSSAs are obtained by means of the probabilistic multiscaling techniques developed in [3, 4]. The stochastic and the deterministic QSSAs mostly agree with each other with some notable differences. Please see [57] for examples of discrepancies as well as more details on the methods. Here, for paucity of space, we do not consider the stochastic QSSAs or possible discrepancies between stochastic and deterministic methods in the present age-structured models.

4. Prokaryotic auto-regulation

As another example, we consider a simple genetic network with feedback. We apply our approach using an age-dependent measure-valued process to build a model for a simple prokaryotic auto-regulation with a time delay. We modify an auto-regulation mechanism in the prokaryote gene network in [59] (section 1.5.7). We simplify the example by approximating transcription and translation as a one-step process with a time delay and replacing repression of the gene by a protein dimer to repression by a single protein instead. For other related examples for the gene transcription and translation, see section 2.1.1 in [60] and [61–64].

Consider a genetic network with a gene (G), a protein (P), and a gene-protein complex (C). The gene activates production of protein following a hazard function b_P and the protein degrades following a hazard function d_P. The protein can reversibly bind with the gene to form a complex with binding hazard b_C and unbinding hazard d_C. Since the gene-protein complex cannot participate in the production of protein, this is auto-regulation of the gene by its complex. Schematically, the reactions are as follows:

$\begin{align}\hfill G& \;{\rightarrow }^{{b}_{P}}\enspace P+G,\hfill \\ \hfill P+G& \;{\rightarrow }^{{b}_{C}}\enspace C,\hfill \\ \hfill C& \;{\rightarrow }^{{d}_{C}}\enspace P+G,\hfill \\ \hfill P& \;{\rightarrow }^{{d}_{P}}\enspace \varnothing .\hfill \end{align} \tag{ 4.13 }$

In (4.13), we assume that the age of the gene is important. Therefore, the hazard functions b_P and b_C are assumed to be age-dependent whereas d_C and d_P are assumed to be constants. Note that after unbinding of the gene-protein complex, the age of the gene is reset to zero. On the other hand, the age of the gene is not affected by the protein production.

Denote by N_G(t), N_P(t), and N_C(t) the total molecular counts of the gene, the protein, and the gene-protein complex at time t, respectively. For the species G, P, and C, define the measure-valued processes

$\begin{align*}\hfill & {X}_{t}^{G}{:=}\sum _{i=1}^{{N}_{G}\left(t\right)}{\delta }_{{g}_{i}\left(t\right)},\quad {X}_{t}^{P}{:=}\sum _{i=1}^{{N}_{P}\left(t\right)}{\delta }_{{p}_{i}\left(t\right)},\quad \hfill \\ \hfill & {X}_{t}^{C}{:=}\sum _{i=1}^{{N}_{C}\left(t\right)}{\delta }_{{c}_{i}\left(t\right)},\hfill \end{align*}$

where we denote the age of the i-th molecule of the species G, P, and C by g_i, p_i, and c_i respectively. As in the case of the Michaelis–Menten enzyme kinetic reaction, we scale the hazard function b_C corresponding to the bimolecular reaction by n⁻¹ following the stochastic law of mass actions [1].

The LLN limit of the scaled process ${n}^{-1}{X}_{t}{:=}\left({n}^{-1}{X}_{t}^{G},{n}^{-1}{X}_{t}^{P},{n}^{-1}{X}_{t}^{C}\right)$ can be derived following by now familiar arguments of the previous examples. As one would expect, the scaled process n⁻¹ X_t converges to a deterministic function ${x}_{t}{:=}\left({x}_{t}^{G},{x}_{t}^{P},{x}_{t}^{C}\right)$ whose components are finite measures on ${\mathbb{R}}_{+}$ . Since we assume only the age of the gene is important, we consider the limiting age density y_G of the gene, which we obtain as the density, when it exists, of the measure ${x}_{t}^{G}$ with respect to the Lebesgue measure. Similarly, define the limiting concentrations of the product y_P and the complex y_C. The limiting system is then described by

$\begin{equation}\begin{aligned}\hfill \left({\partial }_{t}+{\partial }_{s}\right){y}_{G}\left(t,s\right)& =-{b}_{C}\left(s\right)\enspace {y}_{G}\left(t,s\right){y}_{P}\left(t\right),\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{P}\left(t\right)& ={\int }_{0}^{\infty }{b}_{P}\left(s\right){y}_{G}\left(t,s\right)\enspace \mathrm{d}s\hfill \\ \hfill & \quad -{y}_{P}\left(t\right){\int }_{0}^{\infty }{b}_{C}\left(s\right){y}_{G}\left(t,s\right)\enspace \mathrm{d}s\hfill \\ \hfill & \quad +{d}_{C}\enspace {y}_{C}\left(t\right)-{d}_{P}\enspace {y}_{P}\left(t\right),\hfill \\ \hfill \frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}{y}_{C}\left(t\right)& ={y}_{P}\left(t\right){\int }_{0}^{\infty }{b}_{C}\left(s\right){y}_{G}\left(t,s\right)\enspace \mathrm{d}s\hfill \\ \hfill & \quad -{d}_{C}\enspace {y}_{C}\left(t\right),\hfill \end{aligned}\end{equation} \tag{ 4.14 }$

with the boundary condition

$\begin{equation*}{y}_{G}\left(t,0\right)={d}_{C}\enspace {y}_{C}\left(t\right)\end{equation*}$

and the initial condition y_G(0, s) = f_G(s), which specifies the initial ages of the gene. Note that the hazard function for unbinding of the gene-protein complex appears in the boundary condition since we assumed that the age of the gene is reset to zero when the complex breaks into the gene and the protein. Also, recall that b_P(s) encodes a time delay in transcription and translation. For example, we may set b_P(s) = r1_[τ,∞)(s), which asserts that protein is produced only when the age of the gene is greater than τ with a hazard function r.

5. Discussion

Many biological processes with time delays, including CRNs, cannot be directly modeled using CTMCs due to non-exponentially distributed inter-event times of the processes. The simulation and analysis of systems with an age structure and time delays become challenging since the system dynamics are affected by the inherent randomness (stochasticity) as well as time delays. One way to simulate such stochastic systems with age structure and time delays is to modify simulation algorithms for CTMC models where the next reaction time and type are determined based on molecule counts of reactants. Bratsun et al [39], Barrio et al [65] and Cai [66] constructed modified SSAs, while Anderson [67] introduced a modified next reaction method to simulate discrete stochastic chemical reaction networks with delays. Notably, all of those works assume that the time lags in the delayed reactions are constant. Furthermore, in [68], Caravagna and Hillston described a non-Markovian stochastic process algebra, called Bio-PEPAd, to incorporate deterministic delays and perform formal analysis. Mura et al [69] described how general holding time distributions can be incorporated in the programming language BlenX and studied the effect of the choice of the reaction time distributions. A stochastic simulation algorithm for non-Markovian biochemical reactions based on constraint programming is presented in [70].

CRNs with an age structure and random time delays provide a more realistic description of stochastic biophysical or chemical systems compared to the ones with fixed time delays. Unfortunately, the literature on stochastic systems with random time delays remains sparse. In a previous work by Koyama (chapter 4 in [40]), the author investigated a stochastic kinetic network with a random time delay where a delayed reaction can be interrupted by another reaction and can fail to complete. In another work by Marquez-Lago et al [71], the authors utilized probability distributed time delays to incorporate spatial effects such as diffusion or translocation of molecules in temporal stochastic models. In a recent work by Choi et al [27], the authors described protein production in transcription and translation as a birth and death process with a random time delay.

In this paper, we developed a new way to incorporate an age structure and time delays in CRNs using age-dependent processes. We availed ourselves of previous theoretical works [36, 37, 43, 44] designed to study age-dependent population dynamics. We applied those stochastic models in the context of CRNs to account for the non-Markovian property due to the time delays. The use of age-dependent hazard functions not only enables us to model age-dependent time delays or reaction rates but also covers the modeling of constant and random time delays in the existing literature. We illustrated our method using simple biophysical systems in gene regulation and enzyme kinetics, but it will easily apply to general CRNs.

One potential disadvantage of the age-dependent processes is that simulation can be prohibitive since the age of each individual molecule of the chemical species of interest needs to be tracked over the entire simulation time. Therefore, we derived a large-volume limit of the age-dependent process for CRNs in the form of PDEs using the analytic methods in [36, 37, 43, 44] and used the PDE limit to construct a hybrid simulation algorithm, which, in our example, turned out to be five times faster than the full stochastic simulation. Moreover, we approximated a mean first passage time efficiently utilizing the theoretical limit.

In this work, we emphasized how age-structured processes and their large-volume limits can be applied to model CRNs, in particular, biophysical or chemical systems with time delays. Many previous findings for general CRNs under Markovian assumption can be reinvestigated and extended to non-Markovian settings using age-structured processes. It would be interesting to see how the long time behavior of stochastic CRNs is affected by incorporating age structure. For example, it would be interesting to study stationary distributions of autocatalytic CRNs with switching behavior [72], to identify a class of CRNs maintaining product-form Poisson distributions for all times [73] and to find when CRNs show nonexplosive behavior [74]. Another interesting direction will be to study stability of CRNs [75] and to estimate transition times between different attractors in CRNs [76].

For the sake of simplicity, we have assumed in this paper that the molecular entities of all chemical species are abundant at the same order of magnitude so as to obtain the large-volume limit under the classical scaling. A natural extension of this work is to consider general CRNs with a wide range of molecular abundances and reaction rates where we can apply multiscale approximations to reduce model complexity [1, 3, 77]. We leave such investigation to future work. In this paper, we briefly described how an analogue of QSSA can be derived in the Michaelis–Menten enzyme-kinetic reactions. As shown in the related previous work [57, 58], both deterministic and stochastic QSSAs can be revisited with an extension of our approach to multiscale approximations in enzyme kinetics under non-Markovian setting. Another promising application of our approach seems to be in parameter inference and survival analysis of general CRNs with age structure. Given the current interests in pandemic modeling, such CRNs could lead to interesting examples in population dynamics and epidemiology. We hope to be able to pursue such work in the near future.

We conclude our discussion by briefly mentioning a class of CRNs modeled using Poisson processes with time-varying intensities. While retaining the Markov property, time-varying intensities provide a flexible way to aggregate out unobserved processes and to account for heterogeneity in the system such as cell-to-cell variability, changes in the volume or temperature of a cell affecting reaction rates [67, 78, 79]. However, the crucial difference between those models and ours is that time-varying intensities alone cannot induce a dependence structure of time delays on the initiation times of reactions whereas introduction of an age structure can. This is because time-varying intensities are a property of the system, whereas the age is a property of the individual molecule. Therefore, making the intensities depend explicitly on the individual ages of the molecules, as we do in this paper, provides a richer class of models.

Funding

WKB was supported by the National Institute of Allergy and Infectious Diseases (NIAID) Grant R01 AI116770, the National Science Foundation (NSF) Grant DMS-2027001 and the Ohio State University's President's Postdoctoral Scholars Program (PPSP). EK was supported by NIAID Grant R01 AI116770 and the NSF Grants DMS-2027001 and DMS-1853587. GAR was supported by the NSF Grants DMS-2027001 and DMS-1853587. HWK was supported in part by the NSF under the Grant DMS-1620403. The project was initiated when HWK was visiting the Mathematical Biosciences Institute (MBI) at the Ohio State University in Summer 2019. The authors acknowledge the hospitality and the support of MBI. The content of this manuscript is solely the responsibility of the authors and does not represent the official views of NSF, NIGMS, NIAID, or NIH.

List of symbols

Acronyms

Appendix A.: Preliminaries

For the sake of completeness, we briefly describe some statistical and mathematical preliminaries here. Consider a continuous random variable U taking nonnegative values with cumulative distribution function (CDF) G_U and probability density function (PDF) g_U. The survival function S_U of the random variable U is defined as

$\begin{equation}{S}_{U}\left(t\right){:=}\mathsf{P}\left(U{ >}t\right)=1-{G}_{U}\left(t\right).\end{equation} \tag{ A.15 }$

The hazard function h_U of the random variable U is defined as

$\begin{equation}{h}_{U}\left(t\right){:=}\frac{{g}_{U}\left(t\right)}{{S}_{U}\left(t\right)}.\end{equation} \tag{ A.16 }$

Hazard and survival functions are extensively used in survival analysis to model time to event data, e.g., time to death, time to hospitalization, time to default, time to failure etc. Intuitively, the hazard function describes the probability of failure in an infinitesimally small time period (t, t + Δt) given survival till time t. With little application of calculus, one can see that

$\begin{align}\hfill {h}_{U}\left(t\right)& =\underset{h\to 0}{\mathrm{lim}}\frac{\mathsf{P}\left(t{< }U{< }t+h\vert U{ >}t\right)}{h}\hfill \\ \hfill & \hfill =-\frac{\enspace \mathrm{d}}{\enspace \mathrm{d}t}\mathrm{log}\enspace {S}_{U}\left(t\right),\end{align}$

which yields another useful relationship between the hazard function and the survival function:

$\begin{equation*}{S}_{U}\left(t\right)=\mathrm{exp}\left(-{\int }_{0}^{t}{h}_{U}\left(u\right)\enspace \mathrm{d}u\right)=\mathrm{exp}\left(-{{\Lambda}}_{U}\left(t\right)\right),\end{equation*}$

where ${{\Lambda}}_{U}\left(t\right){:=}{\int }_{0}^{t}{h}_{U}\left(u\right)\enspace \mathrm{d}u$ is called the cumulative hazard function. Hazard and survival functions cannot always be obtained in closed form. Probability distributions for which we can obtain them in closed form include Weibull, exponential, log-logistic distributions. The case of exponential distribution is unique in that it is the only probability distribution for which the hazard function is constant. However, a constant hazard is unrealistic in models for many biophysical systems.

Appendix B.: Brief derivation of the PDE limit

In this section, we provide a brief, intuitive derivation of the PDE limit mentioned in section 2.3. The line of argument follows the standard tightness-uniqueness route for abstract Markov processes and has been used in several prior works [36, 37, 43–45]. A rigorous proof of convergence for a general class of non-Markovian CRNs will be discussed elsewhere.

Consider the CRN in equation (2.1) with the measure-valued process X_t as defined in section 2.3. The components ${X}_{t}^{A},{X}_{t}^{B}$ satisfy the trajectory equations given in equations (2.4) and (2.5). In order to study moments and martingale properties of ${X}_{t}^{A}$ and ${X}_{t}^{B}$ , it is worthwhile to check that

$\begin{align*}\hfill \begin{aligned}\hfill \langle {X}_{t}^{A},{f}_{t}\rangle & =\sum _{k=1}^{{N}_{A}\left(0\right)}{f}_{t}\left(t+{\sigma }_{k}\left({X}_{0}^{A}\right)\right)+{\int }_{0}^{t}{\int }_{0}^{\infty }\hfill \\ \hfill & \quad {\times}{f}_{t}\left(t-s\right)\enspace {\mathsf{1}}_{\left\{\theta {\leqslant}b\right\}}\enspace {Q}_{1}\left(\mathrm{d}s,\mathrm{d}\theta \right)\hfill \\ \hfill & \quad -{\int }_{0}^{t}{\int }_{\mathbb{N}}{\int }_{0}^{\infty }{f}_{t}\left(t-s+{\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\hfill \\ \hfill & \quad {\times}\enspace {\mathsf{1}}_{\left\{i{\leqslant}{N}_{A}\left(s-\right)\right\}}\enspace {\mathsf{1}}_{\left\{\theta {\leqslant}\tau \left({\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\right\}}\enspace \hfill \\ \hfill & \quad {\times}{Q}_{2}\left(\mathrm{d}s,di,\mathrm{d}\theta \right)-{\int }_{0}^{t}{\int }_{\mathbb{N}}{\int }_{0}^{\infty }\hfill \\ \hfill & \quad {\times}{f}_{t}\left(t-s+{\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\enspace {\mathsf{1}}_{\left\{i{\leqslant}{N}_{A}\left(s-\right)\right\}}\enspace \hfill \\ \hfill & \quad {\times}{\mathsf{1}}_{\left\{\theta {\leqslant}d\left({\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\right\}}\enspace {Q}_{3}\left(\mathrm{d}s,di,\mathrm{d}\theta \right),\hfill \end{aligned}\\ \hfill \begin{aligned}\hfill \langle {X}_{t}^{B},{f}_{t}\rangle & ={\int }_{0}^{t}{\int }_{\mathbb{N}}{\int }_{0}^{\infty }{f}_{t}\left(t-s\right)\enspace {\mathsf{1}}_{\left\{i{\leqslant}{N}_{A}\left(s-\right)\right\}}\enspace \hfill \\ \hfill & \quad {\times}{\mathsf{1}}_{\left\{\theta {\leqslant}\tau \left({\sigma }_{i}\left({X}_{s-}^{A}\right)\right)\right\}}\enspace {Q}_{2}\left(\mathrm{d}s,di,\mathrm{d}\theta \right),\hfill \end{aligned}\end{align*}$

for a sufficiently large class of test functions f : (a, s) → f_s(a).

As in the case of standard Markov model in section 2.1, we are now interested in the large-volume limit (n → ∞) of the scaled stochastic process n⁻¹ X_t. By virtue of the LLN, if we assume (i) the hazard functions are continuous, (ii) the global jump rates are bounded above by a finite quantity, (iii) a finite second moment condition on the initial population size ${\mathrm{sup}}_{n}\mathsf{E}\left[{n}^{-2}{N}_{A}{\left(0\right)}^{2}\right]{< }\infty$ , and (iv) the initial age distribution does not explode, we have that the scaled process n⁻¹ X_t converges to a deterministic function ${x}_{t}{:=}\left({x}_{t}^{A},{x}_{t}^{B}\right)$ whose components ${x}_{t}^{A}$ and ${x}_{t}^{B}$ are themselves measure-valued functions. This can be formally justified by verifying that the sequence of processes n⁻¹ X_t is tight and then, showing that the limit points (along subsequences) are unique. We can identify the limit points by studying certain martingale processes associated with the scaled processes n⁻¹ X_t. Outline of the argument is provided below.

B.1. Martingale property and tightness-uniqueness

First, under the above mentioned assumptions, we can show that the components of the scaled process n⁻¹ X_t do not explode (similar derivation in [37, lemma 2.6 and proposition 2.7]). Now, note that the trajectory equations for the processes ${X}_{t}^{A}$ and ${X}_{t}^{B}$ given in equations (2.4) and (2.5) are driven by PPMs. Since we have

$\begin{align*}\hfill {f}_{t}\left(a+t-s\right)& ={f}_{s}\left(a\right)+{\int }_{s}^{t}\left(\frac{\partial }{\partial u}{f}_{u}\left(a+u-s\right)\right.\hfill \\ \hfill & \quad \left.+\frac{\partial }{\partial a}{f}_{u}\left(a+u-s\right)\right)\enspace \mathrm{d}u,\hfill \end{align*}$

and using the compensated PPMs of the PPMs Q₁, Q₂, Q₃, we can show the processes

$\begin{equation*}\begin{aligned}\hfill {M}_{t}^{A,f}& =\langle {n}^{-1}{X}_{t}^{A},{f}_{t}\rangle -\langle {n}^{-1}{X}_{0}^{A},{f}_{0}\rangle \hfill \\ \hfill & \quad -{\int }_{0}^{t}{\int }_{0}^{\infty }\left(\frac{\partial }{\partial a}{f}_{s}\left(a\right)+\frac{\partial }{\partial s}{f}_{s}\left(a\right)\right.\hfill \\ \hfill & \quad \left.-{f}_{s}\left(a\right)\left(\tau \left(a\right)+d\left(a\right)\right)\right){n}^{-1}{X}_{s}^{A}\left(\enspace \mathrm{d}a\right)\enspace \mathrm{d}s\hfill \end{aligned}\end{equation*}$

$\begin{equation*}\begin{aligned}\hfill {M}_{t}^{B,f}& =\langle {n}^{-1}{X}_{t}^{B},{f}_{t}\rangle -{\int }_{0}^{t}{\int }_{0}^{\infty }\left(\frac{\partial }{\partial a}{f}_{s}\left(a\right)+\frac{\partial }{\partial s}{f}_{s}\left(a\right)\right.\hfill \\ \hfill & \quad \left.+{f}_{s}\left(0\right)\tau \left(a\right)\right){n}^{-1}{X}_{s}^{A}\left(\enspace \mathrm{d}a\right)\enspace \mathrm{d}s\hfill \end{aligned}\end{equation*}$

are zero mean, square integrable, càdlàg martingale processes with predictable quadratic variations of the order n⁻¹. Since we expect the predictable quadratic variations to vanish in the limit of n → ∞, the scaled process n⁻¹ X_t converges to a deterministic, continuous function x_t. The tightness of the process n⁻¹ X_t can be established by verifying a criterion due to Roelly [80] in the vague topology and the Aldous–Rebolledo criteria [81]. See [36] or [37, proposition 3.1] for similar calculations. Furthermore, thanks to the martingale representations above, we expect the limit x_t to satisfy

$\begin{align*}\hfill \langle {x}_{t}^{A},{f}_{t}\rangle & =\langle {x}_{0}^{A},{f}_{0}\rangle +{\int }_{0}^{t}{\int }_{0}^{\infty }\left(\frac{\partial }{\partial a}{f}_{s}\left(a\right)+\frac{\partial }{\partial s}{f}_{s}\left(a\right)\right.\hfill \\ \hfill & \quad \left.-{f}_{s}\left(a\right)\left(\tau \left(a\right)+d\left(a\right)\right)\right){x}_{s}^{A}\left(\enspace \mathrm{d}a\right)\enspace \mathrm{d}s\hfill \\ \hfill \langle {x}_{t}^{B},{f}_{t}\rangle & ={\int }_{0}^{t}{\int }_{0}^{\infty }\left(\frac{\partial }{\partial a}{f}_{s}\left(a\right)+\frac{\partial }{\partial s}{f}_{s}\left(a\right)\right.\hfill \\ \hfill & \quad \left.+{f}_{s}\left(0\right)\tau \left(a\right)\right){x}_{s}^{A}\left(\enspace \mathrm{d}a\right)\enspace \mathrm{d}s.\hfill \end{align*}$

The uniqueness of the solutions can be shown by first establishing that the solutions remain bounded on finite time intervals (recall the global jump rates are assumed bounded) and then invoking Grönwall's lemma to show the distance between two possible solutions must vanish proving the desired uniqueness.

Appendix C.: Software

The numerical results in this paper are obtained by the Julia programming language [46]. The Julia scripts (compatible with version 1.4.1) used in this paper have been made available publicly at a dedicated GitHub repository [38].

Incorporating age and delay into models for biophysical systems

Article metrics

Submit

Permissions

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction

1.1. Delays are inherent and a useful model reduction tool

1.2. Our contribution

2. The simplest model with a delay

2.1. Standard Markov approach

2.2. Age-structured model

2.3. The measure-valued process and the limiting system

2.4. Mean first passage times

2.5. Fast hybrid simulation

3. Michaelis–Menten enzyme-kinetic reactions

3.1. Enzyme kinetics with age structure

3.2. The standard QSSA

4. Prokaryotic auto-regulation

5. Discussion

Funding

List of symbols

Acronyms

Appendix A.: Preliminaries

Appendix B.: Brief derivation of the PDE limit

B.1. Martingale property and tightness-uniqueness

Appendix C.: Software

Incorporating age and delay into models for biophysical systems

Article metrics

Submit

Permissions

Share this article

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction

1.1. Delays are inherent and a useful model reduction tool

1.2. Our contribution

2. The simplest model with a delay

2.1. Standard Markov approach

2.2. Age-structured model

2.3. The measure-valued process and the limiting system

2.4. Mean first passage times

2.5. Fast hybrid simulation

3. Michaelis–Menten enzyme-kinetic reactions

3.1. Enzyme kinetics with age structure

3.2. The standard QSSA

4. Prokaryotic auto-regulation

5. Discussion

Funding

List of symbols

Acronyms

Appendix A.: Preliminaries

Appendix B.: Brief derivation of the PDE limit

B.1. Martingale property and tightness-uniqueness

Appendix C.: Software