Abstract
This paper is concerned with the inverse random source problem for a stochastic time fractional diffusion equation, where the source is assumed to be driven by a Gaussian random field. The direct problem is shown to be well-posed by examining the well-posedness and regularity of the solution for the equivalent stochastic two-point boundary value problem in the frequency domain. For the inverse problem, the Fourier modulus of the diffusion coefficient of the random source is proved to be uniquely determined by the variance of the Fourier transform of the boundary data. As a phase retrieval for the inverse problem, the phaselift method with random masks is applied to recover the diffusion coefficient from its Fourier modulus. Numerical experiments are reported to demonstrate the effectiveness of the proposed method.
Export citation and abstract BibTeX RIS
1. Introduction
In the past two decades, the differential equations involving fractional-order derivatives, known as the fractional differential equations (FDEs), have received increasing attention in applied disciplines. Such models are able to capture more faithfully the dynamics of anomalous diffusion processes in amorphous materials. Consequently, fundamentally different physics can be obtained. For instance, the FDEs can be used to model some anomalous diffusions in a highly heterogeneous aquifer [1], underground environmental problems [21], the relaxation phenomena in complex viscoelastic materials [20], and non-Markovian diffusion processes with memory [30].
Motivated by the significant applications in scientific and industrial fields, the research of inverse problems has gone tremendous developments in the past decade. Recently, the inverse problems on FDEs have become an active research field [6, 11, 12]. In particular, the inverse problems for the time fractional diffusion equations have been extensively studied mathematically and numerically. Generally, the inverse source problems of FDEs are to determine the time-dependent or the space-dependent source functions by using the space-dependent or time-dependent data [2, 4, 22, 39]. There are also cases where boundary conditions are used as the data [31, 38]. Although there has been a lot of research done for the inverse problems of FDEs, there is little work on the stochastic inverse problems for the FDEs.
The inverse random source problems refer to the inverse source problems that involve uncertainties. Due to the randomness and uncertainty, compared to the deterministic counterparts, stochastic inverse problems have more difficulties in addition to the existing obstacle of ill-posedness. For the inverse random source scattering problem, where the wave propagation is governed by the stochastic Helmholtz equation driven by a white noise, it is shown in [14] that the correlation of the random source could be determined uniquely by the correlation of the random wave field. Effective computational methods are developed in [5, 26] for the white noise model, where statistical properties, such as the mean and the variance, of the random source are recovered by using the boundary measurements of the random wave field at multiple frequencies.
There is much less work on the inverse random source problems for the FDEs. Consider the stochastic time fractional diffusion equation
where denotes the Caputo fractional derivative with α ∈ (0, 1), BH is the fractional temporal Brownian motion with Hurst parameter H ∈ (0, 1) and denotes the formal derivative of BH with respect to the time t, D is a bounded domain with the Lipschitz boundary ∂D, and f, g are two deterministic functions supported in D. Given f and g, the direct source problem is to determine u; while the inverse source problem is to determine the unknowns f and g that generate a prescribed u. For this kind of random sources perturbed by a time-dependent random noise, the mild solution of the problem can be obtained by using the Mittag–Leffler functions, and it may lead to the reconstruction formulas between the unknowns and the measurements. In [33], f and |g| are proved to be uniquely determined by the mean and covariance of the final data u(x, T), respectively, under the conditions that and , i.e., the white noise case. It is also pointed out that the inverse problem is not stable as a small variance of the data may lead to a huge error on the reconstructions. The inverse problem for a generalized case with α ∈ (0, 1) and H ∈ (0, 1) is considered in [16], where f and |g| are recovered from the same statistics of the final data u(x, T). Similar results are obtained about the uniqueness and the stability of the inverse problem. A related inverse random source problem on the stochastic fractional diffusion equation for the time-dependent noise may be found in [27]. If the random sources are perturbed by a space-dependent noise, the mild solution approach is not available anymore since the spatial noise may not be regular enough to guarantee the well-posedness of the problem. New techniques need to be explored to reconstruct the diffusion coefficient of this kind of random sources.
In this paper, we consider the one-dimensional stochastic time fractional diffusion equation
where F is a deterministic function satisfying F(0) = 0 and denotes the diffusion coefficient of the random source, and the Caputo fractional derivative , α ∈ (0, 1), is defined by
with being the Gamma function. Here, Wx stands for the spatial Brownian motion satisfying for any x, y ∈ (0, 1), and denotes the formal derivative of Wx , which should be interpreted as a distribution and is known as the white noise.
To deal with the initial boundary value problem of the stochastic diffusion equation (1.1), we consider an equivalent stochastic two-point boundary value problem in the frequency domain, and study the well-posedness and regularity of the solution based on the estimate of the Green function. It then leads to the well-posedness of (1.1) by showing that the solution to (1.1) is the inverse Fourier transform of the solution to the equivalent problem in the frequency domain. For the inverse problem, the Fourier modulus of the diffusion coefficient F is proved to be uniquely determined by the variance of the solution to the equivalent problem in the frequency domain. However, the recovery of F from is apparently not unique, since we measure instead of and lose information about the phase of F. If we could retrieve the phase of F, then it would be trivial to recover F. This kind of problem, i.e., the reconstruction of a signal from the magnitude of its Fourier transform, is generally known as the phase retrieval [34]. It arises in many applications such as diffraction imaging, optics and quantum mechanics, and is usually ill-posed and notoriously difficult to solve. A large amount of methods have been proposed to solve the phase retrieval problem [23]. These approaches can be broadly classified into two categories: utilizing either a priori information about the signal F or additional measurements of the modulus . In this work, we adopt the latter and apply the phaselift method with random masks to collect additional measurements of the modulus , which may be used to uniquely determine |F|, which is the modulus of the diffusion coefficient of the random source. Numerical experiments are reported to demonstrate the effectiveness of the proposed method.
The rest of the paper is organized as follows. In section 2, the time fractional derivative and its properties are introduced. Section 3 is concerned with the well-posedness of the direct problem. The inverse problem is addressed in section 4, which includes the recovery of , the phase retrieval problem, and its numerical method. Section 5 presents several numerical examples to illustrate the effectiveness of the proposed method. The paper is concluded with some general remarks and directions for the future research in section 6.
2. The time fractional derivative
In this section, we discuss the Fourier transform of the Caputo fractional derivative, which serves as a basis to convert the initial boundary value problem of the stochastic time fractional differential equation (1.1) into an equivalent stochastic two-point boundary value problem in the frequency domain.
Lemma 2.1. Let v be a causal function, i.e., v(t) = 0 if t ⩽ 0, whose fractional derivative is well-defined in . Then the Fourier transform of the fractional derivatives of v satisfies
where
denotes the Fourier transform of v.
Proof. The result is obvious for the case α = 1 if . Next we show the assertion for the case α ∈ (0, 1). Define a causal function with α ∈ (0, 1) by
which satisfies
Based on the fact
it suffices to show that the Fourier transform of admits the following form (cf [35, Sec. 2.9.2]):
In fact, let be a simply connected open set with γR being a closed curve shown in figure 1. It is clear to note that the mapping
defines a holomorphic function in UR . It follows from the Cauchy integral theorem that
where and denote the positively oriented a quarter section of the circle with radius R (the dashed curve in figure 1) and the negatively oriented a quarter section of the circle with radius 1/R (the solid curve in figure 1), respectively.
Clearly, we have
Taking the limit of (2.2) as R → +∞ leads to
Let iωz = s. A simple calculation yields
which completes the proof by noting (2.1). □
Remark 2.2. Note that (iω)α is a multi-valued function when α is a fractional number. Throughout the paper, we define
where sgn (⋅) denotes the sign function.
3. The direct problem
In this section, we discuss the direct problem. The well-posedness of the problem (1.1) and the regularity of its mild solution are investigated by studying an equivalent problem in the frequency domain.
3.1. The direct problem in the frequency domain
Since the function F satisfies F(0) = 0, we denote by the zero extension of F in (−∞, 0) and by the Fourier transform of . Consider the two-point boundary value problem of the stochastic differential equation
In the following, we deduce the Green function and present the well-posedness of (3.1).
3.1.1. Green's function
Let s := (iω)α and denote by gω (x, y) the Green function of (3.1). We consider two cases where gω takes two different expressions.
If ω ≠ 0, then gω satisfies
where δ is the Dirac delta function. It is known from solving the second order ordinary differential equation with constant coefficients that gω (x, y) has the general form
where Ai and Bi , i = 1, 2, are to be determined. Using the boundary conditions
and the continuity and jump conditions
we may easily obtain from Remark 2.2 that
where
Here the notations and stand for the real and imaginary parts of a complex value, respectively.
If ω = 0, then g0(x, y) solves
Similarly, we may solve the above two-point boundary value problem and obtain
Lemma 3.1. The Green function gω given in (3.2) and (3.3) satisfies the estimate
where C > 0 is a constant independent of ω.
Proof. If ω = 0, then it follows from a simple calculation that
If ω ≠ 0, we get
where the function h is defined by
Clearly, h is a nonnegative function for all k > 0.
We claim that h(k) is uniformly bounded for all k > 0. On the one hand, due to the fact
there exists a constant C0 such that h(k) < 1 for all k > C0, which shows that h is uniformly bounded on (C0, ∞). On the other hand, by noting
we get that h is also uniformly bounded on (0, C0] due to its smoothness, which completes the proof of the claim.
Using the estimates for h, we have from (3.4) that there exists a constant C > 0 independent of ω such that
which completes the proof. □
3.1.2. Well-posedness
Based on the Green function gω , we are able to show that the two-point boundary value problem of the stochastic differential equation (3.1) admits a unique mild solution. The following result gives the estimate of the mild solution, which plays an important role in the analysis of the time domain problem.
Theorem 3.2. Assume that . Then the stochastic differential equation (3.1) admits a unique mild solution given by
Moreover, the solution U satisfies the estimate
where C > 0 is a constant.
Proof. The existence and uniqueness of the mild solution can be proved similarly as those in [5]. We only show the proof for the estimate (3.6).
By Itô's isometry, Lemma 3.1 and Parseval's identity, we get
which completes the proof. □
Hereafter, the notation a ≲ b stands for a ⩽ Cb, where C is a positive constant whose value is not required but should be clear from the context.
3.2. The direct problem in the time domain
Using the result obtained for the equivalent problem in the frequency domain (3.1), we are now at the position to show the well-posedness of the time domain problem (1.1).
Theorem 3.3. Assume that . Then the initial boundary value problem of the stochastic differential equation (1.1) admits a unique solution u satisfying
where C > 0 is a constant.
where denotes the inverse Fourier transform and U is the mild solution of (3.1). Define
To show the existence of the solution to (1.1), we prove that the function u defined above is a solution of (1.1).
Note that
where is the zero extension of F on (−∞, 0) which is defined at the beginning of this section. Hence, is a causal function with if t ⩽ 0, which implies that
Moreover, it follows from Parseval's identity and Theorem 3.2 that satisfies
which indicates that the Caputo fractional derivative of with respect to t is well-defined.
Taking the inverse Fourier transform with respect to ω on both sides of (3.1), we have
where we have used Lemma 2.1 for the causal function and the fact . The equation (3.10), together with the initial condition (3.8), indicates that u defined in (3.7) is a solution of (1.1) satisfying the estimate (3.9).
The uniqueness of the solution of (1.1) is obtained directly by the equivalence between the time domain problem (1.1) and the frequency domain problem (3.1), as well as the well-posedness of (3.1) given in Theorem 3.2. □
4. The inverse problem
In this section, we address the inverse problem which is to reconstruct the diffusion coefficient F of the source from the measured wave field u(0, t) at the point x = 0 for t > 0.
4.1. The reconstruction of
First we consider the reconstruction of the modulus of the Fourier coefficients of F, and investigate the uniqueness and the issue of instability of the inverse problem.
4.1.1. Uniqueness
It follows from (3.5) that the mean and variance of the solution U(x, ω) at x = 0 satisfies
and
where we have from (3.2) and (3.3) that
if ω ≠ 0 and
if ω = 0.
Proof. For ω = 0, a simple calculation yields
For ω ≠ 0, it holds
where we have used the fact for any , and the function l1 is defined by
It then suffices to show that l1(k) > 0 for all k > 0. Note that
where l2(k) = (2k − 1)e2k − 2k2ek + 1. It is easy to check that l2(k) > 0 and hence l1'(k) > 0 for all k > 0. In fact, by noting that
we get
Hence the function l1 is increasing for all k > 0 and satisfies
which completes the proof. □
Theorem 4.2. Assume that . Then the modulus can be uniquely determined by the data .
Proof. It follows from (4.1) and Lemma 4.1 that
which implies the uniqueness of the reconstruction. □
4.1.2. Instability
By theorem 4.2, the inverse problem admits a unique solution but it lacks stability due to the fact that the denominator ∫D |gω (0, y)|2dy goes to zero as ω → ∞, which is stated in the following theorem.
Theorem 4.3. For any fixed ω ≠ 0, it holds
Proof. By (4.2), we have
Following from the same estimate as that of (3.4), we get
which completes the proof. □
4.2. Phase retrieval
In this section, we discuss the phase retrieval problem. More precisely, we aim to numerically reconstruct |F| from , where the former is the modulus of the diffusion coefficient F of the source in the time domain and the latter is the modulus of the diffusion coefficient in the frequency domain.
To introduce the numerical methods for the phase retrieval, we begin with presenting a discrete version of the phase retrieval problem, and then introduce an additional measurement based framework named phaselift [8], which is adopted to solve the phase retrieval problem.
4.2.1. Discrete phase retrieval problem
Let be a signal of length N, and be its N-point discrete Fourier transform (DFT). Denote by f (m) the conjugate of the mth column of the N-point DFT matrix, i.e.,
Then it is easy to check that ym = ⟨ f (m), x ⟩, where ⟨⋅, ⋅⟩ is the complex inner product defined by
The discrete phase retrieval problem is formulated as follows:
Pioneered by Gerchberg in [19], earlier approaches for the phase retrieval problem are based on alternating projections and can be reformulated as the following least-squares problem:
The algorithm requires oversampling by using an M-point DFT with M > N. It attempts to minimize the above non-convex objective by starting with a random initialization and iteratively imposing the time domain and Fourier magnitude constrains using projections. However, since the projections are taken between a convex set and a non-convex set, the solution usually converges to a local minimum, which leads to the limited recovery ability of the algorithm even in the deterministic setting.
Recent frameworks to attack the ill-posedness of the phase retrieval problem can be broadly classified into two categories: (1) developing a modified model with a prior information; (2) taking more magnitude measurements. The former aims to reduce the number of unknowns by assuming some a prior information of the signal, such as the support constraints [13], positivity and real-valuedness [17], or sparsity [24]. Depending on the applications, the latter can be done in various ways, which include the use of masks [25], optical gratings [28], oblique illuminations [15], or short-time Fourier transform magnitude measurements which utilize overlap between adjacent short-time sections [36].
4.2.2. Phaselift
Based on the semi-definite programming method, the phaselift is an effective approach to solve the phase retrieval problem. It has been shown that the phaselift may yield robust solutions to various quadratic-constraints. Since phase retrieval results are in quadratic constraints, the phaselift is adopted to handle our inverse problem.
There are two main ingredients in this method: multiple structured illuminations and lifting. For the self-contained purpose, we briefly introduce the two components of this method. The details may be found in [8].
First we comment on the multiple structured illuminations. Let be the object of interest, and assume that the illumination schemes, which collect the diffraction pattern of the modulated object , are available, where the pattern may be selected by the user. There are several ways to implement the recovery in practice, such as masking, optical grating, and oblique illuminations. It is usually preferred to adopt the approach in which fewer diffraction patterns are required for a stable recovery.
Next we present the lifting. Based on the diffraction pattern w mentioned above, suppose that we have quadratic measurements of the form
where a (m) can be chosen based on the diffraction pattern w in the following form:
where is defined in (4.3). The phase retrieval problem is then transformed to the feasibility problem:
The core idea of the phaselift is to embed the signal x into a higher dimensional space by using the transformation X = xx *. Next, we lift up and interpret the signal x to this rank-one matrix X . Denoting , we get
where Tr(⋅) denotes the trace of a matrix. Let be the linear operator mapping any positive semi-definite matrices X , denoted by X ⪰ 0, into
Based on the above notation, the phase retrieval problem is equivalent to
where is defined in (4.4). Consequently, the phase retrieval reduces to finding a rank-one positive semi-definite matrix X which satisfies these affine measurement constraints. Equivalently, the phase retrieval problem can be formulated to
The equivalence between (4.7) and (4.8) is obvious since according to (4.5) and it has a rank-one solution. After solving (4.8), we factorize the rank-one solution X as xx *, which then leads to the solution of the phase retrieval problem. Therefore, our inverse problem is equivalent to a rank-minimization problem over an affine slice of the positive semi-definite cone. Furthermore, it is reduced to a problem of low-rank matrix completion or matrix recovery, which is a classical optimization problem that has gained tremendous attention in recent years [9, 10].
However, it is known that the rank-minimization problem (4.8) is NP-hard. The trace norm is thus explored as a convex surrogate [29] for the rank functional in (4.8), which gives the following semi-definite programming problem:
The semi-definite programming problem (4.9) is convex, and there exists a wide choice of numerical solvers including the popular Nesterov accelerated first-order method [32]. As far as the relationship between (4.8) and (4.9) is concerned, it is beyond the scope of this work and we refer to [9, 10] for the detailed discussion.
4.3. Numerical method
In this work, all the numerical algorithms are implemented in MATLAB by modifying the templates for first-order conic solvers (TFOCS) [7]. The TFOCS is a library of MATLAB files which are designed to facilitate the construction of optimal first-order methods for a variety of convex optimization problems including the semi-definite programming problem (4.9) considered in this paper.
To illustrate how we actually handle (4.9), we next briefly introduce the formulation and implementation of a class of optimal first-order methods to solve the following general convex optimization problem:
where is convex and smooth and is convex.
Consider a class of first-order methods to solve (4.10) based on iterations with a generalized projection (cf [7]):
where ||⋅|| is a chosen norm and tk is the step size. The global convergence of (4.11) is guaranteed if tk is bounded away from zero and g( x k+1) has the following upper bound
which can be accomplished by assuming that ∇g satisfies a generalized Lipschitz criterion
for any x , y belonging to the domain of ϕ. Here, ||⋅||* denotes the dual norm of the norm ||⋅|| defined by
Then the bound of (4.12) is assured for any tk ⩽ L−1 under the assumption (4.13), which leads to the convergence
in O(L/) iterations for a simple algorithm, known as the forward–backward algorithm or proximal gradient descent [18], based on (4.11).
Optimal or accelerated first-order methods are able to improve the bound of number of iterations to , and have been studied by many researchers in the past decades (e.g., [37]). The TFOCS implements a variety of the optimal first-order variants based on a variation of the method described by Nesterov in [32]. One variant, the Auslender and Teboulle method [3], is described as follows:
where x 0 is chosen in the domain of ϕ, , and θ0 = 1. Here, θk is usually referred to as the accelerated parameter.
5. Numerical experiments
In this section, we discuss the implementation for solving the direct and inverse random source problems, and report several numerical examples to demonstrate the effectiveness of the proposed methods.
5.1. Numerical discretization of the direct problem
To avoid the inverse crime, we employ the finite difference method to discretize the initial boundary value problem of the stochastic differential equation
over the interval (0, 1) × (0, T) for some T > 0.
Define the partition of the time and spatial intervals with nodes
where ht = T/Nt and hx = 1/Nx , respectively. Let be the numerical approximation to u(xi , tn ). The fractional derivative at (xi , tn ) is approximated by
The second order derivative ∂xx u at (xi , tn ) is approximated by using the central difference method
The white noise at x = xi is approximated by the increment [W(xi+1) − W(xi )]/hx of the Brownian motion W satisfying
where means that the random variables a and b have the same distribution and is a set of independent and identically distributed standard normal random variables, denoted by ξi ∼ N(0, 1).
Using the above approximations, we obtain the following implicit scheme:
By denoting , the above scheme can be rewritten as
for n = 1, ⋯, Nt and i = 0, 1, ⋯, Nx − 1, where we have used the initial condition
Moreover, according to the boundary condition ∂x u(0, t) = 0 in (1.1) and noting that
we define
Hence, when i = 0, (5.1) leads to
By the boundary condition u(1, t) = 0 in (1.1), the following boundary condition is taken into account to the numerical scheme:
and thus (5.1), when i = Nx − 1, turns to be
Combining (5.1) with (5.2) and (5.3), we conclude that the numerical scheme has the following compact matrix form:
5.2. Implementation of the phaselift
For the problem (1.1), the modulus of the Fourier coefficient can be reconstructed from (4.1) by utilizing the variance of the Fourier transform of the field u(0, t). To solve the phase retrieval problem numerically, i.e., to uniquely recover the source at discrete temporal nodes, more measurements are usually required and the phaselift with masks introduced in section 4.2 is used.
Let M 1 and M 2 be two masks with M 1 being the Nt × Nt identity matrix and M 2 being a random binary mask, i.e., an Nt × Nt diagonal matrix whose entries are randomly chosen as 0 or 1. Using these two masks to perform the structured illumination with sources M i f , we can obtain in total 2Nt intensity measurements , where the operator is defined in (4.6) with N = 2Nt , the vectors involved in the operator are defined by
and f (m) is given in (4.3).
To recover the source f , it then suffices to solve the convex optimization problem introduced in section 4.3 based on the objective functional
where ||⋅||2 , λ > 0 is a regularization parameter, and h is the indicator function defined by
Here, b is the quadratic Fourier modulus obtained by the measurement at x = 0 perturbed by the sources with masks. More precisely, has entries
where is the 2Nt -point DFT of the measurement and ωn = 2πnht /Nt .
The Auslender and Teboulle method (4.14) is applied to solve the above problem, which is terminated when the relative residual error of our reconstructed result is less than a fixed tolerance, i.e., . Note that the solution is unique only up to the global phase, i.e., is unique, where c is a complex scalar satisfying |c| = 1. In particular, if F(t) is a real-valued nonnegative function, then the recovery is unique. As a result, the absolute value |F| is recovered in the following three numerical examples.
We set Nt = 65 and Nx = 100, and use the total number of 1000 sample paths to approximate the variance of the solution. In addition, the data is assumed to be polluted by a uniformly distributed noise with the noise level σ = 0.05. In the first example, we set the source function F(t) = sin(t) exp (−t/6) and T = 4π. In the second example, the source function is taken to be F(t) = sin(2t) cos(3t) with T = π. In the third example, a discontinuous function F is considered:
where T = π. We present the results for two different cases: α = 0.4 and α = 0.8 in all of the examples. The reconstructed solutions are plotted against the exact solution in figures 2–4, for the three examples, respectively. In the third example, the results are given for three different noise levels σ = 0, σ = 0.03, and σ = 0.05 in order to test the stability of the method. It can be observed from the numerical experiments that the proposed method is effective and robust to reconstruct the modulus of the source functions.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution image6. Conclusion
In this paper, we have studied an inverse random source problem for the stochastic time fractional diffusion equation driven by a spatial Gaussian random field. By examining the equivalent two-point stochastic boundary value problem in the frequency domain, we show that the direct source problem is well-posed and the inverse source problem has a unique solution. The ill-posed nature is revealed for the inverse problem, i.e., the modulus of the Fourier coefficient of the source function decays in an α order of the frequency. The inverse problem is then converted into a phase retrieval problem which is to recover the original signal from its Fourier modulus and is implemented via the phaselift with random masks. The numerical results show that the method is effective to reconstruct the modulus of the source functions.
The proposed approach based on the Fourier transform can be naturally extended to solve higher dimensional problems. For more complex cases, there are still several interesting problems to be investigated along this line of research. First, the source can be modeled by more general random processes such as spatial fractional Brownian motions. The analysis and results would remain the same for the direct problem. However, the uniqueness of the inverse problem may not be guaranteed due to the correlated kernel of fractional Brownian motions. Second, the Fourier transform requires more constraints about the functions u and f, while the Laplace transform demands less constraints on these functions. Nevertheless, there is no efficient algorithm to deal with the phase retrieval problem on the Laplace transform. In addition, when it comes to the situation that α ∈ (1, 2), more initial conditions are required to ensure the well-posedness of the model equation, which leads to the fact that the transform may not apply in this case. We hope to report the progress on these problems elsewhere in the future.
Acknowledgments
The research of P Li is supported in part by the NSF Grant No. DMS-1912704. The research of X Xu was partially supported by NSFC 11621101, 12071430 and the Fundamental Research Funds for the Central Universities.
Data availability statement
No new data were created or analysed in this study.