Abstract
The inverse problem for radiative transfer is important in many applications, such as optical tomography and remote sensing. Major challenges include large memory requirements and computational expense, which arise from high-dimensionality and the need for iterations in solving the inverse problem. Here, to alleviate these issues, we propose adaptive-mesh inversion: a goal-oriented hp-adaptive mesh refinement method for solving inverse radiative transfer problems. One novel aspect here is that the two optimizations (one for inversion, and one for mesh adaptivity) are treated simultaneously and blended together. By exploiting the connection between duality-based mesh adaptivity and adjoint-based inversion techniques, we propose a goal-oriented error estimator, which is cheap to compute, and can efficiently guide the mesh-refinement to numerically solve the inverse problem. We use discontinuous Galerkin spectral element methods to discretize the forward and the adjoint problems. Then, based on the goal-oriented error estimator, we propose an hp-adaptive algorithm to refine the meshes. Numerical experiments are presented at the end and show convergence speed-up and reduced memory occupation by the goal-oriented mesh adaptive method.
Export citation and abstract BibTeX RIS
Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
This paper concerns the numerical solution of inverse radiative transfer equation, which serves as the mathematical foundation for applications such as optical tomography [2, 4, 32], remote sensing [13, 14, 29, 34], and neutron transport [27, 28, 30]. Despite its wide applications, devising numerical methods for inverse radiative transfer is notoriously challenging because of the high-dimensionality of the forward problem, for which standard discretizations with sufficient accuracy would usually require large memory occupation, and can render the solver too slow to be applicable. Therefore, it is of great interest to devise numerical methods which use fewer degrees of freedoms (DOFs) or less memory, while still maintain the accuracy requirement.
Among the various types of memory reduction methods, we are interested in hp-adaptive mesh refinement (hp-AMR) methods. This is for several reasons. Firstly, the method is versatile in the sense that it can efficiently represent the solution in regions where it is smooth, while also capturing local features. This advantage is especially beneficial for applications such as optical tomography and remote sensing, for which it is common to observe a pattern with a few local features embedded in a smooth background distribution (clouds in the sky for remote sensing; narrow, dirac-delta like inflow laser beam for optical tomography; etc). Another reason concerns the well-developed theoretical understanding of AMR, and its success in the field of solid and fluid mechanics. Indeed, the concept of hp-AMR traces back to at least the 1980s by the pioneering work of Babuška and his collaborators [18]; see also the review paper [5] for hp-finite element methods (FEM). It was shown that hp-adaptive FEM can achieve exponential convergence even when the solution presents singularities. This suggests the great potential of using hp-AMR to reduce the DOFs for radiative transfer. Our recent work [15] demonstrates this potential for the forward problem, while here we consider solving the inverse radiative transfer problem.
Before we proceed to the introduction of our proposed method, we first review some related existing work on inverse radiative transfer, or more generally on inverse transport problems. The well-posedness of inverse transport problems was studied in [6, 12, 37] by exploiting the singular decomposition of the albedo operators. Then, the stability of the inverse problem in different scaling was studied in [10, 11, 25, 26]. For the numerical discretization of the inverse problem, the time-independent inverse radiative transfer was considered in [1, 22–24], and the frequency domain problem was considered in [33]. Finally, [32] gives a review on the numerical techniques for the inverse transport problems in medical imaging. In [3], the effect of the numerical error on the quality of the reconstruction was discussed, where it was shown that the numerical error on the forward/adjoint solver can lead to significant errors in the reconstruction of the optical images.
Considering the efficiency of AMR in reducing numerical error, it is not surprising to see that AMR has also been applied to inverse problems. In [19, 20], the authors considered the diffuse optical imaging problem, where the connection between the forward/adjoint solver error and the reconstruction error was studied theoretically, and an AMR method was proposed based on their theoretical findings. In [35, 36], finite volume (FV) based AMR methods for fluorescence/luminescence imaging were studied numerically. In [7, 8], a FEM based AMR method was proposed for fluorescence-enhanced optical tomography, and a large savings in DOFs was observed in numerical tests.
In this paper, we propose a goal oriented hp-AMR method which is distinctive to the existing AMR approaches of solving the inverse radiative transfer equation in the following aspects. First, we devise a novel goal-oriented error estimator, which provides more efficient mesh-refinement strategies compared to energy or L2-norm based error estimators; see [9, 39]. In addition, we use hp-AMR instead of only h-AMR with fixed p. The extra p-adaptivity allows more efficient representation of the solution where it is smooth. Finally, we consider the full radiative transfer equation instead of its diffusion-type approximation. The full radiative transfer equation is a more computationally challenging task because of the interplay of both the advection and the scattering terms. Since we aim for maximal generality of our method for the different regimes of the radiative transfer equation, we use discontinuous Galerkin (DG) methods to approximate the forward and the adjoint equations. The DG methods can handle well both the advection-dominated and the diffusive regimes, by their flexible choices of the numerical traces. For instance, upwind-type fluxes allow the schemes to suppress unphysical oscillations in the advection-dominated regime, while in the diffusive regime, HDG-type fluxes enable the techniques of static condensation to save more DOFs. Finally, the DG setting allows a straightforward implementation of hp-adaptivity, which lies at the center of our memory reduction method.
The development of our method is based on two key observations. The first is that there exists a goal function, and by minimizing this goal function, the discretized error landscape will be sandwiched by the original landscape in the PDE setting (see (4) and (13) for their definitions). This observation motivates the use of this goal function for devising error estimators for AMR. Another observation is that when the numerical error is large, we can use the adjoint solution, which was originally calculated for updating the optical parameters, to also devise the goal-oriented error estimators for doing AMR. This observation suggests that the goal-oriented AMR is especially suitable for the inverse problem setting, since the adjoint solution only needs to be solved for one time but can serve two purposes, namely, the gradient calculation and the error estimator calculation.
The rest of the paper is organized as follows. In section 2, we consider the forward and the inverse problems of the radiative transfer in the PDE setting, and propose an algorithm of reconstructing the optical coefficients. This algorithm will serve as the 'PDE map' for the numerical discretization of the inverse equation. In section 3, we first introduce the DG discretization for the forward and the adjoint problems, and the gradient calculation. Then, we propose the goal function and devise the goal-oriented error estimators. At the end, we combine the discretization and the error estimator and propose an algorithm of reconstructing the optical coefficient with hp-mesh adaptivity. Finally, in section 4, we present numerical experiments to test the performance of our proposed method.
2. Forward and inverse problems
In this section, we introduce the forward and the inverse problems of the radiative transfer equation. Then, an algorithm (algorithm 1) is proposed to reconstruct the optical coefficients in the PDE setting.
Algorithm 1. Reconstructing optical coefficients—PDE setting. |
---|
1: Set the maximal iteration count and the error tolerance ε > 0 |
2: Initialize σ0 |
3: Set k = 0 |
4: while do |
5: Using σk , solve (7c ) = 0 to obtain ui |
6: Using σk and ui , solve (7b ) = 0 to obtain vi |
7: Using σk , ui , vi , and (7a ) to obtain the updating direction |
8: Update the optical coefficients by , where γ is determined by a line search |
9: Update the iteration count: |
10: if then |
11: Break the loop |
12: end if |
13: end while |
2.1. Forward problem
Let be a bounded Lipschitz domain, and S represent the unit sphere in . Let be the boundary of Ω, and n the outward-pointing unit normal vector on the boundary. We denote by
the outflow and the inflow boundary of , respectively. We consider the following (time-independent) radiative transfer equation: find such that
In the above equation, is the radiative intensity at spatial location x and along the direction , while , , , and are the absorption coefficient, the scattering coefficient, the source term, and the inflow radiation, respectively. We assume the scattering phase function to have the form of the Henyey–Greenstein function:
where g is the asymmetric parameter, and c is chosen such that . Here V is the solution space
where Vg is the affine space of V satisfying the inflow condition (1b ). Note that V0 is a subspace of V.
To render the presentation more concise, we rewrite (1) into a more compact form. Let , and , where γ− is the trace operator to , and σ represents the collection of both and , namely . Then the equation (1) transforms into
In the equation (1) (or (3)), we use the square bracket to indicate that (or ) depends linearly on u; the round bracket suggests that (or ) can depend non-linearly on σ. In the setting of the forward problem, the optical property represented by σ and the source/boundary term F are given as input data. Then the task it to solve the equation (1) (or (3)) for the radiative intensity u. We refer to [16] and the references therein for the well-posedness of the forward problem.
2.2. Inverse problem
In the setting of the inverse problem, we are given certain measurements of the solution, here denoted as , while the task is to infer the distribution of the optical property σ or the source term f. The former task fits into the field of optical tomography, while the latter one is usually referred to as inverse source problems. We refer to [2, 6] for a review on these topics.
In this paper, we focus on the optical tomography problem. To be more specific, we consider the task of reconstructing the optical property of the medium σ, based on a set of tests determined by the source terms Fi with , and a set of measurements on the outflow boundary , here denoted as with . This problem can be formulated as an optimization problem. Suppose is the target optical coefficient we would like to reconstruct. We define as the measurement data. For many applications, the data are polluted by noise. In such a case we define where δij represent the noise. The optical tomography problem becomes
where wij are the weights associated to each test-measurement signal yij . A typical choice is ; see [32]. Here, is the regularization term and α is the regularization parameter. Some widely used examples of regularization include the L2-regularization, for which we choose (where is a spatial average), or H1-regularization, for which we choose . For more on regularization techniques and the choices of regularization parameters, we refer to the monograph [17] and the references therein.
Note that is nonlinear and there is no explicit expression for in general cases. To proceed, we transform (4) into a PDE-constrained optimization problem:
To solve the problem, we introduce the Lagrangian
where vi is the Lagrange multiplier. By the Karush–Kuhn–Tucker (KKT) necessity condition [38], the solution of (5a ), if exists, is also the first-order stationary point of Ψ in (6). By taking Fréchet derivatives of (6), we obtain
where is the trial-test bracket, manifested as the summation of the L2-inner product on and , and is the adjoint operator of .
Now, the optimization problem reduces to finding the stationary point such that equations (7) equal 0. Based on (7), we propose the following algorithm 1 of finding . In the algorithm, the forward and the adjoint problems associated with (7c ) and (7b ) are first solved to obtain ui and vi , which are then used to update σ by an updating direction and the step length γ > 0. The step length γ can be determined by a line search [31]. The updating direction can be simply chosen as the gradient given by (7a ), or by Quasi–Newton method such as the BFGS method. We refer to [31] more details.
3. Numerical discretization
In this section, we introduce the numerical approximations to the inverse problem (4), manifested by using the discontinuous Galerkin spectral element (DGSE) methods to discretize the forward problem (7c ) = 0, the adjoint problem (7b ) = 0, and the gradient calculation (7a ). Then, we propose a goal-oriented error estimator which can be easily calculated based on the forward and the adjoint numerical solution. Finally, based on the error estimator, we introduce an hp-AMR algorithm to solve the inverse radiative transfer problem numerically.
3.1. DG
3.1.1. Approximation spaces.
We begin by considering the approximation spaces. For simplicity, we restrict ourselves to consider only a 2D domain Ω discretized by rectangular meshes . The generalization to 3D and polyhedral meshes is straightforward under the DG setting. Since any is rectangular, we can write . We require that the mesh satisfies the one-irregular condition, i.e. for any element , there are at most two neighbour elements connecting to K through each edge. Here h is both an index for the triangulations and the mesh-size, which is defined as . For each , we denote by the triangulation of the unit sphere S. We also let be the collection of the faces of K. Now we introduce the approximation space:
where represents the polynomial degree and . Here we choose where φi supported on is the Lagrange polynomial basis associated to the ith Gauss–Legendre–Lobatto quadrature points, and is the push-forward map. This choice of basis gives us spectral element methods. Note that the space Vh is completely determined by the mesh discretization and the polynomial degrees , , and which are associated to each spatial-angular element . We will sometimes write to denote the mesh encoded with also the polynomial degree information. Finally, to incorporate the boundary condition, we introduce
As a special case, is a subspace of Vh with zero inflow radiation.
For notation conciseness, we define
as the L2 inner product on , and we denote the associated norm as . Also, when there are two elements and which share a common face F, we introduce the jump notation . If the face F is an exterior face, namely, if , then .
3.1.2. DG for the forward and adjoint problems.
Now we are ready to introduce the DG numerical discretization for the radiative transfer equations. We first define the following bilinear form:
where we take the upwind numerical trace :
and is the restriction of uh from the neighbour elements of K on F.
The DG approximation for the ith test of the forward problem (7c ) reads as
On the other hand, the DG approximation for the ith test of the adjoint problem (7b ) reads as
where
are the measurements. For different tests, we choose different underlying hp-meshes. We denote by and as the underlying hp-mesh for the ith test of the forward and the adjoint problem, respectively. We shall assume that is obtained by refining . For instance, can be a one-level h or p-refinement of . This extra refinement for the adjoint meshes will be essential for devising the error estimators for the inverse problem, and will be discussed in more detail in section 3.2.2.
3.1.3. Gradient calculation in the discrete setting.
For the rest of this subsection, we consider how to use the forward and the adjoint numerical solutions, namely and , to calculate the gradient and update the optical parameter in the DG setting. Namely, we consider the DG discretization for the gradient calculation (7a ).
We begin by introducing the approximation spaces for the optical parameter σ. Since we use the DG spectral element method as our radiative transfer equations solver, it is natural to choose the discretized optical parameter σh to live in a DG spectral element space. We shall denote by as the underlying mesh for this space, and by , or as the space.
Here we adopt the parametric reconstruction method [32] as a regularization to relieve the ill-posedness of the original inverse problem (4). This means that we shall set to live in a coarse mesh with large mesh-size. In the extreme case, we choose to be the one-element mesh covering the whole domain Ω. Then, the polynomial degree p can be adjusted in the sense that a lower degree p represents a stronger regularization effect (restriction to the low-frequency modes) and vice versa.
Now, the discretized version of (7a ) becomes
for any test function , where , and and are the solution of (9) and (10), respectively. By letting the test function go over all the basis polynomials of , we obtain a vector as the updating gradient for σh .
We conclude section 3.1 by the following algorithm 2.
Algorithm 2. Reconstructing optical coefficients—DG discretization. |
---|
1: Set the mesh for the optical parameter— |
2: Set the mesh for each test i— for the forward and for the adjoint problem |
3: Set the maximal iteration count and the error tolerance ε > 0 |
4: Initialize |
5: Set k = 0 |
6: while do |
7: Using , solve (9) to obtain |
8: Using and , solve (10) to obtain |
9: Using , , , and (12) to obtain the updating direction |
10: Update the optical coefficients by , where γ is determined by a line search |
11: Update the iteration count: |
12: if ( defined in (13)) then |
13: Break the loop |
14: end if |
15: end while |
3.2. Goal-oriented error estimator
3.2.1. The goal function.
The numerical solver (9) and (10) can be rewritten in a compact form as follows:
where and represent the forward and the adjoint solver, respectively. Using this notation, the discretized inverse problem (4) becomes
where is the index set for all tests. Note that the discretized error landscape functional depends both on the optical property σ and the set of the hp-meshes . If we denote
then the error landscape can be related to its discretized version ) as follows:
As a result, when , we have
The above estimate suggests that, when the error goes to zero, the minimizer of also minimizes and vice versa. This observation suggests that we should aim for devising numerical methods such that is reduced in the most efficient way. We next explain how this can be achieved in more detail.
Toward this aim, for simplicity, write and . Recall that we denote by the minimizer of so we have by definition. Now, for each test i, we introduce the functional :
Then, it holds that
The above identity suggests that we can use the functional to devise error estimators to guide the mesh-refinement of . This approach fits into the well-known framework of goal-oriented error control and AMR; see [9]. To be more specific, the error estimators are obtained by solving the dual equation
which coincides with (10) except the right-hand-side term is different. Now note that
Therefore, when , or namely, when the numerical error dominates the error caused by optical parameters, we have
Thus, in this case, we can use as an alternative of to devise error estimators to guide the mesh refinement of . As a result, we have . This suggests that we do not need to solve another adjoint equation for but can simply use to devise the error estimators.
3.2.2. The error estimators.
For the rest of this subsection, we show how a goal-oriented error estimator can be devised based on a given functional J in the DG setting. Namely, we would like to control by a posteriori error estimators. To make the presentation concise, we shall consider the forward problem (1), and its DG discretization formulated as follows: find such that
Before we present the main result, we first introduce two lemmas on the consistency and the Galerkin orthogonality of the DG bilinear form (8a ).
Lemma 3.1 (consistency of the DG bilinear form). Suppose u solves (1). Then
Proof. By multiplying (1a ) with and integrating on , we obtain
Since u is the exact solution, we have by the consistency of the numerical trace, see (8b ). Then we take summation for all and . This completes the proof.
Lemma 3.2 (Galerkin orthogonality). Suppose u solves (1) and uh solves (14). Then
Proof. We simply choose in (15) and then take the difference between (14) and (15). This completes the proof.
Consider the following dual equation: find such that
Then we define
where
In the above equations, ϕh can be any function in .
Proposition 3.1. Let be the solution of (1), solves (14), and be the solution of the dual problem (17). Then, we have
Proof. Since and , we know . Thus, we can let in (17), which gives us
where the Galerkin orthogonality identity (16) is used for the second equal sign. Then, by the definition of the bilinear form (8a ) and its consistency property (15), we can proceed as follows:
Now, note that
This completes the proof.
Proposition 3.1 suggests that we need to solve a dual equation (17) to calculate the error estimators. Since a direct solve for z is not feasible, we calculate an approximate solution to z by zh . Here zh should live in a more refined space than the space for uh . Otherwise, we can choose and this will render the error estimators vanishing. To be more specific, we seek such that
Here is a more refined space than Vh . For instance, we can choose to be an h-refined or p-refined version of Vh . In this paper, we shall use the p-refined version since it is straightforward to implement. Once zh is calculated, we choose where is the L2 projection onto the space Vh . Namely, the weighting terms become
For the rest of this section, we use the error estimators derived here to design an hp-AMR method to solve the inverse radiative transfer problem (4).
3.3. Goal-oriented mesh adaptation for the inverse problem
Up to this point we have introduced the forward solver (9), the adjoint solver (10), the gradient calculation (12), and the error estimators (18). Next, we show how they can be combined to solver the inverse problem (13) with mesh adaptation. We shall first go over the basic elements of the hp-AMR and then propose the full algorithm.
3.3.1. hp-AMR.
Given a mesh , we can refine it based on the error estimators introduced in (18). For h-refinement only, (18) is sufficient for an AMR algorithm. For hp-AMR, we will need an additional smoothness estimators to complete the algorithm. Since we use a more refined mesh for the duality solution vh , the dominating error comes from uh . Hence, here we use uh for devising the smoothness estimator.
We consider the smoothness estimators proposed in [21]. These estimators are obtained by examining the decaying pattern of the coefficients obtained from a Legendre expansion of the numerical solution. To be more specific, given the forward solution uh , we first calculate its mean intensity . Then, for any element , we calculate the Legendre expansion coefficients of on K, here denoted as with and , where px and py are the polynomial degrees of the tensor-product basis function on K in the x and the y directions, respectively. Then, the smoothness estimator is defined as follows:
With the error estimator ηK (defined in (18)) and the smoothness estimator (defined in (19)), we propose the following algorithm 3 to refine a given mesh .
Algorithm 3. hp-adaptive mesh refinement. |
---|
1: Sort all element in increasing order according to the error estimator ηK defined in (18). |
2: Mark the largest percentage of the elements for refinement—. |
3: For all , calculate the smoothness estimators defined in (19). |
4: For all , we perform the following refinement |
5: if then |
6: Perform p-refinement for K. |
7: else |
8: Perform h-refinement for K. |
9: end if |
Note that for each test i, the error estimators are computed based on the forward solution and the adjoint solution ; see equation (18), where z is replaced by .
3.3.2. The full algorithm.
We conclude this section by proposing the algorithm 4 of solving the discretized inverse problem (13) with mesh adaptation.
Algorithm 4. Reconstructing optical coefficients—DG discretization with hp-adaptive mesh refinement. |
---|
1: Set , the mesh for the optical parameter |
2: Set the maximal iteration count , the maximal DOFs count , and the error |
tolerance ε > 0. |
3: For each test i, initialize , the mesh for solving the forward and the adjoint problems |
4: Initialize the optical parameter . |
5: Set k = 0 |
6: while and do |
7: Using , solve (9) to obtain |
8: Using and , solve (10) to obtain |
9: Using , , , and (12) to obtain the updating direction |
10: Update the optical coefficients by , where γ is determined by a line search |
11: Update the iteration counting: |
12: if ( defined in (13)) then |
13: For each test i, refine the mesh by algorithm 3 |
14: Reinitialize the optical parameter |
15: end if |
16: end while |
Note that previously in algorithm 2, we stop the iteration when the relative difference . Here in algorithm 4, we apply mesh-refinement instead of stopping the iteration. This procedure is repeatedly applied until the maximal DOFs count is reached. This can prevent the algorithm from refining indefinitely. The constant can be chosen according to the memory-occupation restriction or computational time restriction. The parameter λ determines how much information will be inherited from the optical parameter recovered in the previous mesh setting. If the previous mesh setting can provide a relatively good approximation of σ, then we can increase λ for a better initial guess. Otherwise, we choose λ = 0 to avoid potential bias from an inaccurate numerical solution. In the numerical tests we carried out in this paper, it only takes a few steps for to converge, so we always choose λ = 0.
4. Numerical experiments
In this section, we carry out numerical experiments to test out the performance of our proposed numerical methods. In the first subsection, we test the goal-oriented AMR method for the forward problem. Then, in the second subsection, we test the performance of the method for the inverse problem.
4.1. Goal-oriented AMR for the forward problem
In this subsection, we consider the numerical approximation to the forward problem (1).
Before testing the goal-oriented AMR methods, we first test the convergence of the DGSE method without adaptivity. We consider a rectangular spatial domain with the inflow radiation coming from the top and the left boundary:
For the other sides of the boundary, we assume zero inflow radiation and zero source term f. A scatterer is placed at the center of the domain, so that the extinction coefficient is
We choose the single scattering albedo to be , so the scattering coefficient is and the absorption coefficient is . We choose . The asymmetric parameter in the scattering phase function in (1) is chosen as g = 0.8, which is a typical value for water clouds in the atmosphere.
For this test we consider the DG method (9) with the polynomial degree and also with for the spatial discretization. To compare, we also consider an FV spatial discretization method, which is equivalent to the DG method with . For all these methods, we shall uniformly partition the domain into a collection of elements and slowly decrease the element size. For the angular discretization, we apply a uniform partition into 32 pieces by P0 element (FV angular discretization). Figure 1 shows that the DG method has errors that converge as where the dimension d = 2. This is the expected optimal convergence order for the DG method and the FV method (upwind).
We next consider the forward problem with the goal-oriented AMR algorithm. We consider a rectangular spatial domain with the inflow radiation coming from the top boundary:
For the other sides of the boundary, we assume zero inflow radiation and zero source term f. For this test, we also consider a scatterer placed at the center of the domain following the form of equation (20). To better test the AMR algorithm, here we consider the scatterer with a sharper boundary transition by taking . The other settings are the same as the previous test; namely, we choose and g = 0.8. See figure 2 for a visualization of .
Download figure:
Standard image High-resolution imageOur aim is to find an efficient numerical approximation to J(u), where u is the solution of (1). Here we set the goal function J to be (a smoothed version of) a point measurement on the outflow radiation, located at the right side of the domain:
Specifically, the (smoothed) point measurement is put at . In the first test, we use the error estimator ηK defined in (18) to guide mesh refinement. The mesh refinement algorithm 3 is repeatably applied with the refinement ratio , until the DOFs for uh reach the threshold . To better visualize where the meshes are refined, here we use a fixed polynomial degree p = 3.
To compare, we also consider a standard jump estimator
Since the exact solution u is not known, we calculate as an approximation to the true functional error , where is calculated on a more refined mesh than u. Here is calculated with an extra p-refinement for each element .
Figure 3 shows the numerical solutions and the meshes calculated based on the goal-oriented estimator ηK and the standard jump estimator . We observe that the goal-oriented estimator can successfully guide the mesh to refine at the place where the solution has a sharp gradient, namely, top of the scatter. In addition, it refines along the path connecting the place of the sharp gradient and the place of the point measurement (located at the right side boundary at ). In contrast, the standard estimator only refines where there are sharp gradients, without taking into consideration of the effect of the measurement J.
Download figure:
Standard image High-resolution imageIn figure 4, we plot the error . We observe that the goal-oriented estimator reduces the error efficiently while the standard estimator fails to decrease the error. This is consistent with our observation in figure 3, where the goal-oriented estimator refines both the places of the sharp gradient and the path connecting to the measurement, while the standard estimator fails to refine those elements concerning the measurement J. The experiment shows that the goal-oriented estimator can be much more efficient than the standard one, when the goal function is not an L2 or energy-based norm, but instead a measurement on the domain boundary. Note that this is exactly the case for the applications such as optical tomography, which we shall consider in the next subsection.
Download figure:
Standard image High-resolution image4.2. Goal-oriented AMR for the inverse problem
In this subsection, we consider the numerical tests for the inverse problem (13). We aim to test the performance of algorithm 4 and compare it with other mesh-refinement methods. We consider a rectangular domain where on each side, we put 2 inflow radiation laser beams and 20 measurements collecting the angularly-averaged outflow radiation. As a result, we have and in the formulation (4) or (13). The absorption is fixed as and the asymmetric parameter is chosen as g = 0.1. The scattering coefficient σs is decomposed into a summation where is a background state and represents a perturbation. We assume the background state and the absorption are known and we aim to recover the scattering coefficient . We shall consider two test cases with the same background state but with different perturbations . See the upper-left sub-figures of figures 6 and 9 for the true scattering coefficients for the first and the second test case, respectively.
To generate the data , we solve the forward problem (9) on a very refined mesh, namely, a 8 by 8 mesh with the polynomial degree chosen as p = 9, so we can regard as the 'true' solution. The optical parameter σh is set to live on the one-element mesh with p = 19.
To solve the inverse problem, we apply an H1-regularization, namely , with the regularization parameter chosen as ; see (4) where α was first introduced. We implement algorithm 4, with , and for the mesh refinement algorithm 3. For the iteration method, we use the limited memory BFGS algorithm to calculate the updating direction . The parameter γ is determined by a line search method, for which we use the backtracking algorithm which starts with µ = 1 and then followed by the candidate step length until the Armijo condition
is satisfied, where . For more details on the BFGS method and the line search algorithm, we refer to [31].
We test the performance of algorithm 4 by comparing it with other mesh-refinement methods. We set the initial mesh () by a 4 by 4 mesh with p = 2. To be more specific, we shall compare algorithm 4 with other methods, for which, instead of applying the AMR algorithm 3 in the algorithm 4, we apply (1) h-refinement, for which we partition the domain into more elements (uniformly divided), (2) p-refinement, for which we increase the polynomial degree on each element, (3) h-AMR-goal, for which we only apply h-refinement in the algorithm 3, and (4) hp-AMR-standard, for which we use the standard jump error estimator (18) to guide the mesh-refinement in the algorithm 3. We remark that the h-refinement method with p = 2 is similar to a FV method as mesh resolution is increased, while the p-refinement method with an increased polynomial order p is similar to a spectral method in the sense that an exponential (spectral) convergence can be achieved when the solution is sufficiently smooth.
Now we show the results for the first test case. Figure 5 shows that the hp-AMR-goal method is the most efficient method in reducing the error . Namely, it uses the fewest DOFs to achieve the best recovery. The h-AMR-goal method is overall less efficient than the hp-AMR-goal method but their performances are similar. For the hp-AMR-standard, h-Ref, and p-Ref methods, we observe their errors first decrease and then increase, and in all DOFs levels, their recovered optical property is poorer than the one recovered by the hp-AMR-goal method. The error for the hp-AMR-standard method decreases fast at the beginning but then the error increases to back to the original value, but even in its decreasing phase its performance is less efficient than the hp-AMR-goal method. This consolidates our claim that the goal-oriented estimator is necessary for inverse problems. For h-Ref and p-Ref methods, we observe that they decrease the errors much more slowly than hp-AMR-goal. In summary, the hp-AMR-goal method behaves better than other methods in almost any measure, and the h-AMR-goal method behaves less well but similar to hp-AMR-goal.
Download figure:
Standard image High-resolution imageTo see this more clearly, we plot in figure 6, the recovered scattering coefficients by the different refinement methods. To make a fair comparison, for all the refinement methods, we take the snapshot at the refinement step such that the DOFs are closest to 106. The results agree with what we observe in figure 5. Namely, the hpAMR-goal method gives the best approximation to the true scattering coefficient, while using the fewest number of DOFs.
As additional detail of what is seen in figure 6, we observe that the hRef method and hpAMR-standard method both fail to provide a satisfactory recovery to the true distribution (left-top sub-figure). All of the pRef, hAMR-goal, and hp-AMR methods can successfully recover the correct location and the approximate shape of the perturbation . In further detail, though, the h-AMR-goal and hp-AMR-goal methods provide a more accurate recovery for the amplitude of perturbation in comparison to the p-Ref method. Note that here we chose a fixed regularization parameter α for all refinement methods for a fair comparison. Despite this, we remark that if it is allowed for the regularization parameter α to be adapted properly to the different refinement algorithms, the recovery for pRef, hAMR-goal, and hpAMR-goal might further improve.
To visualize how the meshes are refined by the different methods, we plot the forward solution and the corresponding meshes in figure 7. For goal-oriented hp-AMR method, we observe that the mesh is refined both at the place where the forward solution has a sharp gradient, and also at the left boundary where the measurement is located. In contrast, the standard hp-AMR method only refines the mesh where the sharp gradients are located, but fail to refine also at the measurements. The goal-oriented h-AMR method refines both at the sharp gradients and at the boundary where the measurements are located. However, it does not use a high order approximation as in the hp-AMR cases. This could explain why its performance is overall less great than hp-AMR-goal. Therefore, the goal-oriented hp-AMR is the only method that supports both (1) a balanced refinement strategy between resolving the forward and the adjoint solution, and (2) an automatic high-order approximation by p-adaptivity. Considering this, it is not surprising to see that the hp-AMR-goal method recovers the optical coefficient with the best quality in figure 6.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageTo test the reliability of our proposed method, we conduct another experiment (the second test case) with the same setting used for figure 5 but with a different scattering coefficient σs to be recovered; see the top-left figure of figure 9 for a visualization of . From figures 8 and 9, we again observe that our proposed goal-oriented hpAMR method has the best performance compared to all the other refinement approaches. Namely, it uses the fewest DOFs but provides the best quality of the recovered scattering coefficient σs .
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageTo test the robustness of the proposed goal-oriented hp-adaptive algorithm, we conduct additional numerical tests with noise added to the signals yij . Namely, instead of using yij , we use where Xij are independent, identically distributed random variables from a uniform distribution on , and δ represents the noise level. We consider two cases of and .
Figure 10 shows that when the noise level , the performance of the different refinement methods are similar to the noise-free case (figure 5). We again observe that the hp-AMR-goal methods gives the smallest error while using the fewest DOFs. When the noise level δ increases to , we observe an increase of the errors for most refinement methods. Despite this, the hp-AMR-goal method still provides the overall smallest error compared to the other refinement approaches, especially in the last refinement level. In figures 11 and 12, we plot the recovered scattering coefficient by the different refinement methods with the noise level and , respectively. The results are consistent with what we observe in figure 10.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageWe repeat the noise test for the second test case (with the true scatterer shown in the upper-left sub-figure of figure 9). In figure 13, we observe that in both cases of the noise level of and , the hp-AMR-goal method provides the overall smallest error. We plot in figures 14 and 15 the recovered scattering coefficients. The results are consistent with what we observe in figure 13 in the sense that the hp-AMR-goal methods provides the best quality of the recovery in the different noise levels.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution image5. Conclusion
In this paper we propose a goal-oriented hp-AMR method to solve the inverse radiative transfer equation. The method is based on the development of a novel goal-oriented error estimator, which is achieved by connecting two kinds of duality arguments in different fields, namely, (1) the duality-based based mesh adaptivity for goal-oriented error minimization, and (2) the adjoint-based inversion techniques for solving inverse problems. The numerical tests suggest that the proposed method solves the inverse problem with the best quality of the recovered optical coefficients, while using the fewest DOFs. While our method is proposed here for inverse radiative transfer, the general principles of devising the error estimators and designing the refinement algorithms should be able to be naturally extended to enable adaptive-mesh inversion for other types of inverse problems, such as the Calderón problem or inverse scattering.
Potential future work includes further study of combinations of more involved regularization strategies, and applications to remote sensing problems.
Acknowledgments
This research was partially supported by Grant NSF DMS 2324368 and by the Office of the Vice Chancellor for Research and Graduate Education at the University of Wisconsin–Madison with funding from the Wisconsin Alumni Research Foundation. The authors thank two anonymous reviewers whose suggestions led to improvements of the paper.
Data availability statement
No new data were created or analysed in this study.