Brought to you by:
Paper

Image domain multi-material decomposition using single energy CT

, , , , , , , , and

Published 23 March 2020 © 2020 Institute of Physics and Engineering in Medicine
, , Citation Yi Xue et al 2020 Phys. Med. Biol. 65 065014 DOI 10.1088/1361-6560/ab7503

0031-9155/65/6/065014

Abstract

Multi-material decomposition (MMD) technique decomposes the CT images into basis material images and has been promising in clinical practice for material composition quantification within the human body. MMD could be implemented using the image data acquired from spectral CT or its special case, dual-energy CT (DECT) while the spectral CT data acquisition usually requires a hardware modification. In this paper, we propose an image domain MMD method using single energy CT (SECT). The proposed objective function applies a least square data fidelity term to enforce the minimization between the linear combination of decomposed material image and the measured SECT image, and an edge-preserving (EP) regularization term to meet the piecewise constant property of the material image. We apply the optimization transfer principle to form a pixel-wise separable quadratic surrogate (PWSQS) function in each iteration to decrease the objective function. The pixelwise direct inversion method assisted by the two-material assumption (TMA) is applied to obtain a good initial value. The proposed method is evaluated using a digital phantom, a Catphan phantom and the clinical data. A low-pass filtration method is implemented for a comparison purpose. In the phantom study, the proposed TMA method achieves high volume fraction accuracy (VFA) of 79.64% and the proposed EP method further increases the VFA by 15.56% and decreases the decomposition standard deviation (STD) by 81.51% compared with the TMA method. At the comparable noise level, the proposed EP method increases spatial resolution by an overall factor of 1.01 when the modulation transfer function magnitude is decreased to 50% compared with the low-pass filtration method. In clinical data study, the virtual non-contrast image generated by the proposed method achieves the root-mean-squared-relative error of 2.93% compared with the contrast-free ground-truth image.

Export citation and abstract BibTeX RIS

1. Introduction

Multi-material decomposition (MMD) is a promising technique in clinical CT diagnosis to identify the material compositions within the human body (Alvarez and Macovski 1977, Mendonca et al 2014, Shen et al 2018a, 2018b). MMD technique decomposes the CT images into multi-material images by taking into account of both material spatial distribution and composition. The differentiated materials can be used in the clinical applications including the lesion delineation, organ contouring, virtual monoenergetic synthesis, virtual non-contrast (VNC) imaging and liver fibrosis quantification (Kelcz et al 1979, Kalender et al 1988, Sidky et al 2004, Lamb et al 2015, Ding et al 2018).

MMD can be achieved using the single-energy CT (SECT) or spectral CT scans (Stenner et al 2007, Kis et al 2012, Mendonca et al 2014, Shao et al 2015, Ren et al 2018). In the spectral CT scan, the data acquisition usually requires a hardware modification to the conventional CT scanner including multiple scans (Wang and Zhu 2016), fast x-ray tube voltage switching (Mueller et al 2016), dual-source scanning (Maass et al 2011), dual-layer (Johnson et al 2011) and photon counting detector schemes (Shen et al 2018b), primary modulation (Lee et al 2017a, Petrongolo and Zhu 2018) and etc.

Though demonstrated its efficacy in some clinical applications, spectral CT still suffers from the expensive hardware, complicated decomposition schemes and limited accessibility. SECT, however, is routinely applied as the diagnostic modality in almost all the hospitals. SECT based MMD does not require any modification of the hardware nor decrease the systematic decomposition stability. Using SECT for the MMD is proposed in the literature. Some researchers utilized deep learning method to achieve SECT based material decomposition. Liao et al proposed a cascade deep convolutional neural networks (CD-ConvNet) which applies the first layer of convolutional neural network to implement material decomposition while the second one for noise suppression in the decomposed material images(Liao et al 2018). This method has to train a material specific neural network for a dedicated material basis. For example, two networks (i.e. bone- and water- specific neural networks) have to be trained respectively to perform bone and water decomposition. This multiple training operation narrows the generality of the algorithm. Zhao et al proposed a U-net based network to generate dual-energy CT (DECT) images from SECT image and performed material decomposition using the generated DECT data (Zhao et al 2019). This U-net based method is successfully applied in VNC imaging which is an important application of material decomposition. It is, however, challenged by the requirement of large amount of training data. In addition to the deep learning based SECT material decomposition method, Benedek et al proposed a two-pass SECT two-material decomposition method using the x-ray total path length estimation (Kis et al 2012). This method reconstructs the CT image from the original projections using standard FDK algorithm and then calculates the x-ray path length using the ray tracing algorithm (Siddon 1985). The x-ray total path length, which is the sum of path lengths of two materials, is utilized in the decomposition algorithm as a known constraint. Combining the total path length constraint and the projection measurement, two sets of material images are successfully estimated. Crawley et al analyzed the accuracy of SECT in bone-mineral measurements. The bone-mineral determination using SECT relies on the linear correlation between the in-vivo concentrations of calcium hydroxyapatite and dipotassium phosphate (Crawley et al 1988). These two methods successfully achieve dual material decomposition using SECT while the clinical application is hindered by the amount of decomposed materials since three or more materials are required in some clinical situations (Kis et al 2012). Harald et al further compared the accuracy of liver-fat quantification using the spectral CT and SECT (Kramer et al 2017). The work concludes that SECT achieves the same liver-fat quantification accuracy as that using DECT.

In this paper, we propose an image domain MMD method for SECT using the iterative scheme. The sum-to-one constraint and each pixel containing at most two materials assumption are introduced into the decomposition model. The objective function is composed of a least-square data fidelity term and an edge-preserving regularization term to suppress the magnified noise in the decomposition. The proposed objective function is non-convex due to the two-material assumption. We apply the optimization transfer principle to form a pixel-wise separable quadratic surrogate (PWSQS) function in each iteration to decrease the objective function. To acquire a good initial value for the non-convex problem, we design the pixelwise direct matrix inversion method assisted by the two-material assumption. The proposed method is evaluated using a digital phantom, a Catphan©600 phantom and the clinical data.

2. Methods and materials

2.1. MMD mathematical model using SECT

In the image domain MMD theory, the mass attenuation of a mixture ${\mu _M}\left( E \right)$ in the CT image is assumed to be a linear combination of the mass attenuation of different basis materials and written as (Long and Fessler 2014):

Equation (1)

where ${\mu _{tM}}\left( E \right)$ is the mass attenuation coefficient of the tth basis material at the energy level E. The subscript M indicates the mass attenuation coefficient. ${T_0}$ represents the total number of basis materials. ${\beta _t}$ is the mass fraction of the tth basis material and is written as:

Equation (2)

where ${m_t}$ is the mass of the tth basis material.

The relationship between the linear attenuation coefficient (LAC)${\,}{\mu _E}{\,}$ and mass attenuation coefficient ${\mu _M}\,$ is expressed as:

Equation (3)

where ${V_t}$ is the volume of the tth basis material.

Combining equations (1) and (2) into equation (3), we have:

Equation (4)

where ${x_t} = \frac{{{V_t}}}{{\mathop \sum \nolimits_{t = 1}^{{T_0}} {V_t}}}$ represents the volume fraction of the tth basis material and ${\mu _{tE}}$ is the LAC of the tth basis material. ${x_t}$ satisfies the sum-to-one and box [0 1] constraints:

Equation (5)

The MMD model can be constructed from equations (4) and (5) as:

Equation (6)

In the above equation (6), ${T_0}$ unknows need to be estimated from two equations including a CT measurement ${\mu _E}$ combined from basis material LAC equation and a sum-to-one equation. Additional assumptions are needed to solve the ${T_0}$ unknows. We assume that each pixel in the CT image contains no more than two materials, i.e.

Equation (7)

where ${I_{\left\{ \cdot \right\}}}$ denotes an indicator function which equals 1 if the tth material exists and 0 otherwise. p denotes the pth pixel in the CT image. We define $\Omega $ as a two-material candidates library composed of all the two-material candidates formed from ${T_0}$ basis materials. For each two-material candidate in the library $\Omega $, only two materials need to be estimated from equation (6), indicating that the equation (6) is solvable with the assumption equation (7).

To implement the MMD in the CT image of Np pixels, equation (6) is rewritten in the matrix–vector form as:

Equation (8)

where $\vec \mu $ is the measured CT images after column vectorization. Similarly, $\vec x = {\left[ {{{\overrightarrow {{x_1}} }^T},{{\overrightarrow {{x_2}} }^T}, \ldots ,{{\overrightarrow {{x_{{T_0}}}} }^T}} \right]^T}$ is the columnized vector with the size of ${T_0} \times {N_p}$ and each entry as the volume fraction image of the tth basis material. $A \in {\mathbb{R}^{{N_p} \times {T_0}{N_p}}}$ is the material composition matrix and is written as:

Equation (9)

where $ \otimes $ represents the Kronecker product. ${I_{{N_p}}}$ is a ${N_p} \times {N_p}$ identity matrix. ${A_0} = \left[ {{\mu _1},{\mu _2}, \ldots {\mu _{{T_0}}}} \right]$ is the composition matrix composed of the LACs of the basis materials. The matrix ${A_0}$ is determined by selecting a uniform region of interest (ROI) on the CT images that contain the basis materials and taking the mean value of the ROI as the entry of matrix A.

We formulate the image domain SECT MMD using the least square estimation with regularization as following:

Equation (10)

The material-wise edge-preserving regularization, which has distinguished noise suppression ability and edge maintainability, is selected as $R\left( {\vec x} \right)$ (Fessler and Rogers 1996, Long and Fessler 2014):

Equation (11)

where the regularizer for the tth material is

Equation (12)

where $k$ is the kth neighboring pixel of p. The potential function ${\psi _t}$ is a hyperbola:

Equation (13)

The regularization parameters ${\beta _t}$ and ${\delta _t}$ are chosen for different materials individually to achieve desired edge preservation and noise-resolution tradeoff for each material image. We refer to this proposed MMD scheme as the edge-preserving (EP) constrained SECT method hereafter.

2.2. Optimization algorithm

The objective function equation (10) cannot be minimized directly since the two-material assumption is a non-convex constraint. Similar to our previous work, we apply the optimization transfer principles to minimize to find a series of PWSQS functions at each iteration to decrease the cost function monotonically. The PWSQS function of the cost function at the nth iteration is written as (Xue et al 2017):

Equation (14)

where $R_p^{\left( n \right)}\left( {{{\vec x}_p}} \right)$ is the PWSQS function of the regularization term. Applying the De Pierro's additive convexity trick and Huber's optimal curvature as our previous work, we further derive the PWSQS function ${\phi ^{\left( n \right)}}\left( {\vec x} \right)$ as (De Pierro 1995, Erdogan and Fessler 1999, Fessler 2000, Huber 2011, Xue et al 2017):

Equation (15)

where

Equation (16)

Equation (17)

where $H_{Rp}^{\left( n \right)}\,$ and $\dot R_p^{\left( n \right)}$ are the Hessian and gradient of the regularization term and is derived as:

Equation (18)

where${\,}\,{\omega _{{\psi _t}}}\left( z \right) \triangleq {\dot \psi _t}\left( z \right)/z$.

Equation (19)

where

Equation (20)

To satisfy the constraint of each pixel containing at most two material, we loop over all the two-material candidate in the two-material candidate library ${\Omega }$. The candidate with the minimal surrogate objective function (i.e. equation (14)) value is determined to be the optimal two-material candidate for the pixel to be decomposed. For each two-material candidate, $\tau = \left( {i,j} \right) \in {\Omega }$, the surrogate function degenerates to a classical convex quadratic function of a vector with two unknowns, ${\vec x_p}\left( \tau \right) \triangleq {\left[ {{x_{ip,\,}}{x_{jp\,}}} \right]^T}$. Combining the sum-to-one and box [0 1] constraints, the surrogate objective function can be further rewritten as:

Equation (21)

where $H\left( \tau \right)$ and $\vec q\left( \tau \right)\,$ are formed from elements in $H\,$ and $\vec q$ with indexes corresponding to $\tau = \left( {i,j} \right)$, respectively. The above convex quadratic programming problem can be solved utilizing the Generalized Sequential Minimization Algorithm (GSMO).

2.3. Initial value estimation

Due to the nonconvex of objective function equation (10), a good initial guess is needed to initialize the minimization. Here we propose a direct inversion method based on the two-material assumption to realize the initialization. Equation (8) for the pth pixel can be re-written as:

Equation (22)

where ${A_0} = \left[ {{\mu _1},{\mu _2}, \ldots {\mu _{{T_0}}}} \right]$ is a $1 \times {T_0}$ matrix. When combining the two-material assumption and the sum-to-one constraint with ${A_0}$, we have the composition matrix $A'$ for a two-material candidate $\tau = \left( {i,j} \right)$ as:

Equation (23)

where ${\mu _i}$ and ${\mu _j}$ are the LACs of the ith and jth basis materials, respectively. The two-material decomposition of the pth pixel can be written as:

Equation (24)

Equation (24) is initially solved using the direct matrix inversion since the determinant of $A'$ is not equal to zero. The MMD can be achieved by looping over all the two-material candidates with the total number of $C_{{T_0}}^{\,2}$ options in to two-material candidate library $\Omega $ and picking up the optimal solution with minimum Euclidean distance between the LACs of the current pixel and the basis materials. The distance is calculated as:

Equation (25)

If no direct solution to equation (24) exists, the optimal one is estimated by minimizing the least-square form of equation (24) as:

Equation (26)

Equation (26) can be solved using, e.g. the gradient projection (GP) algorithm (Figueiredo et al 2007). We refer to the proposed SECT initialization scheme as the two-material assumption (TMA) constrained SECT method hereafter.

2.4. Implement details

In summary, the pseudo code of the proposed method is shown in table 1, where the CT images and decomposed material images are denoted by vector signs. The initial value is generated using the proposed TMA method as shown in Lines 1–22. In Lines 11–14 of the multi feasible solutions case, the optimal material candidate is selected by minimizing Euclidean distance between the current pixel and the basis materials. In Lines 15–19 of the no feasible solution case, the optimal solution is estimated by minimizing equation (26). Lines 23–35 show the proposed EP method. The Hessian and gradient of the regularization term are computed in Line 25. The two-material candidate looping operation is performed in Lines 26 to 30. The two-material specific convex quadratic programming problem is solved in Line 29. The optimal result is selected in Line 31.

Table 1. Pseudocode of the proposed MMD algorithm for SECT.

line program
1 ***Initial value generation using the proposed TMA method***
2 //${\mu _p}$ is the LAC of the pth pixel; P is the number of pixels. ${\tau _c}$ is the c-th two-material candidate. C is the number of material candidates. (i,j) is the ith and jth material basis in the material candidate ${\tau _c}$.
3 for p = 1:P
4    count = 0; //the variable count counts the number of feasible solutions
5    for c = 1:C
6      $\left[\!\! {\begin{array}{*{20}{c}} {{x_{pi}}} \\ {{x_{pj}}} \end{array}}\!\! \right] = {\left[\!\! {\begin{array}{*{20}{c}} {{\mu _i}}\quad {{\mu _j}} \\ 1 \quad 1 \end{array}} \right]^{ - 1}}\left[\!\! {\begin{array}{*{20}{c}} {{\mu _p}} \\ 1 \end{array}} \!\!\right]$
7      if ${x_{pi}},{x_{pj}} \ge 0$
8         count = count ± 1;
9      end
10    end
11    if count > 0//Multi solution exist
12      //Calculate the Euclidean distance
13      ${\left. {{\tau _c}^{\text{*}}} \right|_{{\tau _c}^{\text{*}} = \left( {{i^*},{j^*}} \right)}} = \arg \mathop {\min }\limits_{{\tau _c} = \left( {i,j} \right)} \sqrt {{{\left\| {{\mu _p} - {\mu _i}} \right\|}^2} + {{\left\| {{\mu _p} - {\mu _j}} \right\|}^2}} $;
14      $x_p^{\text{*}} = {\left( {{x_{p{i^{\text{*}}}}},{x_{p{j^{\text{*}}}}}} \right)^T}$;
15    else //no feasible solution
16      Use GP method to solve:
17      $\left( {{\tau _c}^{\rm{*}},{x_{pi}}^{\rm{*}}{x_{pj}}^{\rm{*}}} \right) = arg\!\!\!\mathop{{\rm{min}}}\limits_{\begin{array}{*{20}{c}} {{\tau _c} = \left( {i,j} \right)}\\ {{x_{pi}},{x_{pj}}} \end{array}} \left[\!\! {\begin{array}{*{20}{c}} {{\mu _i}}\quad{{\mu _j}}\\ 1 \quad 1 \end{array}} \!\!\right]\left[\!\! {\begin{array}{*{20}{c}} {{x_{pi}}}\\ {{x_{pj}}} \end{array}} \!\!\right] - \left[ {\begin{array}{*{20}{c}} {{\mu _p}}\\ 1 \end{array}} \!\!\right]{_2},\{ \begin{array}{*{20}{c}} {{x_{pi}} \ge 0}\\ {{x_{pj}} \ge 0} \end{array}$.
18      $x_p^{\text{*}} = {\left( {{x_{p{i^{\text{*}}}}},{x_{p{j^{\text{*}}}}}} \right)^T}$;
19    end
20 end
21 Obtain $x_p^{\text{*}} \equiv x_p^{\text{*}}$ with padded zeros for $t \notin {\tau _c}$.//t is the tth material
22 Output: ${\vec x^{\left( 0 \right)}} = {\text{col}}{\left( {x_1^{\text{*}},x_2^{\text{*}}, \ldots ,x_p^{\text{*}}} \right)^T}$//initial value
  ***************Proposed EP method******************
23 Input: ${\vec x^{\left( 0 \right)}} = {\text{col}}{\left( {x_1^{\text{*}},x_2^{\text{*}}, \ldots ,x_p^{\text{*}}} \right)^T}$//initial value
24 For each iteration n = 1,..., Niter //Niter is the total number of iterations
25    Compute Hessian H and gradient $\vec q$ using equations (16) and (17).
26    For each material candidate ${\tau _c} = \left( {i,j} \right) \in {\Omega }$
27        ${\vec x_p}\left( {{\tau _c}} \right) = {\left[ {{x_{ip,\,}}{x_{jp\,}}} \right]^T}$, $\vec x_p^{\left( n \right)}\left( {{\tau _c}} \right) = {\left[ {x_{ip}^{\left( n \right)},\,x_{jp}^{\left( n \right)}} \right]^T}$
28        Form $H\left( {{\tau _c}} \right)$ and $\vec q\left( {{\tau _c}} \right)$ from elements in H and $\vec q$ with indexes corresponding to ${\tau _c} = \left( {i,j} \right)$.
29        Find the optimal ${\widehat {\vec x}_p}\left( {{\tau _c}} \right)$ and the function value $\phi _p^{\left( n \right)}\left( {{{\widehat {\vec x}}_p}\left( {{\tau _c}} \right)} \right)$ of equation (21) using GSMO.
30    end
31    $\widehat {{\tau _c}} = _{\,\tau \in {\Omega }}^{\,argmin}\phi _p^{\left( n \right)}\left( {{{\widehat {\vec x}}_p}\left( {{\tau _c}} \right)} \right)$
32    Obtain ${\widehat {\vec x}_p} \equiv {\widehat {\vec x}_p}\left( {\widehat {{\tau _c}}} \right)$ with padded zeros for $t \notin {\tau _c}$.
33    Update ${\vec x^{\left( {n + 1} \right)}} = \left( {{{\widehat {\vec x}}_1}, \ldots ,{{\widehat {\vec x}}_p}, \ldots ,{{\widehat {\vec x}}_P}} \right)$.
34 end
35 Output: ${\vec x^{\left( {Niter} \right)}}$//The decomposed material images

3. Evaluation

3.1. Data acquisition

We evaluate the proposed method using a digital phantom and a Catphan©600 phantom. In the digital phantom study, the three material bases bone, muscle and adipose and the mixture of muscle and adipose are inserted into six rods placed in a digital adipose container. The digital phantom is an ellipse with the long axis length of 220 mm and the short axis length of 160 mm. The LACs of basis materials are obtained from the National Institute of Standards and Technology (NIST) database. To evaluate the proposed method using various imaging doses and energies, the digital phantom data is simulated using different number of incident x-ray photons and energies to mimic the scanning protocols of low- and normal- doses using various energies.

The Catphan©600 phantom data is measured from a tabletop cone-beam CT (CBCT) system with the geometry in accordance with that of a Varian On-Board Imager (OBI, www.varian.com). The system utilizes a CB4030 flat-panel detector (Varian Medical Systems, www.varian.com) which is composed of 1024 × 768 pixels with the pixel size of 0.388 mm × 0.388 mm. To suppress the CBCT scatter photons, the longitudinal beam width of the projection is limited to 15 mm.

The clinical patient data which is scheduled for the abdomial scan in our hospital are acquired on the 256-slice Philips iCT scanner (www.usa.philips.com/healthcare/) using the standard multi-phase protocol including a contrast-free scan and three more contrast-enhanced scans. The contrast-enhanced scans are classified as arterial phase, portal venous phase and delayed phase according to the contrast agent traversal time throughout the patient body. The scanning and reconstruction details of the above three data sets are shown in Table 2.

Table 2. Data acquisition and reconstruction parameters.

  Basis materials Scanning and reconstruction parameter
Material Ratio Term Value
Digital phantom Adipose 1 Long/Short axis (mm) 220/160
  Bone 1 SID/SAD (mm) 1500/1000
  Muscle 1 Projection number 676
  Muscle-adipose 0.7:0.3 Tube energy (keV) photon number 51:1:85 1 × 103/1 × 105
      Detector/pixel size (mm) 1024 × 768/0.388 × 0.388
      Image/Pixel size (mm) 512 × 512/0.5 × 0.5
Catphan Teflon Projection number 655
  Delrin Tube energy (kVp) 75
  Iodine solution Tube current (mA) 99
  Soft-tissue Detector/pixel size (mm) 1024 × 768/0.388 × 0.388
  Polymethylpentene Image/pixel size (mm) 512 × 512/0.5 × 0.5
Clinical data Bone SID/SAD (mm) 1040/570
  Kidney Tube energy (kVp) 120
  Liver Current (mA) 322
  Muscle Slice thickness (mm) 5
  Adipose Image/Pixel size (mm) 512 × 512/0.68 × 0.68

3.2. Comparison study

To evaluate the performance of the proposed method, we compare the proposed algorithm with a classical separate low-pass filtration method (Rutherford et al 1976, Johns and Yaffe 1985) which suppress the decomposition noise using the TMA method as the initialization. For a fair comparison, the image noise standard deviation (STD) using the low-pass filtration method is kept as the same as that using the proposed EP method.

3.3. Evaluation metrics

3.3.1. Digital and catphan phantom study

We define the volume fraction accuracy (VFA) of decomposition as:

Equation (27)

where ${x_t}^{truth}$ and ${\bar x_t}$ are the ground truth of decomposed volume fraction and the mean value within the ROI for the tth material, respectively.

The STD is applied to quantitatively measure each decomposed image noise in an ROI. STD is defined as:

Equation (28)

where m is the pixel index within the ROI. ${x_{tm}}$ is the mth pixel value in the tth material image. M is the total number of pixels in the selected ROI. The overall decomposition STD is defined as the averaged value of STDs in all the decomposed images.

To further evaluate the noise frequency characteristics of the low-pass filtration method and the proposed EP method at the comparable noise level, we measure the noise power spectrum (NPS) in a uniform area of the decomposed materials. The 2D NPS is defined as

Equation (29)

where $f$ denotes the ROI in which gray values are offset to achieve a zero mean, $DF{T_2}\left\{ f \right\}$ is the 2D Discrete Fourier Transform (DFT) on $f$ (Xue et al 2017).

In addition to the NPS, we measure the modulation transfer function (MTF) in the digital and Catphan phantom studies to investigate the decomposed image spatial resolution using the low-pass filtration method and the proposed EP method (Richard et al 2012). The line spread function (LSF) is calculated using the gradient of the object edge profile and is transfered to MTF using the Fourier transform. We average the profile of adjacent boundaries to acquire the resultant MTF with the minimal fluctuation due to image noise. The measured frequencies at MTF magnitude decreased to 0.5 (−3dB) are compared to evaluate the relative spatial resolution (Samei et al 1998).

To evaluate the impact of energy and noise on the composition matrix in the digital phantom study, we calculate the condition number of the composition matrix. The condition number measures the stability of the solution to a linear system of equations when a small perturbation is applied to the system. The higher condition number indicates the more instability of the system after the perturbation (Niu et al 2014).

3.3.2. Clinical data study

The patient data is very complicated and no mature commercial MMD method is applied in the clinic. True value does not exist as a reference to evaluate the proposed method. Instead, we apply the proposed SECT MMD algorithm to acquire the decomposed materials and perform virtual non-contrast imaging (VNC) using the decomposed materials to verify the accuracy of the proposed method indirectly. VNC, which is commonly used in the clinic, is a technique to remove the contrast agent from the original contrast-enhanced CT image to reduce the radiation dose. Currently, VNC is mostly achieved using dual-energy CT technology. In this paper, we apply the proposed method for SECT to acquire the decomposed materials and substitute the contrast agent by the blood with the same volume fraction. The VNC image is synthesized by the linear combination of the substituted materials. The VNC image at a given pixel p and energy E is reconstructed as (Mendonca et al 2014)

Equation (30)

where the t1-th material refers to the contrast agent and the t2-th material refers to the blood (Mendonca et al 2014).

For the evaluation of the clinical data, the CT number percentage error and the root-mean-square-relative error (RMSRE (%)) of the CT number are summarized to quantify the accuracy of VNC generated using the proposed method. The CT number percentage error is calculated as:

Equation (31)

where ${\left( \mu \right)_t}$ is the average CT number of the tth interested tissue or organ in the VNC image and ${({\mu ^{truth}})_t}$ is the true CT number of the tth interested tissue or organ in the contrast-free image. The RMSRE is calculated as:

Equation (32)

where ${T_0}$ denotes the number of interested tissues and organs in the abdomen slice of the human.

4. Results

4.1. Digital phantom study

The digital phantom on the contrast rods slice consists of four materials including bone, muscle, adipose and air. The high- and low-energy CT images are shown in figure 1. The decomposed basis material images are shown in figure 2. The brightness in the decomposed image indicates the volume fraction of each basis material. The proposed TMA method initially decomposes the CT image into four material images in the high- and low-energy studies. Especially in the ROI3, the proposed TMA method successfully differentiates the mixture area into muscle and adipose, indicating the mixture decomposition capability of the TMA method. On the basis of the TMA method, the proposed EP scheme further suppresses the decomposition noise while maintains the material edge, which is indicated as the smoother area of each material and the comparable material edge.

Figure 1.

Figure 1. 52 keV CT image of the digital phantom using 1 × 103 photons per pixel. Display window is [0.01 0.04] mm−1. The components of ROIs are bone (ROI1), muscle (ROI2), mixture (ROI3), adipose (ROI4), and air (ROI5), respectively.

Standard image High-resolution image
Figure 2.

Figure 2. The decomposed bone (the 1st column), muscle (the 2nd column), adipose (the 3rd column) and air (the 4th column) images of the digital phantom. Display windows are [0 1].

Standard image High-resolution image

For quantitative analysis, we select the ROIs indicated by the dashed circles as shown in figure 1. The average values and noise STDs are summarized in table 3. The VFA of the proposed TMA method is 82.49%. The near to one VFA indicates high decomposition accuracy of the proposed TMA method. Especially, in the mixture region of ROI3, the proposed TMA method decomposes the volume fraction of 0.596 and 0.382. The near to the ground truth (i.e. 0.7 and 0.3) volume fraction of the mixture area indicates the mixture decomposition capability of the proposed TMA method. Compared with the TMA method, the proposed EP method further increases the VFA by a factor of 12.23% and decreases the STD from 0.2492 to 0.0401.

Table 3. The means and STDs of decomposed digital phantom images within each ROI.

Methods Truth TMA Low-pass filtration EP
ROI1 Bone 1 ± 0 0.992 ± 0.018 0.992 ± 0.018 0.995 ± 0.009
ROI2 Muscle 1 ± 0 0.653 ± 0.375 0.849 ± 0.026 0.892 ± 0.026
ROI3 Muscle 0.7 ± 0 0.596 ± 0.386 0.554 ± 0.066 0.633 ± 0.082
Adipose 0.3 ± 0 0.382 ± 0.39 0.437 ± 0.057 0.36 ± 0.102
ROI4 Adipose 1 ± 0 0.746 ± 0.296 0.862 ± 0.01 0.984 ± 0.012
ROI5 Air 1 ± 0 0.979 ± 0.03 0.979 ± 0.03 0.98 ± 0.01
VFA   82.49% 83.60% 92.58%
Average STD   0.2492 0.0347 0.0401

The proposed method suppresses the noise into the comparable level with the low-pass filtration method. We measure the NPS of the decomposed adipose image generated using the TMA method, low-pass filtration method and the EP method to evaluate the noise frequency characteristics. The result is shown in figure 3. Compared with the TMA method, the NPS generated using the low-pass filtration method has gone through a great change while the proposed EP method remains more NPS structure.

Figure 3.

Figure 3. Measured NPS on the decomposed adipose image generated using different methods. The display windows are [600 6000].

Standard image High-resolution image

The MTF is calculated to investigate the spatial resolution maintenance of the proposed EP method. We measure the typical MTFs of muscle and mixture using the low-pass filtration and EP methods. The result is plotted in figure 4. The spatial resolution using the proposed EP method is increased by an overall factor of 0.48 when the MTF magnitude is decreased to 50% compared with that using the low-pass filtration method.

Figure 4.

Figure 4. MTF curves measured on the muscle and mixture areas.

Standard image High-resolution image

To evaluate the impact of energy and noise on the composition matrix, we calculate the condition number of the composition matrix. The condition number result is shown in figure 5 as the red parts. The crosses and marks indicate the condition number in low- and normal- dose studies. In both studies, the condition number of the composition matrix increases along with the energy, indicating the worse decomposition stability at a higher energy. Additionally, the condition number curves in the low and normal dose studies are in high consistence. This observation shows the measurement of composition matrix is independent of the imaging dose in the CT scan.

Figure 5.

Figure 5. Condition number and volume fraction accuracy using different energies and scanning doses.

Standard image High-resolution image

The volume fraction accuracy is calculated to evaluate the impact of energy and noise on the decomposition and is shown as blue part in figure 5. The upper, lower, left and right triangles represent the volume fraction accuracy using TMA and EP method in the low- and normal dose studies, respectively. The blue solid, dotted, short dotted, long and short dotted lines are generated to fit the triangle scatters. Both in the low- and normal dose studies, the linearly fitted curves decrease along with the increasement of the energy, indicating a worse decomposition accuracy at higher energy. This observation is consistent with the condition number increasement along with the energy. The higher energy leads to a larger condition number which means the stability of the decomposition is degraded and thus results in inferior decomposition accuracy.

Additionally, the decomposition accuracy declines severely when the noise is magnified in the low dose scan as shown in figure 5. The reason is that the magnified noise leads to the incorrect material composition selection within the two-material candidates looping operation. The incorrect material composition selection ultimately results in the decomposition bias, namely low decomposition accuracy. As illustrated by the higher dotted blue line than the solid blue line in figure 5, the decomposition accuracy decline is alleviated using the propose EP method compared with that using the TMA method in the low-dose study since the applied edge-preserving achieves noise suppression within the material decomposition.

4.2. Catphan phantom study

The basis materials selected in the Catphan phantom on the contrast rods slice are Teflon, Delrin, iodine solution, Polymethylpentene (PMP), inner soft-tissue and air. The reconstructed CT image are shown in figure 6. The decomposed basis material images are shown in figure 7. The brightness in the decomposed image indicates the volume fraction of each basis material. The rods inside the Catphan phantom are composed of the pure material and thus the ideal volume fraction (i.e. ground truth) for each rod is 1. The proposed TMA method initially decomposes the Catphan CT image into six material images in the high- and low-energy studies. Compared with the proposed TMA method, the proposed EP scheme further suppresses the decomposition noise while maintains the material edge.

Figure 6.

Figure 6. DECT images of the Catphan©600 phantom on the contrast rods slice using a table-top cone-beam CT (CBCT) system. Display window is [0.010 0.030] mm−1. The materials within the ROIs are Teflon (ROI1), Delrin (ROI2), iodine solution (ROI3), Polymethylpentene (PMP) (ROI4), inner soft-tissue (ROI5) and air (ROI6), respectively.

Standard image High-resolution image
Figure 7.

Figure 7. The decomposed basis material images of the Catphan©600 phantom on the contrast rods slice using: (1) the TMA, (2) the low-pass filtration and (3) the proposed EP schemes. The display window is [0 1]. The material type is shown on the bottom of each subfigure. The small rods are zoom-in displayed for better visualization at the bottom right within each subfigure.

Standard image High-resolution image

For quantitative analysis, we select the ROIs indicated by the dashed circles as shown in figure 6. The average values and noise STDs are summarized in table 4. The VFA of the proposed TMA method is 76.78%. Compared with the TMA method, the proposed EP method further increases the VFA by a factor of 18.9% and decreases the STD from 0.1947 to 0.0406.

Table 4. The means and STDs of decomposed Catphan phantom images within each ROI.

Methods Truth TMA Low-pass filtration EP
ROI1 Teflon 1 ± 0 0.9134 ± 0.1344 0.9913 ± 0.0125 0.9217 ± 0.0164
ROI2 Delrin 1 ± 0 0.5748 ± 0.3034 0.6042 ± 0.0352 0.8484 ± 0.0361
ROI3 Iodine 1 ± 0 0.6550 ± 0.2729 0.6895 ± 0.0930 0.9097 ± 0.0762
ROI4 PMP 1 ± 0 0.8168 ± 0.1800 0.8205 ± 0.0422 0.9328 ± 0.0463
ROI5 Soft-tissue 1 ± 0 0.6510 ± 0.2589 0.6779 ± 0.0529 0.8708 ± 0.0587
ROI6 Air 1 ± 0 0.9962 ± 0.0187 0.9960 ± 0.0115 0.9981 ± 0.0098
VFA   76.78% 79.65% 91.35%
Average STD   0.1947 0.0412 0.0406

The proposed method suppresses the noise into the comparable level with the low-pass filtration method. We further measure the NPS of the decomposed soft-tissue image generated using the TMA method, low-pass filtration method and the EP method to evaluate the noise frequency characteristics. The result is shown in figure 8. Compared with the TMA method, the NPS generated using the low-pass filtration method has gone through a great change while the proposed EP method remains more NPS structure.

Figure 8.

Figure 8. Measured NPS on the decomposed soft-tissue image generated using different methods. The display windows is [10 300].

Standard image High-resolution image

The MTF is calculated to investigate the spatial resolution maintenance of the proposed EP method. We measure the typical MTFs of Delrin and PMP using the low-pass filtration and EP methods in the high- and low-energy scans. The result is plotted in figure 9. The spatial resolution using the proposed EP method is increased by an overall factor of 1.55 when the MTF magnitude is decreased to 50% compared with that using the low-pass filtration method.

Figure 9.

Figure 9. MTF curves measured on the iodine solution and PMP areas.

Standard image High-resolution image

4.3. Clinical data study

The CT images of contrast-free phase, arterial phase, portal venous phase and delayed phase are shown in the first row in the figure 10. We apply bone, muscle, contrast agent, adipose and air as the basis materials to perform the material decomposition using the proposed method for the three contrast-enhanced phases. By applying Eq. (30), we acquire the VNC image of arterial, portal venous and delayed phases as shown in figure 10. The VNC images of three phases generated utilizing the proposed method are faithfully consistent with the contrast-free image (i.e. the ground truth image), especially the kidney structure as pointed by the yellow arrow in figure 10. It should be noted that each two phases have a few seconds time interval. The four phases are not completely consistent due to the organ motion in the scanning interval, for instance the liver region pointed by the red arrow in the contrast-free image in figure 10.

Figure 10.

Figure 10. The top row shows the CT images of: (a0) contrast-free phase and contrast-enhanced phases including (a1) arterial phase, (a2) portal venous phase and (a3) delayed phase of the abdominal slice of the patient. The bottom row displays the corresponding VNC images of the above contrast-enhanced CT images of three phases: (b1) arterial phase, (b2) portal venous phase and (b3) delayed phase. The display window is (0.015 0.027) mm−1.

Standard image High-resolution image

For quantitative analysis, we select five regions of interest: bone (ROI1), kidney (ROI2), liver (ROI3), muscle (ROI4) and adipose (ROI5) as shown in the arterial VNC image in figure 10 to calculate the CT number percentage error and RMSRE. The quantitative results are shown in table 5. The RMSREs of arterial, portal venous and delayed phases are 3.6%, 3.0% and 2.2%, respectively. The VNC images generated by the proposed method achieve high accuracy using the contrast-free CT image as the ground truth. The proposed method successfully removes the contrast agent from the contrast-enhanced images and maintains consistent CT number to the contrast-free CT image.

Table 5. The percentage errors and RMSRE of the interested region in three phases.

Materials Percentage Errors (%) RMSRE (%)
Bone Kidney Liver Muscle Adipose
Arterial 7.6 1.9 0.6 0.9 0.5 3.6
Portal venous 6.2 1.9 0.7 0.9 0.4 3.0
Delayed 4.0 2.4 0.3 1.3 0.2 2.2

4.4. Implementation details

The proposed EP method contains two tunable parameters including ${\beta _t}$ which controls the noise-resolution and ${\delta _t}$ which controls the edge preservation. ${\beta _t}$ and ${\delta _t}$ are material-specific since each material has its unique volume fraction distribution and neighbored pixels. We thus empirically select the optimal combination of ${\beta _t}$ and ${\delta _t}$ to achieve the tradeoff between the noise suppression and spatial resolution maintenance. The regularization coefficients ${\beta _t}$ and the edge-preserving parameters ${\delta _t}$ for each material in the digital and Catphan studies are listed in table 6.

Table 6. The regularization coefficients and edge-preserving parameters.

Data ${\beta _t}$ ${\delta _t}$
Digital phantom 0.0010 0.0100 1.0000 1.0000
(for bone, muscle, adipose and air) 0.0100 0.1000 1.0000 0.1000
Catphan phantom 0.0600 0.0500 0.2500 0.2500
(for teflon, delrin, iodine, PMP, soft-tissue and air) 0.0250 0.0100 1.0000 0.6000
  0.0100 0.0700 0.6000 0.6000

5. Discussion

We propose an image domain MMD method for SECT using the iterative scheme. The sum-to-one constraint and each pixel containing at most two materials assumption are introduced into the decomposition model. The objective function is composed of a least-square data fidelity term and an edge-preserving regularization term to suppress the magnified noise in the decomposition. The proposed objective function is non-convex due to the two-material assumption. We apply the optimization transfer principle to form a PWSQS function in each iteration to decrease the objective function. To acquire a good initial value for the non-convex problem, we design the pixelwise direct matrix inversion method assisted by the two-material assumption.

The proposed TMA method achieves high decomposition accuracy and the proposed EP scheme further increases the decomposition accuracy and decreases the noise STD. Compared with the low-pass filtration method, the proposed EP scheme achieves decomposition accuracy enhancement and noise suppression while maintaining high image quality.

Although the proposed method has achieved promising results, there are several aspects for future improvement. Some materials have very close or even equal LACs in a specific energy and cannot be differentiated in SECT. The proposed MMD scheme is implemented in image-domain and thus these materials with close LACs cannot be further correctly decomposed using the proposed scheme. This limitation can be solved by utilizing the clinician's experience as an assistance to pre-define the material basis for the materials with low contrast.

Additionally, the two-material hypothesis is not feasible in some cases where three or more materials are contained in one pixel. It should be noted that this assumption is proposed to solve the ill-posed problem of MMD using single energy CT. For dual-energy CT or spectral CT, three or more than three material assumption which may be more feasible in clinic can be applied (Long and Fessler 2014). The two-material based MMD for SECT proposed in this paper is complimentary to the clinical application where DECT technique or spectral CT is not readily available. In the clinic practice of virtual non-contrast imaging, fat-liver quantification and etc, the proposed EP method using the two-material assumption still reveals its feasibility and practicality since only two material bases of interest are required in these clinical applications (Lamb et al 2015).

The proposed method is implemented in the image domain with the advantage of clinically practical and time-efficient. Nevertheless, the beam-hardening artifacts cannot be suppressed and may decrease the decomposition accuracy. We will further extend the proposed method into the projection domain to construct the non-linear inversion model to reconstruct basis materials directly from the projection data and alleviate the beam-hardening artifacts simultaneously (Long and Fessler 2014, Chen et al 2017). In addition, the scatter artifacts are commonly observed in CT images, especially in CBCT since the scatter photons increases with the enlarged illumination. The scatter artifacts contaminate the CT images and lead to incorrect decomposition. We presently narrow the collimation between the x-ray source and the object to limit the cone angle to alleviate the scatter photons. We will further apply effective scatter correction schemes to decrease the scatter artifacts in the future (Wu et al 2015, Yang et al 2017).

6. Conclusion

In this paper, we propose an image domain MMD method for SECT using the iterative scheme. The proposed method applies a least square data fidelity term and an edge-preserving regularization term to enforce the minimization between the linear combination of decomposed material image and the measured SECT image, and to meet the piecewise constant property of the material image, respectively. The proposed method is evaluated using the digital phantom, the Catphan phantom and the clinical patient data. In the phantom study, the proposed TMA method achieves high VFA of 79.64% and the proposed EP method further increases the VFA by 15.56% and decreases the decomposition STD by 81.51% compared with the TMA method. At the comparable noise level, the proposed EP method increases spatial resolution by an overall factor of 1.01 when the MTF magnitude is decreased to 50% compared with the low-pass filtration method. In clinical data study, the VNC image generated by the proposed method achieves RMSRE of 2.93% compared with the contrast-free ground truth image. The high decomposition image quality substantially facilitates the SECT-based MMD applications.

Acknowledgment

This work was supported by National Key R&D Program of China (2018YFE0114800), Natural Science Foundation of China (NSFC Grant No. 81871351). We thank Mr. Ye Zhang in the Information Technology Center, Zhejiang University for his technical assistance on computational hardware support.

Please wait… references are loading.
10.1088/1361-6560/ab7503