Model-independent Characterization of Charge Diffusion in Thick Fully Depleted CCDs

, , , and

Published 2011 August 3 © 2011. The Astronomical Society of the Pacific. All rights reserved. Printed in U.S.A.
, , Citation David Lawrence et al 2011 PASP 123 1100 DOI 10.1086/661948

1538-3873/123/907/1100

ABSTRACT

We present a new method to measure charge diffusion in charge-coupled devices (CCDs). The method is based on a statistical characterization of the shapes of charge clouds produced by low-energy X-rays using known properties of the two-dimensional Gaussian point-spread function (PSF). The algorithm produces reliable upper and lower bounds on the size of the PSF for photons converting near the entrance window of a device. It is optimally suited to the case of thick back-illuminated CCDs where the X-ray absorption length is smaller than the silicon thickness and the diffusion scale is of the same order as the pixel size. The only assumptions are that the charge cloud width is a monotonically increasing function of distance from the conversion point to the buried channel, and that the conversion probability is peaked at the surface. Otherwise, no physical models of carrier transport or knowledge of the electric field profile in the CCD are needed. In suboptimal conditions, the upper bound increases and the lower bound is unaffected, so confidence in the correctness of results is retained. The new method has been benchmarked against Monte Carlo simulations and tested on X-ray images measured on thick high-resistivity prototype CCDs for the Large Synoptic Survey Telescope. In Monte Carlo simulations of noiseless images having the optimal diffusion scale, the upper bound approximated the true PSF within 5%, increasing to 10% in simulations with added noise. Even with severely undersampled or truncated PSFs, the method brackets the true value to within 25%. Our method is accurate and computationally efficient and offers a fast and simple experimental setup.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The charge-coupled device (CCD) is one of the most sensitive visible-light detectors available for use in modern astronomical instruments. Because of the increasing cosmological interest in extragalactic sources, considerable effort is devoted to pushing CCD sensitivity further into the near-infrared range. Thick fully depleted silicon CCDs (FDCCDs), originally developed for use in X-ray astrophysics, have been proposed for use in current and upcoming wide-field surveys, e.g., Panoramic Survey Telescope and Rapid Response System (PanSTARRS), Dark Energy Survey, HyperSuprime, and Large Synoptic Survey Telescope (LSST). These FDCCDs offer improved infrared sensitivity, but they experience increased charge diffusion as a result of their thickness. An accurate quantification of charge diffusion is therefore essential when characterizing such devices.

Several techniques for investigating charge diffusion in CCDs have been developed. Optical methods include a virtual knife-edge scan (Karcher et al. 2004) that measures the profile of a finely focused spot, measurement of the line-spread profile using a projected slit image angled with respect to the CCD columns (Robbins 2011), and modulation transfer function measurement using interferometrically projected fringe patterns of variable spatial frequency (Takacs et al. 2010). Other techniques use ionizing radiation, either in the form of X-ray photons from a radioisotope or accelerator sources (Janesick 2001; Kraft et al. 1995) or charged particles such as cosmic-ray muons (Robbins 2011). When a mask can be placed directly on the CCD's sensitive surface, the mesh or virtual pixel scan technique (Tsunemi et al. 1997) gives a detailed and accurate characterization of the point-spread function (PSF).

Rodney and Tonry (2006) introduced a methodology that is very similar to the one used here, relating the histogram of pixel values found in an image exposed to 55Fe events to the charge diffusion, and providing a simple statistic that can be used to infer charge diffusion given an average event. This article offers an alternate approach that has two advantages: we need not assume a specific model for how charge carriers diffuse through the silicon, and we can construct two different estimators that bound the true amount of charge diffusion.

In this analysis we work with images resulting from exposure of the CCD to 55Fe X-rays. The absorption of a single soft X-ray photon generates charge carriers in a tight cluster (Tsunemi et al. 1999) that diffuse laterally to produce a charge cloud with a Gaussian profile (Janesick 2001; Hopkinson 1987; Groom et al. 2000). The width σ of the cloud grows as carriers drift from the conversion point to the collecting electrodes. A single 55Fe image typically contains thousands of these charge clouds.

With a uniform flux of X-rays incident on the CCD's entrance window, photon conversions are distributed uniformly across the pixel array and with an exponential falloff in depth. (The attenuation length of Mn Kα photons is 29 μm in silicon.) The geometry is illustrated in Figure 1. In a back-illuminated device, photon conversions close to the entrance window result in the widest charge clouds, since the charge carriers they produce have the longest transit times to the potential wells on the front side. The profile of conversion depths therefore leads to a cluster width distribution with a sharp cutoff at σmax, corresponding to photons that convert just at the entrance window. This limiting width is significant because it predicts the amount of diffusion experienced by visible-light photons, which convert within the first few microns of silicon.

Fig. 1.—

Fig. 1.— Geometry of a CCD. Note that nonnormal incidence increases the prevalence of surface conversion.

Two difficulties hamper the analysis of X-ray images to extract σmax. First, the profiles of X-ray clusters are severely undersampled, given the pixel size and typical operating conditions. Second, it is necessary to examine the ensemble of cluster sizes and develop a robust estimator for only that subset having the largest amount of diffusion. For device characterization purposes, it is most desirable to obtain a strict upper bound on σmax.

In the following sections we report a new technique for extracting the maximum charge cloud width from an X-ray image. The method is computationally inexpensive, accurate for a wide range of PSF sizes, and avoids the need for any diffusion model. We begin in § 2 with a simple algorithm to select Mn Kα photon conversions within an image. In § 3, we characterize each event with a single number—the "difference ratio." Section 4 considers the aggregate of all difference ratios and proposes two different methods for determining the maximum PSF width. We test the algorithm using simulated data in § 5 and apply it to images from a high-resistivity device in § 6. Conclusions are presented in § 7.

2. PIXEL SELECTION ALGORITHM

Starting with a background-subtracted image file of an FDCCD exposed to 55Fe radiation (Fig. 2), we reconstruct events and select those originating from conversions of 5.988 keV Mn Kα photons, which produce 1590 electron-hole pairs in silicon at 173 K (Lowe and Sareen 2007). In our target FDCCDs the diffusion scale is on the order of the pixel size, so it is sufficient to consider events whose charge is contained in 2 × 2 pixel clusters (in the case of FDCCDs with larger PSFs, a straightforward Gaussian fit to the charge distribution would suffice and this technique would not be necessary). The algorithm proceeds as follows:

Fig. 2.—

Fig. 2.— Portion of an image taken in the presence of X-ray radiation. X-rays appear as localized clusters of dark pixels superimposed on background noise. Images are calibrated such that the background noise has a mean value of zero.

  • 1.  
    Calculate the total charge in every 2 × 2 pixel "box" within the image.
  • 2.  
    For every pair of overlapping boxes, immediately reject the one with the smaller total charge, leaving only the boxes that are local maxima.
  • 3.  
    Histogram the totals of the remaining boxes (Fig. 3).
  • 4.  
    Identify the window centered on the Mn Kα peak position with upper bound at the location of the minimum between the Kα and Kβ peaks, and remove from consideration all boxes with totals outside this window.
Fig. 3.—

Fig. 3.— Histogram depicting the total charge in 2 × 2 pixel boxes that are local maxima, taken from acquired images. The Kα and Kβ peaks are well-defined, and the Si fluorescence and escape peaks are visible as well.

This method of selection is unbiased in that no event will be represented by multiple boxes: an event may lie within multiple boxes, but only one of those can be a local maximum. The algorithm also tolerates overlapping events, since a box containing multiple events has a total outside the selection window and is ignored altogether.

3. CHARACTERIZING X-RAY EVENTS

Suppose that a single X-ray photon is absorbed at the point (x0,y0) in an infinite unpixelated plane. Since the diffusion PSF is Gaussian, the flux S is given by

where s is the total signal produced and σ is the width of the charge cloud. Let u and v be the total charge gathered in the positive- x and negative- x half-planes, respectively. Integrating equation (1), we find that

where erf is the error function.1

For any X-ray event, we define the difference ratio d1 such that

Note that d1 is closely related to the first moment of the charge distribution about the coordinate origin.

Alternatively, we may express the difference ratio in terms of x0 and σ:

which follows from equations (2) and (3). Note that the difference ratio does not depend on s.

In the pixelated case, consider a 2 × 2 box that has been selected by our algorithm in § 2. As before, suppose that B contains an X-ray event located at coordinates (x0,y0) relative to its center, and let the PSF width be σ. Using equation (4), we can calculate the ideal difference ratio for this box as d2(x0,σ).

If the X-ray charge is fully contained in B, then u and v are exactly equal to the summed fluxes b + d and a + c, respectively. When the PSF is small, this remains a valid approximation, and so u ≈ b + d and v ≈ a + c. Equation (3) can then be applied to this event as well, and we find that d2(x0,σ) ≈ d1(b + d,a + c) for every box B. We now have a practical method to compute the difference ratio of any event without knowledge of x0 or σ. (We have only considered the difference ratio in the x -axis direction. Clearly, an analogous quantity can be defined in the y -axis direction. Since charge diffusion is isotropic, we do not differentiate between the axes in our analysis, although considering the axes separately can be useful for quantifying anisotropy in nonideal CCDs.)

The primary utility of the difference ratio is that it provides a simple relationship between measured pixel values (a, b, c, and d) and purely theoretical Gaussian profile parameters (x0 and σ). The value of the difference ratio d for a single cluster does not provide enough information to unambiguously determine the cluster's width σ or centroid location x0. However, knowing that x0 is uniformly distributed in and that large σs are proportionately overrepresented in the population, we can estimate σmax from the distribution of d for the entire ensemble, as shown in the next section.

4. THE DIFFERENCE-RATIO HISTOGRAM

Once we have computed the x - and y -directed difference ratios for every 2 × 2 box, we plot them in a histogram with area normalized to unity (Fig. 4). At this point, data from multiple images may be combined to reduce statistical errors. The histogram is seen to have a pronounced peak close to the maximum difference ratio that levels off to a plateau as the difference ratio decreases. Two features of this histogram can be used to estimate σmax —the location of the peak and the height of the plateau.

Fig. 4.—

Fig. 4.— Difference-ratio histogram, taken from the same images as Fig. 3. Note the plateau of height 0.7 and the peak at location 0.9.

We can determine σmax from the location of the histogram's peak, which must come from those events with the widest PSF. The variable |x0| has a uniform distribution between 0 and , but since the error function is approximately linear for small arguments and asymptotically constant for large arguments, a greater density of difference ratios will be obtained when |x0| is as large as possible. Since σ has a sharp cutoff at σmax and |x0| has a sharp cutoff at , a pronounced peak is expected at the region of greatest density. Therefore, the histogram peak is located at . Solving for σmax, we obtain

where p is the location of the peak and erfinv is the inverse error function.

A second indicator of σmax is the value of the plateau. For difference ratios near zero, the error function is nearly linear, and . Inverting, we have . Then the number of points with a difference ratio between da and db is equal to the number of points with |x0| between and , where σ takes some value less than σmax. Since |x0| is uniformly distributed from 0 to , this is simply equal to , where n is the total number of points. Finally, a normalized histogram must have constant value z throughout the region where the error function is linear, and we have . Since the distribution of σ is biased toward σmax, we have

Equations (5) and (6) represent two estimators for σmax based solely on the shape of the difference-ratio histogram, which we will hereinafter refer to as σpeak and σplat. These estimators make only a few assumptions about the data: that charge diffusion results in Gaussian PSFs that are largely contained within 2 × 2 pixel clusters, that X-ray events are randomly distributed across the surface of the CCD, and that the largest PSF widths are the most common.

To estimate the location of the peak, we use Parzen windows to construct a continuous estimate of the difference-ratio probability density function (Parzen 1962). The weighting function is a Gaussian distribution with standard deviation 0.02, since this is found to produce the best estimate for sets of 5000 events (about 10 images). The peak location is then defined as the mode of the estimated probability density function.

To estimate the height of the plateau, we again use Parzen windows, but some adaptations are made. In order to avoid artificial falloff toward the negative side of the distribution, we construct our estimator on a modified data set comprised of all difference ratios together with the additive inverses of all difference ratios. The weighting function is a Gaussian distribution with standard deviation 0.1 (it can be wider because this part of the histogram is smoother). The plateau height is then given by evaluating the probability density function at 0.

In the following sections we examine the performance of these estimators and the validity of the simplifying assumptions used.

5. ALGORITHM CHARACTERIZATION

The difference-ratio analysis was applied to simulated images with predetermined σmax. Values of σ were simulated using a Monte Carlo procedure, which takes into account the absorption probability for 6 keV photons in silicon, the electric field in the CCD, and the transport and diffusion properties of electrons in silicon (Eremin and Li 1995). The simulated distribution of σ values (Fig. 5) was scaled linearly in order to simulate varying amounts of charge diffusion. Values of x0 and y0 were chosen uniformly at random between 0 and .

Fig. 5.—

Fig. 5.— Sample distribution of σ values, produced by assuming an exponential photon absorption profile and using the electron transport model given by Eremin and Li (1995). The graph is generated using a Monte Carlo simulation. The artificially sharp cutoff at σ/σmax = 1 will not affect the algorithm significantly.

Sets of 10 images, each containing 500 X-ray events, were simulated with and without noise for three values of σmax. The standard deviation of the noise was taken to be of the signal produced by a Kα event (the experimentally determined noise for the particular FDCCD analyzed in § 6). One hundred distinct data sets were created for each set of parameters and analyzed using several algorithms. We determined σpeak and σplat using the difference-ratio algorithms outlined in § 4. The average and standard deviation across the 100 trials for each test are given in Table 1 and Figure 6.

Fig. 6.—

Fig. 6.— Monte Carlo simulation of X-ray conversions where the surface PSF is forced to 0.15, 0.30, and 0.45 pixels. σpeak and σplat are the values of surface PSF found by the method of § 4.

The error in σpeak is caused by two factors: PSF truncation and incorrect determination of peak location. PSF truncation occurs when σmax is large and charge outside the 2 × 2 analysis box is excluded from the difference-ratio calculation. In the worst case, the difference ratio is calculated as

which is always less than the actual difference ratio of

Second, the histogram peak is sloped asymmetrically, so use of a Gaussian kernel density estimator shifts its apparent location—an effect that is compounded by the addition of Gaussian noise. Due to the behavior of the inverse error function, the algorithm is far more sensitive to errors in peak location for small σmax, but this error exists for large σmax as well. The histogram peak is sloped more steeply on the right, so peak location errors tend to shift it left. Therefore, both sources of error increase σpeak, and σpeak is always an upper bound on σmax.

Error in σplat stems from the assumption that σ ≈ σmax. Most values of σ are close to σmax, but all are at least slightly smaller. When we derive a value for σmax, we are actually deriving a value for some smaller average σ for the entire spectrum of events. Therefore, we underestimate σmax whenever we use this method. The exact amount of this underestimate depends on the distribution of σ, but is about 15% for our simulated distribution.

The rms variation between simulation runs is well below the absolute error, indicating that the upper and lower bounds are reproducible. (The lower bound is especially tolerant of noise, because noise does not distort the flat section of the histogram.) Using σpeak and σplat, we obtain a tight upper bound and a robust lower bound on σmax without assuming any specific distribution of σ.

6. RESULTS AND DISCUSSION

The methods described in this report have been used to analyze a next-generation FDCCD being developed for the LSST. The FDCCDs under development must balance two conflicting design goals: high quantum efficiency in the near-infrared and small PSF size (Radeka et al. 2009). Since the technology required is still relatively untested, the development of an accurate PSF measurement technique is paramount.

Prototype FDCCDs have been obtained and tested (O'Connor et al. 2008). Device 106-06 is a 100 μm thick backside-illuminated FDCCD with 13.5 μm pixels. The electron transport model given by Eremin and Li (1995), which takes velocity saturation and temperature dependence into account, was used to predict PSF size. Separately, sets of 50 fully depleted X-ray images were acquired at bias voltages ranging from -30 V to -50 V in 2 V increments. These images were analyzed using the Q -ratio method to give a PSF estimate and analyzed using the difference-ratio technique to give upper and lower bounds (Table 2 and Fig. 7).

Fig. 7.—

Fig. 7.— PSF size from X-ray measurements on high-resistivity LSST prototype FDCCD with 100 μm thickness. The dashed line represents a model with resistivity 2 kΩ·cm and the solid line represents a model with resistivity 3 kΩ·cm. Open symbols are computed using the Q -ratio method and solid symbols use the method of § 4. Recall that σQ is calculated using the Rodney and Tonry (2006) model and σQ' is calculated using the Eremin and Li (1995) model.

In previous work (Rodney and Tonry 2006), an alternative measure was proposed for determining σmax from 55Fe data. Those authors considered the quantity

for all 3 × 3 pixel boxes identified as containing Kα hits. Thus, 0 < Q < 1, and Q = 0 for an event whose charge is completely contained in a single pixel. The aggregate Q ratio for an image is formed by superimposing all Kα hits. To derive σmax from Q, they performed simulations using a simple two-parameter diffusion model, with (x,y,z) conversion locations drawn from the appropriate distributions. From multiple simulations the relation between σmax and Q can be tabulated. Values of σmax found in this way are denoted as σQ. This computation is also carried out using the diffusion model of Eremin and Li (Fig. 8), and the results are denoted by σQ'.

Fig. 8.—

Fig. 8.— Modeled charge diffusion PSF width as a function of depth of conversion. Device modeled has p -type substrate, resistivity of 3 kΩ·cm, thickness of 100 μm, temperature of 200 K, and applied bias of 35 V. Solid line: uniform electric field, constant electron mobility, and a focusing field near gates (Rodney and Tonry 2006). Dashed line: linearly varying electric field, velocity-field relation from Eremin and Li (199n5), and no focusing field.

The Q -ratio method provides an easily computed relative measure of diffusion that can be used for comparing devices of different thickness, doping, and substrate bias, as shown in Rodney and Tonry (2006). Its absolute calibration scale, however, relies on a simplified diffusion model (Rodney and Tonry's eq. [3]), which assumes that the diffusion width depends on the conversion point's height z above the front-side gates as , with a correction applied for conversions within some height above the gates for which σ = 0. There are some conditions for which the functional form of σ(z) will depart from the assumed model. An example is shown in Figure 8. The graph compares the σ(z) dependence from Rodney and Tonry (2006) with a model that includes the electric field z dependence and the nonlinear velocity-field characteristic as

where D is the transverse diffusion coefficient, v is the electron velocity from Eremin and Li (1995), and E(z) = V/d - e·NA(d - 2z)/2epsilon. The device modeled is just above full depletion, having thickness 100 μm, resistivity 3 kΩ·cm, barrier-field height 13.5 μm, and Vsub = -35 V. The surface diffusion σmax is 5.4 μm, and both models are normalized to this value at z = 0. Note that the full model has smaller σ(z) near z = 0, which will result in a higher estimate of σmax than the simplified model. In general, this discrepancy is largest for thick devices biased at low overdepletion voltages; for well-overdepleted sensors the agreement between the two models is much closer. The barrier-field focusing effect introduced near the gate has little effect for this device, as the absorption probability in this region is only 1.8%. Nonnormal X-ray incidence will increase the fraction of events converting at shallower depths in Si, which will increase the aggregate Q ratio, since those clusters have the widest diffusion.

The present model simply seeks to identify the maximum-size clusters, which can be recognized by the distribution of first moments as described previously, regardless of the details of the σ(z) and Pabs(z) functions. However, it provides only upper and lower bounds on σmax. While these bounds are shown to be robust, they leave a range of uncertainty that can be large for low-S/N images or for undersampled or truncated PSFs. A combination of difference ratio, Q ratio, and optical techniques may be advantageous when the most accurate results are required.

7. CONCLUSIONS

We have developed an algorithm to quickly and accurately estimate the contribution of charge diffusion to the PSF of thick fully depleted CCDs. The method uses a statistical analysis of the shapes of charge clusters in soft X-ray images to determine upper and lower bounds on the maximum PSF width, thereby bracketing the PSF expected for visible light. Simulations have been used to estimate the accuracy of the method over a range of PSF widths and with varying amounts of noise. All stages of the algorithm are computationally efficient—the analysis of a 100 Mpixel image takes less than 1 minute on a single-core desktop computer. Finally, the algorithm does not rely on a physical model of carrier transport and can be successfully applied to CCDs with widely varying charge diffusion characteristics. An analysis of an LSST prototype device shows good consistency with model predictions and with alternate methods of analysis. These properties make the difference ratio a useful tool for studying charge diffusion within CCDs.

Thanks go to Veljko Radeka, the Instrumentation Division Chair, and the Brookhaven National Laboratory Office of Educational Programs for allowing this research to proceed. The Large Synoptic Survey Telescope (LSST) is a public-private partnership. Funding for design and development activity comes from the National Science Foundation, private donations, grants to universities, and in-kind support at Department of Energy laboratories and other LSST Corporation Institutional Members. Support of the W. M. Keck Foundation for sensor development is gratefully acknowledged. This work is supported by in part the National Science Foundation under Scientific Program Order 9 (AST-0551161) and Scientific Program Order 1 (AST-0244680) through Cooperative Agreement AST-0132798. Portions of this work are supported by the Department of Energy under contract DE-AC02-76SF00515 with the Stanford Linear Accelerator Center and contract DE-AC52-07NA27344 with Lawrence Livermore National Laboratory.

Footnotes

  • The error function of n is defined as . Note that erf = 1 and erf - n = -erfn.

Please wait… references are loading.
10.1086/661948