Correlation Statistics of Quantized Noiselike Signals

Published 2003 December 12 © 2004. The Astronomical Society of the Pacific. All rights reserved. Printed in U.S.A.
, , Citation Carl Gwinn 2004 PASP 116 84 DOI 10.1086/381167

1538-3873/116/815/84

ABSTRACT

I calculate the statistics of correlation of two digitized noiselike signals, which are drawn from complex Gaussian distributions, sampled, quantized, correlated, and averaged. Averaged over many such samples, the correlation r approaches a Gaussian distribution. The mean and variance of r fully characterize the distribution of r. The mean corresponds to the reproducible part of the measurement, and the variance corresponds to the random part, or noise. I investigate the case of nonnegligible covariance ρ between the signals. Noise in the correlation can increase or decrease, depending on quantizer parameters, when ρ increases. This contrasts with the correlation of continuously valued or unquantized signals, for which the noise in phase with ρ increases with increasing ρ, and noise out of phase decreases. Indeed, for some quantizer parameters, I find that the correlation of quantized signals provides a more accurate estimate of ρ than would correlation without quantization. I present analytic results in exact form and as polynomial expansions, and compare these mathematical results with results of computer simulations.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

Nearly all signals from astrophysical sources can be represented as electric fields comprising Gaussian noise. These noiselike signals have zero mean. All information about the source is contained in the variance of the electric field and in covariances between different polarizations, positions, times, or frequency ranges. The intensity, for example, is simply the sum of the variances in two basis polarizations. More generally, all the Stokes parameters can be expressed in terms of the variances and covariances of these two polarizations. Similarly, in interferometry the covariance of electric fields at different positions is the visibility, the Fourier transform of source structure. In correlation spectroscopy the covariances of electric field at different time separations, expressed as the autocorrelation function, are the Fourier transform of the spectrum. Because the signals are drawn from Gaussian distributions, their variances and covariances completely characterize them. The single known exception to this rule of Gaussian statistics is radiation from pulsars, under certain observing conditions (Jenet, Anderson, & Prince 2001).

Particularly at wavelengths of a millimeter or more, covariances are usually estimated by correlation. (Actually, correlation is used at all wavelengths, but at wavelengths shortward of a millimeter quantum mechanical processes come into the picture, complicating it). Correlation involves forming products of samples of the two signals. The average of many such products approximates the covariance. In mathematical terms, for two signals x and y, the covariance is ρ = 〈xy〉, where the angular brackets 〈...〉 represent a statistical average over an ensemble of all statistically identical signals. Correlation approximates this enormously infinite average with a finite average over Nq samples of x and y: r = 1/Nqi = 1Nq xiyi. Here, the subscript reflects the fact that x and y are unquantized; their accuracy is not limited to a finite number of quantized levels. The subscript q indicates sampling at the Nyquist rate, as I assume (see Thompson, Moran, & Swenson 1986).

Because the number of samples in most measurements of correlation is large, the results of a finite correlation follow a Gaussian distribution. This is a consequence of the central‐limit theorem. Thus, one expects a set of identical measurements of r to be fully characterized by their mean, 〈r〉, and their standard deviation, 〈(r - 〈r〉)21/2. The mean is the deterministic part of the measurement; it provides an estimate of ρ. The standard deviation characterizes the random part of the measurement and is often called "noise" (but is to be distinguished from the noiselike signals x and y that are being correlated). In principle, the best measurement minimizes the random part while preserving the relation between the deterministic part and ρ. The signal‐to‐noise ratio (S/N) of the correlation, Script R = 〈r〉/〈(r - 〈r〉)21/2, provides a figure of merit that quantifies the relative sizes of deterministic and random parts (see, e.g., Thompson et al. 1986).

The electric field is commonly digitized before correlation. Digitization includes sampling and quantization. Sampling involves averaging the signal over short time windows; it thus restricts the range of frequencies that can be uniquely represented. For simplicity, in this paper I restrict discussion to "video" or "baseband" signals, for which a frequency range of 0 up to some maximum frequency is present; and I assume that they are sampled at the Nyquist rate, or at half the shortest period represented. I also assume that the signals are "white," in the sense that samples xi and yj are correlated only if i = j; and that the signals are stationary, so that the correlation of xi and yj is independent of i. These assumptions limit the influence of sampling. I will discuss spectrally varying signals elsewhere (C. Gwinn 2004, in preparation).

Quantization limits the values that can be represented so that the digitized signal imperfectly represents the actual signal. Quantization thus introduces changes in both the mean correlation 〈rM〉 and its standard deviation 〈(rM - 〈rM〉)21/2. Here the subscript "M" represents the fact that the quantized signal can take on M discrete values. The mean and standard deviation of rM can be calculated from the statistics of the quantized signals x ˆi and y ˆi, and the details of the quantization scheme.

A number of previous authors have addressed the effects of quantization on correlation of noiselike signals (see, e.g., Thompson et al. 1986, chap. 8, and references therein). Notably, Van Vleck & Middleton (1966) found the average correlation and the standard deviation for two‐level quantization—the case in which quantization reduces the signals to only their signs. Cooper (1970) found the average correlation and its standard deviation for small normalized covariance ρ≪1, for four‐level correlation: quantization reduces the signals to signs and whether they lie above or below some threshold v0. He found the optimal values of v0 and the relative weighting of points above and below the threshold n, as quantified by the S/N Script R4. Hagen & Farley (1973) generalized this to a broader range of quantization schemes and studied effects of oversampling. Bowers & Klingler (1974) examined Gaussian noise and a signal of general form in the small‐signal limit. They devise a criterion for the accuracy with which a quantization scheme represents a signal and show that this yields the highest S/N for ρ≪1. Most recently, Jenet & Anderson (1998) examined the case of many‐level correlators. They use a criterion similar to that of Bowers & Klingler (1974) to calculate the optimal level locations for various numbers of levels. Jenet & Anderson (1998) also find the mean spectrum for a spectrally varying source, as measured by an autocorrelation spectrometer; they find that quantization introduces a uniform offset to the spectrum and scales the spectrum by a factor, and they calculate this offset and factor. D'Addario et al. (1984) provide an extensive analysis of errors in a three‐level correlator.

In this paper, I calculate the average correlation and its standard deviation for nonvanishing covariance ρ. I provide exact expressions for these quantities, and approximations valid through fourth order in ρ. Interestingly, noise actually declines for large ρ for correlation of quantized signals. Indeed, the S/N for larger ρ can actually exceed that for correlation of an unquantized signal. In other words, correlation of a quantized signal can provide a more accurate measure of ρ than would correlation of the signal before quantization. This fact is perhaps surprising; it reflects that correlation is not always the most accurate way to determine the covariance of two signals.

The organization of this paper is as follows. In § 2, I review the statistics of the correlation of unquantized, or continuously variable, complex signals. In § 3, I present expressions that give the average correlation and the standard deviation, in terms of integrals involving the characteristic curve. I include statistics of real and imaginary parts, which are different. I present expansions of these integrals as a power series in ρ. In § 4 I discuss computer simulations of correlation to illustrate these mathematical results. I summarize the results in § 5.

2. CORRELATION OF UNQUANTIZED VARIABLES

2.1. Bivariate Gaussian Distribution

Consider two random, complex signals, x and y. Suppose that each of these signals is a random variable drawn from a Gaussian distribution. Suppose that the signals x and y are correlated so that they are, in fact, drawn from a Gaussian joint probability density function P(x,y) (Meyer 1975). Without loss of generality, I assume that each signal has variance of 2:

In this expression, the angle brackets 〈...〉 denote a statistical average; in other words, an average over all systems with the specified statistics. This choice for a variance of 2 for x and y is consistent with the literature on this subject, much of which treats real signals (rather than complex ones) drawn from Gaussian distributions with unit variance (Cooper 1970; Thompson et al. 1986). Of course, the results presented here are easily scaled to other variances for the input signal. I require that the signals themselves have no intrinsic phase; in other words, the statistics remain invariant under the transformation x→xe0 and y→ye0, where ϕ0 is an arbitrary overall phase. It then follows that

From these facts, one finds:

and similarly for y. Thus, real and imaginary parts are drawn from Gaussian distributions with unit variance. The distributions are circular in the complex plane for both x and y.

Without loss of generality, I assume that the normalized covariance of the signals ρ is purely real:

One can always make ρ purely real by rotating x (or y) in the complex plane: x→xex, and y→y. Note that because of the absence of any intrinsic phase,

so that

In other words, the real parts of x and y are correlated, and the imaginary parts of x and y are correlated, but real and imaginary parts are uncorrelated. In mathematical terms, real parts (or imaginary parts) are drawn from the bivariate Gaussian distribution

where (X,Y) stands for either (Re(x), Re(y)) or (Im(x), Im(y)). The distributions for real and imaginary parts are identical, but real and imaginary parts are uncorrelated. Therefore,

2.2. Correlation of Unquantized Signals

2.2.1. Correlation

Consider the product of two random complex signals, drawn from Gaussian distributions as in the previous section, that are sampled in time. Suppose that the signals are not quantized; they can take on any complex value. The product of a pair of samples ci = xiy*i does not follow a Gaussian distribution. Rather, the distribution of ci is the product of an exponential of the real part, multiplied by the modified Bessel function of the second kind of order zero of the magnitude of ci (Gwinn 2001). However, the average of many such products, over a large number of pairs of samples, approaches a Gaussian distribution, as the central limit theorem implies. In such a large but finite sum,

provides an estimate of the covariance ρ. Here the index i runs over the samples, which are commonly taken at different times. The total number of samples correlated is Nq. Hereafter, I assume that in all summations, indices run from 1 to Nq. The subscript on the correlation r again indicates that the correlation has been formed for variables xi and yi, which can take on any of an infinite number of values; in other words, it indicates that xi and yi have not been quantized.

2.2.2. Mean Correlation for Unquantized Signals

The mean correlation is equal to the covariance, in a statistical average

where I used the assumption that the phase of ρ is zero, equation (5). This can also be seen from the mean of the distribution of the products ci = xiy*i (Gwinn 2001), or simply by integrating over the joint distribution of xi and yi (eqs. [7] and [8]).

2.2.3. Noise for Correlation of Unquantized Signals

Because the distribution of r is Gaussian, the distribution of r is completely characterized by its mean (eq. [10]) and by the variances 〈rr*〉 and 〈rr〉. Suppose that the samples xi and yi are independent; in mathematical terms, suppose that

If they are not independent, the results will depend on the correlations among samples; this case is important if, for example, the signal has significant spectral structure. Jenet & Anderson (1998) find average spectrum in this case; I will discuss the noise in future work. Here I consider only independent samples. In that case,

where I have separated the terms with i = j from those with i≠j, and appealed to the facts that xi and yj are covariant only if i = j ("white" signals), and that for i = j the statistics are stationary in i. For Gaussian variables with zero mean a, b, c, and d, all moments are related to the second moments,

so that

Therefore,

An analogous calculation yields

and so

I combine these facts to find the means and standard deviations of the real and imaginary parts of the measured correlation r:

If the number of independent samples Nq is large, the central‐limit theorem implies that Re(r) and Im(r) are drawn from Gaussian distributions. The means and variances of these distributions, as given by equation (18), completely characterize r. The fact that the real part of r has greater standard deviation than the imaginary part reflects the presence of self‐noise or source noise. Sometimes this is described as the contribution of the noiselike signal to the noise in the result.

Commonly, and often realistically, astrophysicists suppose that ρ measures the intensity of one signal that has been superposed with two uncorrelated noise signals to produce x and y (see Bowers & Klingler 1974; Kulkarni 1989; Anantharamaiah et al. 1991). A change in ρ then corresponds to a change in the intensities 〈|x|2〉 and 〈|y|2〉 as well. Here, I suppose that 〈|x|2〉 = 〈|y|2〉≡1 while ρ varies. The results presented here can be scaled to those for the alternative interpretation.

2.2.4. S/N for Correlation of Unquantized Signals

The S/N for Re(r) is

Note that for a given number of observations Nq, S/N increases with ρ; the increase is proportional for ρ≪1.

A related quantity to S/N is the rms phase, statistically averaged over many measurements. The phase is ϕ(r) = tan -1[Im(r)/Re(r)]. When the number of observations is sufficiently large, and the true value of the phase is 0, as assumed here, the standard deviation of the phase is 〈ϕ(r)〉 = 〈Im(r)21/2/〈Re(r)〉. The inverse of the standard deviation of the phase (in radians) is analogous to the S/N for the real part (eq. [19]). This S/N for phase is

For constant Nq the S/N of the phase increases with ρ and increases much faster than proportionately for ρ→1.

3. QUANTIZED SIGNALS

Quantization converts the variables x, y to discrete variables x ˆ, y ˆ. These discrete variables depend on x and y through a multiple‐step function known as a characteristic curve. Each step extends over some range [vi,vi + 1] of the unquantized signal and is given some weight ni in correlation. The function X ˆ(X) denotes the characteristic curve, where again X stands for either Re(x) or Im(x). The complex quantized variable x ˆ is thus given by x ˆ = X ˆ(Re(x)) + i X ˆ(Im(x)). The same characteristic curve is applied to real and imaginary parts. I hold open the possibility that the characteristic curves X ˆ(X) and Y ˆ(Y) are different. I assume in this paper that the characteristic curves are antisymmetric: X ˆ(- X) = -X ˆ(X), and similarly for Y. D'Addario et al. (1984) describe effects of departures from antisymmetry and how antisymmetry can be enforced. Figure 1 shows a typical characteristic curve for four‐level (or two‐bit) sampling.

Fig. 1.—

Fig. 1.— Characteristic curve X ˆ(X) for four‐level quantization.

Systems with M levels of quantization can be described by analogous, more complicated characteristic curves, and corresponding sets of weights {ni} and levels {vix} and {viy} (Jenet & Anderson 1998). For M‐level sampling, the correlation of Nq quantized samples is

In practical correlators, deviations from theoretical performance can often be expressed as deviations of the characteristic curve from its desired form. In principle, these can be measured by counting the number of samples in the various quantization ranges and using the assumed Gaussian distributions of the input signals to determine the actual levels vix and viy. D'Addario et al. (1984) present an extensive discussion of such errors, and techniques to control and correct them.

Although the results presented below are applicable to more complicated systems, the four‐level correlator will be used as a specific example in this paper, with correlation r ˆ4. For four‐level sampling, commonly a sign bit gives the sign of X, and an amplitude bit assigns weight 1 if |X| is less than some threshold v0x, and weight n if |X| is greater than v0x. Together, sign and amplitude bits describe the four values possible for X ˆ(X). Other types of correlators, including two‐, three‐, or "reduced" four‐level (in which case the smallest product for | X ˆ | = 1, and | Y ˆ | = 1 is ignored), can be formed as special cases or sums of four‐level correlators (Hagen & Farley 1973).

3.1. Correlation of Quantized Signals: Exact Results

3.1.1. Mean Correlation: Exact Result

Ideally, from measurement of the quantized correlation r ˆM one can estimate the true covariance ρ. The statistical mean of r ˆM is

Because Re(x ˆ) depends only on Re(x) and Im(y ˆ) depends only on Im(y), and Re(x) and Im(y) are completely independent [and similarly for Im(x ˆ) and Re(y ˆ)],

Thus, the imaginary part of 〈 r ˆ 〉, which involves products of these statistically independent terms, has an average of zero (eq. [3]). For the real part,

where I use the assumption that the characteristic curves are identical for real and imaginary parts, and that real and imaginary parts of x and y have identical statistics (eqs. [3] and [5]). I use the bivariate Gaussian distribution for real and imaginary parts to find a formal expression for the statistical average of the correlation:

This integral defines ϒXY. For the assumed antisymmetric characteristic curves X ˆ(X) and Y ˆ(Y), one can easily show that ∂ϒXY/∂ρ>0. In other words, the ensemble‐averaged quantized correlation is an increasing function of the covariance ρ for completely arbitrary quantizer settings (so long as the characteristic curves are antisymmetric).

The discussion of this section reduces the calculation of the average quantized correlation to that of integrating P2(X,Y) over each rectangle in a grid, with the edges of the rectangles given by the thresholds in the characteristic curves (Kokkeler, Fridman, & van Ardenne 2001). The function ϒXY depends on ρ through P2(X,Y). This function is usually expanded through first order in ρ, because ρ is small in most astrophysical observations.

3.1.2. Simpler Form for ϒXY

The integral ϒXY and similar integrals can be converted into one‐dimensional integrals for easier analysis. If one defines

then the Fourier transform of P2(v0x,v0Y) is equal to that of ∂Q(v0x,v0Y)/∂ρ, as one finds from integration by parts. Thus,

The integral ϒXY is the sum of one such integral and one such constant for each step in the characteristic curve. This one‐dimensional form is useful for numerical evaluation and expansions. Kashlinsky, Hernández‐Monteagudo, & Atrio‐Barandela (2001) also present an interesting expansion of ϒXY in Hermite polynomials, in v0.

3.1.3. Noise: Exact Results

This section presents an exact expression for the variance of the correlation of a quantized signal when averaged over the ensemble of all statistically identical measurements. Real and imaginary parts of r ˆM have different variances. This requires calculation of both 〈 r ˆM r ˆ*M〉 and 〈 r ˆM r ˆM〉. Note that

where the sum over i≠j is simplified by the fact that samples at different times are uncorrelated, and by equation (22). I expand the first average in the last line:

where I have used the fact that the real part of x ˆ has zero covariance with the imaginary part of y ˆ, and vice versa. Because the real and imaginary parts are identical, this sum can be expressed formally in terms of the integrals:

These expressions define ϒX2Y2, AX2, and AY2. Thus,

Note that in these expressions AX2 and AY2 are constants that depend on the characteristic curve, but not on ρ, whereas ϒXY and ϒX2Y2 depend on ρ in complicated ways as well as on the characteristic curve.

Similarly,

I again expand the first sum in the last line:

where I omit the imaginary terms, all of which average to zero. Therefore,

Using the same logic as in the derivation of equation (18), equations (30) and (33) can be used to find the means and standard deviations of the real and imaginary parts of r ˆM:

Again, note that AX2 and AY2 are constants that depend on the characteristic curve, but not on the covariance ρ, whereas ϒXY and ϒX2Y2 depend on the actual value of ρ as well as the characteristic curve. For particular characteristic curves and particular values of ρ, these expressions nevertheless yield the mean correlation and the standard deviations of real and imaginary parts about the mean. Figures 2–5 show examples and compare them with the approximate results from the following section.

Fig. 2.—

Fig. 2.— Average correlation plotted with covariance ρ. Curves show 〈r〉 and approximations to 〈 r ˆ4〉 for v0 = 1.0 and 0.602, with n = 3. Heavy lines show the third‐order approximation of eq. (39), and light lines show the linear approximation of Cooper (1970). Circles show true values as computed by direct integration of eq. (25). Crosses show results of simulations (§ 4). The sharp rise in 〈 r ˆ4〉 for ρ ≈ 1, shown by the circles at far right, motivates the approximation of Jenet & Anderson (1998) that 〈 r ˆ4〉 varies proportionately with ρ for ρ<1, with a spike at ρ = 1.

Note that, because ϒXY is an increasing function of ρ, the standard deviation of the imaginary part decreases with increasing covariance ρ. This holds for arbitrary quantizer parameters so long as the characteristic curves are antisymmetric. In other words, the noise in the imaginary part always decreases when the correlation increases.

For uncorrelated signals, ρ = 0. One finds then that ϒXY = 0 and ϒX2Y2 = AX2AY2. In this case both real and imaginary parts have identical variances, as they must:

This recovers the result of Cooper (1970) and others for the noise in this limit.

If the characteristic curves are identical so that X ˆ(X) = Y ˆ(Y), then if the signals are identical (ρ = 1), one finds that ϒX2Y2 = AX2 = AY2. Under these assumptions then 〈Im(r ˆM)2〉 = 0.

3.2. Correlation of Quantized Signals: Approximate Results

3.2.1. Mean Correlation: Approximate Result

Unfortunately, the expressions for the mean correlation and the noise, for quantized signals, both depend, in a complicated way, on the covariance ρ, the quantity one seeks to measure. Often the covariance ρ is small. Various authors discuss the correlation r ˆM of quantized signals x ˆ and y ˆ to first order in ρ, as is appropriate in the limit ρ→0 (Van Vleck & Middleton 1966; Cooper 1970; Hagen & Farley 1973; Thompson et al. 1986; Jenet & Anderson 1998). Jenet & Anderson (1998) also calculate 〈r4〉 for ρ = 1. As they point out, this case is important for autocorrelation spectroscopy. D'Addario et al. (1984) present an expression for 〈 r ˆ 〉 for a three‐level correlator, and present several useful approximate expressions for the inverse relationship ρ(〈 r ˆM〉). Here I find the mean correlation 〈 r ˆM〉 through fourth order in ρ.

For small covariance ρ, one can expand P(X,Y) in equation (7) as a power series in ρ:

Note that the coefficient of each term in this expansion over ρ can be separated into two factors that depend on either X alone or Y alone. The extension to higher powers of ρ is straightforward, and the higher‐order coefficients have this property as well.

As noted above, I assume that the characteristic curve is antisymmetric: X ˆ(- X) = -X ˆ(X). The integral ϒXY (eq. [25]) involves first powers of the functions X ˆ(X) and Y ˆ(Y). In this integral, only terms odd in both X and Y match the antisymmetry of the characteristic curve and yield a nonzero result. Such terms are also odd in ρ, as is seen from inspection of equation (36). The first‐order terms thus involve the integrals

where again X and Y can stand for either real or imaginary parts of x and y. Here I consider terms up to order 3 in ρ. One thus encounters the further integrals

and the analogous expression for DY. Therefore, through fourth order in ρ,

For thresholds v0 ≈ 1, the linear approximation is quite accurate (Cooper 1970; Thompson et al. 1986; Jenet & Anderson 1998). However, for other values of v0 the higher order terms can become important. My notation differs from that of previous authors; my BXBY is equal to [(n - 1)E + 1] of Cooper (1970) and Thompson et al. (1986). It is equal to the A(σ ˆ22) of Jenet & Anderson (1998).

Figure 2 shows typical results of the expansion of equation (39) for a four‐level correlator, and compares this estimate for r ˆ4(ρ) with the results from direct integration of equation (25) over rectangles in the xy plane. In this example, v0x = v0y≡v0. For both v0 = 1 and v0 = 0.602, 〈 r ˆM〉 is relatively flat, with a sharp upturn very close to ρ = 1. However, in both cases, but especially for v0 = 0.602, the curve of r ˆ4 bends upward well before ρ = 1 so that the linear approximation is good only for relatively small ρ.

3.2.2. Noise: Approximate Results

Expressions for the noise in the integral involve the integral ϒX2Y2 (eq. [34]). This integral involves only the squares of the characteristic curves X ˆ(X)2 and Y ˆ(Y)2. Because the characteristic curves are antisymmetric about 0, their squares are symmetric: X ˆ(X)2 = X ˆ(- X)2 and Y ˆ(Y)2 = Y ˆ(- Y)2. Therefore, the only contributions come from terms in the expansion of P(x,y) (eq. [36]) that are even in X and Y. One thus encounters the integrals

and analogously for AY2, CY2, and EY2. Then, through fourth order in ρ,

I find the standard deviations of real and imaginary parts from equations (34), (39), and (41):

I have used the fact that 〈 r ˆM〉 is purely real; this is a consequence of the assumption that ρ is purely real.

Figure 3 shows examples of the standard deviations of Re(r ˆ4) and Im(r ˆ4) for two choices of v0. These are the noise in estimates of the correlation. Note that the noise varies with ρ. The quadratic variation of these quantities with ρ is readily apparent. The higher order variation is more subtle, although it does lead to an upturn of the standard deviation of Re(r ˆ4) near ρ ≈ 0.7 for v0 = 1. The series expansions become inaccurate near ρ = 1, as expected. The standard deviation of Re(r ˆ4) can also increase, instead of decrease, for large ρ. Such an increase is more common for parameter choices with v0>1. Again, note that the standard deviation of the imaginary part always decreases with increasing ρ.

Fig. 3.—

Fig. 3.— Standard deviation of the real part of correlation (left) and of the imaginary part (right), plotted with covariance ρ. Both are normalized for the number of samples by the factor (2Nq)1/2. Curves show 〈 r ˆ〉 and approximations to 〈 r ˆ4〉 for quantization with v0 = 1.0 and 0.602, with n = 3. Heavy lines show the fourth‐order approximation of eq. (42), and light lines show the approximation to second order. The values found by Cooper (1970) are the y‐intercepts (ρ = 0). Filled circles show true values as computed by direct integration of eq. (34). Crosses show results of simulations (§ 4).

3.2.3. S/N for Quantized Correlation

The S/N for a quantizing correlator is the quotient of the mean and standard deviation of r ˆM, the results of §§ 3.2.1 and 3.2.2. I recover the results of Cooper (1970) for the S/N for a quantizing correlator by using our approximate expressions through first order in ρ (see also Hagen & Farley 1973; Thompson et al. 1986; Jenet & Anderson 1998):

For a four‐level correlator, a S/N of Script R4 = 0.88115ρ is attained for n = 3, v0=1, in the limit ρ→0. Many four‐level correlators use these values. The maximum value for Script R4 is actually obtained for n = 3.3359, v0 = 0.9815, for which Script R4 = 0.88252ρ. This adjustment of quantization constants provides a very minor improvement in S/N.

For nonvanishing ρ, the optimum‐level settings depend upon the covariance ρ. The S/N is

This can be approximated using the expansions for ϒXY and ϒX2Y2; Figure 4 shows the results. Note that in the examples in the figure, the S/N for the quantized correlations actually curve above that for the unquantized correlation beyond ρ ≈ 0.5. This indicates that correlation of quantized signals can actually yield higher S/N than would be obtained from correlating the same signals before quantization. This results from the decline in noise with increasing ρ, visible in Figure 3.

Fig. 4.—

Fig. 4.— S/N Script R or Script R4, normalized for number of samples by (2Nq)-1/2, and plotted with covariance ρ. Curves mark the exact expression given by eq. (19) for Script R, or the approximate expression given by the expansions of eqs. (39) and (42) through fourth order in eq. (44). Filled circles give exact values as found from direct integration of eq. (34) in eq. (44). Crosses show results of computer simulations (§ 4). Quantization uses the characteristic curve in Fig. 1 with values of v0 = 1.0 or 0.602, and n = 3.

For a proper comparison of S/Ns, one must compare with the S/N obtained for nonquantized correlation, Script R(Re(r ˆM)) (eq. [19]). One finds

Figure 5 shows this ratio for two choices of v0. The ratio can exceed 1, again indicating that a quantized correlation provides a more accurate result than would correlation of an unquantized signal.

Fig. 5.—

Fig. 5.— S/N for quantized signals, normalized to the S/N of the unquantized signal: Script R4/Script R, plotted with covariance ρ. Curves mark the approximate expressions given by eqs. (39) and (42) for Script R4, normalized by Script R as given by eq. (18). Filled circles give exact expressions as found from direct integration of eq. (25). The y‐intercept for v0 = 1.0 is the standard S/N for a four‐level correlator, Script R4 = 0.88115, with v0 = 1.0 and n = 3, which is optimal for ρ = 0. Note that the ratio can be greater than 1. This indicates that quantized correlation can be more efficient than correlation of the original, unquantized signals.

The S/N for a measurement of phase for a quantized correlation is the inverse of the standard deviation of the phase, as discussed in § 19. For a quantized correlation, this is

Again, because ϒXY always increases with ρ, the S/N of the phase always increases with increasing covariance.

The ratio of the S/N for phase to that for correlation of an unquantized signal, Script RM(ϕ(r ˆM))/Script R(ϕ(r)) (see eq. [20]), provides an interesting comparison. For ρ→0, the statistics for the imaginary part of the correlation are identical to those for the real part (as they must be), and the highest S/N for the phase is given by the quantizer parameters that are optimal for the real part, traditionally v0 = 1, n = 3. This ratio is approximately constant with ρ up to ρ ≈ 0.6, and then decreases rather rapidly. Simulations suggest that quantized correlation is less efficient than unquantized for measuring phase. However, I have not proved this in general.

4. SIMULATIONS

Simulation of a four‐level correlator provides a useful perspective. I simulated such a correlator by generating two sequences of random, complex numbers, xi and yi. The real parts of xi and yi are drawn from one bivariate Gaussian distribution, and their imaginary parts from another independent one (see eq. [8]). These bivariate Gaussian distributions can be described equivalently as elliptical Gaussian distributions, with major and minor axes inclined to the coordinate axes X = Re(xi) and Y = Re(yi) and the corresponding axes for the imaginary parts.

Meyer (1975) gives expressions that relate the semimajor and semiminor axes and angle of inclination of an elliptical Gaussian distribution to the normalized covariance ρ and variances σ2X and σ2Y. For the special case of σX = σY = 1 used in this work, the major axis always lies at angle π/4 to both coordinate axes, along the line X = Y. The semimajor axis b1 and semiminor axis b2 are then given by

To form the required elliptical distributions, I drew pairs of elements from a circular Gaussian distribution, using the Box‐Muller method (see Press et al. 1989). I scaled these random elements so that their standard deviations were b1 and b2. I then rotated the resulting two‐element vector by π/4 to express the results in terms of X and Y. I repeated the procedure for the imaginary part.

I quantized the sequences xi and yi according to the four‐level characteristic curve shown in Figure 1, to yield x ˆi and y ˆi. Both the unquantized and the quantized sequences were correlated by forming the products xiy*i and x ˆi y ˆ*i, respectively, and results were averaged over Nq = 105 instances of the index i. This procedure yields one realization each of r and r ˆ4. I found that values for Nq that were smaller than about 105 could produce significant departures from Gaussian statistics for r ˆ4, particularly for larger values of ρ.

I repeated the process to obtain 4096 different realizations of r and r ˆ4. I found the averages and standard deviations for the real and imaginary parts for this set of realizations. Figures 2–5 show these statistical results of the simulations and compare them with the mathematical results of the preceding sections. Clearly, the agreement is good.

In graphical form, samples of the correlation form an elliptical Gaussian distribution in the complex plane, centered at the mean value of correlation 〈r〉 or 〈 r ˆM〉, as the case may be. The principal axes of the distribution lie along the real and imaginary directions (or, more generally, the directions in phase with ρ and out of phase with ρ). The lengths of these principal axes are the variances of real and imaginary parts.

5. DISCUSSION AND SUMMARY

5.1. Change of Noise with Covariance

The fundamental result of this paper is that a change in covariance ρ affects quantized correlation r ˆM differently from unquantized correlation r. For unquantized correlation, an increase in covariance ρ increases noise for estimation of signal amplitude. For quantized correlation, an increase in ρ can increase or decrease amplitude noise. For both quantized and unquantized correlation, an increase in ρ leads to a decrease in phase noise. In this work I arbitrarily set the phase of ρ to 0 so that amplitude corresponds to the real part, and phase to the imaginary part of the estimated correlation. The net noise (summed, squared standard deviations of real and imaginary parts) can decrease (or increase) with increasing ρ; noise is not conserved.

I present expressions for the noise as a function of quantization parameters, both as exact expressions that depend on ρ and on power‐series expansions in ρ. These expressions, and a power‐series expansion for the mean correlation, are given through fourth order in ρ.

The increase in noise with covariance ρ for analog correlation, sometimes called source noise or self‐noise, is sometimes ascribed to the contribution of the original, noiselike signal to the noise of correlation. This idea is difficult to generalize to comparisons among quantized signals, because such comparisons require additional assumptions about changes in quantizer levels and the magnitude of the quantized signal when the covariance changes. These comparisons are simpler for multiple correlations derived from a single signal (as, for example, for the correlation function of a spectrally varying signal), and I will discuss them in that context elsewhere (C. Gwinn 2004, in preparation). The discussion in this paper is limited to "white" signals, without spectral variation and with only a single independent covariance.

5.1.1. Increase in S/N via Quantization

One interesting consequence of these results is that the S/N of correlation can actually be greater for quantized signals than it would be for correlation of the same signals before quantization. At small covariance ρ, S/N is always lower for quantized signals, but this need not be the case for covariance ρ≳0.4. This appears to present a paradox, because the process of quantization intrinsically destroys information: the quantized signals x ˆi, y ˆi contain less information than did the original signals xi, yi. However, correlation of unquantized signals also destroys information: it converts xi and yi to the single quantity ci = (xiy*i). Different information is destroyed in the two cases.

Moreover, correlation does not always yield the most accurate estimate of the covariance ρ. As a simple example, consider the series {Xi} = {0.400, - 0.800,1.600} and {Yi} = {0.401, - 0.799,1.600}. Here Nq = 3. One easily sees that X and Y are highly correlated. If X and Y are known to be drawn from Gaussian distributions with unit standard deviation, equation (47) suggests that ρ ≈ 0.999. However, the correlation is r = 1 / 3 ∑i XiYi = 1.29. Clearly r is not an optimal measurement of ρ. I will discuss strategies for optimal estimates of covariance elsewhere (C. Gwinn 2004, in preparation).

5.1.2. Quantization Noise

Sometimes effects of quantization are described as "quantization noise"; an additional source of noise that, like "sky noise" or "receiver noise," reduces the correlation of the desired signal. However, unlike other sources of noise, quantization destroys information in the signals, rather than adding unwanted information. The discussion in the preceding section suggests that the amount of information that quantization destroys (or, more loosely, the noise that it adds) depends on what information is desired; and that correlation removes information as well. Unless the covariance ρ is small, effects of quantization cannot be represented as a one additional, independent source of noise in general.

5.1.3. Applications

The primary result of this paper is that for quantized correlation, noise can increase or decrease when covariance increases, whereas for continuous signals it increases. This fact is important for applications requiring accurate knowledge of the noise level; as for example in studies of rapidly varying strong sources such as pulsars, where one wishes to know whether a change in correlation represents a significant change in the pulsar's emission; or for single‐dish or interferometric observations of intraday variable sources, for which one wishes to know whether features that may appear and disappear are statistically significant.

A second result of this paper is that the S/N for quantized correlation can be quite different from that expected for a continuum source or for a continuum source with added noise. This effect is most important for large correlation, ρ≳0.5. Correlation this large is often observed for strong sources, such as the strongest pulsars, or maser lines, and for the strongest continuum sources observed with sensitive antennas. For example, at Arecibo Observatory a strong continuum source easily dominates the system temperature at many observing frequencies. The effect will be even more common for some proposed designs of the Square Kilometer Array.

Many sources show high correlation that varies dramatically with frequency. Such sources include scintillating pulsars and maser lines. Typically, observations of these sources involve determination of the full correlation function and a Fourier transform to obtain an estimated cross‐power or autocorrelation spectrum. I discuss the properties of noise for this analysis elsewhere.

5.1.4. S/N Enhancement?

An interesting question is whether one can take advantage of the higher S/N afforded by quantization at high ρ even for weakly correlated signals, perhaps by adding an identical signal to each of two weakly covariant signals and so increasing their covariance ρ, before quantizing them. The answer appears to be no. As a simple example, consider a pair of signals with covariance of ρ ≈ 0.01. After correlation of Nq = 2 × 106 instances of the signal using a four‐level correlator with v0 = 1 and n = 3, the S/N is 20, and one can determine at a level of 2 standard deviations whether ρ = 0.010 or 0.011. If a single signal with 4.6 times greater amplitude is added to both of the original signals, then these two cases correspond to ρ = 0.7004 or 0.7005. To distinguish them at 2 standard deviations requires a S/N of 1400, requiring Nq = 4 × 108 samples of the quantized correlation. Thus, the increase in S/N is more than outweighed by the reduction in the influence on the observable.

5.2. Summary

In this paper I consider the result of quantizing and correlating two complex noiselike signals, x and y, with normalized covariance ρ. The signals are assumed to be statistically stationary, "white," and sampled at the Nyquist rate. The correlation r provides a measurement of ρ. The variation of r about that mean, characterized by its standard deviation, provides a measure of the random part of the measurement, or noise.

I suppose that the signals x and y are quantized to form x ˆ and y ˆ. I suppose that the characteristic curves that govern quantization are antisymmetric, with real and imaginary parts subject to the same characteristic curve. I recover the classic results for the noise for ρ = 0 and for the mean correlation to first order in ρ in the limit ρ→0 (Van Vleck & Middleton 1966; Cooper 1970; Hagen & Farley 1973; Thompson et al. 1986). I find exact expressions for the mean correlation and the noise, and approximations that are valid through fourth order in ρ. I compare results with simulations. Agreement is excellent for the exact forms, and good for ρ that is not too close to 1, for the approximate expressions.

I find that for nonzero values of ρ, the noise varies, initially quadratically, with ρ. I find that the noise in an estimate of the amplitude of ρ can decrease with increasing ρ. This is opposite the behavior of noise for correlation of unquantized signals, for which noise always increases with ρ. The mean correlation can increase more rapidly than linearly with ρ. The S/N for correlation of quantized signals can be greater than that for correlation of unquantized signals, for ρ≳0.5. In other words, correlation of quantized signals can be more efficient than correlation of the same signals before quantization, as a way of determining the covariance ρ.

I am grateful to the Dominion Radio Astrophysical Observatory (DRAO) for supporting this work with extensive correlator time. I also gratefully acknowledge the VLBI Space Observatory Programme (VSOP) Project, which is led by the Japanese Institute of Space and Astronautical Science in cooperation with many organizations and radio telescopes around the world. I thank an anonymous referee for useful comments. The National Science Foundation provided financial support.

Please wait… references are loading.
10.1086/381167