This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.

A CLIPPING METHOD TO MITIGATE THE IMPACT OF CATASTROPHIC PHOTOMETRIC REDSHIFT ERRORS ON WEAK LENSING TOMOGRAPHY

, , , and

Published 2010 July 14 © 2010. The American Astronomical Society. All rights reserved.
, , Citation Atsushi J. Nishizawa et al 2010 ApJ 718 1252 DOI 10.1088/0004-637X/718/2/1252

0004-637X/718/2/1252

ABSTRACT

We use a mock catalog of galaxies based on the COSMOS galaxy catalog, including information on photometric redshift (photo-z) and spectral energy distribution types of galaxies, in order to study how to define a galaxy subsample suitable for weak lensing tomography feasible with optical (and near-IR) multi-band data. Since most useful cosmological information arises from the sample variance limited regime for upcoming lensing surveys, a suitable subsample can be obtained by discarding a large fraction of galaxies that have less reliable photo-z estimations. We develop a method to efficiently identify photo-z outliers by monitoring the width of the posterior likelihood function of redshift estimation for each galaxy. This clipping method may allow us to obtain clean tomographic redshift bins (here three bins are considered) that have almost no overlap, by discarding more than ∼70% of galaxies with ill-defined photo-zs corresponding to the number densities of remaining galaxies less than ∼20 arcmin−2 for a Subaru-type deep survey. Restricting the ranges of magnitudes and redshifts and/or adding near-IR data help us obtain a cleaner redshift binning. Using the Fisher information matrix formalism, we propagate photo-z errors into biases in the dark energy equation of state parameter w. We find that, by discarding most of the ill-defined photo-z galaxies, the bias in w can be reduced to a level comparable to the marginalized statistical error; however, the residual small systematic bias remains due to asymmetric scatters around the relation between photometric and true redshifts. We also use the mock catalog to estimate the cumulative signal-to-noise ratios (S/Ns) for measuring the angular cross-correlations of galaxies between finer photo-z bins, finding higher S/N values for the bins that include photo-z outliers.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The bending of light by mass, gravitational lensing, causes images of distant galaxies to be distorted (see, e.g., Bartelmann & Schneider 2001, for a thorough review). These sheared source galaxies are mostly too weakly distorted to enable us to measure the effect in individual galaxies, but require surveys containing at least millions of galaxies to detect the signal in a statistical way (see, e.g., Fu et al. 2008, for the latest measurement result). This cosmic shear is now recognized as one of the most promising probes that allows a direct reconstruction of dark matter distribution as well as to constrain the properties of dark energy or to test the theory of gravity on cosmological scales (see, e.g., Hoekstra & Jain 2008; Massey et al. 2010; Huterer 2010, for recent reviews). In particular, by adding redshift information for source galaxies, the lensing geometrical information as well as the redshift evolution of dark matter clustering can be inferred, thereby significantly improving its facility to constrain cosmology (e.g., Hu 1999; Huterer 2002; Takada & Jain 2004).

To address questions about the nature of dark energy and/or the properties of gravity on cosmological scales, a number of ambitious wide-field optical and infrared imaging surveys have been proposed: the Panoramic Survey Telescope & Rapid Response System (Pan-STARRS4), the Dark Energy Survey (DES5), the Large Synoptic Sky Survey (LSST6), the space-based Joint Dark Energy Mission (JDEM7), and EUCLID. However, there are several sources of systematic errors inherent in weak lensing measurements, and understanding these errors is currently the most important issue for achieving the full potential of planned lensing surveys (e.g., Huterer 2010).

One of the most serious systematic errors is the uncertainty in estimating redshifts of source galaxies. Since it is practically infeasible to obtain spectroscopic redshifts for the huge number of imaging galaxies (108–109 galaxies for future surveys), these redshifts need to be estimated from multi-band photometry—the so-called photometric redshifts (hereafter photo-z). Both statistical errors and systematic biases in the relation between photometric and spectroscopic redshifts need to be well controlled (e.g., a sub-percent level for the bias for future surveys) in order not to have any serious biases in cosmological parameters comparable with the apparent statistical errors (Huterer et al. 2006; Ma et al. 2006). Understanding the properties of photo-z errors is also important for exploring an optimal survey design given the goal of achieving the desired level of cosmological constraints; depth versus number of filters versus area surveyed. Given this research background, there have been recent studies on photo-z requirement based on real data (Abdalla et al. 2008; Lima et al. 2008; Cunha et al. 2009).

In this paper, we focus on the issue of how to identify and remove catastrophic redshift errors—a case where photometric redshift is grossly misestimated (see Bernstein & Huterer 2009 for a similar study). This can be done by monitoring the posterior likelihood function of photo-z estimation for each galaxy. The important fact is that future surveys are planned to use the sample variance limited regime in cosmic shear information rather than the shot noise regime in order to constrain cosmology. Therefore, one can discard a large fraction of galaxies whose photo-z estimations are less reliable (Jain et al. 2007). Thus, it would be worth addressing how to construct a galaxy subsample suitable for lensing experiments. Having such a subsample of galaxies with reliable photo-z estimates may also relax requirements on a spectroscopic training set to calibrate the residual photo-z errors. In this paper, we will address these issues by using a mock catalog of photometric galaxies based on the COSMOS photo-z catalog (Ilbert et al. 2009) that provides the most reliable current photo-z catalog calibrated with 30 band data and a spectroscopic subsample.

This paper is organized as follows. In Section 2, we provide a brief overview of the theory of weak lensing. In Section 3, we provide details on how we made our simulated mock catalog of photometric galaxies based on the COSMOS data. In Section 4, we use the simulated catalog to assess the performance of photo-z estimation assuming survey parameters on depth and filter set, which are chosen to closely resemble the Subaru Hyper Suprime-Cam (HSC) Survey. In Section 5, we present the main results: we use the simulated photo-z catalog to implement a hypothetical weak lensing experiment, paying particular attention to how to construct a galaxy subsample which is defined such that photo-z errors have minimal impact on cosmological parameters. Section 6 is devoted to a summary and discussion. Unless explicitly stated we assume the concordance ΛCDM model consistent with the Wilkinson Microwave Anisotropy Probe (WMAP) 5-year results (Komatsu et al. 2009).

2. PRELIMINARIES

In this section, we briefly review the basics of cosmic shear tomography. Throughout this paper, we work in the context of a spatially flat cold dark matter model for structure formation.

2.1. Convergence Power Spectrum

Gravitational shear can be simply related to the lensing convergence: the weighted mass distribution integrated along the line of sight (see, e.g., Bartelmann & Schneider 2001 for a thorough review and references therein). Photometric redshift information on source galaxies allows us to subdivide galaxies into redshift bins, enabling more cosmological information to be extracted, which is referred to as lensing tomography (e.g., Hu 1999; Huterer 2002; Takada & Bridle 2007). In the context of cosmological gravitational lensing, assuming the flat-sky approximation and the Limber approximation, the lensing power spectrum of the ith and jth tomographic bins can be expressed as

Equation (1)

where H(z) is the Hubble expansion rate, χ is the comoving angular diameter distance up to redshift z, and Pδ(k, z) is the three-dimensional matter power spectrum at scale k and at redshift z. The lensing weight function W(i)(χ) in the ith tomographic redshift bin, defined to lie between the redshifts zi and zi+1, is given by

Equation (2)

and

where n(z) is the redshift distribution of galaxies, and $\bar{n}_i$ is the average number density of galaxies residing in the ith tomographic bin (or the redshift range z = [zi, zi+1]): $\bar{n}_i=\int _{z_i}^{z_{i+1}}dz^{\prime } n(z^{\prime })$.

In practice, the power spectrum measured from a galaxy survey has shot noise contamination arising from the finite sampling of galaxy images. Hence, the measured power spectrum becomes

Equation (3)

where σepsilon is the rms intrinsic ellipticities per component and δKij is the Kronecker delta symbols; δKij = 1 when i = j, otherwise δKij = 0.

Note that the distribution n(z) appearing in Equation (2) denotes the underlying true redshift distribution of galaxies used in lensing analysis. However, the distribution needs to be inferred from photo-z information of individual galaxies available from multi-color imaging data sets. This generally causes biases in the lensing power spectrum in the presence of photo-z errors. As long as tomographic redshift bins are broad enough, O(107) galaxies are available in each bin for a Subaru-type survey with ∼1000 deg2 sky coverage. Therefore, the statistical errors of photo-z are not problematic: the lensing power spectrum is primarily sensitive to the mean redshift of source galaxies. Instead, a precise knowledge of the mean redshift in each tomographic bin is required not to have any significant biases in best-fit parameters compared to the statistical errors, as studied by Huterer et al. (2006).

To assess the required photo-z accuracies for lensing tomography, an important fact we should keep in mind is that the lensing measurement for planned wide-field surveys is not shot noise limited. Hence a significant fraction of galaxies with ill-defined photo-zs can be discarded, without severely degrading parameter accuracies (Jain et al. 2007). With these considerations in mind, we will address in the following how to define an adequate subsample of galaxies for a given multi-color data set.

Figure 1 gives a quick summary of the impact of redshift uncertainty on the lensing power spectrum for the no-tomography case, i.e., a single redshift bin. Here, for simplicity, we assumed the redshift distribution given by the analytical form, n(z) ∝ z2exp [ − (z/z0)2] with z0 = 1, corresponding to the mean redshift $\langle z\rangle = 2z_0/\sqrt{\pi }\simeq 1.12$. (Note that the following results are all computed using simulated galaxy catalogs that have different redshift distributions). The plot shows that a 5% change in the mean redshift causes a 10% level change in the power-spectrum amplitude, and the amount of the change varies with multipoles due to the projection of the nonlinear matter power spectrum. This change can be compared with the effect of the dark energy equation of state and the statistical errors at each multipole bin expected for the power-spectrum measurement. Clearly such a bias in the mean redshift is problematic for planned surveys.

Figure 1.

Figure 1. Lensing shear power spectrum (solid curve) assuming the concordance ΛCDM model and the galaxy redshift distribution with mean redshift 〈z〉 = 1.13 (see the text for details). The dashed curve shows the resulting spectrum when the mean redshift is shifted by 5%, while the dotted curve shows the spectrum when the dark energy equation of state parameter is changed to w = −1.2. The shaded boxes around the fiducial power spectrum show the expected 1σ error at each multipole bin assuming ΩS = 2000 deg2, $\bar{n}_g=30$ arcmin−2, and σepsilon = 0.22 for survey area, the average number density of galaxies, and the rms intrinsic ellipticities, respectively. The bottom panel shows the relative differences of power spectra with respect to the fiducial spectrum.

Standard image High-resolution image

3. SIMULATION OF A PHOTOMETRIC GALAXY CATALOG

To assess the impact of photo-z errors on cosmic shear tomography, we use the following procedure. First, we simulate a mock catalog of galaxies that contains information on true redshifts, magnitudes in each filter, and spectral energy distribution (SED) for the survey parameters we consider. Then we estimate photometric redshifts for each simulated galaxy from its colors, yielding the photo-z catalogs.

A quick summary of the procedures used in making the mock photometric catalog is as follows.

  • 1.  
    Based on the results of the COSMOS photo-z catalog (Ilbert et al. 2009), we first model the redshift distribution of galaxies as a function of the i-band magnitudes down to i = 25.8 (see Section 3.1).
  • 2.  
    We then use the synthetic galaxy spectral model, GISSEL98, to generate a set of SED templates for each type of galaxy, where the age and star formation history are randomly varied (see Section 3.2).
  • 3.  
    We then use the HyperZ code8 to generate a mock photometric catalog of galaxies in which the SED and redshift are assigned to each galaxy. By doing this we impose conditions on the catalog so that it will satisfy the redshift–magnitude relation as well as reproduce an appropriate mixture of galaxy SED types which is consistent with the COSMOS galaxy population (see Section 3.3).
  • 4.  
    For a given set of filters, we compute apparent magnitudes in each filter for each simulated galaxy by taking into account the filter transmission curve and the redshifted spectrum (see Section 3.4). The statistical magnitude errors are also added to the magnitude of each filter (see Section 3.5).

To make a realistic simulated catalog, we assume survey parameters (depth, filter transmission curves, and so on) that are chosen to closely resemble the planned HSC survey. We also consider the external imaging data sets of u band and/or near-IR (NIR) to study how combining the different colors improves photo-z accuracies.

In this paper, we use a mock galaxy catalog containing about 105 galaxies in the range 20 < i < 25.8.

In the following subsections, we will describe the details of each procedure above; readers who are more interested in the results can skip these subsections and go to Section 4.

3.1. Magnitude–Redshift Relation

To make a mock catalog we need to properly take into account the redshift distribution of galaxies, which varies with the range of magnitudes considered. For example, fainter galaxies are preferentially at higher redshifts. This is the so-called magnitude–redshift relation. We use the magnitude–redshift relation estimated from the COSMOS catalog of galaxies selected by the Subaru i-band magnitudes (Ilbert et al. 2009). The COSMOS catalog provides the most accurate current photometric redshifts because the photo-z are estimated from 30 broad, intermediate, and narrowbands covering from UV, optical to mid-infrared. Also the photo-z estimates are calibrated by the spectroscopic subsample. Ilbert et al. (2009) studied subsamples of photo-z galaxies for different limiting magnitudes and showed that the resulting redshift distributions are well fitted by the polynomial form:

Equation (4)

where A, a, b, and c are the fitting parameters. The best-fit parameters for different magnitudes in the range i = [22, 25] are given in Table 2 in Ilbert et al. (2009). The COSMOS data is deep enough in the i band, and can be safely considered as a magnitude limit sample for i < 25. The COSMOS catalog also includes information on the angular number counts of galaxies for a given magnitude range as well as on the estimated galaxy SED type for each galaxy. To model a hypothetically deeper survey such as we are interested in, we extrapolate the fitting parameters to obtain the redshift distribution for fainter galaxies down to i = 25.8.

We thus generate a mock i-band photometric catalog of galaxies such that the resulting catalog satisfies the magnitude and redshift relation for different ranges of i-band magnitudes. Figure 2 shows the magnitude–redshift distributions for the simulated catalog containing about 105 galaxies.

Figure 2.

Figure 2. Redshift distribution of our simulated galaxies (containing about 105 galaxies) as a function of the i-band magnitude ranges as indicated by the labels. Note that the mock catalog is constructed so as to reproduce the redshift distribution found in the COSMOS galaxies with i < 25. The galaxies with 25 < i < 25.8 are simulated by extrapolating the COSMOS results down to the fainter magnitudes (see the text for details).

Standard image High-resolution image

3.2. Synthetic Spectral Models

For a given simulated galaxy labeled with some i-band magnitude and redshift z, we need to model the SED from which the apparent magnitudes can be computed for a given set of filters. We use the publicly available library GISSEL98 (Bruzual & Charlot 1993, 2003; Bolzonella et al. 2000) to model the synthetic galaxy spectrum. The galaxy SEDs are generated to represent SEDs from early- to late-types (elliptical, S0, Sa, Sb, Sc, Sd, Im, and starburst). To model these populations—composite stellar populations (CSPs)—the single stellar population (SSP) is convolved with a model star formation history:

Equation (5)

Note that the SSP is modeled in Bruzual & Charlot (1993), with the initial stellar mass function given in Miller & Scalo (1979). The function ψ(t) is the star formation rate (SFR) at galaxy age t. We assumed ψ(t) ∝ exp(−t/τ) with τ = 1, 2, 3, 5, 15, and 30 Gyr for elliptical, S0, Sa, Sb, Sc, and Sd galaxies, respectively. For a starburst galaxy, star formation occurs instantaneously, while ψ = constant for an irregular (Im) galaxy. The metallicity evolves self-consistently with galaxy age, and we checked that different models of metallicity have little effect on the photo-z estimates (also see Bolzonella et al. 2000). We randomly chose the age of each simulated galaxy from 221 different ages in the range of t = [0, 20] Gyr, where the age interval is determined according to GISSEL98.

Figure 3 demonstrates simulated SEDs for starburst, elliptical, irregular, and spiral galaxies at six different ages. The dust extinction is modeled following Calzetti et al. (2000) with AV in the range [0, 2.0].

Figure 3.

Figure 3. Synthetic galaxy spectra, fν = λ2fλ normalized at λ = 4000 Å. For each panel, from top to bottom at 10000 Å, galaxy ages are 10 Gyr, 5 Gyr, 1 Gyr, 100 Myr, 10 Myr, and 1 Myr, respectively. From the top-left to bottom-right panels, the galaxy SED types are starburst (SB), elliptical (Ell), irregular (Im), and spiral (Sc), respectively.

Standard image High-resolution image

3.3. Mock Galaxy Catalog

To make a mock galaxy catalog containing various galaxy populations, we used the publicly available code HyperZ. In doing this, we need an appropriate mixture of different galaxy SED types. We employed the composition (SB, E, S, Im) = (0.52, 0.035, 0.40, 0.045) over the entire redshift range, which was chosen so as to match the composition of best-fit galaxy SED types found in the COSMOS catalog with i < 25.9 For this purpose, the command make_catalog in HyperZ was slightly modified in such a way so that the resulting catalog satisfied the magnitude–redshift relations for each magnitude range in Section 3.1 and the assumed composition of galaxy SED types, because the unmodified make_catalog generates a catalog in which redshift, reference magnitude, age, galaxy SED type, and the amount of dust extinction are randomly assigned to each galaxy.

3.4. Photometric Magnitudes

Once the SED is specified for each simulated galaxy at redshift z, it is straightforward to compute the apparent magnitudes for a given set of filters taking into account the redshifted spectrum at observed wavelengths. The photo-z estimate is sensitive to the details of observational parameters: the wavelength coverage, the transmission curve of a given filter, the exposure time, the limiting magnitude, and so on. We consider the parameters that match those of the planned Subaru HSC survey: our default filter set is g'r'i'z'y' (hereafter the prime superscripts are sometimes omitted), and the 5σ limiting magnitudes (2'' aperture) are set to g = 26.5, r = 26.4, i = 25.8, z = 24.9, and y = 23.7, respectively, assuming an exposure time of 15 minutes for each passband, 3 days from new moon, and 1.2 air mass at the Subaru Telescope site.10

We also study how adding other bands, u-band data and NIR data, into the optical data above can improve photo-z accuracies. Having a wider wavelength coverage helps break degeneracies in photo-z estimates; more specifically, it helps discriminate the Lyman break and 4000 angstrom break from the multi-color data. We consider here the u-band data that can be delivered from the Canada–France–Hawaii Telescope (CFHT), and also the J, H, Ks(hereafter K) bands of the planned VIKING (VISTA Kilo-Degree Infrared Galaxy) survey. The 5σ limiting magnitudes are u = 25.0, J = 22.1, H = 21.5, and K = 21.2, respectively. The set of filters and the depths we consider in this paper are summarized in Table 1.

Table 1. Filters and Limiting Magnitudes (5σ)

Filter Survey λc (Å) FWHM (Å) AB (mag) Texp  (s)
u* CFHTLS 3752 740 25.0 900
g' HSC 4814 1120 26.5 900
r' HSC 6279 1370 26.4 900
i' HSC 7687 1500 25.8 900
z' HSC 9143 1330 24.9 900
y' HSC 9923 490 23.7 900
J VIKING 12578 1713 22.1 420
H VIKING 16581 2828 21.5 420
K VIKING 21790 2828 21.2 420

Download table as:  ASCIITypeset image

A detailed study of the optimal filter parameters, for example, the number of filters and filter resolution, in terms of minimizing the outlier fraction or photo-z scatters, are found in Jouvel et al. (2010).

3.5. Magnitude Errors

Finally, we include statistical errors in the apparent magnitudes. Assuming the sky noise limit, we simply model this magnitude error as Gaussian fluctuations with width given by

Equation (6)

where SN is the signal-to-noise ratio for a given galaxy; SN is computed once the galaxy's apparent magnitude and depth in the filter are given. The magnitude error is computed as follows. First, the sky noise is added to the observed flux of a galaxy, causing a deviation from the true flux as fobs = f0 + Δf = f0(1 + 1/SN). Then the magnitude error above is computed as Δm = −2.5log(1 + 1/SN) because m0 + Δm = −2.5log f0(1 + 1/SN) + constant. Strictly speaking, even for a Gaussian sky noise, the magnitude error does not obey a Gaussian distribution due to the log-mapping. However, the Gaussian approximation on Δm holds for galaxies with sufficiently high SN values, which we will assume for the following results.

Note that a galaxy which has its apparent magnitude near the limiting magnitude may be excluded from or included in the sample in the presence of the magnitude error. While our simulated galaxies are all i-band selected, some galaxies may have apparent magnitudes below the detection limit in other passbands. We use such an upper limit on the flux in the photo-z estimate, which improves the photo-zs to some extent. In doing this, the flux for such an undetected galaxy in a given filter is set to the magnitude corresponding to the halved limiting flux flim/2. Also, we note that the systematic offset of photometry may cause an additional uncertainty in magnitude measurement, which in this paper is ignored. Such a zero-point magnitude offset can be calibrated, for example, by using a spectroscopic redshift subsample (Ilbert et al. 2009).

According to the procedures described above we made a mock photometric catalog containing about 105 galaxies down to magnitude i = 25.8. Although we tried to make a realistic mock catalog based on the COSMOS catalog, some of our treatments may be still oversimplified: for example, we assumed an SSP for galaxy SEDs. The simplified assumptions may make our results somewhat optimistic. A more accurate method to overcome these obstacles would be to use the real data, including spectra, for a representative subsample of imaging galaxies. However, such spectroscopic data, especially for faint galaxies of interest, are still limited; this will be our future work.

4. METHOD: PHOTOMETRIC REDSHIFT AND PARAMETER BIAS

Now we use the mock photometric catalog of i-band-selected galaxies to assess the performance of photo-z estimates in the context of weak lensing tomography experiment.

4.1. Photometric Redshift Estimation

By combining multi-passband magnitudes of a given imaging galaxy, its redshift can be estimated without spectroscopic observation—the so-called photo-z. There are various techniques that have been developed: the template fitting method (Sawicki et al. 1997; Bolzonella et al. 2000), the template method combined with prior information (magnitude prior and so on; Benítez 2000; Mobasher et al. 2004), the method including a self-calibration based on a training spectroscopic set (Collister & Lahav 2004, and references therein).

In this paper, we use the publicly available code, Le Phare,11 which is a template fitting method. The photo-z for each galaxy is estimated based on the χ2 fitting:

Equation (7)

where fobsi is the observed flux in the ith filter, f(T, z, E) is the model flux which is given as a function of galaxy SED type (T), redshift (z), and the amount of dust extinction (E), and σi is the magnitude error. Note that galaxy SED type is modeled according to the method described in Section 3.2. The summation in Equation (7) runs over the number of filters considered (Nf). The extra factor parameter α, which is the same in all the filters, is introduced in Equation (7) because the photo-z is estimated only from colors, the relative amplitudes of fluxes in different filters, not from the absolute fluxes. Therefore, there are (Nf−1) constraints given the data of Nf filters. The best-fit redshift parameter, i.e., the best-fit photo-z, is obtained by minimizing the χ2 value with varying other model parameters.

If the location of spectral features such as the Lyman break and the 4000 Å break is captured given the wavelength coverage of observed filters and the magnitude depths taken, the redshift is robustly estimated. On the other hand, a misidentification of the spectral features causes a degeneracy in redshift estimation, often yielding multiple solutions at low and high redshifts. Hence the photo-z method based on broadband photometry generally yields a large fraction of outliers, where the best-fit photo-z can be far from the true redshift. To quantify the photo-z accuracy for each galaxy we will use the following two quantities: (1) the goodness-of-fit parameter for the template fitting, and (2) the width of likelihood function of redshift estimation.

The goodness of fit for the template fitting of a given galaxy may be defined as

Equation (8)

where χ2min is the minimum χ2 value for the best-fit model and redshift, zbf is the best-fit redshift, and Nf − 1 is the number of colors available. Note that the quantity above, χ2ν, is not the same as the reduced χ2, which is defined as the number of constraints minus the number of model parameters. The number of model parameters are equal for all the galaxies, so χ2ν gives a measure of the goodness of fit.

We also use the width of likelihood function of redshift estimation for each galaxy defined as

Equation (9)

where p(z) is the likelihood function given as p(z) ∝ exp [ − χ2(z)/2], which is normalized to satisfy ∫dzp(z) = 1. We compute the likelihood function p(z) by fixing other model parameters to their best-fit values. The normalization factor (1 + zbf) is introduced based on the fact that the photo-z accuracy scales with (1 + z). Compared to σ(z), the local standard deviation of photo-z estimation, which is obtained from Δχ2 ⩽ 1, the quantity Var(z) is sensitive to the outlier probability with |zzbf| ≫ 1 due to the weight (zzbf)2. Hence for galaxies whose likelihood function has multiple peaks, i.e., multiple redshift solutions, the quantity Var(z) tends to be larger. Similar quantities to Var(z) are also used in previous works (Mobasher et al. 2004; Wolf 2009) where the primary purpose was to improve the photo-z performance for a majority of galaxies. In this paper, we use the figure-of-merit quantity Var(z) mainly for identifying photo-z outliers.

Note that in the following results we use zbf to construct the redshift distribution of galaxies. An alternative method, which may be less sensitive to photo-z outliers, is summing the photo-z posterior likelihood function p(z) over all the sampled galaxies to obtain the overall redshift distribution (Wittman 2009; Cunha et al. 2009). Calibrating the mean redshift distribution is another important issue that needs to be carefully studied (Ma & Bernstein 2008; Bordoloi et al. 2009), but is beyond the scope of this paper.

Figure 4 demonstrates examples of the photo-z fitting for two simulated galaxies. It can be seen that a galaxy which has a ill-defined photo-z estimate, i.e., a wider likelihood function of redshift estimation, tends to have a larger value of Var(z). In particular, even if the likelihood function locally has a narrow peak around the best-fit redshift, therefore even if the photo-z error looks apparently small, the value Var(z) becomes larger if the likelihood has multiple peaks (i.e., the case of multiple redshift solutions), as demonstrated in the upper panel. On the other hand, a galaxy with reliable photo-z estimate has a small value of Var(z).

Figure 4.

Figure 4. Examples of the photo-z fitting for two simulated galaxies. The upper panel shows an example of the photo-z outlier, i.e., the ill-defined photo-z estimate, while the lower panel shows an example of the reliable photo-z estimate. In each panel, the solid curve shows the likelihood function of redshift parameter, p(z) ∝ exp [ − χ2/2], and the solid (red) and dashed (blue) arrows denote the best-fit redshift and the true redshift, respectively. The width of the likelihood is quantified by Var(z), defined by Equation (9), and the value for each simulated galaxy is denoted in the upper-right corner of each panel.

Standard image High-resolution image

While the quantity Var(z) is empirically defined as an indicator of photo-z outliers, Figure 5 gives a quantitative study of how Var(z) can characterize the photo-z likelihood function. The figure shows the distributions of simulated galaxies in the Var(z)–Δz plane, where Δz is the difference between true and photometric redshifts defined as Δz ≡ (zbfztrue)/(1 + ztrue) for each galaxy. According to the properties of their photo-z likelihood functions, we divide galaxies into two subsamples: one (thick black line) is defined from galaxies (about 25% of all the galaxies) that have a single peak and therefore a reliable photo-z estimate, while the other (thin gray line) is from galaxies of multiple peaks, respectively. Note that the second- and higher-order peaks are defined as the local maximum values of the likelihood if their heights are higher than 10% of the first peak height. One can find from the figure that the quantity Var(z) nicely separates galaxies that tend to have greater photo-z biases and multiple solutions of photo-zs, i.e., degenerate photo-z estimates; most of the galaxies where Var(z) ≳ 0.1 have multiple peaks.

Figure 5.

Figure 5. Probability distributions of simulated galaxies in parameter space of Var(z) and Δz, where Var(z) is defined in terms of the photo-z likelihood by Equation (9) and Δz denote the bias between photometric and true redshifts, Δz ≡ (zbfztrue)/(1 + ztrue). In each panel, thick black dots or lines show galaxies whose photo-z likelihood function has a single peak, while thin gray dots or lines show galaxies that multiple peaks in their likelihood. It is clear that galaxies, which have greater photo-z biases and have multiple photo-z solutions in their likelihood, tend to have greater values of Var(z).

Standard image High-resolution image

4.2. Fisher Matrix Formalism

We will use the Fisher matrix formalism to estimate accuracies of estimating parameters given the lensing power spectrum. The Fisher matrix is given by

Equation (10)

where pα denotes a set of cosmological parameters, the matrix C denotes the covariance matrix, and C−1 denotes the inverse matrix. In this paper, we simply use the covariance matrix given by the first term in Equation (9) in Takada & Jain (2009), assuming the Gaussian errors on power-spectrum measurements. The Gaussian error assumption is adequate for our purpose because the impact of non-Gaussian errors on parameter estimation is not significant as long as a multi-parameter fitting is considered as shown in Takada & Jain (2009). The marginalized 1σ error on the αth parameter pα is given by σ2(pα) = (F−1)αα, where F−1 is the inverse of the Fisher matrix. Throughout this paper, we set lmin = 5 and lmax = 3000 for the minimum and maximum multipoles as in the summation above. Note that all the parameter forecasts given below are for the lensing tomography combined with the expected Planck information, which is obtained by simply adding the two Fisher matrices of lensing and cosmic microwave background (CMB): ${\boldsymbol{F}}_{\rm WL+CMB}={\boldsymbol{F}}_{\rm WL}+{\boldsymbol{F}}_{\rm CMB}$.

As we explained for Equation (1), the lensing power spectrum is sensitive to the underlying true redshift distribution of galaxies, n(z). For a multi-color imaging survey, however, the distribution n(z) needs to be estimated from the available photo-z information. In this procedure, the photo-z errors affect weak lensing experiments. The most dangerous effect is a systematic bias in parameter estimations: if the inferred redshift distribution has a bias in the mean redshift compared to the true one, the redshift bias may cause significant biases in cosmological parameters. In order to quantify the biases in cosmological parameters caused by photo-z errors, we use the following method based on the Fisher matrix formalism in Huterer & Takada (2005) (see also Appendix B of Joachimi & Schneider 2009 for the detailed derivation):

Equation (11)

where ${\boldsymbol{F}}^{-1}_{\rm WL+CMB}$ is the inverse of the Fisher matrix, and δpα denotes a bias in the αth parameter, the difference between the best-fit and true values. In the equation above, the spectrum Cκmn(l) is the underlying true power spectrum, while Cκ,photo-zmn(l) is the spectrum estimated from the redshift distribution inferred based on the photo-z information. In the presence of the photo-z errors, generally CκijCκ,photo-zij, thereby causing a bias in parameter estimation. We can compute both spectra, Cκ and Cκ,photo-z, from a simulated galaxy catalog for a hypothetical lensing survey. Note that in Equation (11), for simplicity, we have not considered any other nuisance parameters that model other systematic effects such as shape measurement errors (e.g., Huterer et al. 2006) and the inability to make precise model predictions arising from nonlinear clustering and baryonic physics (Huterer & Takada 2005; Rudd et al. 2008; Zentner et al. 2008).

To compute the parameter forecasts, we need to specify a fiducial cosmological model and survey parameters. Our fiducial cosmological model is based on the WMAP 5-year results (Komatsu et al. 2009): the density parameters for dark energy, CDM, and baryon are Ωde(=0.74), Ωcdmh2(=0.1078), and Ωbh2(=0.0196) (note that we assume a flat universe); the primordial power-spectrum parameters are the spectral tilt, ns(=1), and the normalization parameter of primordial curvature perturbations, As ≡ δ2ζ(=2.3 × 10−9) (the values in the parentheses denote the fiducial model); and the dark energy equation of state parameter w0(= −1). We used the publicly available code CAMB developed in Lewis & Challinor (2006) (also see Seljak & Zaldarriaga 1996) to compute the transfer function, and use the fitting formula in Smith et al. (2003) to compute the nonlinear mass power spectrum from which the lensing power spectrum is computed over the relevant range of angular scales.

Our fiducial survey roughly resembles the planned Subaru Hyper-Suprime Cam (HSC) Survey (Miyazaki et al. 2006). We adopt the set of filters (grizy) and the depths in each filter as given in Section 3. We will also study how the results change when the hypothetical Subaru survey is combined with other surveys that deliver the u-band data or/and the NIR data, which especially help to improve the photo-z accuracies. The survey area is throughout assumed to be Ωs = 2000 deg2. The redshift distribution of galaxies is computed for an assumed subsample of galaxies based on the photo-z information.

4.3. Object Selection and Clipping of Photo-z Outliers

We may be able to construct a suitable subsample of galaxies in the sense that the impact of photo-z errors are minimized in order not to have more than 100% biases in cosmological parameters compared to the statistical errors. Hence a selection of adequate galaxies is important for weak lensing: this may be attained, for example, by discarding galaxies with ill-defined photo-zs. However, an important fact to keep in mind is that by discarding more galaxies, the statistical accuracy of parameter estimation is degraded due to the increased shot noise contamination in the power-spectrum measurement. Thus, there would be a trade-off point in defining a suitable galaxy subsample in terms of the parameter bias versus the statistical error for a given survey.

Throughout this paper, we work on i-band-selected galaxies assuming that the i-band data are used for the lensing shape measurement as was often done in previous lensing works. For our simulated galaxies, given the limiting magnitude i = 25.8 at 5σ significance, a sufficient number of photometric galaxies are available: the number density for the total galaxies is 80 arcmin−2. However, not all the galaxies are usable for lensing measurements. First, in order to obtain a reliable shape measurement, galaxies used in the lensing analysis need to be well resolved, requiring them to have sufficient S/N values (say more than 10σ). Second, galaxies with ill-defined photo-z estimates are not useful because including them may cause a significant bias in parameter estimation.

With the above considerations in mind, we will use the following object selections or their combinations to make parameter forecasts.

  • 1.  
    The restrictive range of i-band magnitudes (22.5 ⩽ i ⩽ 25). The range is a typical one used in the weak lensing analysis (e.g., Okabe et al. 2009). The faint-end magnitude cut may be imposed such that the selected galaxies have sufficiently high S/N values: i = 25 corresponds to S/N ≃ 10 in our simulations. The bright-end magnitude cut is not important, but usually imposed in practice to avoid galaxies with saturated pixels.
  • 2.  
    The photo-z selection. We select only galaxies that have reasonably good photo-z estimates by imposing a threshold on the goodness of fit of photo-z estimation, χ2ν ⩽ 2 (see Equation (8)). The clipping threshold χ2ν = 2 is not a unique choice; we selected the value as one working example.
  • 3.  
    The restrictive range of photo-zs (0.2 ⩽ zbf ⩽ 1.5). The spectral features of galaxies in this range of redshifts can be captured relatively well by the wavelength coverage of optical filters. The lower redshift cut is introduced, because there is a strong degeneracy between galaxies at such low redshifts as z ≲ 0.2 and those at higher redshifts, especially in cases where that the u-band data are not available or shallower than the optical data considered in this paper.
  • 4.  
    Clipping of photo-z outliers. By discarding galaxies with Var(z) (see Equation (9)) greater than a given threshold, which turn out to be mostly photo-z outliers, we define a subsample from the remaining galaxies that have relatively reliable photo-zs. In the following, we study the performance of this clipping method by varying the clipping threshold values of Var(z).

Table 2 gives the fraction of remaining galaxies compared to the original sample, where galaxies in each subsample are selected with object selection criteria described above for a given set of filters. The column denoted by "χ2ν ⩽ 2" shows the fraction of galaxies selected when imposing the threshold on the goodness of fit for each galaxy. It is clear that this clipping discards only a small fraction of galaxies for all the cases of filter combinations. The second and third columns show the fractions when further imposing the restricted ranges of photometric redshifts and magnitudes, respectively.

Table 2. Fractions of Galaxies Included in Each Subsample

Filters χ2ν < 2 ∩0.2 < zp < 1.5 ∩22.5 < i' < 25.0
grizy 0.94 0.61 0.50
ugrizy 0.94 0.61 0.50
grizyJHK 0.89 0.57 0.47
ugrizyJHK 0.89 0.58 0.48

Notes. The subsamples denoted as "χ2ν ⩽ 2" show included galaxies selected with χ2ν ⩽ 2 in the photo-z fitting for each combination of filters (see text for the details). The second and third columns show the results when imposing further conditions on the ranges of photo-zs and i-band magnitudes as denoted.

Download table as:  ASCIITypeset image

5. RESULTS

In this section, we show the main results of this paper using mock galaxy catalogs.

5.1. Photo-z Accuracy

Figure 6 shows the results using different galaxy catalogs with various combinations of the measured passbands (see Table 1), grizy, ugrizy, grizy+JHK, and ugrizy+JHK from the left- to right-column panels, respectively. Each panel in the top row shows the photo-z performance for the simulated objects with i' < 25.8 (>5σ), selected by imposing the condition that the goodness-of-fit χ2ν < 2 (see Equation (8)). It is clear that, with broadband photometry alone, the photo-z accuracy is limited: a significant contamination of outliers is unavoidable, even when including NIR- and/or u-band data.

Figure 6.

Figure 6. Lower panel: scatter plots between photometric and true redshifts for our simulated galaxies. The different columns correspond to the results for different sets of filters as indicated in the bottom panels, while the different rows correspond to different object selections (see Section 4.3 for details). The upper-row panels show the results of samples containing all the galaxies with i < 25.8 that have their photo-z fits quantified as χ2ν ⩽ 2 (see Equation (8)). The middle- and bottom-row panels show the results of subsamples obtained by discarding 40% and 70%, respectively, of the galaxies with ill-defined photo-z estimates. This is done by choosing the threshold value Var(z) (see Equation (9)) for each galaxy such that the desired fractions of galaxies remain in the resulting subsamples. Note that, for illustrative purpose, only 5% of the representative galaxies in each subsample are shown in each plot. Upper panel: ratio of remaining galaxies that have Var(z) values greater than a given threshold denoted on the vertical axis. The solid curve shows the result for the set of filters, grizy, while the dotted curve shows the result when the NIR data JHK are added (see Table 1 for details). Note that the results are almost unchanged by further adding the u-band data. The vertical thin dashed lines denote the selections used in the lower plot, using the criterion of discarding 40% and 70%, respectively, of the galaxies with poor photo-zs.

Standard image High-resolution image

The middle- and lower-row panels show the results obtained by further discarding photo-z outliers based on the clipping method with a given threshold on the quantity Var(z) (Equation (9)). The thresholds are chosen such that 40% or 70% of the objects in each top-row panel, which tend to have ill-defined photo-zs, are discarded, respectively. We found that this clipping method based on the photo-z likelihood function of each galaxy can efficiently discard galaxies with ill-defined photo-zs, especially when combined with the NIR and u-band data.

To be more explicit, the upper panel shows the fraction of galaxies included in the subsample after discarding galaxies with Var(z) greater than a given threshold denoted on the vertical axis. The solid and dotted curves show the results without and with the JHK data in addition to the optical data, grizy. Note that including the CFHT-type u-band data of the assumed depth, as given in Table 1, has little effect on the two results. For a given particular value of Var(z), the dotted curve has more remaining galaxies than the solid curve, implying that these NIR data help improve the photo-z accuracies on an individual galaxy basis, i.e., indicating that the shape of the photo-z likelihood function is narrowed by adding the NIR data for most of the galaxies.

Figure 7 shows a similar result to the previous figure, but for brighter galaxy samples, selected with i < 25 or equivalently with S/N values greater than 10σ in the i band. These brighter galaxies are more suitable for the accurate shape measurement as discussed in Section 4.3. We find that the photo-z accuracy is improved compared to Figure 6.

Figure 7.

Figure 7. Same as Figure 6, but for a different range of i-band magnitudes for object selection, 22.5 < i < 25.

Standard image High-resolution image

5.2. Simulating Lensing Tomography: The Impact of Photo-z Errors

We are now in a position to use the photo-z galaxy catalogs, constructed in the preceding sections, to make the trade-off analysis on the number of galaxies within a sample versus how "clean" the tomographic redshift intervals are. Then we study the impact of photo-z errors on parameter estimation assuming the hypothetical lensing tomography experiment.

Figure 8 shows the redshift distributions of each tomographic bin made by subdividing the photo-z galaxies into three intervals of photometric redshifts, zp < 0.8, and 0.8 < zp < 1.5, and zp > 1.5. The redshift intervals are chosen such that each redshift interval contains similar number densities for the original mock catalog of galaxies. Note that the redshift binning is fixed in the following analysis for simplicity, which helps us to compare the results of different galaxy catalogs. Also note that three redshift bins are a minimal choice of lensing tomography for constraining the dark energy equation of state parameter "w" to a reasonable accuracy by efficiently breaking parameter degeneracies in the lensing power spectrum (e.g., Takada & Jain 2004).

Figure 8.

Figure 8. Shaded histograms are the redshift distributions of galaxies based on their photo-z information, i.e., the sharp cuts imposed on the photo-zs: 0 < zp < 0.8, 0.8 < zp < 1.5 and zp > 1.5, indicated on the vertical dashed lines. In this case, the horizontal axis denotes photo-z values. The solid-line histograms are the underlying true distributions, therefore the horizontal axis denotes true redshifts in this case. The upper and lower panels are the results using the galaxy catalogs in Figures 6 and 7, respectively. The different panels are for different galaxy selections as in Figures 6 and 7.

Standard image High-resolution image

The upper plot in Figure 8 shows the tomographic redshift distributions constructed from different photo-z catalogs in Figure 6, where different panels correspond to the different sets of filters and the different clipping thresholds. The shaded regions in each panel show the photometric redshift distributions of galaxies, which have sharp cutoffs in the distributions due to the sharp redshift binning, while the solid line histograms show the underlying true redshift distributions. As can be found from the top-row panels, if all the galaxies are used, the resulting redshift distributions have significant overlaps between different redshift bins due to a significant contamination of photo-z outliers for any combinations of filters. On the other hand, the middle- and bottom-row panels show that, when 40% or 70% of photo-z outliers are discarded by imposing the corresponding thresholds on Var(z), respectively, the overlaps can be increasingly reduced. In particular, when the optical data are combined with the NIR data such as those expected from the VIKING survey, the resulting subsamples have almost no overlap, if about 70% of galaxies are discarded. We should note that such a clean redshift binning can greatly reduce a possible contamination of the intrinsic ellipticity alignments arising from the physically close pairs of galaxies in the similar redshifts (Takada & White 2004).

The lower plot shows similar results for higher S/N samples with 22.5 < i < 25 whose scatter plots are seen in Figure 7. Again, by using the clipping method with a wider coverage of wavelengths, a clean subsample with almost no overlap between redshift bins can be obtained.

As has been stressed, the weak lensing power spectra are, to the zeroth-order approximation, sensitive to the mean redshifts of each redshift bin, and less to the statistical errors of photo-zs or the detailed shape of redshift distribution. The level of systematic photo-z errors in each tomographic bin is quantified in Figure 9. The central values and error bars in this plot show the bias in mean redshift and the statistical error of the mean redshift, σ(〈Δz〉), where Δz is defined before (see Figure 5 caption) and the average 〈 ⋅ ⋅ ⋅ 〉 denotes the average over all the galaxies in the tomographic redshift bin.12 Note that, for illustrative purposes, the error bars are scaled for a survey area of 1 arcmin2, and therefore the corresponding errors for our fiducial survey area of 2000 deg2 are much smaller than plotted, by a factor of $\sqrt{2000\times 60^2}\simeq 2700$.

Figure 9.

Figure 9. Bias between photometric and true redshifts, 〈Δz〉, in each tomographic redshift interval. The error around each point is the statistical error of the mean redshift, σ(〈Δz〉), in each redshift bin. For illustrative purposes, the errors are for a survey area of 1 arcmin−2, and the errors are smaller by a factor $\sqrt{2000\times 60^2}\simeq 2700$ for our fiducial survey area of 2000 deg2. The panels in different columns show the results for different sets of filters as indicated on the horizontal axis. The upper-row panels are for the subsamples where galaxies with i < 25.8 are selected with only the condition χ2ν < 2. The middle- and bottom-row panels are for the subsamples where the condition 0.2 < zp < 1.5 or 22.5 < i < 25 is further imposed for the selection, respectively. The round symbols in each panel show the results for the whole galaxy sample, while the triangle and square symbols are the results for the subsamples discarding 40% and 70% of galaxies with ill-defined photo-zs, respectively, based on our clipping method. In each panel, the four symbols are for different redshift intervals: "bin1", "bin2", and "bin3" correspond to the lowest, medium, and highest redshift bins in Figure 8, and the leftmost symbols are for the case of no tomography, i.e., a single redshift interval.

Standard image High-resolution image

The middle- and bottom-row panels are the results of subsamples obtained by further imposing the condition 0.2 ⩽ zp ⩽ 1.5 or 22.5 ⩽ i ⩽ 25, respectively. It is clear that, without clipping ill-defined photo-z galaxies, the bias and errors are significant. Note that the redshift bias for the no tomography case sometimes becomes smaller than in some tomographic bins (especially the highest redshift bins), because photo-z outliers at low- and high-redshifts cancel out to some extent in the no-tomography case. As can be seen from the middle-row panels, when the restricted redshift range of 0.2 < z < 1.5 is considered, a subsample with very accurate photo-zs is obtained, because spectral features of galaxies, especially the Lyman and 4000 Å breaks, are well captured by the sets of filters in this redshift range.

We next propagate the errors of tomographic redshifts into the dark energy parameter w, expected from the lensing power-spectrum measurements, based on the Fisher matrix formalism described in Section 4.2. To do this we address the following issues.

  • 1.  
    The trade-off of dark energy constraint: the statistical accuracy of w, σ(w), versus the offset of the best-fit value from the true value, δw.
  • 2.  
    The dark energy trade-off against different subsamples of galaxies.

These can be studied using the simulated galaxy catalogs. The statistical error of w can be reduced by including a greater number of galaxies in the sample for a fixed range of working multipoles (5 ⩽ ℓ ⩽ 3000), because the shot noise is more suppressed. On the other hand, a bias in the best-fit w due to the photo-z errors can be reduced by discarding photo-z outliers, leaving a smaller number of galaxies in the subsample. Hence a trade-off point in σ(w) versus δw may be found by compromising these competing effects.

Figure 10 shows the marginalized error σ(w) and the amount of bias |δw| as a function of number densities of galaxies included in the corresponding galaxy subsamples. Again, note that we considered the lensing tomography with three tomographic bins for a sky coverage of 2000 deg2. The smaller number densities in the horizontal axis correspond to subsamples of galaxies where more galaxies with ill-defined photo-zs are discarded by imposing more stringent thresholds on Var(z), i.e., smaller threshold values of Var(z), in the clipping method. The different curves are for different combinations of filters. The error σ(w) is computed from the underlying true redshift distribution, i.e., for the case with perfect photo-zs, therefore specified by the number density in the horizontal axis. When δw ⩾ σ(w), the best-fit value of w can be different from the true value by more than the 1σ error; even if the true model has the cosmological constant (w = −1), the result of w ≠ −1 may be falsely inferred. Hence, a minimal requirement on photo-z accuracies can be assessed from the condition δw ⩽ σ(w).

Figure 10.

Figure 10. Predicted constraints on the dark energy equation of state parameter w as a function of number densities of galaxies included in the corresponding galaxy catalogs, expected for the lensing tomography experiment with survey area of 2000 deg2 in combination with the Planck CMB information. The three redshift bins are considered for each galaxy subsample as in Figure 8. The solid curve in each panel shows the marginalized error σ(w) assuming no photo-z errors. The other curves show the offset bias of the best-fit w from the true value (w = −1), computed by using the Fisher matrix formalism (see Equation (11)): the dotted, dashed, long-dashed, and dot-dashed curves are for combinations of filters, grizy, ugrizy, grizyJHK, and ugrizyJHK, respectively. The left, middle, and right panels are the results for different object selections as in Figure 8.

Standard image High-resolution image

First, the plot shows that, as the subsample is restricted to galaxies with more accurate photo-zs, i.e., the smaller number densities, the bias in w is reduced to some extent. On the other hand, the error σ(w) is only slightly degraded because the constraint comes mainly from the sample variance limited regime for a given range of working multipoles (5 ⩽ l ⩽ 3000).

It is also shown that the bias can be reduced by adding the NIR and/or u-band data. However, the broadband data alone may not be sufficient to reduce the bias. The optimal range of redshifts needs to be considered, and a brighter subsample whose galaxies have higher S/N values in each filter is preferred to sufficiently reduce the bias, as implied from the middle and right panels. Depending on the available set of filters, the compromise point can be obtained around the number densities $\bar{n}_g=[10,30]$ arcmin−2, i.e., more than 60% of ill-defined photo-z galaxies need to be discarded. It is also worth noting that combining the lensing constraints with other dark energy probes such as the baryon acoustic oscillation experiment may allow us to further calibrate photo-z errors by breaking parameter degeneracies.

We have so far paid special attention to how to eliminate photo-z outliers in order to obtain a subsample of galaxies suitable for tomographic lensing measurements. However, due to the limitation of photo-z accuracies, there may remain a residual bias in the tomographic redshift bins, even if photo-z outliers are completely removed. To study this, Figure 11 shows the results obtained by artificially discarding photo-z outliers according to the clipping criteria

Equation (12)

with t = 0.1, 0.3 and 0.5, respectively. The figure shows that a bias in w cannot be fully eliminated even if the outliers are completely discarded. This implies that there remains a residual bias in the mean redshift for each tomographic bin due to asymmetric photo-z errors around zp = zs, therefore the residual biases would need to be calibrated, e.g., by using a spectroscopic training subsample (e.g., Ma & Bernstein 2008).

Figure 11.

Figure 11. As in Figure 10, but with results obtained by artificially discarding photo-z outliers given as log(1 + zp)/(1 + zs) ⩾ ±0.1, 0.3 and 0.5 (the three dotted curves from top to bottom, respectively). The filter combination grizy is considered in this plot. For comparison the top dashed curve in each panel is the same as the dotted curve in Figure 10.

Standard image High-resolution image

5.3. Angular Cross-correlations of Galaxies Between Different Photo-z Bins

An alternative way to identify photo-z outliers is using angular cross-correlations of galaxies between different photo-z bins (Newman 2008; Erben et al. 2009; Zhang et al. 2009; Schulz 2009). As implied in Figure 8, photo-z errors cause overlaps of galaxies between different redshift bins. Therefore, photo-z errors may cause non-vanishing cross-correlations of galaxies between different photo-z bins, if the galaxies indeed have similar true redshifts and therefore are physically correlated with each other. In other words, the cross-correlations can, albeit statistical, be used to monitor the contamination of photo-z outliers. In this subsection, we use our simulated photo-z catalogs to estimate the expected S/N values for measuring the cross-correlations assuming the same survey parameters we have considered.

Assuming the Limber approximation, the angular power spectra of galaxies in the ith and jth photo-z bins are given as

Equation (13)

where ni(z) is the underlying true redshift distribution for the ith photo-z bin and $\bar{n}_i$ is its mean number density. In the following, we consider a sharp redshift binning in photo-z space, however, the underlying true distributions generally have overlaps due to photo-z errors. We simply assume that the galaxy distribution in the ith bin is related to the matter distribution via constant bias parameter bi, which is taken to bi = 1 for all the photo-z bins for simplicity. To make this assumption reasonable, we restrict the following analysis to a range of low multipoles l < 500. Note that the cross power spectra are not affected by shot noise.

The strength of cross-correlations or redshift leakages can be quantified by the cross-correlation coefficients at each multipole:

Equation (14)

The coefficient μij ≃ 1 implies significant cross-correlations between the ith and jth bins compared to their auto-spectra, while μij = 0 means no cross-correlation or no leakage of photo-z outliers into different bins. The total S/N values expected for measuring the cross-correlations can be estimated as

Equation (15)

Here we simply assume the Gaussian covariances to model the statistical errors in measuring cross power spectra from a survey. Note that the error covariance includes the shot noise contamination via the auto spectra Cii and Cjj. For the minimum and maximum multipoles used in the summation, we adopt ℓmin = 5 and ℓmax = 500, respectively.

Figure 12 studies the angular cross-correlations for our fiducial survey parameters with 2000 deg2 coverage. Here we consider the photo-z galaxy catalog selected with ugrizy and χ2ν < 2, and adopt 35 redshift bins over 0 < zp < 3.5 with the bin width dz = 0.1 corresponding to a typical photo-z error on individual galaxy basis. The upper-left triangle in each panel shows the correlation coefficients of cross power spectra between the two different photo-z bins, μij. The coefficients have large values around the diagonal terms, i.e., zpizpj, because the photo-z errors cause significant overlaps between neighboring redshift bins. Compared with the results in Figure 6, one can find that photo-z outliers cause some isolated regions with significant correlation coefficients.

Figure 12.

Figure 12. Angular cross-correlations between galaxies in two photo-z bins, denoted on the horizontal and vertical axes, for our fiducial survey with 2000 deg2, adopting 35 redshift bins over 0 < zp < 3.5 with the bin width dz = 0.1. As in Figure 6, the left panel shows the result for galaxy catalog selected with ugrizy and χ2ν < 2, while the middle- and right-panels show the results for the catalogs where 40% and 70% of galaxies with ill-defined photo-zs are discarded. The upper-left off-diagonal components in each panel show the correlation coefficients μij (see Equation (14)) at multipole ℓ = 100. Compared with Figure 6, one can find photo-z outliers cause "island" regions with high coefficients μij ≃ 1. The lower-right components show the cumulative S/N values expected in measuring the cross-power spectrum over a range of multipoles 5 ⩽ ℓ ⩽ 500. Again the cross-correlations between different bins, caused by photo-z outliers, show high S/N values greater than 10.

Standard image High-resolution image

The lower-right triangle shows the expected S/N, for measuring cross-correlations. Sufficiently high S/N values, say greater than 10, can be expected for redshift bins that have high correlation coefficients. Thus, monitoring the cross-correlations between different photo-z bins may allow us to further identify photo-z outliers, in a statistical sense. However, the genuine ability of the cross-correlation method to eliminate outliers or calibrate photo-z errors needs to be more carefully studied.

Finally, we remark on a more quantitative work done by Schulz (2009), which studied, based on mock simulations, the use of cross-correlations of photometric galaxies with an overlapping spectroscopic sample to calibrate the redshift distribution of the photometric galaxies without using the photo-z information. While using cross-correlations to identify photo-z outliers is not the main purpose of this paper, the promising result shown is that the redshift distribution can be reconstructed well by using a sufficiently large spectroscopic sample. However, one of the limiting factors found is that the reconstruction requires a sufficiently fine binning of spectroscopic redshifts, which tends to make the cross-correlation measurements noisy. Therefore, it would be interesting to study how the method can be further refined by combining the cross-correlation method and the photo-z information, and the combined method could relax requirements on the size of the spectroscopic calibration sample. We also note that, as pointed out in Bernstein & Huterer (2010), the cross-correlation method is affected by the lensing magnification bias, which may cause an apparent correlation between foreground and background galaxies even if there are no photo-z errors to cause redshift overlaps. This effect also needs to be included, and mock simulations would be useful for such a study of the cross-correlation method.

6. SUMMARY AND DISCUSSION

In this paper, we have studied how photo-z errors available from broadband multi-color data affect cosmological parameter estimation obtained from tomographic lensing experiments. To do this, we made a simulated mock galaxy catalog with photo-z information constructed from the COSMOS catalog. Since the photo-z errors are sensitive to survey parameters such as available filters, depths, and so on, we considered in this paper survey parameters resembling the planned Subaru Hyper Suprime-Cam survey, which is characterized by the optical multi-passband data (grizy) and the depth i ≲ 26. We also studied how the photo-z accuracy can be improved by combining the optical data with the u-band data expected from a CFHT-type telescope and the NIR (JHK) data from the VIKING-type survey. However, the method developed in this paper can be readily extended to other weak lensing surveys.

We particularly paid attention to how to construct a galaxy subsample suitable for weak lensing tomography. We showed that photo-z outliers can be efficiently identified by monitoring the posterior likelihood of redshift estimation for each galaxy: more exactly, the width of likelihood function defined by the second moment around the best-fit redshift parameter (see Equation (9)) was used as an indicator of the photo-z accuracy on an individual galaxy basis. It was also shown that the photo-z outliers can be more efficiently removed by restricting the ranges of working magnitudes and/or redshifts, and adding the u- and NIR bands (see Figures 69).

Using the Fisher matrix formalism, we estimated how the photo-z errors in a defined galaxy catalog cause biases in cosmological parameters, especially the dark energy equation of state parameter w. It was shown how the parameter biases can be reduced by discarding galaxies with ill-defined photo-z estimates. However, by discarding more galaxies, the statistical accuracies of parameters are degraded due to the increased shot noise contamination. We found that the trade-off point, where the parameter bias becomes similar or smaller than the marginalized statistical error, can be achieved if a large fraction of ill-defined photo-z galaxies (∼70%) are discarded. Combined with the u- and NIR-band data sets, we can reduce the number of galaxies to be discarded (Figure 10).

However, as demonstrated in Figure 11, even if photo-z outliers are completely eliminated, there may remain a non-negligible, residual bias in the mean redshift of each tomographic bin because the scatters around the relation between photometric and true redshifts, zp = zs, are not necessarily symmetric and therefore not perfectly canceled even after averaging galaxies in each redshift bin. Therefore, a careful calibration of the residual photo-z errors will inevitably be needed for any future surveys (Hearin et al. 2010).

A powerful method for photo-z calibration is using a training spectroscopic subsample. Naively, if a fair, representative spectroscopic subsample of imaging galaxies used in the lensing analysis is available, it allows a calibration of photo-z errors. However, the size of such a spectroscopic sample required for achieving the meaningful dark energy constraint becomes very large; for a lensing survey with sky coverage of more than 1000 deg2, containing more than 108 imaging galaxies, a subsample with more than 106 spectroscopic redshifts is required (Huterer et al. 2006; Ma et al. 2006). Note that the current largest redshift sample is given by the COSMOS project containing 104 redshifts. Thus a survey collecting 106 spectra, which is required for our sample fully calibrated, is observationally very expensive and almost infeasible, especially if redshifts of faint galaxies are needed (but see Bernstein & Huterer 2010, for relaxing this requirement).

On the other hand, there is a new method recently proposed (Mandelbaum et al. 2008; Lima et al. 2008; Cunha et al. 2009), using a spectroscopic subsample of smaller size, which is not necessarily a fair, representative subsample of imaging galaxies. First, spectroscopic galaxies are compared to imaging galaxies in multi-dimensional color space, rather than the photo-z space. Second, the ratio between number densities of spectroscopic and imaging galaxies is computed at each point in multi-color space. Third, the ratio is multiplied to the redshift distribution of the spectroscopic subsample to infer the underlying redshift distribution of imaging galaxies. Thus, this weighting method may allow photo-z calibration using a spectroscopic subsample of smaller size. However, there are still open issues in this method that need to be carefully investigated. For example, it is unclear how the calibration degrades if the spectroscopic subsample has significant sample variance fluctuations in the redshift distribution, e.g., due to clustering contamination at particular redshifts due to a finite area coverage.

Another calibration method is using cross-correlations of galaxies in photo-z bins, as partially studied in Figure 12. Again, the non-vanishing cross-correlations only arise when the photo-z errors cause leakages into different bins of true redshifts. Or spectroscopic galaxies in the same survey region, if available, can also be used to cross-correlate with imaging galaxies in order to calibrate the photo-z errors over a range of redshifts covered by the spectroscopic sample (Newman 2008; Schulz 2009). However, spectroscopic galaxies may only be correlated with particular types of galaxies, therefore, this method may have a limitation. Hence, it would be interesting to explore how to calibrate the redshift distribution of imaging galaxies down to the required accuracy level by combining various methods, including the method developed in this paper and the methods based on spectroscopic subsample or/and cross-correlation measurements.

Finally, we comment on another important contaminating effect, the intrinsic alignment in galaxy shapes (e.g., Hirata & Seljak 2004; Mandelbaum et al. 2006, 2009). As studied in detail in King & Schneider (2003) (also see Heymans & Heavens 2003; Takada & White 2004; Bridle & King 2007), accurate photo-z information is needed to calibrate and/or correct for the intrinsic alignment contamination in weak lensing tomography. A subsample with reliable photo-zs, constructed based on the method in this paper, may be also useful for this purpose.

We thank Rachel Mandelbaum and the members of the Hyper Suprime Cam weak lensing working group for useful discussions and comments. We acknowledge the use of the publicly available codes HyperZ, LePhare and CAMB. This work is supported in part by the Japan Society for Promotion of Science (JSPS) Core-to-Core Program "International Research Network for Dark Energy," by Grant-in-Aid for Scientific Research from the JSPS Promotion of Science (18072001,21740202), by Grant-in-Aid for Scientific Research on Priority Areas No. 467 "Probing the Dark Energy through an Extremely Wide & Deep Survey with Subaru Telescope," and by World Premier International Research Center Initiative (WPI Initiative), MEXT, Japan.

Footnotes

Please wait… references are loading.
10.1088/0004-637X/718/2/1252