Measurement and Calibration of Noise Bias in Weak Lensing Galaxy Shape Estimation
Abstract
Weak gravitational lensing has the potential to constrain cosmological parameters to high precision. However, as shown by the Shear TEsting Programmes (STEP) and GRavitational lEnsing Accuracy Testing (GREAT) Challenges, measuring galaxy shears is a nontrivial task: various methods introduce different systematic biases which have to be accounted for. We investigate how pixel noise on the image affects the bias on shear estimates from a MaximumLikelihood forward modelfitting approach using a sum of coelliptical Sérsic profiles, in complement to the theoretical approach of an an associated paper. We evaluate the bias using a simple but realistic galaxy model and find that the effects of noise alone can cause biases of order 110% on measured shears, which is significant for current and future lensing surveys. We evaluate a simulationbased calibration method to create a bias model as a function of galaxy properties and observing conditions. This model is then used to correct the simulated measurements. We demonstrate that this method can effectively reduce noise bias so that shear measurement reaches the level of accuracy required for estimating cosmic shear in upcoming lensing surveys.
keywords:
methods: statistical, methods: data analysis, techniques: image processing, cosmology: observations, gravitational lensing: weak1 Introduction
Weak gravitational lensing is an important cosmological probe, which has the greatest potential to discover the cause of the accelerated cosmic expansion (e.g. Peacock & Schneider, 2006; Albrecht et al., 2006, 2009). In the standard cosmological model dark energy affects both the expansion history of the universe and the rate of gravitational collapse of large scale structure. The rate of this collapse can be studied by observing the spatial distribution of dark matter at different times in the history of the universe. Gravitational lensing occurs when the path of light from distant galaxies is perturbed while passing through intervening matter. This phenomenon causes the images of galaxies to be distorted. The primary observable distortion is called gravitational shear, and typically causes the galaxy images to be stretched by a few percent. The scale of this effect is related to the amount of matter between the source and the observer, and to their relative geometry. Thus, cosmic shear can provide a valuable dataset for testing cosmology models (Kaiser, 1992; Hu, 1999).
Several upcoming imaging surveys plan to observe cosmic shear, including the KIloDegree Survey: KIDS, the Dark Energy Survey (DES)^{1}^{1}1http://www.darkenergysurvey.org, the Hyper SuprimeCam (HSC) survey^{2}^{2}2http://www.naoj.org/Projects/HSC/HSCProject.html the Large Synoptic Survey Telescope (LSST)^{3}^{3}3http://www.lsst.org, Euclid^{4}^{4}4http://sci.esa.int/euclid and WFIRST ^{5}^{5}5http://exep.jpl.nasa.gov/programElements/wfirst/. For these surveys, it is crucial that the systematics introduced by data analysis pipelines are understood and accounted for. The most significant systematic errors are introduced by (i) the measurements of the distance to the observed galaxies using photometric redshifts, (ii) intrinsic alignments of galaxies, (iii) modelling of the clustering of matter on the small scales in the presence of baryons, (iv) measurement of lensed galaxy shapes from imperfect images. In this paper, we focus on the latter.
To evaluate the performance of shear measurement methods, simulated datasets have been created and released in form of blind challenges. The Shear TEsting Programme 1 (STEP1: Heymans et al., 2006), was the first in this series, followed by STEP2 (Massey et al., 2007). Both challenges aimed to test endtoend shear pipelines and simulated galaxy images containing many physical effects including those stemming fom telescope optics and atmospheric turbulence. A modified approach was taken in the GREAT08 (Bridle et al., 2009, 2010) and GREAT10 (Kitching et al., 2010) challenges, which sought to isolate independent parts of the data analysis process. They explored the impact of different true galaxy and image parameters on the shear measurement, by varying them one at a time among various simulation realisations. These parameters included signal to noise ratio, galaxy size, galaxy model, Point Spread Function (PSF) characteristics and others. The results showed that the shear measurement problem is intricate and complex. Existing methods proved to be sufficient for current surveys, but there is room for improvement for the future.
For a well resolved, blurfree, noisefree image, the galaxy ellipticity can be calculated by taking the moments of the image (Bonnet & Mellier, 1995). However, a typical galaxy image used in weak lensing is highly affected by the observation process. The image degrading effects are (i) convolution with the PSF of the telescope, (ii) pixelisation of the image by the light buckets of the detector, (iii) pixel noise on the image due to the finite number of photons from the source and atmosphere (roughly Poisson) and detector noise (often assumed Gaussian), and (iv) galaxy colours being different from the stars used to map the PSF (Cypriano et al., 2010) and a function of position on the galaxy (Voigt et al., 2011).
Momentbased methods such as KSB (Kaiser, Squires, & Broadhurst, 1995), and most recently DEIMOS (Melchior et al., 2011), and FDNT (Bernstein, 2010) measure the quadrupole moment of the image, using a masking function (often Gaussian) to counter the effects of noise, and then correct for the PSF. Decomposition methods, e.g. shapelets or a GaussLaguerre expansion, (Refregier, 2003; Bernstein & Jarvis, 2002; Nakajima & Bernstein, 2007) use an orthogonal image basis set which can be easily convolved with the PSF. Noise is accounted for by regularisation of the coefficients matrix and truncating the basis set to a finite number of elements. Simple model fitting methods based on sums of Gaussians (Kuijken, 1999; Bridle et al., 2002), Sérsic profiles (Miller et al., 2007; Peng et al., 2002), create an ellipticity estimator from a likelihood function. Stacking methods (Lewis, 2009; Hosseini & Bethge, 2009), which have been demonstrated for constant shear fields, average a function of the image pixels to increase the signaltonoise ratio and then deconvolve the PSF.
All these methods introduce some level of systematic error. Bias on the shear can result from inaccurate centroiding of the galaxy, for example see Lewis (2009). Another source, model bias, results from using a galaxy model which does not span the true range of galaxy shapes. Voigt & Bridle (2010) quantified the shear measurement bias from using an elliptical isophote galaxy model on a galaxy with a a more complicated morphological structure in the presence of a PSF (see Lewis, 2009, for a general proof). Melchior et al. (2010) investigated the effectiveness of shapelets at representing more realistic galaxies. Viola, Melchior, & Bartelmann (2011) and Bartelmann et al. (2011) quantified biases on the KSB method and investigated possibilities to correct for it.
Pixel noise bias arises from the fact that ellipticity is not a linear function of pixel intensities in the presence of noise and PSF. Hirata et al. (2004) showed its effects on second order moment measurements from convolved Gaussian galaxy images. The bias due to pixel noise on parameters fitted using Maximum Likelihood Estimators (MLEs) for elliptical shapes was demonstrated by (Refregier et al., 2012, hereafter R12), for the case when the noise is Gaussian and the correct galaxy model is known. It presented a general expression for the dependency of the bias on the signal to noise ratio. It also demonstrated the consistency of analytical and simulated results for the bias on the width for a one parameter Gaussian galaxy model.
In this paper, we show the significance of this bias for weak lensing measurements using more realistic galaxy images. We find that the bias as a function of true input parameters is consistent with the theoretical framework derived in R12. Furthermore, we present a method to effectively remove this noise bias for realistic galaxy images. Using the Im3shape shear measurement framework and code (Zuntz et al. in prep), we use a forward model fitting, Maximum Likelihood (ML) approach for parameter estimation. We create a model of the bias as a function of galaxy and PSF parameters by determining their bias from various simulations that sample parameter space. We apply this model to the noisy MLEs and demonstrate that this procedure successfully removes the noise bias to the accuracy required by upcoming galaxy surveys. By performing a calibration that depends on the specific statistics of every recorded galaxy, this method is independent of the overall galaxy and PSF parameter distributions.
This paper is organised as follows. Section 2 summarises the equations governing the cosmic shear measurement problem and describes methods to quantify the biases on estimated parameters. We also discuss the requirements on those biases for lensing surveys, followed by a summary of the cause of bias arising from image noise. In Section 3, we show the results of bias measurements. A method for correcting the noise bias based on numerical simulations is presented in Section 4. We conclude and briefly discuss this approach and alternatives in Section 5. In the Appendices, we detail the method used for measuring the multiplicative and additive bias and tabulate our results and fit parameters.
2 Shear measurement biases in model fitting
We first discuss the parametrisation of shear measurement biases, and present an overview of the model fitting approach. We summarise recent work on noise bias in a simple case, and then describe our shear measurement procedure and simulation parameters.
2.1 Quantifying systematic biases in shear estimation
In weak gravitational lensing the galaxy image is distorted by a Jacobian matrix (see Bartelmann & Schneider, 2001; Bernstein & Jarvis, 2002; Hoekstra & Jain, 2008, for reviews)
M  (1) 
where is the convergence and is the complex gravitational shear.
For a galaxy with elliptical isophotes we can define the complex ellipticity as
(2) 
where is the galaxy minor to major axis ratio and is the orientation of the major axis anticlockwise from the positive axis. The postshear lensed ellipticity is related to the intrinsic ellipticity by
(3) 
for (Seitz & Schneider, 1997), where is the reduced shear. In the weak lensing regime , and . We assume throughout this paper.
Galaxies have intrinsic ellipticities which are typically an order of magnitude larger than the shear. For a constant shear and an infinite number of randomly orientated galaxies the mean lensed ellipticity is equal to the shear, to third order in the shear. In practice is averaged over a finite number of galaxies and the error on the shear estimate (referred to as ‘shape noise’) depends on the distribution of galaxy intrinsic ellipticities and the number of galaxies analysed.
The accuracy of a shape measurement method can be tested on a finite number of images in the absence of shape noise by performing a ‘ringtest’ (Nakajima & Bernstein, 2007). In the ringtest, the shear estimate is obtained by averaging the measured estimates from a finite number of instances of a galaxy rotated through angles distributed uniformly from 0 to 180 degrees. If is the measured lensed ellipticity, then the shear estimate is and the bias on the shear is
(4) 
where is the true shear. This bias on the shear is usually quantified in terms of multiplicative and additive errors and for both shear components such that
(5) 
assuming does not depend on , and vice versa (Heymans et al., 2006). The requirements on the level of systematic errors for current and future galaxy surveys are expressed in terms of in Amara & Réfrégier (2008) and are summarised in Table 1.
Survey  

Current  0.02  0.001 
Upcoming future  0.004  0.0006 
Far future  0.001  0.0003 
2.2 Galaxy shear from model fitting
A simple approach to measuring ellipticity is to use a parametric model. For galaxy fitting, models such as sums of Gaussians (Kuijken, 1999; Bridle et al., 2002), Sérsic profiles (Miller et al., 2007), and Gauss  Laguerre polynomials (shapelets) (Refregier, 2003; Bernstein & Jarvis, 2002; Nakajima & Bernstein, 2007) were used.
In general, model fitting methods are based on a likelihood function. Under uncorrelated Gaussian noise, this function is
(6)  
(7) 
where is a set of variable model parameters, is the observed galaxy image, is a model function, is the model image created with parameters , and the number of pixels in images and . These equations assume a known noise level on each pixel , which is often assumed constant . Sometimes a prior on the parameters is used to create a posterior function.
Usually an ellipticity estimator is derived from this likelihood function; so far maximum likelihood estimators (MLE; e.g. Im3shape, Shapelets), mean likelihood (Im2shape) and mean posterior (e.g. LensFit) have been used. We use the MLE in this paper.
Parametric models based on elliptical profiles typically use the following galaxy parameters: centroid, ellipticity, size, flux and a galaxy light profile parameter. Often a combination of two Sérsic profiles (Sérsic, 1963) is used to represent the galaxy bulge and disc components, with identical centroids and ellipticities.
The model also contains information about other effects influencing the creation of the image. These image parameters are not often a subject of optimisation: noise level , PSF kernel and the pixel integration kernel. SNR is often defined as and this definition will be used throughout this paper. This definition of SNR is the same as in GREAT08, but different to GREAT10: SNR=20 here corresponds to SNR=10 in GREAT10.
2.3 Noise bias
The bias of parameter estimation for MLEs in the context of galaxy fitting was first studied by R12. The authors derived general expressions for the covariance and bias of the MLE of a 2D Gaussian galaxy model convolved with a Gaussian PSF. For a nonlinear model, in the Taylor expansion of (in equation 7) the terms in even power of the noise standard deviation are found to contribute to the the estimator bias. The analytical results were confirmed by simulations using a single parameter toy model. It was also noted that the bias is sensitive to the chosen parametrisation, especially if the parameter space is bounded.
2.4 Im3shape pipeline
The analyses in this paper were performed using the Im3shape shear measurement framework and code. Here we outline the system, which will be described in more detail in Zuntz et al. (in prep).
Each simulated galaxy is fitted with a model containing two cocentric, coelliptical Sérsic components, one de Vaucouleurs bulge (Sérsic index=4) and one exponential disc (Sérsic index=1). The amplitudes of the bulge and disc were free but the ratio of the half light radii was fixed to 1.0. They are convolved with the true Moffat PSF model to produce a model image. Since there is high resolution structure in de Vaucouleurs bulges we made the models at a higher resolution than the final images. We use a resolution three times higher in the outer regions and 45 times higher in the central 33 pixels of the final image. Since very highly elliptical images are hard to simulate accurately we restrict the allowed space of models to those with .
We find the peak of the likelihood using the LevenbergMarquadt method (Lourakis, 2004) using numerical gradients of each image pixel in the likelihood. We tested the performance of the optimiser for variety of input galaxy and image parameters to ensure that the optimiser always converges to a local minimum by evaluating the likelihood in the neighbourhood of the found best fit point for multiple test noise realisations. In this nonlinear optimisation problem multiple likelihood modes are possible. However, for our simple model, we found that usually there was only one local minimum (i.e. the bias results did not depend on the starting parameters given to the minimiser). We will discuss this further in Zuntz et al. (in prep).
2.5 Simulation parameters
The galaxies used for this study were created using a two component model: a Sérsic profile of index 4 for the bulge and a Sérsic profile of index 1 for the disc. Both components have the same centroid, ellipticity and scale radius. The galaxy model used for fitting encompassed the one used to create the true galaxy image; therefore we are isolating the noise bias effect from the model bias effect in this study. The PSF was modelled as a Moffat profile with a FWHM of 2.85 pixels and Moffat parameter of 3 (see, e.g., Bridle et al., 2010 for a definition of the Moffat and the notation adopted here). We use the same PSF in the fit as in the simulated images to prevent any bias effects caused by incorrect modelling of the PSF. We fit a total of 7 parameters: galaxy centroid , ; galaxy ellipticity , ; galaxy size ; bulge flux ; and disc flux .
We expect variation in the following physical parameters to have the most significant influence on the noise bias, and therefore the bias will be evaluated as a function of:

Signaltonoise ratio (SNR),

Intrinsic galaxy ellipticity,

PSF ellipticity,

Size of the galaxy compared to the size of the PSF, expressed as , which is the ratio of the FWHM of the convolved observed object and the FWHM of the PSF. Note that this is not the same as the parameter we fit. This is because the noise bias strongly depends on the PSF parameters, and the galaxy radius parameter alone would not fully capture this dependence.

Light profile of the galaxy, described by the flux ratio , which is the flux of the bulge component divided by total flux of the galaxy. For a purely bulge galaxy, and for a disc galaxy . In our model, we allow the amplitudes of the components to be negative, so the flux ratio can take both values and . Therefore, for , the galaxy has a negative disc component, which results in the galaxy being less ‘peaky’ than a galaxy with , and the galaxy model image may even be more similar to a galaxy with . An alternative might be to use a more flexible radial profile, for example a larger number of Sersic components, or allowing the Sersic indices to be free parameters in the fit.
These parameters will be used to create a model for the noise bias. We expect these physical parameters to best encapsulate the main dependencies of the bias, although we are aware that there may exist other statistics that better capture bias variation.
We do not show the effect of the galaxy centroid on the bias, as no significant dependence on this parameter was found in our experiments. We measured the noise bias for a simulated galaxy image with identical model parameters, once located in the middle of a pixel and once on the edge of a pixel. We found no difference in ellipticity bias to our desired precision. In the simulations, the galaxy centroid is randomised.
The values for the simulation parameters are summarised in Table 2. Their choice is based on galaxies used in GREAT08. We define a fiducial parameter set and make departures D1 to D4 in one parameter at a time using the values given in the Table. We restrict our analysis to SNR values of 20 and greater because we find convergence of the minimiser does not pass our quality tests at lower values. However, the SNR values of most interest for upcoming surveys are low, and therefore we use the lowest SNR we can use with confidence by default for all simulations. We investigate a SNR value of 200 which matches that of the GREAT08 LowNoise simulation set, plus an intermediate value of 40 which is also used in GREAT08. By default, we use a galaxy with half the flux in a bulge and half in a disc. The two perturbations we consider are to pure bulge and pure disc. Finally we explore the dependence of noise bias on the PSF ellipticity, spanning the range from zero to 10%.
For the minimisation parameters used in this paper, Im3shape takes around one second per galaxy, which is typical for model fitting methods. To obtain our desired accuracy on noise bias we needed to simulate 2.5 million galaxies for each set of simulation parameters shown in Table 2. Therefore the computations shown in this paper took of order 1 year of CPU time. This computational burden limited the number of points we could show on the figures to 3 per varied parameter.
Parameter  Fiducial  Deviations  

D1  SNR  20  40, 200 
D2  1.62  1.41, 1.82  
D3  0.5  0 , 1  
D4  0.05  0, 0.1 
3 Evaluation of the noise bias effect
In this section we evaluate the noise bias as a function of galaxy and image parameters. We define the noise bias on an ellipticity measurement as
(8) 
We calculate the bias using the following procedure: we create a galaxy image with some true ellipticity, add a noise map and measure the MLE of the ellipticity. Then, we repeat this procedure with different noise realisations which results in a distribution of noisy MLE ellipticities. The difference between the mean of this distribution and the true galaxy ellipticity is the bias on ellipticity.
The histograms of ML estimates for 300 thousand noise realisations are plotted in Figure 1 to illustrate the nature of the noise bias. The galaxy and image had default parameters described in Section 2.5 and intrinsic ellipticities of , and , in the left and right upper and middle panels, respectively. The spread of values comes from the Gaussian noise added to the images to approximate the finite number of photons arriving on the detector. As discussed in Section 2.5, we assume a default SNR value of 20.
Two effects contribute significantly to the bias on ellipticity for the left hand panels in which the true ellipticity is , . The ellipticity distribution is slightly skewed away from being a Gaussian. There is a larger tail to high ellipticity values than to negative ellipticity values. The peak is shifted to lower ellipticities, which is also visible in the twodimensional histogram in the middleleft panel of Figure 1. Overall there is a net positive bias to larger ellipticity values, as shown by the vertical solid line which is to be compared with the vertical dashed line placed at the true value. Although this net positive bias is hard to see by eye, it is significant at the level of shear measurement accuracy required from future observations. This is discussed in more detail in the following sections.
Furthermore, the ellipticity parameter space is theoretically bounded at an ellipticity modulus of unity. This is exacerbated by any realistic measurement method which will break down just short of unity. The consequence of this effect is visible for a galaxy with true intrinsic ellipticity of , shown in the upperright and middleright panels. For this example, it counteracts the noise bias effect by reducing the amount of overestimation. For more noisy or smaller galaxies, which will have larger variance in the ellipticity MLEs, this effect will be stronger and may even cause the ellipticity to be underestimated, see 4 for an illustration of this.
Distributions of other fitted parameters are also biased and skewed, as discussed in R12. We show histograms of fitted galaxy size and galaxy light profile in the two bottom panels of Figure 1. The convolved galaxy to PSF size ratio peaks at lower values than the ones that are used in the input simulation but there is a tail to larger values. Overall the mean is biased low by around 10%. The flux ratio is skewed to larger values and overestimated by around 10%. Moreover, this distribution has two modes; one close to the truth, and one close to .
The shear measurement biases thus depend on the galaxy intrinsic ellipticity in a nontrivial way. However, this can be converted into the shear measurement bias for a population of galaxies at different orientations using the ring test. This is discussed in greater detail in Appendix A. We effectively perform a ringtest to obtain the shear calibration metrics described in Section 2.1.
For the default galaxy and image parameters we find a multiplicative shear measurement bias of a few per cent. For an intrinsic galaxy ellipticity of 0.3 we find which is an order of magnitude larger than the requirement for upcoming surveys. The additive shear measurement bias is around which is larger than the requirement for upcoming surveys, and around an order of magnitude larger than the requirement for farfuture surveys.
The multiplicative and additive shear measurement bias is shown as a function of galaxy and image parameters in Figure 2. Data points for those plots are listed in Table 3, and the functions we fitted are given in equations in Table 4, both in Appendix B.
The upper panels show the dependence on the image SNR. This demonstrates clearly that the bias we observe is truly a noise bias, since the biases tend to zero at high SNR. Indeed for a SNR of 200 the biases are well below the requirement even for farfuture surveys. The dependence on SNR is well described by a quadratic function, shown as a fitted line, as discussed anecdotally (Bernstein, priv. com.) and as expected from the derivations in Hirata et al. (2004) and R12.
The upper middle panels of Figure 2 show the dependence on the ratio of convolved galaxy to PSF size, as defined in Section 2.5. The derivations in R12 showed that for Gaussian functions, the bias on the size parameter increases with the size of the PSF (Eq. 17). In our simulations the bias on the shear has a similar trend, as we observe an increased bias with decreased galaxy size relative to the PSF. The bias is reduced by a factor of almost three when the convolved galaxy to PSF size increases from 1.41 to the default value of 1.62. We modelled this dependence by using inverse power expansion with terms in and .
The lower middle panels of Figure 2 show the bias as a function of the flux ratio. Both multiplicative and additive bias change signs when the galaxy light profile changes from bulge to disc. Bulges are underestimated and discs are overestimated. This peculiar behaviour of the bias demonstrates the complexity of this problem. We use a straight line to fit the points, and this works reasonably well.
The dependence on PSF ellipticity is shown in the bottom panels of Figure 2. As expected, e.g. from PaulinHenriksson et al. (2008), the dependence of the additive shear measurement bias is much greater than that of the multiplicative bias. The additive shear bias dependence is very close to linear (shown by the fitted lines). Rotational symmetries in the problem, also visible on Figure 4 indicate that there is very little dependence on the pixel orientation with respect to the PSF and galaxy. This essentially means that we can use results for the PSF aligned with the x  axis for any other PSF angle, by rotating the coordinate system.
4 Noise bias calibration
In this section we investigate how the bias measurements can be used to calibrate out the noise bias effect. First, we create a model of the bias on the ellipticity measurement as a function of four measured parameters: , , , , similar to Figure 4 (note that we do not directly use the functions presented on Figure 2, as they show a bias on shear in the form of and , instead of the bias on the ellipticity). We apply an additive correction predicted by our model directly to the measured ellipticity values. Finally we verify the accuracy of this procedure by testing it using a ring test consisting of 10 million noisy fiducial galaxies.
This approach will not provide a perfect calibration, as our model of biases is calculated for a set of galaxies with particular true galaxy and image properties. In practice we will only know the measured galaxy parameters, which are noisy, as illustrated in Figure 1. Therefore, if we read off the bias values from the measurements of the noisy measured galaxy parameters they will not be exactly the correct bias values for that galaxy. In this section, we investigate the scale of this effect.
The estimator of the ellipticity is biased, so that , where is the unbiased estimator. By definition averaged over noise realisations is equal to the true ellipticity, so that .
We estimate the true shear with an estimator in a ring test. We write the following equations to show mathematically what is happening when we do the correction on the individual galaxy ellipticities.
(9)  
(10) 
where is the lensed ellipticity, and subscripts and denote averages over noise realisations and around the ring respectively. Eq. 10 shows that the bias of the shear estimator will be equal to the bias on the lensed ellipticity , averaged over noise realisations and the ring. This is the bias we aim to calibrate.
We create a correction model which describes as a function of four galaxy parameters, i.e.
(11) 
Then we apply this correction to the noisy estimates , creating an estimator of the correction and we update our ellipticity estimate to be
(12) 
Using this correction in the ring test implies
(13) 
Because we are applying the correction to the noisy maximum likelihood estimates, the correction itself can be biased under noise, so that . Including this ‘bias on the correction’, we expect the the final bias on the shear after applying our calibration procedure to be
(14)  
(15)  
(16) 
Testing this procedure will include finding out how big the term in Eq. 15 is.
In practice we create the model of the bias (Eq. 11) using a learning algorithm based on Radial Basis Functions (RBF) Interpolation ^{6}^{6}6http://www.mathworks.com/matlabcentral/fileexchange/10056, trained on all our simulated results. Then we use Eq. 12 to correct the ellipticity estimates.
The calibration procedure was tested by generating nearly ten million galaxy images using the default galaxy parameters. The ring test was performed as follows: a set of galaxies was simulated with the galaxy intrinsic ellipticity angles equally spaced at 16 values from 0 to , (i) with no shear applied (ii) with a shear of applied. In total 300,000 galaxies were simulated at each angle in the ring, for each shear value. To compute the uncalibrated shear measurement bias, the measured ellipticity was averaged over all galaxies with a given shear to obtain a shear estimate for that population. Then a straight line was fitted to the resulting shear estimates as a function of input shear to obtain the usual and . To compute the calibrated shear measurement bias, the measured ellipticities were corrected using Eq. 12 before averaging to obtain the shear estimate.
The uncalibrated and calibrated shear measurement biases are presented in Figure 3. We see that the uncalibrated shear measurement biases are well outside the requirement for upcoming surveys, as discussed earlier. The calibration reduces the additive bias by a factor of around three, and the multiplicative bias by a factor of around ten. We find that the bias term in Eq. 15 is insignificantly small to the accuracy afforded by our simulations. Therefore the calibrated biases are now within the requirement for upcoming surveys for both additive and multiplicative shear biases.
5 Conclusions
In this paper we have investigated the effect of noise on shear measurement from galaxy images. We have found that this can significantly bias shear measurement from realistic images, even though the bias goes away completely for images with lower noise levels. This was previously studied in (Hirata et al., 2004) and R12, who demonstrated the existence of this noise bias effect. We quantified noise bias using images simulated from more realistic galaxy models and we used a forward fitting shear measurement method which fitted a matching set of galaxy models to the simulations (Im3shape, Zuntz et al. in prep). These models are based on observationallymotivated combinations of exponential disk and de Vaucouleurs bulge models and are broadly representative of the light profiles of realistic galaxies. They have also formed the basis of previous weak lensing simulation programmes (Heymans et al., 2006; Bridle et al., 2010; Kitching et al., 2012). We use a maximum likelihood estimator (MLE) to obtain galaxy ellipticity estimates from the images, and use these ellipticity estimates as our noisy shear estimates. We find that the shear measurement biases often exceed 1 and even approach 10 for the smallest galaxies and highest noise values we consider in this paper.
One feature of the simulations presented is that they are deliberately internal: test galaxies are generated using the same models and routines used later for fitting them, the only difference being the addition of noise. In this way we are able to explore the effects of noise biases in isolation from the contribution of underfitting or model bias (e.g. Melchior et al., 2010; Voigt & Bridle, 2010; Bernstein, 2010). The fact that the biases we detect are considerable, even when fitting with perfect knowledge of the parametric galaxy model, is striking. We conclude that, for many methods, bias from unavoidable noise in galaxy images must be considered an important potential source of systematic error when seeking shear inference at subpercent level accuracy. The existence of noise bias is likely to be a common feature to many shape measurement methods (Hirata et al., 2004; R12). Unless shape measurement methods are theoretically constructed to avoid noise bias, empirical calibration with simulations is necessary.
We quantified the noise bias as a function of image and galaxy parameters and found a strong dependence. We found that the dependence on image signaltonoise ratio is inverse square, as expected from symmetry arguments (e.g. see R12). The dependence on galaxy size is quite nonlinear and rises steeply as the galaxy size decreases relative to the PSF size. The bias depends on the galaxy profile in a complicated way. We find that for our fiducial parameters shears are overestimated for exponential disc galaxies and underestimated for de Vaucouleurs bulge galaxies. The dependence on bulge to total flux ratio is reasonably consistent with a linear relation. There is a good linear relation between the additive shear measurement noise bias and the PSF ellipticity.
Many shape measurement methods are potentially subject to noise bias, and for these methods this sort of calibration will be an important step in order to reduce systematic errors below the level required for upcoming survey datasets. We illustrate a correction scheme based on a model of the measured biases, as function of observed galaxy properties. Note that this is not expected to remove the bias completely because the observed galaxy properties are not the true galaxy properties and therefore we will be using slightly the wrong bias correction. This correction was able to reduce ellipticity estimator biases to lower levels than those required for the upcoming lensing surveys, for a fiducial galaxy with SNR=20 and a typical intrinsic ellipticity of magnitude 0.3.
There is a small residual bias remaining after this first level of correction. This is due to the scatter and bias in measured galaxy parameters about their true values. This scatter and bias is an output of the simulations and could therefore be propagated into a second level of bias correction which would reduce the residual bias yet further, into the realm of farfuture surveys.
The calibration scheme we proposed can only be applied to a method which, in addition to ellipticity, also produces estimates of other parameters; it will probably be difficult to use it with a method such as KSB, which primarily aims to estimate only the ellipticity parameters.
This calibration approach is extremely computationally expensive and would ideally be carried out for a large range and sampling of image and galaxy parameters. The resolution of our results was limited by the available computing time. The final results shown in this paper took over 1 year of CPU time.
These results use a simple galaxy model in both the simulations and the fit. In practice it will be necessary to investigate more complicated galaxy models for both. However, the presented results are encouraging. For future surveys the simulated data must be carefully constructed in order to recreate realistic observing conditions, and the realistic properties of the underlying galaxies (the latter requirement poses greater difficulties than the former). The deep imaging of the real sky is potentially an expensive overhead for future surveys, but may prove necessary for confidence in the final results. Accurate estimates of gravitational shear from methods affected by noise bias will rely on consistent strategies for measuring and correcting these systematic effects.
The presented calibration scheme does not use the information about the galaxy parameters distribution in the universe. We found that the measured galaxy parameters were a sufficiently good proxy for the true galaxy parameters that the noise bias could be corrected well enough for upcoming surveys. If this result were generally true then this places less stringent requirements on the simulations because the galaxy population demographics would not need to match exactly with reality, and the simulations would only have to span a realistic range of galaxy parameters. However, different calibration schemes could be created based on the distributions of galaxy parameters. The simplest solution would be to calculate one and for the whole population of galaxies, randomly drawing not only noise maps but also galaxy and image parameters from histograms of measured parameters from galaxies in the survey. Using this method is not limited to maximum likelihood fitting; potentially all shear measurements methods could be calibrated that way.
We have used a white Gaussian noise model. In general it should be possible to repeat this procedure for a case of correlated noise. It should also be possible to repeat the procedure for Poisson noise. Our bias results will also depend on the number of parameters used in the fitting. We have used seven free parameters and fixed the ratio of radii of the bulge and disc galaxy components to unity. We also assumed no constant background in the image, whereas this could also be included as a free parameter in the fit. An uncertain variable background level would complicate the analysis further.
Another approach would be to use a fully Bayesian analysis: use the full likelihood distribution (or samples) of ellipticity given the noisy images and propagate this uncertainty to the cosmological parameters. In this case the calibration would not be necessary.
Acknowledgements
TK, SB, MH, BR and JZ acknowledge support from the European Research Council in the form of a Starting Grant with number 240672. Part of BR’s work was done at the Jet Propulsion Laboratory, California Institute of Technology, under contract with NASA. We thank Gary Bernstein for suggesting the calibration approach and for many fruitful discussions. The authors acknowledge the use of the UCL Legion High Performance Computing Facility, and associated support services, in the completion of this work. We thank Dugan Witherick for help with Legion Cluster. We thank Cris Sabiu and Caroline Pung for helpful discussions.
References
 Albrecht et al. (2009) Albrecht A., et al., 2009, arXiv, arXiv:0901.0721
 Albrecht et al. (2006) Albrecht A., et al., 2006, astro, arXiv:astroph/0609591
 Albrecht & Bernstein (2007) Albrecht A., Bernstein G., 2007, PhRvD, 75, 103003
 Amara & Réfrégier (2007) Amara A., Réfrégier A., 2007, MNRAS, 381, 1018
 Amara & Réfrégier (2008) Amara A., Réfrégier A., 2008, MNRAS, 391, 228
 Bartelmann & Schneider (2001) Bartelmann M., Schneider P., 2001, PhR, 340, 291
 Bartelmann et al. (2011) Bartelmann M., Viola M., Melchior P., Schäfer B. M., 2011, arXiv, arXiv:1103.5923
 Bernstein (2010) Bernstein G. M., 2010, MNRAS, 406, 2793
 Bernstein & Jarvis (2002) Bernstein G. M., Jarvis M., 2002, AJ, 123, 583
 Bonnet & Mellier (1995) Bonnet H., Mellier Y., 1995, A&A, 303, 331
 Bridle et al. (2009) Bridle S., et al., 2009, AnApS, 3, 6
 Bridle et al. (2010) Bridle S., et al., 2010, MNRAS, 405, 2044
 Bridle et al. (2002) Bridle S., Kneib J.P., Bardeau S., Gull S., 2002, in Natarajan P., ed., The shapes of galaxies and their dark halos, Proceedings of the Yale Cosmology Workshop ”The Shapes of Galaxies and Their Dark Matter Halos”, New Haven, Connecticut, USA, 2830 May 2001. Edited by Priyamvada Natarajan. Singapore: World Scientific, 2002, ISBN 9810248482, p.38 Bayesian galaxy shape estimation. pp 38–+
 Clocchiatti et al. (2000) Clocchiatti A., et al., 2000, IAUC, 7549, 2
 Crittenden et al. (2002) Crittenden R. G., Natarajan P., Pen U.L., Theuns T., 2002, ApJ, 568, 20
 Cypriano et al. (2010) Cypriano E. S., Amara A., Voigt L. M., Bridle S. L., Abdalla F. B., Réfrégier A., Seiffert M., Rhodes J., 2010, MNRAS, 405, 494
 Fu et al. (2008) Fu L., et al., 2008, A&A, 479, 9
 Goldberg & Bacon (2005) Goldberg D. M., Bacon D. J., 2005, ApJ, 619, 741
 Graham & Worley (2008) Graham A. W., Worley C. C., 2008, MNRAS, 388, 1708
 Heymans et al. (2006) Heymans C., et al., 2006, MNRAS, 368, 1323
 Hirata et al. (2004) Hirata C. M., et al. 2004, MNRAS, 353, 529
 Hoekstra & Jain (2008) Hoekstra H., Jain B., 2008, ARNPS, 58, 99
 Hosseini & Bethge (2009) Hosseini R., Bethge M., 2009, Technical Report, Max Planck Institute for Biological Cybernetics
 Hu (1999) Hu, W., 1999, ApJL, 522, L21
 Huterer et al. (2006) Huterer D., Takada M., Bernstein G., Jain B., 2006, MNRAS, 366, 101
 Irwin et al. (2007) Irwin J., Shmakova M., Anderson J., 2007, Astrophys. J., 671, 1182
 Kaiser (1992) Kaiser N., 1992, ApJ, 388, 272
 Kaiser (2000) Kaiser N., 2000, ApJ, 537, 555
 Kaiser, Squires, & Broadhurst (1995) Kaiser N., Squires G., Broadhurst T., 1995, ApJ, 449, 460
 Kitching et al. (2010) Kitching T., et al., 2010, arXiv, arXiv:1009.0779
 Kitching et al. (2012) Kitching, T. D. et al., 2012, arXiv, arXiv:1202.5254
 Kitching et al. (2008) Kitching T. D., Miller L., Heymans C. E., van Waerbeke L., Heavens A. F., 2008, MNRAS, 390, 149
 Kuijken (1999) Kuijken K., 1999, A&A, 352, 355
 Lewis (2009) Lewis A., 2009, MNRAS, 398, 471
 Lourakis (2004) Lourakis M. I. A. http://www.ics.forth.gr/~lourakis/levmar
 Massey et al. (2007) Massey R., et al., 2007, MNRAS, 376, 13
 Massey & Refregier (2005) Massey R., Refregier A., 2005, MNRAS, 363, 197
 Massey et al. (2007) Massey R., Rowe B., Refregier A., Bacon D. J., Bergé J., 2007, MNRAS, 380, 229
 Melchior et al. (2010) Melchior P., Böhnert A., Lombardi M., Bartelmann M., 2010, A&A, 510, A75
 Melchior et al. (2011) Melchior P., Viola M., Schäfer B. M., Bartelmann M., 2011, MNRAS, 412, 1552
 Miller et al. (2007) Miller L., Kitching T. D., Heymans C., Heavens A. F., van Waerbeke L., 2007, MNRAS, 382, 315
 Nakajima & Bernstein (2007) Nakajima R., Bernstein G., 2007, AJ, 133, 1763
 Ngan et al. (2009) Ngan W., van Waerbeke L., Mahdavi A., Heymans C., Hoekstra H., 2009, MNRAS, 396, 1211
 PaulinHenriksson et al. (2008) PaulinHenriksson S., Amara A., Voigt L., Refregier A., Bridle S. L., 2008, A&A, 484, 67
 Peacock & Schneider (2006) Peacock, J. and Schneider, P., 2006, The Messenger, 125, 48
 Peng et al. (2002) Peng C. Y., Ho L. C., Impey C. D., Rix H.W., 2002, AJ, 124, 266
 Refregier & Bacon (2003) Refregier A., Bacon D., 2003, MNRAS, 338, 48
 Refregier (2003) Refregier A., 2003, MNRAS, 338, 35
 Refregier et al. (2012) Refregier A., Amara A., Bridle S. L., Kacprzak T., Rowe B., 2012, in prep.
 Rowe (2010) Rowe B., 2010, MNRAS, 404, 350
 Schneider et al. (2002) Schneider P., van Waerbeke L., Kilbinger M., Mellier Y., 2002, A&A, 396, 1
 Seitz & Schneider (1997) Seitz C., Schneider P., 1997, A&A, 318, 687
 Sérsic (1963) Sérsic, J. L. 1963, Boletin de la Asociacion Argentina de Astronomia La Plata Argentina, 6, 41
 Simard et al. (2002) Simard L., et al., 2002, ApJS, 142, 1
 Van Waerbeke et al. (2006) Van Waerbeke L., White M., Hoekstra H., Heymans C., 2006, APh, 26, 91
 Viola, Melchior, & Bartelmann (2011) Viola M., Melchior P., Bartelmann M., 2011, MNRAS, 410, 2156
 Voigt et al. (2011) Voigt L. M., Bridle S. L., Amara A., Cropper M., Kitching T. D., Massey R., Rhodes J., Schrabback T., 2011, arXiv, arXiv:1105.5595
 Voigt & Bridle (2010) Voigt L. M., Bridle S. L., 2010, MNRAS, 404, 458
 Zhang (2008) Zhang J., 2008, MNRAS, 383, 113
 Zuntz et al. in prep (2012) Zuntz J., Bridle S., Kacprzak T., Rowe B., Hirsch M., Voigt, L. 2012, ArXiv:
Appendix A Measurement of the bias on the shear
The multiplicative and additive bias was measured using the following procedure.

Evaluate the bias on a grid in observed ellipticity: A grid in observed ellipticity parameter was created for each test galaxy in Table 2. This grid consisted of 8 angles on a ring. At each angle, 15 ellipticity magnitudes were used in range . This grid is presented in Figure 4. For each point on this grid, we evaluate 20000 noise realisations, and average them to obtain the bias. The number of noise realisations is chosen so that the uncertainty on the mean was smaller than .

Create a model of the bias as a function of observed ellipticity: A third order 2D polynomial was fit to the surface of the bias. Not all terms in the 2D expansion were used to avoid overfitting of the data. In particular, we used for fitting the bias on , analogously for . This expansion takes into account the inherent rotational symmetry of the problem: rotating galaxy ellipticity and PSF ellipticity vectors results in the rotation of the bias vector.

Perform a ringtest to calculate and : The parametric model of the bias surface allows us to perform a ring test at any desired intrinsic ellipticity.
The bottom panels of Figures 4 present the grid (dots) and interpolated surface (colour scale) of the magnitude of bias as a function of true and for a circular and elliptical PSF. We note that for circular PSF within the modelled range, the bias surface has a circular symmetry which demonstrates that the problem is symmetric and that the effect of the pixel orientation with respect to the galaxy is not strong. The top panels of Figure 4 present cross sections of the above grid and surface for each angle.
Appendix B Parameters and functions used to create models of the bias on ellipticity and shear
fiducial  

disc  
bulge  
:= SNR  

:=  
:=  
:=  
fiducial  

disc  
bulge  
Table 3 contains the multiplicative and additive bias measurements for all galaxies used in this work. See Appendix A for details of how these values were calculated. Fiducial galaxy parameters were: , , , , , , . Table 4 contains equations of the functions in Figure 2. Table 5 contains the parameters of polynomial function fitted to the bias on ellipticity, for example in Figure 4. The equation used with these parameters is
(17) 
for accordingly with parameters .