Constraints on Modified Gravity from Sunyaev–Zeldovich Cluster Surveys
Abstract
We investigate the constraining power of current and future SunyaevZeldovich cluster surveys on the gravity model. We use a Fisher matrix approach, adopt selfcalibration for the massobservable scaling relation, and evaluate constraints for the SPT, Planck, SPTPol and ACTPol surveys. The modified gravity effects on the mass function, halo bias, matter power spectrum, and massobservable relation are taken into account. We show that, relying on number counts only, the Planck cluster catalog is expected to reduce current upper limits by about a factor of four, to (68% confidence level) while SPT, SPTPol and ACTPol yield about . Adding the cluster power spectrum further improves the constraints to for Planck and for SPTPol, pushing cluster constraints significantly beyond the limit where number counts have no constraining power due to the chameleon screening mechanism. Further, the combination of both observables breaks degeneracies, especially with the expansion history (effective dark energy density and equation of state). The constraints are only mildly worsened by the use of selfcalibration but depend on the mass threshold and redshift coverage of the cluster samples.
I Introduction
One of the most fascinating aspects of contemporary cosmology is the potential of constraining fundamental physics with the plethora of available data. Galaxy clusters constitute one of the major tools we can use to this aim. The biggest gravitationally bound objects in the Universe, they have formed fairly recently and several of their global properties such as abundance and clustering on large scales can be predicted accurately with theoretical models (e.g., Tinker et al. (2008); Sheth and Tormen (1999)). For this reason, they have been extensively used in the past in order to constrain fundamental parameters such as the total matter density and the matter power spectrum normalization (Viana and Liddle, 1999; Pierpaoli et al., 2001, 2003; Borgani et al., 2001; Allen et al., 2003a). When combined with other cosmological data at various redshifts, clusters can also be used to constrain particle physics and neutrino properties (Pierpaoli, 2004; Allen et al., 2003b; Vikhlinin et al., 2009).
Given that gravity is the only relevant force in the formation of structure in the Universe on large scales, cosmological observations are uniquely suited to test gravity on scales of Mpc, complementing Solar System tests on AU scales. In recent years, clusters have received considerable interest as a probe of gravity Martino et al. (2009); Diaferio and Ostorero (2009). Modifications to gravity generically change the growth of largescale structure (e.g., Jain and Khoury (2010); Clifton et al. (2011)), and clusters at the highmass tail of the mass function are especially sensitive to changes in the growth rate. This has been exploited in Schmidt et al. (2009a) who used a sample of Xray clusters to constrain gravity. Similarly, Lombriser et al. (2010) used an optical Sloan cluster sample. A consistency test of the General Relativity + smooth Dark Energy framework using clusters was done in Rapetti et al. (2010).
Here, we focus on the model of gravity, using the functional form proposed in Hu and Sawicki (2007). This model produces acceleration without a true cosmological constant, and is indistinguishable from CDM through geometric probes (CMB, Supernovae, , BAO measurements). However, gravitational forces are modified on smaller scales. Furthermore, the model includes the chameleon screening mechanism which restores General Relativity in highdensity environments. Thus, this model is able to satisfy all current constraints on gravity. Structure formation in this modified gravity model is now understood on all cosmological scales: the linear regime of structure formation in this model has been studied in Hu and Sawicki (2007). The nonlinear structure formation was investigated using dedicated Nbody simulations in Oyaizu (2008); Oyaizu et al. (2008); Schmidt et al. (2009b); Zhao et al. (2011). This allows for fully selfconsistent constraints and forecasts to be made for this model.
While cluster samples have mainly been selected in the optical and X–ray bands in the past, recent observations based on the Sunyaev–Zeldovich (SZ) effect are starting to produce new detections Williamson et al. (2011); Vanderlinde et al. (2010a); Marriage et al. (2011); Planck Collaboration et al. (2011a, b). The SZ effect consists in CMB photons inverse–Compton scattering off electrons in the intra–cluster medium. This process causes a distortion in the CMB blackbody spectrum, and a frequencydependent brightness change Birkinshaw (1999). What makes SZ clusters particularly interesting as cosmological probes is the unique, almost redshiftindependent sensitivity for detecting clusters. As a consequence, SZ surveys have the potential to discover clusters at high redshift where optical and Xray surveys are not very efficient. This new probe is receiving significant attention because of additional data expected from ongoing SZ surveys like Planck, the Atacama Cosmology Telescope (ACT), and the South Pole Telescope (SPT) in the near future.
In this paper, we explore to what extent these new cluster surveys are expected to constrain models through cluster number counts and clustering. The paper is organized as follows. We begin by presenting the surveys and expected cluster samples in Sec. II. This is useful as the modified gravity effects discussed throughout the paper depend sensitively on the characteristics of the cluster samples. In Sec. III we present the parametrization of modified gravity effects on the halo abundance and clustering. Sec. IV details the Fisher formalism employed here, as well as the fiducial cosmology adopted. The forecasted constraints are presented in Sec. V. We discuss our results in Sec. VI and conclude in Sec. VII.
Ii Cluster surveys
We will investigate the predictions for the four surveys described in the following. While we try to obtain as realistic survey specifications as possible, in particular for the mass limit as function of redshift , the lack of previous large samples of SZ clusters necessarily makes these quantities somewhat uncertain. In particular, the relation between cluster mass and SZ signal is still imperfectly known (e.g. Ameglio et al. (2009); Rasia et al. (2005); Nagai et al. (2007); Piffaretti and Valdarnini (2008)). The final mass limits as a function of redshift are shown in Fig. 1, and the resulting expected number of clusters for each survey is shown in Fig. 2.
ii.1 The Planck Catalog
Planck is imaging the whole sky with an unprecedented combination of sensitivity ( per beam at 100  217 GHz), angular resolution ( at 217 GHz), and frequency coverage ( GHz). The SZ signal is expected to be detected from a few thousand individual galaxy clusters. Planck will produce a cluster sample with median redshift (see Fig. 2, upper left panel). The SZ observable is the integrated Comptonization parameter out to a given radius. For Planck, a detection threshold ensuring high level of completeness (about 90%) corresponds to (Melin et al., 2006), where is the integrated comptonization parameter within , the radius enclosing a mean density of 200 times the critical density. The early release from the Planck Collaboration gives a sample of 189 high signaltonoise SZ clusters with detection. It is therefore likely that our assumed detection threshold will be eventually reached in future data releases. For an SZ survey, its flux limit can be translated into a limiting mass by using simulationcalibrated scaling relations Sehgal et al. (2007):
(1) 
In order to mitigate the effect of overestimation of unresolved clusters at low redshift, we further restrict to be at least at all . With all these criteria, the Planck survey is expected to detect clusters. The mass threshold we find with this approach is consistent with the one in Malte Schäfer and Bartelmann (2007). While we keep as our reference minimum value for presentation of the main results, we will also discuss predictions for a lower mass threshold, corresponding to . With such threshold, the completeness of the sample is reduced to about 70% and the total number of clusters is 2700.
ii.2 SPT and SPTpol
The SPT survey is currently observing the sky with a sensitivity of K/arcmin at 148 GHz, 218 GHz, and 277 GHz. This survey covers square degrees of the southern sky (between , ) with a projected survey size and cluster mass limit well matched to the Stage III survey specification of the Dark Energy Task Force Vanderlinde et al. (2010b). For the mass limits, we employ the calibrated selection function of the survey by Vanderlinde et al. (2010b). This is based on simulations and used to provide a realistic measure of the SPT detection significance and mass. Disregarding the scatter in the fitting parameters for this relation, we use here:
(2) 
where is the detection significance. For the SPT survey, we take clusters detected at which ensure a 90% purity level. Currently, the SPT team is setting a low redshift cut at in their released cluster sample, due to difficulties in reliably distinguishing lowredshift clusters from CMB fluctuations in single frequency observations. Nevertheless, with upcoming multifrequency observations, a lower cut will likely be attained. We therefore apply this cut in our work. With this, the SPT survey is expected to detect clusters.
In addition to this, we also consider the upcoming SPT polarization survey (hereafter SPTpol) which will have an increased sensitivity of K/arcmin at 150 GHz for a 3 year survey and sky coverage of 625 square degrees. We scaled the mass limits by a factor of in Eq. (2) to match with the expected mass limits of SPTpol clusters (Benson 2011, private communication). We again use , resulting in a total expected number of clusters. While these are the limits we use for our main results, we also discuss outcomes that consider a lower mass limit, corresponding to (80% purity). With this mass limit, SPT would find 800 clusters and SPTPol about 1400 clusters.
ii.3 ACTpol
The Atacama Cosmology Telescope (ACT) has been observing a portion of the southern sky since 2008 consisting of two strips of the sky, each 4 degrees wide in declination and 360 degrees around in right ascension, one strip is centered at , and the other is centered at Sehgal et al. (2007). With a sensitivity of K/arcmin, only about 100 clusters are expected to be detected. Instead, we turn to the newly developing dualfrequency (150 GHz and 220 GHz) polarization sensitive receiver (hereafter ACTpol Niemack et al. (2010) and reference therein) to be deployed on ACT in 2013. One of the three ACTpol observing seasons will have a wide survey covering to a target sensitivity of K/arcmin in temperature at 150 GHz. With the wide field, they aim to find clusters in the ACTpol survey. The survey is complete above a limiting mass of (Sehgal 2011, private communication), and we therefore assume this as our redshiftindependent mass limit for ACTpol. As in SPT, the ACT team also put a low redshift cut in their parameter determination works and we likewise take for ACTpol, resulting in a total expected number of clusters. We also present in the discussion section the results corresponding to a lower mass limit, , which would result in a catalog of about 1000 clusters.
Iii Theoretical modeling
iii.1 gravity
In the model (see Nojiri and Odintsov (2006); Sotiriou and Faraoni (2010) and references therein), the EinsteinHilbert action is augmented with a general function of the scalar curvature Carroll et al. (2004); Nojiri and Odintsov (2003); Capozziello et al. (2003),
(3) 
Here and throughout . This theory is equivalent to a scalartensor theory (if the function is nontrivial). The additional field given by mediates an attractive force whose physical range is given by the Compton wavelength . On scales smaller than , gravitational forces are increased by 4/3, enhancing the growth of structure.
A further important property of such models is the nonlinear chameleon effect which shuts down the enhanced forces in regions with deep gravitational potential wells compared with the background field value, Khoury and Weltman (2004); Hu and Sawicki (2007). This mechanism is necessary in order to pass Solar System tests which rule out the presence of a scalar field locally. Thus, Solar System tests constrain the amplitude of the background field to be less than typical cosmological potential wells today ().
In this paper, we will choose the functional form introduced by Hu & Sawicki Hu and Sawicki (2007):
(4) 
with two free parameters, , . Note that as , , and hence this model does not contain a cosmological constant. Nevertheless, as , the function can be approximated as
(5) 
with replacing as the second parameter of the model. Here we define , so that , where overbars denote the quantities of the background spacetime. Note that implies always, as required for stable cosmological evolution. If , the curvature scales set by and differ widely and hence the approximation is valid today and for all times in the past.
The background expansion history thus mimics CDM with as a true cosmological constant to order . Therefore in the limit , the model and CDM are essentially indistinguishable with geometric tests. The linear growth rate is identical to that of CDM on scales larger than , and becomes strongly scaledependent on smaller scales Hu and Sawicki (2007).
Note that we have chosen a model whose expansion history is close to CDM by construction. In general, there is sufficient freedom in the free function to emulate any given expansion history Song et al. (2007). Hence, below we will also allow the expansion history to vary, parametrized by effective dark energy parameters and . Further, while we choose a specific functional form for here, it is straightforward to map constraints onto different functional forms (see Ferraro et al. (2011) for details). In the following, for notational simplicity will always refer to the absolute value of the field amplitude today.
iii.2 Cluster abundance in
Studying structure formation in gravity beyond linear theory is complicated by the nonlinear field equation for the scalar field , the nonlinearity being responsible for the chameleon mechanism. The field equation needs to be solved simultaneously with the evolution of the matter density. This has been done in the selfconsistent Nbody simulations of Oyaizu (2008). The abundance of dark matter halos (mass function) and their clustering (halo bias) in the simulations was studied in Schmidt et al. (2009b).
Since these simulations are very timeconsuming, they cannot be used to exhaust the cosmological parameter space. Instead, we use a simple model developed in Schmidt et al. (2009b) based on spherical collapse and the peakbackground split in order to predict the cluster abundance and their linear bias.
In order to describe the effect of gravity on the halo mass function, we employ the ShethTormen prescription for the comoving number density of halos per logarithmic interval in the virial mass , given by
(6) 
where the peak threshold and
(7) 
Here is the variance of the linear density field convolved with a top hat of radius that encloses at the background density
(8) 
where is the linear power spectrum (either in CDM or in ) and is the Fourier transform of the top hat window. The normalization constant is chosen such that . The parameter values of , , and for the spherical collapse threshold have previously been shown to match simulations of CDM at the level. The virial mass is defined as the mass enclosed at the virial radius , at which the average density is times the mean density. We transform the virial mass to the desired overdensity criterion assuming a NavarroFrenkWhite Navarro et al. (1997) density profile Hu and Kravtsov (2003), and assuming the massconcentration relation of Bullock et al. (2001) (note that the rescaling depends very weakly on the assumed halo concentration for the values of used here). We thus obtain the mass function of halos in the ST prescription, , from .
The effects of modified gravity enter in two ways in this prescription: first, we use the linear power spectrum for the model in Eq. (8). Second, we assume modified spherical collapse parameters which were obtained by rescaling the gravitational constant by 4/3 during the collapse calculation as well as the corresponding linear growth extrapolation to obtain . This corresponds to the case where the collapsing region is always smaller than the Compton wavelength of the field. Schmidt et al. (2009b) showed that this case always underestimates the effects on the mass function and bias, and hence serves as conservative model. For our fiducial cosmology at , we obtain GR collapse parameters of , , while the modified parameters are given by , . The ShethTormen prescription itself does not provide a very accurate prediction for the abundance of clusters in CDM in the entire redshift range relevant for SZ surveys. Since more precise parametrizations are available, we only use the ST prescription to predict the relative enhancement of the cluster abundance in . Specifically, after rescaling to our adopted mass definition, we take the ratio of the two and multiply it by the CDM mass function from Tinker et al. (2008),
(9) 
where we use the parameters given in their Appendix B. Note that for small field values and at high masses, the predicted mass function in fact becomes smaller than that for CDM. Since this effect is not seen in the simulations, we conservatively set the mass function ratio to 1 whenever it is predicted to be less than 1.
Fig. 2 shows the number of clusters as a function of redshift expected from the four surveys considered in this work (see Sec. II), and the relative deviations of the model from the CDM model for different values of (dashed lines). The modifications are most prominent at low redshifts , since the changes in the linear power spectrum are restricted to progressively smaller scales towards higher redshifts. Further, for , we see the strongest effects for surveys with the lowest mass thresholds, in particular Planck (for ) and SPTpol. This is a consequence of the chameleon mechanism which suppresses the mass function enhancement above progressively lower masses as decreases. There are negligible differences between and CDM for at high halo masses. Hence, the mass threshold of a given survey determines what field values can be probed by number counts.
Further, we have to take into account the effect of modified gravity on the massobservable relation. The SZ effect is a dynamical mass measure, as the decrement is proportional to the velocity dispersion (pressure) of electrons. In modified gravity, dynamical mass estimates are generally different from the actual mass due to the presence of the additional gravitational force which enters the virial equation. As shown in Schmidt (2010), the dynamical mass is related to the true mass via
(10) 
where is a weighted integral of the force modification over the object which describes the effect on the virial equation. In principle, should be weighted by the SZ emissivity and observational window function. However in the interest of simplicity, and since we are only interested in an approximate forecast, we simply weight the modified forces by the matter density of the halo, assuming an NFW profile Navarro et al. (1997). Further, we assume the host halo is spherically symmetric. We then have
(11) 
where is the Newtonian potential of the halo, found by solving (see Schmidt (2010) for an explicit expression)
(12) 
and is the force modification. In order to calculate the force modification, we have to solve the chameleon field equation for an NFW halo Schmidt (2010). This calculation is computationally expensive, so we instead use a simple model which describes the exact results reasonably well Schmidt (2010); in fact it underpredicts the exact result for the force modification, and thus is a conservative estimate. Specifically,
(13) 
Here, is the outermost radius at which the condition is met. In the largefield limit this condition is never met, so that and throughout. Eq. (10) then yields . For sufficiently small fields, the chameleon mechanism becomes active so that for , thus modeling the screening of the modified force. In this case, will interpolate between and .
We show in Fig. 1 the mass threshold of the four cluster surveys in CDM (solid) and the dynamical mass effect to these thresholds (dashed). Fig. 2 also shows the dynamical mass effect on the observed cluster abundance (dashdotted lines). Note that the dynamical mass effect is not simply additive to the mass function enhancement, since the latter depends on mass as well. Due to the steepness of the halo mass function at the highmass end, the fact that is larger than the true mass significantly boosts the abundance of detected clusters above the mass threshold. The two effects of enhanced growth and increased both contribute to increase the observed cluster abundance. For , the mass function enhancement provides a significant contribution to the overall change in number counts, while at higher redshifts the increase in dynamical mass is the dominant effect.
iii.3 Halo clustering in
In addition to the halo abundance, modified gravity also affects the clustering of halos. This effect comes from two sources: first, the matter power spectrum is enhanced on small scales by the increased gravitational forces. Second, the linear bias of halos at a given mass is reduced, since at a fixed mass halos are less rare in than in GR. The power spectrum of clusters of mass is modeled as
(14) 
The halo bias is given by the peakbackground split. For the mass function used, it is given by Sheth and Tormen (1999)
(15) 
where are defined after Eq. (6). Note that is given in terms of the virial mass and thus, for a given cluster mass , it differs in the two collapse scenarios because of different values.
For the matter power spectrum in Eq. (14), we use the linear theory power spectrum for and CDM. As shown in Oyaizu et al. (2008), this describes the nonlinear power spectrum at measured in Nbody simulations up to scales . In order to minimize the impact of nonlinearities on the power spectrum and its covariance, we limit our Fisher matrix to modes with less than 0.1Mpc. Note that including smaller scales will further improve the constraints; however, a more sophisticated model including nonlinear and/or scaledependent bias, and the nonlinear matter power spectrum would be necessary in this case.
Thus, the effect on the cluster power spectrum is due to three combined effects: enhancement of the linear power spectrum , halo bias , and the dynamical mass effect . Fig. 3 shows the relative deviation of the cluster power spectrum in with respect to CDM for the Planck survey (Sec. II) as a function of redshift and wavenumber . Plots for the other surveys investigated here show similar and dependences, though the amplitude of each effect depends on the survey. Here, we have assumed one mass bin and . Similar to , we plot the total effect (upper left), and separately the effect due to (upper right), and (lower panel). For this field value, the dynamical mass effect is irrelevant since the clusters detectable by Planck are chameleonscreened. The departure from CDM is mainly driven by which shows a strong redshift dependence, and only mildly affected by which is dependent and only relevant on small scales. Given that the power spectrum is shotnoise dominated at all scales for the cluster samples considered, the effect on the linear halo bias in fact is the most important contribution to the constraints from the cluster power spectrum.
Iv Fisher matrix formalism
The Fisher information Matrix (FM hereafter) is defined as
(16) 
where is the likelihood of a data set, e.g. a cluster sample, written as a function of the parameters describing the model. The parameters comprise the cosmological model parameters as well as “nuisance” parameters related to the data set (e.g., mass calibration).
iv.1 Cosmological parameters
Throughout this paper, we assume a spatially flat () cosmology. Our model comprises a total of seven cosmological parameters and one model parameter which are left free to vary. The seven parameters and their fiducial values (in parenthesis, taken from the bestfit flat CDM model from WMAP 7yr data, BAO and measurements Komatsu et al. (2011)) are: baryon density parameter (0.0245); matter density parameter (0.143); dark energy density (0.73); power spectrum normalization (0.809); index of power spectrum (0.963); effective dark energy equation of state through , with fiducial values and . The Hubble parameter is then a derived parameter given by in the fiducial case. The modification can alternately be parametrized using the field amplitude at , or the Compton wavelength at (see Sec. IV.6). Our fiducial value is .
In the following, we first discuss the Fisher matrix for number counts and clustering of clusters, before describing the calibration parameters and CMB priors. Throughout, we divide the redshift range into bins of width . Further, we bin clusters in logarithmic mass bins of width from the minimum mass for each survey (Sec. II) up to a large cutoff mass of . Since the mass limit varies with redshift, the number of mass bins thus also varies somewhat across the redshift range.
iv.2 Number counts
The FM for the number of clusters within the th redshift bin and th mass bin is
(17) 
where the sum over and runs over intervals in the whole redshift range and cluster mass range . We can write the abundance of clusters expected in a survey, within a given redshift and mass interval, using the mass function as:
(18)  
where is the solid angle covered by the cluster survey, , and is the mass function given in Eq. (9). Following Lima and Hu (2005), we take into account the intrinsic scatter in the relation between true and observed mass, as inferred from a given mass proxy, by the factor which is the probability for a given cluster mass with of having an observed mass . Under the assumption of a log–normal distribution for the intrinsic scatter, with variance , the probability is
(19) 
where
(20) 
With these notations, we parameterize the relation, in addition to the intrinsic scatter, by a systematic fractional mass bias . With this prescription, the final expression for the number count FM is:
where erfc is the complementary error function.
iv.3 Power spectrum
We define the FM for the power spectrum of galaxy clusters as
(22) 
where the sum over runs over mass bins, while the sum in and runs over intervals in the whole redshift range and wavenumber with respectively. is the cluster crosspower spectrum for mass bins and , calculated for the given redshift and wavenumber through
(23) 
Here, is the mass function weighted effective bias,
(24) 
The effective volume for mass bins , wave number , and redshift is given by (see App. A)
(25)  
where is the comoving volume of the redshift slice covered by the given survey, and is the cluster number density for mass bin at redshift . The effective volume gives the weight carried by each bin in the space to the power spectrum Fisher matrix, and hence quantifies the amount of information contained in a given redshift and bin. Fig. 4 shows the redshift and scale dependence of the effective volume for the four cluster surveys. We find that for all redshifts and surveys considered, even when not binning in mass, hence the cluster power spectrum is shotnoise dominated for all surveys. As the lower panel of Fig. 4 illustrates, Planck is most limited by shot noise, while SPTpol is least limited, as expected from their respective mass limits and coverage.
iv.4 Calibration parameters
In selfcalibrating the true and observed cluster mass (Eq. (20)), we introduce four nuisance parameters which specify the magnitude and redshiftdependence of the fractional mass bias and the intrinsic scatter . Following Lima and Hu (2005), we assume the following parametrization:
(26) 
Therefore the four nuisance parameters are , , , and . A negative value for corresponds to an underestimation of mass. The mass bias accounts for the possibility of a systematic offset in the calibration of the observable mass scaling relation. We adopt fiducial values of , , , . In deriving the main results, we will not make any assumption on the four nuisance parameters and leave them free to vary. We will study the effect of assuming different priors on the four nuisance parameters on the constraints in Sec. V.4 .
iv.5 CMB Prior
In the following, we present results with the Fisher matrix for the Planck CMB temperature power spectrum added to the constraints from cluster counts and power spectrum. We calculate the full CMB fisher matrix with CAMB Lewis et al. (2000) and method described in Pritchard and Pierpaoli (2008). For the Planck experiment, we use the three frequency bands 100, 143 and 217 GHz, and the are calculated up to . Our fiducial parameter set for the CMB experiment is, as described in the DETF report Albrecht et al. (2006), , where is the primordial amplitude of scalar perturbations and is the optical depth due to reionization. After marginalizing over the optical depth, we transform the Planck CMB fisher matrix to our cluster survey parameter set by using the appropriate Jacobian matrix. The CMB imposes strong prior on the cosmological parameters. For example, is known to be measured with the CMB power spectrum to an exquisite precision, and this helps in breaking parameter degeneracies in the constraints from cluster surveys. As we shall see in Sec. V, the field amplitude parameter shows degeneracies with some of the cosmological parameters, so that the CMB prior also helps in further constraining .
iv.6 NonGaussian likelihood
An inherent assumption in the Fisher matrix approach is that the likelihood can be approximated as Gaussian around its maximum; in other words, that one can do a reasonably accurate Taylor expansion of in all parameters. Unfortunately, this is not the case for the parameter , as the derivatives of the likelihood with respect to diverge at the fiducial value (see Fig. 5 in Schmidt et al. (2009a)). Thus, we choose the Compton wavelength as a parameter instead of , where for the model and fiducial cosmology considered here,
(27) 
With this choice, becomes analytic at the fiducial value . Specifically, we calculate the derivatives numerically as
(28) 
where is the Compton wavelength evaluated at the chosen step size through Eq. (27), and denotes the likelihood from either or . Unfortunately, the likelihood is still strongly nonGaussian in the direction of , and the constraints depend on the step size chosen to evaluate the Fisher matrix elements in Eq. (16). In principle, one would have to evaluate the full likelihood with a MCMC approach, and then perform a marginalization to obtain proper forecasted constraints. Here, we opt instead for a simpler approach. We evaluate the Fisher matrix for a range of step sizes , and then quote the constraints for which is satisfied. One can easily show that this gives the correct answer in the ideal case where the likelihood is Gaussian in all other parameters. Note that while we always use as parameter in the Fisher matrix, we will quote constraints in terms of in order to facilitate comparison with the literature, using only to show parameter degeneracies.
V Results
We begin by discussing constraints from number counts (Sec. V.1) and power spectrum (Sec. V.2) separately, before moving on to combined constraints (Sec. V.3) and the impact of external priors on the nuisance parameters (Sec. V.4).
v.1 Number counts
As discussed in Sec. IV.6, the Fisher constraints depend on the value of adopted to evaluate the numerical derivatives in the Fisher matrix. Fig. 5 shows the projected constraints for the different surveys as a function of . The sharp upturn at (SPT and ACTPol), (SPTPol) and (Planck) signals the transition to the chameleonscreened regime, where the mass function enhancement becomes negligible Schmidt et al. (2009b). The shape of this transition depends on the mass limits of the different surveys, as more massive halos are screened for larger values of . The figure clearly shows that, with number counts alone, constraints cannot be tighter than . Nevertheless, this still constitutes an order of magnitude in improvement over current constraints. It should also be noted that the use of the dynamical mass in the calculations leads to a significant improvement in constraints in the largefield regime where the chameleon mechanism is not active.
The precise constraints obtained at the intersection are listed in Tab. 1, along with the step size used for each survey. The relative constraining power of the different surveys can easily be interpreted by looking at shown in Fig. 2. The best survey to constrain with number counts is Planck which shows prominent deviations in at low redshift, and yields a 68% CL constraint of . Although SPTpol shows significant differences in number counts out to large redshifts, the relatively small survey volume compared to Planck limits the performance in constraining to . It is interesting to notice that while their overall performance is similar, the constraints leverage on clusters in almost disjoint redshift ranges. Therefore these surveys provide complementary information on constraints from number counts, making the overall result less susceptible to specific issues related to either low or high redshift clusters. An investigation of whether the combination of both cluster samples yields a significant improvement on the expected constraints would be worthwhile, but is beyond the scope of this paper. The other two surveys also present results highly competitive with current constraints, and not very different from SPTPol (). A better investigation with a proper likelihood would be necessary in order to make more precise statements.
v.2 Power spectrum
Fig. 6 shows the constraints from the clustering of clusters alone as a function of step size. The constraints generally worsen as the step size decreases to very small values. This is because the likelihood around the fiducial model (CDM) scales as , where , and hence the derivatives go to zero as the step size decreases. However, constraints do not worsen dramatically as the step size crosses the chameleon threshold, because the modification to the halo bias in persists even if the halos are chameleon screened Schmidt et al. (2009b). Furthermore, the deviations in the matter power spectrum on small scales also persist for field values . As expected, the use of the dynamical mass does not affect the constraints for small field values where the entire cluster sample is chameleon screened.
The constraints from power spectrum only are summarized in the second column of Tab. 1. For Planck (as well as marginally for SPTPol) the constraint on from the cluster power spectrum is tighter than that from the abundance only. This is mainly because the power spectrum retains sensitivity to effects even when the halos are chameleon screened. For ACTPol and SPT the power spectrum yields slightly less constraining power than number counts, as the disadvantage of not having allsky coverage is not compensated by the relatively low mass threshold.
In order to investigate what cluster redshift range contributes to the constraints, we the constraints (for fixed) as function of the maximum cluster redshift considered in Fig. 7. For surveys with mass limits which decrease with redshift, i.e. SPT and SPTpol, constraints improve up to , while for Planck all the information is derived from clusters below , and for ACT the constraining power comes from clusters below . It is especially interesting to compare results from ACTpol and SPT, which detect a comparable number of clusters overall but with a different redshift distribution. Fig. 2 shows that ACTpol has a significantly higher number of clusters than SPT out to , and a lower mass limit out to . Yet the constraints from the cluster power spectrum are worse for ACTpol than SPT, due to the contribution from clusters for SPT (Fig. 7). How well each survey can realize their potential constraining power clearly depends on the precise achieved in the final cluster sample.
Parameter  +  

Planck  
ACTpol  
SPT  
SPTpol  
v.3 Combined constraints
Fig. 8 shows constraints on when combining both number counts and clustering, as a function of the step size . The dependence on is similar to the case of power spectrumonly and number countsonly constraints at small and large step size respectively. Combining the two probes helps to break degeneracies and better constrain the nuisance parameters. As a result, the constraints on show improvements with respect to those derived from power spectrum or number counts alone (third column in Tab. 1). While Planck reaches a constraint of , ACTpol, SPT and SPTPol achieve . Among the four surveys, the Planck survey thus yields the tightest constraints regardless of which cluster probe is being used. The relative merit of the Planck survey is due to its large area, which allows to detect massive clusters on the whole sky, and its ability to detect low redshift clusters. Fig. 2 shows that in the smallfield regime (), the low redshift clusters drive the constraints for Planck, while low mass clusters do so for SPTPol.
Up to now, we presented results with conservative mass limits, i.e. clusters are expected to be detected with for all the surveys. We also examined improvements in the constraint when using more optimistic mass limits for each survey, according to what is outlined in section Sec. II. For all surveys, the constraints from number counts only are hardly affected, since they are mainly set by the chameleon threshold. In each case, the larger cluster sample does improve the power spectrum constraints. However, only for Planck does this yield a significant improvement in the combined constraints (by a factor of 1.5 to ), while for ACTpol, SPT, and SPTpol, the improvement in combined constraints is marginal.
Fig. 9 illustrates the most important degeneracies of with standard cosmological parameters for the Planck survey. Here, we show instead of for purposes of presentation. The most prominent degeneracies are with the amount and equation of state of dark energy (, and ). Clearly, the combination of both observables yields a significant reduction in degeneracies in all cases. The degeneracy with dark energy parameters also explains why the combined constraints on are slightly better for SPT than for ACTPol, even though the constraints from number counts and clustering separately are very similar for the two surveys. By probing higher redshifts more effectively, SPT is able to better break degeneracies with dark energy parameters.
Constraints on modified gravity show little with the power spectrum normalization (see Fig. 9). This is due to the fact that the high number of clusters detected allows for good characterization of the shape of the mass function beyond its overall normalization. Similar but somewhat weaker degeneracies are present for the other surveys.
v.4 Uncertainties in scatter of mass observable relations
Throughout this work, we have assumed a functional form for the scaling relations and then allowed the data to calibrate the parameters that characterize it. This procedure is possible thanks to the large number of clusters that are expected to be detected in these surveys. Current strategies for deriving constraints from cluster surveys, however, rely on the calibration of scaling relations as obtained by a small subset of well studied clusters. In general, allowing more freedom to the scaling relation parameters may avoid biases induced by incorrect scaling relations but can also result in a degradation of the final result. In order to investigate the degradation of due to this selfcalibration, we repeat the forecasts assuming different priors on the four “nuisance” parameters. The result is summarized in Tab. 2 for the number counts, clustering, and combined, and for the four surveys.
Here, the “weak prior” case assumes priors on the nuisance parameters of and , as well as and , as suggested by comparison between Xray and lensing cluster mass measurement (e.g., the XMMNewton measurements presented in Zhang et al. (2010)). The combined constraints on are smaller than those for the default, no prior case, by about 25% for Planck, 80% for ACTpol, and 50% for SPT and SPTpol. The most prominent improvements are seen in number counts only constraints (e.g. a factor 3.8 for SPT).
The “strong prior” case assumes that all four nuisance parameters are fixed at their fiducial values. This assumption, which is anyway not realistic, would lead to improvements of about one order of magnitude with respect to the selfcalibration results.
This result suggests that although selfcalibration does not in general lead to major degradations in the constraints, good prior information on normalization and scatter in the massobservable relation can improve constraints considerably in partiuclar for the ACTpol and SPT/SPTpol surveys.
On the other hand, it is important to keep in mind that selfcalibration relies on a specific parametrization of the massobservable relation and its scatter, and external measurements are important to validate these assumptions. As a worstcase scenario, we also considered the case of a single mass bin for each survey, i.e. neglecting all mass information on individual clusters. The fully marginalized, combined constraints on (without any priors on bias and scatter) worsen by approximately a factor of four for Planck and ACTpol. On the other hand, both SPT and SPTpol constraints degrade by only a factor of three, since both surveys has a large lever arm in redshift. While these constraints are considerably worse than when using mass bins, the Planck and SPTpol (?) constraints with a single mass bin still improve over current upper limits.
Survey  (Mpc/h)  


weak  
Planck  1.00  1.02  1.23  1.02  1.01  1.11 
ACTpol  2.14  1.95  1.82  1.01  1.17  1.36 
SPT  3.80  1.02  1.48  1.03  1.01  1.21 
SPTpol  1.29  1.02  1.45  1.13  1.00  1.22 

Vi Discussion
It is worth comparing our forecasted constraints on with those obtained in Schmidt et al. (2009a) (Lombriser et al. (2010) obtained similar constraints). By combining 49 Chandra Xray clusters and using geometric constraints from CMB, supernovae, , and BAO, they found an upper limit of (95% CL), including only the statistical error. Our forecasted constraints are tighter by a factor of (ACTpol, SPT, SPTPol) and (Planck), respectively. The main reasons for the tighter constraints are: the significantly larger cluster samples yielded by these surveys, the use of the dynamical mass (which improves number count constraints), and the inclusion of the clustering of clusters as an observable. As shown in Sec. V, the latter in fact provides the dominant constraining power for these surveys in the small field limit.
Furthermore, the constraints in Schmidt et al. (2009a) are dominated by the systematic uncertainty in the cluster mass scale, and including this systematic increases the upper limit to . The constraints presented here are marginalized over the cluster mass scale, and hence already include this systematic. Indeed, the combination of power spectrum and number counts is essential in order to realize selfcalibration without loosing constraining power.
One interesting finding of our study is that the chameleon screening mechanism, a necessary ingredient in this modified gravity model in order to satisfy Solar System constraints, has a qualitative impact on the constraints. In particular, the number counts by themselves cannot push constraints below due to this effect, while they yield the tightest constraints for larger field values. Similarly, the importance of the dynamical mass effect is controlled by the chameleon threshold. This is expected to hold for other modified gravity scenarios as well, as long as the respective screening mechanism depends mainly on the host halo mass (or potential well) of the cluster. On the other hand, screening mechanisms that mainly depend on the average interior density, such as the Vainshtein mechanism employed in braneworld and galileon models, will show a qualitatively different behavior Schmidt et al. (2010); Schmidt (2010) (see Clampitt et al. (2011) for a study of the related symmetron mechanism). For such models, the utility of number counts will not be limited to certain parameter ranges. Thus, taking into account the screening mechanism is crucial for obtaining realistic constraints on any viable modified gravity model, both for forecasts and when using actual data.
All of the surveys considered here reach the limit set by the chameleon mechanism on the constraints from number counts. The Planck survey achieves the tightest constraints both due to its large volume, which reduces the sample variance especially in the cluster power spectrum, and due to its ability to detect clusters at . For example, if we limit the Planck cluster sample to , the combined constraints in degrade by a factor of four to . We thus expect that significant improvements in constraining power are achievable for groundbased SZ surveys if the minimum cluster redshift can be reduced.
Several improvements upon our treatment here are possible. First, our model for the effects on mass function and bias of halos is conservative. In order to investigate this, we repeated the forecast using the standard as opposed to modified spherical collapse parameters in the model prediction Schmidt et al. (2009b). In case of the Planck survey, the fully marginalized, combined constraint is tightened by a factor of , constraining to less than . This prescription overestimates the effects in the small field regime () and thus leads to overly optimistic constraints. Nevertheless, the improvement in constraints signals that it is worth developing a more accurate model for the effects on halo mass function and bias (e.g., along the lines of Li and Hu (2011)). Given the importance of the cluster power spectrum in the constraints, an accurate model for the modified halo bias will be crucial. Furthermore, a model for the cluster power spectrum on mildly nonlinear scales would also lead to tighter constraints by allowing to be increased above the value of adopted here.
Vii Conclusions
The large cluster samples expected from current and upcoming SZ surveys can be exploited to place tight constraints on modifications to gravity. We have shown that the Planck cluster sample will allow for more than one order of magnitude improvement in constraints on the field parameter over current observational constraints, even when marginalizing over the expansion history (parametrized by ) and bias and scatter in the massobservable relation. Similarly, SPT, SPTPol and ACTPol should provide improvements of about a factor 3–4. Using number counts only, the Planck cluster catalog should be able to reduce errors to in the near future. The inclusion of the cluster power spectrum as a probe greatly improves results especially in the small field limit. The best constraint we obtain is for Planck (combined constraints, ) and is mainly driven by the power spectrum. These constraints push into the regime not ruled out by Solar System tests Hu and Sawicki (2007). Even with selfcalibration, a good understanding of the cluster selection function will be necessary to realize this potential however. On the theoretical side, a better description of the modified gravity effects on halo mass function and bias should allow for further improvements. In addition, the use of a proper likelihood function would constitute an important validation of the results obtained here with the Fisher matrix approximation.
Acknowledgements.
EP and NM acknowledge support from NSF grant AST0649899. EP and DM were partially supported by NASA grant NNX07AH59G. EP also acknowledges support from JPLPlanck subcontract 1290790. She would like to thank the hospitality of the Aspen Center for Physics for hospitality during the preparation of this work. FS would like to thank Wayne Hu for helpful discussions. FS is supported by the Gordon and Betty Moore foundation at Caltech.Appendix A Covariance of Cluster Power Spectra
In this appendix we derive the Fisher matrix element for the cross and autopower spectra of clusters binned in mass. Let denote the crosspower spectrum between mass bins and . In this section we will suppress the explicit redshiftdependence for clarity. The variance of the crosspower spectrum measured in a narrow range is given by
(29) 
Here, denotes the comoving number density of clusters in mass bin , and the number of modes is given by
(30) 
where the factor of in front accounts for the fact that the density field is real, reducing the number of independent modes by one half. This factor is sometimes neglected in the literature. The volume is given by
(31) 
Using this, we can derive the general power spectrum Fisher matrix as