# Implications of multiple high-redshift galaxy clusters.

###### Abstract

To date, high-redshift () galaxy clusters with mass measurements have been observed, spectroscopically confirmed and are reported in the literature. These objects should be exceedingly rare in the standard CDM model. We conservatively approximate the selection functions of these clusters’ parent surveys, and quantify the tension between the abundances of massive clusters as predicted by the standard CDM model and the observed ones. We alleviate the tension considering non-Gaussian primordial perturbations of the local type, characterized by the parameter and derive constraints on arising from the mere existence of these clusters. At the confidence level, with cosmological parameters fixed to their most likely WMAP5 values, or (at confidence) if we marginalize over WMAP5 parameters priors. In combination with constraints from Cosmic Microwave Background and halo bias, this determination implies a scale-dependence of at . Given the assumptions made in the analysis, we expect any future improvements to the modeling of the non-Gaussian mass function, survey volumes, or selection functions to increase the significance of found here. In order to reconcile these massive, high-z clusters with an , their masses would need to be systematically lowered by or the parameter should be higher than CMB (and large-scale structure) constraints. The existence of these objects is a puzzle: it either represents a challenge to the CDM paradigme or it is an indication that the mass estimates of clusters is dramatically more uncertain than we think.

###### pacs:

cosmology## I Introduction

Recent developments in observational hardware and observing techniques have enabled the detection of many massive, high-redshift clusters (see, e.g. Bremer et al., 2006; Muchovej et al., 2007; Brodwin et al., 2010; Chiaberge et al., 2010), which seem to create some tension with the abundance predictions of the standard CDM paradigm (Jimenez & Verde, 2009; Holz & Perlmutter, 2010). Previous work (Matarrese et al., 2000; Lo Verde et al., 2008; D’Amico et al., 2011) have examined how the abundance of high-redshift massive clusters within the CDM model can be enhanced by allowing the primordial fluctuations, a relic of inflation, to deviate from a Gaussian random field. The most basic models of inflation predict a scale invariant power spectrum of density pertabations , described by a Gaussian random field . Probes of the very early Universe (e.g., Hinshaw et al., 2009) and the Large Scale Structure of the late Universe have shown that this description is a good approximation to first order. However, any deviations from the slow-roll, single field, adiabatic vacuum state inflation (and more complex inflationary models) predict deviations from Gaussianity (see e.g., Bartolo et al., 2004; Komatsu & et al, 2009; Byrnes & Choi, 2010, and refs. therein), which are of interest because they 1) Modify the number of high-redshift clusters, relieving tension between theory and observation, and 2) Allow an observational window into early universe physics. The non-Gaussian corrections may be charaterised by the coefficient (Salopek & Bond, 1990; Gangui et al., 1994; Verde et al., 2000; Komatsu & Spergel, 2001), which affects the initial potential field , as

(1) |

in the so-called local non-Gaussianity case.

Observations of the Cosmic Microwave Background (CMB) WMAP3 by Yadav & Wandelt (2008), measured to be within (at the confidence level). More recently, Komatsu & other (2011) find (at 95% C.L.), consistent with the above range but also consistent with zero. The CMB constrains at large scales (), but on smaller scales the Large Scale Structure (LSS) can also constrain through the clustering (see e.g., Verde, 2010; Sartoris et al., 2010, and refs. therein) and abundances of massive halos (see e.g., Matarrese et al., 2000; Lo Verde et al., 2008). Measurements of using LSS, provides complementary constraints to the CMB and probes any scale dependence of . Considering the scale-dependence on halo bias induced by local non-Gaussianity, Xia et al. (2010) obtain at , ( at 95% confidence) from the NVSS survey; this signal comes from scales .

The detection of the high-redshift cluster of galaxies XMMUJ2235.3+2557 (Mullis et al., 2005) and a Hubble Space Telescope weak lensing mass measurement (Jee, 2009), allowed Jimenez & Verde (2009) to show how the tension between CDM (which predicts such clusters) and this cluster could be alleviated with values of . Massive clusters abundance probes on scales corresponding to the Lagrangian radius of the halos; .

Holz & Perlmutter (2010) then calculated at which redshift and mass, the most massive cluster in the Universe was expected to be found, and how this changed with survey volume. They also found that XMMUJ2235.3+2557 was more than away from , CDM predictions.

Finally, Cayón et al. (2010) formally calculated the constraints which could be placed on using XMMUJ2235.3+2557. They computed the probability that the “most massive” cluster expected within the survey volume had a mass, 1) greater than the upper mass estimate of the cluster, 2) within the upper and lower bounds on the mass estimate, and 3) less than the lower bound on the clusters mass. They Poisson sampled from these abundances to obtain a probability that a cluster with the mass of XMMUJ2235.3+2557 was the “most massive” system. By exploring how values of modified cluster abundances (using Matarrese et al., 2000), they placed constraints on to be greater than zero at the significance level. We note that is only one possible explanation of the existence of high-redshift massive clusters (see, e.g., Baldi & Pettorino, 2011).

The above studies represent the latest results for constraining on () cluster scales, and have concentrated on the above single cluster at high-redshift. We extend these previous works by exploring the constraints on using high-redshift () spectroscopically confirmed galaxy clusters with masses measured in the literature.

The layout of the paper is thus; we begin by reviewing the theoretical form of the cluster mass function and the non-Gaussian correction to it, and continue by describing the compilation of a high-redshift cluster sample. Here we discuss our conservative assumptions about the selection functions and survey volumes. We then describe our analysis and find the best fitting cosmological parameters, followed by our conclusions and discussions. Throughout the paper, unless otherwise stated, we assume a flat CDM model with WMAP5 (Hinshaw et al., 2009) cosmological parameters (i.e, ), and quote using the LSS convention, e.g. (see, e.g., Verde, 2010).

## Ii The non-Gaussian cluster mass function

The theoretical cluster mass function was first written down by Press & Schechter (1974) who assumed spherically collapsed halos, and was later improved e.g., Sheth et al. (2001). Subsequently, large-volume, high resolution N-body simulations have been performed and mass functions fitting formulae have been found (see, e.g., Jenkins et al., 2001; Tinker et al., 2008; Bhattacharya et al., 2010). We use the spherical overdensity Gaussian mass function given by Jenkins et al. (2001), which determines the number of haloes as a function of mass as measured within a radius at which the density contrast is times the background matter density , and has the form,

(2) |

where is the rms variation of the density field, smoothed on scales . For a discussion of the minor differences between and mass functions see Tinker
et al. (2008). We use the icosmo^{1}^{1}1http://www.icosmo.org/ package (Refregier et al., 2008) to calculate , co-moving distances and other cosmology-dependent parameters, and use the functional form of (see Equ. B4 of Jenkins
et al., 2001) given by

(3) |

Non-Gaussian corrections to the mass function have been proposed in the literature (Matarrese et al., 2000; Lo Verde et al., 2008; Maggiore & Riotto, 2010; D’Amico et al., 2011), and over the mass and redshift ranges considered here, agree to within (see Fig. of D’Amico et al., 2011). These corrections are typically written as the ratio of the non-Gaussian to Gaussian mass functions , and are, for example, found by lineararising the -point expansion of the collapse density (as in Lo Verde et al., 2008), or by using saddle point approximations to non perturbatively account for higher order corrections (as in Matarrese et al., 2000) (MVJ), (although, see Maggiore & Riotto, 2010). We adopt the MVJ prescription to describe how the ratio of the non Gaussian to Gaussian mass functions change as a function of

(4) |

where describes the normalized skewness of the smoothed density field, and can be used to define a “skewness per unit” as . is given by

(5) | |||

where is the critical density for ellipsoidal gravitational collapse. Wagner et al. (2010) recently tested these predictions for generic non-Gaussianity, using a suite of N-body simulations, but due to difficulty in computing the initial conditions, they probed relatively low mass () systems. They found that the MVJ mass function may slightly over predict the abundances of massive clusters at high-redshift. If this result can be extrapolated to more massive clusters at even higher redshifts, then the over prediction of the MVJ non-Gaussian mass function will only strengthen the conclusions drawn from this work, as a larger value of will be required to fit the observed abundances of massive clusters using a more accurate model, implying this analysis is conservative.

After the publication of this work, Enqvist et al. (2010) found that the exponential fall in the Jenkins et al. (2001) mass function is not enough to counter the exponential increase in the non-Gaussian correction Matarrese et al. (2000) for very large values of and large masses (). They find that the Tinker et al. (2008) mass function is more well behaved for larger values of and masses, but still breaks down at very large scales. We stopped the mass function integration at just before the Jenkins et al. (2001) mass function breaks down. They additionally checked and confirmed the robustness of our method to the choice of the mass function and measure a mean value very close to that measured here, for the same sample of clusters, even after correcting for the mass function approximation. In what follows, we only place a lower constraint on the value of , and thus our approach is robust to the choice of mass function at these lower values of and masses considered.

## Iii Data

Cluster Name | Redshift | M | Method | Mass reference |
---|---|---|---|---|

’WARPSJ1415.1+3612’ | Velocity dispersion | Huang et al. (2009) | ||

’SPT-CLJ2341-5119’ | Richness | High & others. (2010) | ||

’XLSSJ022403.9-041328’ | X-ray | Maughan et al. (2008) | ||

’SPT-CLJ0546-5345’ | Velocity dispersion | Brodwin et al. (2010) | ||

’SPT-CLJ2342-5411’ | Richness | High & others. (2010) | ||

’RDCSJ0910+5422’ | X-ray | Mei et al. (2009) | ||

’RXJ1053.7+5735(West)’ | X-ray | Stott et al. (2010) | ||

’XLSSJ022303.0–043622’ | X-ray | Stott et al. (2010) | ||

’RDCSJ1252.9-2927’ | X-ray | Mei et al. (2009) | ||

’RXJ0849+4452’ | X-ray | Mei et al. (2009) | ||

’RXJ0848+4453’ | X-ray | Mei et al. (2009) | ||

’XMMUJ2235.3+2557’ | X-ray | Stott et al. (2010) | ||

’XMMXCSJ2215.9-1738’ | X-ray | Stott et al. (2010) | ||

’SXDF-XCLJ0218-0510’ | X-ray | Tanaka et al. (2010) |

We compile a list of high-redshift () spectroscopically confirmed clusters with masses measured or estimated in the literature, and present them in Table 1. We believe this list to represent all known spectroscopically identified clusters with mass measurements. We show the cluster’s name, the spectroscopic redshift, the cluster mass and mass error converted to (in units of , assuming an NFW profile (Navarro et al., 1996) if necessary) which is the mass enclosed within a radius at which the density is times that of the background matter density. and the reference to the mass measurement. We distinguish clusters detected by X-ray surveys and those found using the Sunyaev-Zel dovich (Sunyaev & Zeldovich, 1972, hereafter SZ) effect.

Here for each cluster we adopt the mass estimate that gave the least tension (best agreement) with CDM. For an illustrative example consider two cases; 1) A cluster mass has a large central value () with a large error () , and 2) a cluster has a slightly lower mass estimate () with a smaller error bar ) (see Brodwin et al., 2010). In our analysis, we find that case 1 is more likely to exist in an CDM, than case 2. Thus, we use case 1 to be conservative.

We note that mass measurements from different techniques typicaly agree well, e.g. XMMUJ2235.3+2557 had mass measurements using weak lensing of , and (Jee, 2009), and X-ray mass measurements of (Rosati et al., 2009) and (Stott et al., 2010).

We also note that potential high-redshift clusters have been detected, but not followed up spectroscopically (e.g. see Gladders & Yee, 2005), so their redshifts, and typically, masses are subject to larger uncertainties, if not unknown. This implies that our analysis can only place a lower limit on as the other clusters may have higher redshifts and/or be more massive than the clusters in our sample, which would further boost the required value of .

If any of these potential high-redshift clusters candidates were found to be less massive than those in our sample, or at lower redshifts, (and such smaller systems are expected in all , CDM cosmologies), they would not detract from these results using the present selection of clusters, as as our analysis only consider these “rare events” that have already been confirmed.

The ability to detect a cluster, measure its redshift and mass for any survey, can be described by the selection function. For believable upper and lower limits to be placed on cosmological parameters (including ) using galaxy clusters, the selection function must be understood. Our analysis uses heterogeneously selected clusters, so combining the selection functions is non trivial. We now describe how we conservatively model the selection functions for the X-ray and SZ surveys. We note that deviations from the conservative modeling, will only strengthen our conclusions.

### iii.1 Selection function

We split the cluster catalogues into two broad categories, those detected using the X-ray by the ROSAT and XMM satellites, and those found using the SZ effect at South Pole Telescope (Carlstrom et al., 2009, hereafter SPT).

#### iii.1.1 X-ray

Many of the X-ray surveys have partially overlapping footprints, differing flux limits and exposure times. This means that some clusters were multiply detected by distinct groups, e.g. XMMUJ2235.3+2557 was originally detected by the XMM-Newton Distant Cluster Project (Mullis et al., 2005), but was later redetected by the XMM Cluster Survey (Romer et al., 1999). The combination of all of the X-ray surveys, as performed here, makes the construction of the full survey volume and selection function non-trivial.

We continue conservatively, by assuming that all X-ray surveys had independent footprints (even if they did not) and uniform survey volumes (even if some were shallower than others), which we choose to be between (2.2 represents our estimate of the deepest survey limit). We find that our conclusions are stable to arbitrary increases of the maximum redshift assumed, but will depend on improvements to the modeling of the survey footprints and volumes. We reiterate that any improvements to the conservative selection function and footprints adopted here, will make any conclusions drawn from this analysis stronger, as a reduced survey volume (caused by a smaller footprint or exposure time), or a worse selection function (i.e. there are clusters in the volume that have not been found) will modify the number of observable clusters expected, which will, at best not change our results, but at worse, increase tension with CDM.

The conservative X-ray survey footprint is sq. degrees and is composed of sq. degrees from the XMM Cluster Survey, sq. deg. from the XMM-Large Scale Survey (Pierre & Consortium, 2001), sq. deg. from the XMM-Newton Distant Cluster Project, sq. deg. from the XMM Contiguous survey (Finoguenov et al., 2010), sq. deg. from the Wide Angle ROSAT Pointed Survey (Perlman et al., 2002), and sq. deg. from the ROSAT Deep survey (Hasinger et al., 1998).

#### iii.1.2 Sz

The SZ SPT survey has a well understood selection function, and was expected to detect all massive clusters above (Haiman
et al., 2001; Battye &
Weller, 2003), at all redshifts. We again assume a survey volume between and use the footprint of sq. degrees. To measure the redshifts of clusters detected with the SZ, one needs optical spectroscopic follow up. Not all the identified clusters have had their redshifts and masses measured (see High &
others., 2010), but we continue conservatively, by assuming that only clusters with follow-up were detected. This is conservative because future cluster measurements will not relieve the tension with found using the current collection of clusters.

Fig 1 is an Aitoff projection representing the survey footprints of the combined X-ray survey (shown as a green contiguous region, although note that the actual X-ray footprint covers many different directions across the full sky), the SZ SPT survey (in yellow), and we also show the Sloan Digital Sky Survey (Abazajian et al., 2009, hereafter SDSS) survey footprint for comparison (red). The high-redshift clusters compiled here, are represented by the crosses and triangles. This figure demonstrates how little of the high-redshift sky has been observed, and how much volume remains to find other potentially massive, high-redshift clusters, which may increase the tension with CDM.

## Iv Method and Results

Our analysis follows two approaches. First we build on the approach of (and refer the reader to §3 of Cayón et al., 2010), and define the “least probable” (i.e., a combination of most massive and highest redshift) clusters in each of the combined X-ray surveys and the SZ survey, which due to the high mass, should also be the easiest to find. We then extend the approach of Cayón et al. (2010), by using the existence of the compiled cluster sample, including the clusters full mass error distributions, to examine the probability that the ensemble of clusters could exist in a CDM universe, and probe how the probability increases with . Initially we keep the cosmological parameters fixed to WMAP5 peak values, and then relax this constraint and marginalize over WMAP5 priors.

We used the output of both Gaussian () and non-Gaussian (with ) N-body simulations (obtained from the authors of Wagner et al., 2010) at a snapshot corresponding to , to successfully blind test the code pipelines. We computed the relative values of needed to explain the existence, abundances, and masses of clusters above , after crudely assuming a survey footprint and a redshift slice i.e., a survey geometry. We found that at a fixed “probability of existing”, the recovered value of for the non-Gaussian simulation data was always greater, than that of the Gaussian simulation data. For the assumed survey geometry we found that the probability of the ensemble of clusters to exist was at in the Gaussian case and in the non-Gaussian case. In the non-Gaussian case, a value of is required to obtain a probability of existing to be . We reiterate that the exact recovered probability of existence at fixed values, depends on the crude conversion of the simulated snapshot volume at , to the assumed survey geometry, but the differences between the simulations required a value of similar to that inputted into the non-Gaussian simulations.

### iv.1 The least probable clusters

We begin by asking the question, “What is the least probable object to be found in each survey assuming ?”. This approach is analogous to determining the most massive system in the survey (e.g. Cayón et al., 2010), but generalized to include the redshift-dependence of the mass function.

Assuming the central value for the clusters mass, we find that the cluster XMMUJ2235.3+2557 is the least probable X-ray detected object, we expect over the full sky (and in the X-ray survey footprint) at and using our cosmology and theoretical mass function. We also find that SPT-CLJ0546-5345 is the least probable SZ detected cluster; we expect only over the full sky (and in the survey) with and .

Following Cayón et al. (2010), we calculate the probability that the mass of the “least probable” cluster in each survey falls within one of the following three mass bins; 1) less than the mass range of the cluster, 2) within the mass range of the cluster, and 3) greater than the mass range of the cluster. This is accomplished by calculating the theoretical cluster abundance within each mass bin, and then Poisson sampling from these three abundances times (using the same random number seed for each of the three bins), and recording the most massive bin which the Poisson samples is . This yields a probability that the “most massive” cluster exists is within the above mass bins, and within the survey volume.

We then gradually increase , which boosts the abundances of clusters, and Poisson sample from these new abundances to re-derive the above probabilities. This allows us to place constraints on using the least probable observed cluster in each survey.

In Fig. 2 we show the probability that each observed massive cluster is the (theoretically predicted) “least probable” system in the survey as a function of . We note that both clusters provide similar constraints, which, when combined, points to some tension with CDM. The constraints obtained here, are slightly different to that in Cayón et al. (2010), due to differences in the assumed survey footprint, mass function and cosmological parameters. Note that for example XMMUJ2235.3+2557 has another (weak lensing-based) mass estimate which has a higher central value and smaller error-bars. This makes our approach conservative.

### iv.2 All clusters

We proceeded by using the existence, masses and full error budgets, of the clusters in the sample. To model uncertainties we adopt the following Monte-Carlo approach. We log Gaussian random sample from each cluster’s mass and error times producing a set of sampled masses , and determine how many clusters one would expect to find above each sampled mass and above the redshift of the cluster out to edge of survey volume using the mass function expression. For each of the sampled masses , we Poisson sample from the predicted abundances , and noted if the Poisson sample , i.e. that a cluster more massive than this cluster with a redshift equal to or greater than this cluster could exist. This formed a probability , that each cluster , could exist (marginalized over its mass uncertainty), rather than forming a probability that the cluster is the “most massive”, as above i.e., the probability a cluster exists is . We then repeated this analysis for each of the clusters and multiplied the probabilities that each cluster could exist in the surveyed region, to produce a combined probability , that the ensemble of high-redshift clusters could exist in the modeled universe. We increased the value of and repeated the analysis to produce a probability distribution and stopped the analysis when the , i.e, that all the clusters were likely to exist in the cosmological model and survey volumes.

Fig. 3 shows the probability that each cluster could exist given the survey volumes and selection function. The X-ray and SZ identified clusters are distinguished in the figure, but combined in the analysis. We show how the probability for each cluster, varies if we change from (black symbols) to (red symbols).

We see that many clusters are unlikely to exist in a CDM universe, and by multiplying the probabilities, we find that the probability of the observed Universe being well described by this model is . When we note that each cluster is more likely to exist, and the combined probability (for our mass function), which suggests that this model is a better description to the observed Universe (although see, Enqvist et al., 2010, for a discussion of the validity of the chosen mass function).

In Fig. 4 we plot the combined probability that all the clusters could exist as a function of . We see that the model is a poor fit to the observed Universe, and by increasing we alleviate tension. We constrain at the confidence level using these clusters. We remind the reader that any improvement in the modeling of the survey volumes, footprints or theoretical mass function, or the detection of more massive, high-redshift clusters, will only increase this result.

#### iv.2.1 Varying cosmological parameters

We next simultaneously Gaussian random sample from the parameters times, using the WMAP5 priors (without imposing spatial flatness) and record the value of evaluated at , denoted here as , which describes the probability of observing our clusters in their surveys (i.e. the exsistence of these clusters in their surveys is allowed at C.L.). This procedure is totally analogous to the so-called “generalized p-value” for , where the uncertainty in the clusters mass and on cosmological parameters is effectively marginalized over by treating them as “nuisance parameters” with probability distributions given by the mass estimates and WMAP constraints.

In Fig. 5 we show the d distribution of (generalized) p values (so that ) as a function of . In other words Fig. 5 shows the frequency in our Monte Carlo procedure of each value of . We obtain () at () confidence.

In Fig. 6 we present a selection of two dimensional distributions, showing the values of for the sampled parameter values, against marginalized distributions of; left) the variance of the density field smoothed on scales , and right) the spectral index . The filled color contours show the (red) and (blue) significance levels, and we have marked the peaks in each of the distributions by crosses. When viewing these plots, one should keep in mind that they represents value distributions for ; thus these figures should not be interpreted as standard Markov Chain Monte Carlo plots.

We find that is degenerate with , but less degenerate with all the other varied parameters (we have shown only a selection). We can calculate the value of needed for by going to lower values, or extrapolating down the line of degeneracy using the left panel of Fig. 6, resulting in a value of . If we only vary and keep the other parameters fixed to their WMAP5 peaks values, we find when .

It is interesting to note that Actacama Cosmology Telescope found at 95% CL from upper limits on the SZ power spectrum (Fowler et al., 2010), and SPT found (Lueker et al., 2010). The SZ power spectrum signal depends very strongly on but not as strongly on as, for current observations, it is dominated by massive () but lower redshift () clusters (see Komatsu & Seljak (2002)). The latest WMAP results alone (combined with external data sets) give a more direct, cleaner, measurement () Komatsu & other (2011). The high value necessary to obtain is away from these constraints.

## V Conclusions and discussion

We compiled a list of high-redshift () galaxy clusters with mass measurements from the literature and used their existence to place constraints on the non Gaussianity parameter . The clusters were identified from X-ray surveys and the SZ SPT survey, and we conservatively assumed a selection function and survey volume. We used the theoretical Gaussian mass function of Jenkins et al. (2001) and the prescription for modifying the cluster abundance for non Gaussianities of Matarrese et al. (2000). We additionally used the output of the Gaussian and non Gaussian N-body simulations (obtained from the authors of Wagner et al., 2010) at , to successfully blind test the code pipelines.

We chose to use cluster mass estimates which were performed assuming a cosmology close to WMAP5 CDM, and to remain conservative, if more than one measurement technique had been used, we adopted the cluster mass and error measurement which allowed for the lowest sampled cluster mass.

We performed two sets of analysis. First we asked the question, which is the least probable cluster in each survey (this also turns out to be the most massive cluster) and asked how likely this cluster was to be the “most massive” system in each survey. We found that both massive clusters provide some tension with the WMAP5 CDM model, and that by multiplying the probabilities, we find that these two clusters have a probability of being observed of .

Using the existence of the clusters, their masses and full errors distributions, we then calculated the probability that each cluster could exist in the survey. We sampled from each cluster’s mass and error and calculating the expected (Jenkins mass function-predicted) abundance above each sampled mass and above the redshift of the cluster, and then Poisson sampled from the abundances^{2}^{2}2Subsequently, Enqvist
et al. (2010) showed that our results are robust to the choice of mass function for the lower bounds placed on reported here.. We recorded the frequency that the Poisson sampled number was greater than or equal to one, implying that at least one cluster with the sampled mass could exist above the redshift of the cluster in the survey volume. We used the frequency of existence to construct a probability that each cluster could exist. We then combined all probabilities, to obtain a final probability that the ensemble of clusters could be found in the modeled universe, and we showed how this probability changes with . We note that our method allows for only a lower limit to be placed on . This is because any new clusters, or improvements to the survey volumes, or selection functions, will increase tension with CDM with WMAP priors on cosmological parameters.

We found that the best fitting models bound to be greater than at the confidence level, when keeping the WMAP5 parameters fixed at their peak values. We also Gaussian random sampled from the cosmological parameters using the WMAP5 priors. For each realization, we calculated the value of , above which of the probability distribution lay. We find that the median value of is , and drops below , in only of realizations. This means that even after marginalizing over cosmological parameters assuming WMAP5 priors, we still find at the confidence level.

We have performed several checks: i) the signal is not driven by few objects (e.g., only clusters detected in X-rays or only those detected in SZ, or only clusters which mass estimate is obtained from X-rays etc.) ii) these rare events are not evidently clustered in a special patch of the sky iii) cosmological parameters degeneracies: the parameter is degenerate only with the parameter. To obtain that is allowed at 95% C. L., the value of would have to be larger than current cosmological (CMB alone and in combination with LSS) constraints. iv) all the cluster mass estimates would have had to be systematically overestimated by , regardless of the measurement technique used, to allow the ensemble to clusters to be fully compatible with CDM.

In Fig. 7, we compare the result obtained here with other works, using a modified version of Fig. of Verde (2010). We overplot the result on CMB scales (using the at of , at the confidence level by Yadav & Wandelt (2008) (dark green); of at by Komatsu & other (2011) (light green); the LSS results at scales of at by Cayón et al. (2010) (light salmon, but note that to apply an upper constraint, they assume that there will be no other clusters found in this footprint as massive or more massive than this cluster); our result of (so , dark salmon); the result using a measurement of the non Gaussian scale dependent bias at scales of at the C.L. and peaked at by Slosar et al. (2008) (light blue); and the result at ( at the 95% C.L.) by Xia et al. (2010) (dark blue). They also obtained a similar, fully consistent, constraint from the SDSS quasar sample (). For our application here we use the NVSS numbers.

We used these measurements to constrain the non Gaussian spectral index , defined by (Lo Verde et al., 2008),

(6) |

where the indicates the CMB pivot scale, . Note that this scale-dependence parameterization does not allow to change sign, so in the following approach only is sampled by our procedure. This (theoreticaly-imposed) prior is not too important as for only a small region with relatively low probability (recall that Komatsu & other (2011) finds at ).

Due to our inability to reliably place an upper constraint on (see the introduction to the data section for justification), we assumed a log normal distribution for with a mean of and .

We sampled from the measured values of , while keeping fixed to the central value, and found the best fitting curve (using MPFIT^{3}^{3}3http://cow.physics.wisc.edu/craigm/idl/idl.html) and recorded the value of at each pass. The distribution of is described by at , which is a detection of scale dependent bias, using Yadav &
Wandelt (2008), Slosar et al. (2008) and our result, or at , which is a detection of scale dependent bias, using Komatsu &
other (2011), Xia et al. (2010) and our result, or at , using Komatsu &
other (2011), Slosar et al. (2008) and our result. All of these constraints are in agreement with Cayón
et al. (2010). Since these sets of analysis are not independent, the differing results highlight some possible systematics effects. We show these lines of best fit on Fig. 7.

For a non flat distribution of objects, each with an observed error, we must account for more objects to be scattered into some part of the distribution than are scattered out. This is described by the Eddington bias, and occurs here because the number of expected very massive clusters above a mass , is exponentially smaller than the expected number of clusters with mass less than . This could allow lower mass clusters to masquerade as higher mass clusters, and potentially cause us to over estimate .

The Eddington bias is estimated to be only a fraction of the full mass error used in this work, and we have marginalized over the full mass error distribution and have therefore removed any of the Eddington bias effects.

As a worked example we present the cluster XMMU J2235.3-2557. To calculate the true Eddington bias, one should adopt the more robust cluster mass estimate not, as we have done here, the more conservative one. Typically, the more conservative mass estimate is the one with the largest mass error. E.g., Mortonson
et al. (2011) states the X-ray mass estimate of XMMU J2235.3-2557 to be . We find that the statistical correction to the mass , is with , and the correction for the Eddington bias is , which is indeed higher than the statistical correction (although less than ). Now, if we instead use the weak lensing mass estimate , of the same cluster, we obtain a statistical correction of with , and the corresponding Eddington bias correction is . The Eddington bias here is therefore times smaller than the statistical error of the X-ray estimate, which is that used in this work.

We conclude with the remarks that we have attempted to remain very conservative with our choices of selection functions and volumes, with the cluster mass estimates, and the modeling of the theoretical non Gaussian cluster mass function. Any future improvements in the modeling is expected to strengthen the conclusions of this work; if the survey volume decreases, or more clusters are followed up spectroscopically and found to be massive, or the theoretical non Gaussian mass function modeling is improved, the tension with WMAP5 CDM will, in all cases, increase. The existence of high-redshift massive clusters is a puzzle: it represent a challenge to the CDM paradigme if the clusters mass estimates reported in the literature (central values and errors) are taken face value. These objects grew too massive too fast compared to the gravitational instability picture in a CDM paradigm. Alternatively this is an indication that mass estimates of high-redshift clusters is dramatically more uncertain than currently believed. Weak lensing clusters mass estimate is an extremely promising approach to test this possibility as (e.g., Mandelbaum et al., 2010) robust and accurate mass estimates are possible. Such an observational effort would help address this “too big, too early” puzzle.

## Acknowledgments

BH would like to thank Christian Wagner for detailed discussions and making the results of his simulations available, and Shaun Hotchkiss for useful discussions and code comparisons, and LV thanks Carlos Penya Garay for discussions. The authors thank a anonymous referees for comments which improved the paper. BH acknowledges grant number FP7-PEOPLE- 2007- 4-3-IRG n 20218, and the Department of Mathematics and Applied Mathematics at the University of Cape Town for hospitality, LV and RJ are supported by MICINN grant AYA2008-0353. LV is supported by FP7-IDEAS-Phys.LSS 240117, FP7-PEOPLE-2007-4-3-IRGn202182.

## References

- Abazajian et al. (2009) Abazajian K. N., Adelman-McCarthy J. K., Agüeros M. A., Allam S. S., Allende Prieto C., An D., Anderson K. S. J., Anderson S. F., Annis J., Bahcall N. A., et al. 2009, ApJS, 182, 543
- Baldi & Pettorino (2011) Baldi M., Pettorino V., 2011, MNRAS, 412, L1
- Bartolo et al. (2004) Bartolo N., Komatsu E., Matarrese S., Riotto A., 2004, Physics Reports, 402, 103
- Battye & Weller (2003) Battye R. A., Weller J., 2003, PRD, 68, 083506
- Bhattacharya et al. (2010) Bhattacharya S., Heitmann K., White M., Lukić Z., Wagner C., Habib S., 2010, ArXiv e-prints:1005.2239
- Bremer et al. (2006) Bremer M. N., et al., 2006, MNRAS, 371, 1427
- Brodwin et al. (2010) Brodwin M., et al., 2010, ApJ, 721, 90
- Byrnes & Choi (2010) Byrnes C. T., Choi K., 2010, Advances in Astronomy, 2010
- Carlstrom et al. (2009) Carlstrom J. E., et al., 2009, ArXiv e-prints:0907.4445
- Cayón et al. (2010) Cayón L., Gordon C., Silk J., 2010, ArXiv e-prints:1006.1950
- Chiaberge et al. (2010) Chiaberge M., Capetti A., Macchetto F. D., Rosati P., Tozzi P., Tremblay G. R., 2010, ApJL, 710, L107
- D’Amico et al. (2011) D’Amico G., Musso M., Noreña J., Paranjape A., 2011, JCAP, 2, 1
- Enqvist et al. (2010) Enqvist K., Hotchkiss S., Taanila O., 2010, ArXiv e-prints:1012.2732
- Finoguenov et al. (2010) Finoguenov A., et al., 2010, MNRAS, 403, 2063
- Fowler et al. (2010) Fowler J. W., et al., 2010, ApJ, 722, 1148
- Gangui et al. (1994) Gangui A., Lucchin F., Matarrese S., Mollerach S., 1994, ApJ, 430, 447
- Gladders & Yee (2005) Gladders M. D., Yee H. K. C., 2005, ApJS, 157, 1
- Haiman et al. (2001) Haiman Z., Mohr J. J., Holder G. P., 2001, ApJ, 553, 545
- Hasinger et al. (1998) Hasinger G., Burg R., Giacconi R., Schmidt M., Trumper J., Zamorani G., 1998, Astron. & Astrophys., 329, 482
- High & others. (2010) High F. W., others. 2010, ApJ, 723, 1736
- Hinshaw et al. (2009) Hinshaw G., et al., 2009, ApJS, 180, 225
- Holz & Perlmutter (2010) Holz D. E., Perlmutter S., 2010, ArXiv e-prints:1004.5349
- Huang et al. (2009) Huang X., et al., 2009, ApJL, 707, L12
- Jee (2009) Jee M. J. a., 2009, ApJ, 704, 672
- Jenkins et al. (2001) Jenkins A., et al., 2001, MNRAS, 321, 372
- Jimenez & Verde (2009) Jimenez R., Verde L., 2009, PRD, 80, 127302
- Komatsu & et al (2009) Komatsu E., et al 2009, in astro2010: The Astronomy and Astrophysics Decadal Survey Vol. 2010 of Astronomy, Non-Gaussianity as a Probe of the Physics of the Primordial Universe and the Astrophysics of the Low Redshift Universe. pp 158–+
- Komatsu & other (2011) Komatsu E., other 2011, ApJS, 192, 18
- Komatsu & Seljak (2002) Komatsu E., Seljak U., 2002, MNRAS, 336, 1256
- Komatsu & Spergel (2001) Komatsu E., Spergel D. N., 2001, PRD, 63, 063002
- Lo Verde et al. (2008) Lo Verde M., Miller A., Shandera S., Verde L., 2008, JCAP, 4, 14
- Lueker et al. (2010) Lueker M., et al., 2010, ApJ, 719, 1045
- Maggiore & Riotto (2010) Maggiore M., Riotto A., 2010, Astrophys. J., 717, 526
- Mandelbaum et al. (2010) Mandelbaum R., Seljak U., Baldauf T., Smith R. E., 2010, MNRAS, 405, 2078
- Matarrese et al. (2000) Matarrese S., Verde L., Jimenez R., 2000, ApJ, 541, 10
- Maughan et al. (2008) Maughan B. J., et al., 2008, MNRAS, 387, 998
- Mei et al. (2009) Mei S., et al., 2009, ApJ, 690, 42
- Mortonson et al. (2011) Mortonson M. J., Hu W., Huterer D., 2011, PRD, 83, 023015
- Muchovej et al. (2007) Muchovej S., et al., 2007, ApJ, 663, 708
- Mullis et al. (2005) Mullis C. R., Rosati P., Lamer G., Böhringer H., Schwope A., Schuecker P., Fassbender R., 2005, ApJL, 623, L85
- Navarro et al. (1996) Navarro J. F., Frenk C. S., White S. D. M., 1996, ApJ, 462, 563
- Perlman et al. (2002) Perlman E. S., et al., 2002, ApJS, 140, 265
- Pierre & Consortium (2001) Pierre M., Consortium T., 2001, pp 185–+
- Press & Schechter (1974) Press W. H., Schechter P., 1974, ApJ, 187, 425
- Refregier et al. (2008) Refregier A., Amara A., Kitching T., Rassat A., 2008, ArXiv e-prints:0810.1285
- Romer et al. (1999) Romer A. K., Viana P. T. P., Liddle A. R., Mann R. G., 1999, ArXiv astro-ph/9911499
- Rosati et al. (2009) Rosati P., et al., 2009, Astron. & Astrophys., 508, 583
- Salopek & Bond (1990) Salopek D. S., Bond J. R., 1990, PRD, 42, 3936
- Sartoris et al. (2010) Sartoris B., Borgani S., Fedeli C., Matarrese S., Moscardini L., Rosati P., Weller J., 2010, MNRAS, 407, 2339
- Sheth et al. (2001) Sheth R. K., Mo H. J., Tormen G., 2001, MNRAS, 323, 1
- Slosar et al. (2008) Slosar A., Hirata C., Seljak U., Ho S., Padmanabhan N., 2008, JCAP, 8, 31
- Stott et al. (2010) Stott J. P., et al., 2010, ApJ, 718, 23
- Sunyaev & Zeldovich (1972) Sunyaev R. A., Zeldovich Y. B., 1972, Comments on Astrophysics and Space Physics, 4, 173
- Tanaka et al. (2010) Tanaka M., Finoguenov A., Ueda Y., 2010, ApJL, 716, L152
- Tinker et al. (2008) Tinker J., et al., 2008, ApJ, 688, 709
- Verde (2010) Verde L., 2010, Advances in Astronomy, 2010
- Verde et al. (2000) Verde L., Wang L., Heavens A., Kamionkowski M., 2000, MNRAS, 313, 141
- Wagner et al. (2010) Wagner C., Verde L., Boubekeur L., 2010, JCAP, 10, 22
- Xia et al. (2010) Xia J.-Q., et al., 2010, JCAP, 1008, 013
- Yadav & Wandelt (2008) Yadav A. P. S., Wandelt B. D., 2008, Physical Review Letters, 100, 181301