First measurement of the bulk flow of nearby galaxies using the cosmic microwave background
Abstract
Peculiar velocities in the nearby Universe can be measured via the kinetic SunyaevZel’dovich (kSZ) effect. Using a statistical method based on an optimised crosscorrelation with nearby galaxies, we extract the kSZ signal generated by plasma halo of galaxies from the Cosmic Microwave Background (CMB) temperature anisotropies observed by the Wilkinson Microwave Anisotropy Probe (WMAP). Marginalising over the thermal SunyaevZel’dovich contribution from clusters of galaxies, possible unresolved point source contamination, and Galactic foregrounds, we find a kSZ bulk flow signal present at the % confidence level in the sevenyear WMAP data. When only galaxies within 50 Mpc are included in the kSZ template we find a bulk flow in the CMB frame of km s, in the direction , , consistent with bulk flow measurements on a similar scale using classical distance indicators. We show how this comparison constrains, for the first time, the (ionised) baryonic budget in the local universe. On very large ( Mpc) scales, we find a 95% upper limit of 470 km s, inconsistent with some analyses of bulk flow of clusters from the kSZ. We estimate that the significance of the bulk flow signal may increase to 35 using data from the Planck probe.
1 Introduction
Peculiar velocities are the only probe of the largescale (10 – 1000 Mpc
Some recent peculiar velocity studies using classical distance indicators have suggested that there may be a cosmic bulk flow on scales of Mpc in excess of that expected in CDM (Watkins et al., 2009; Lavaux et al., 2010; Feldman et al., 2010; Colin et al., 2011), whereas other studies have found lower values (Nusser & Davis, 2011; Turnbull et al., 2012). Clearly, it would be useful to have an independent probe of the peculiar velocity field on these scales.
On much larger scales, the kSZ effect in rich clusters has recently been used to measure the bulk flow by Kashlinsky et al. (2008, hereafter KAKE), following (Kashlinsky & AtrioBarandela, 2000). However, subsequent studies using similar data sets (Keisler, 2009; Osborne et al., 2011; Mody & Hajian, 2012) have not confirmed a flow with statistical significance claimed by KAKE. While the results of the latter authors have in turn been questioned (AtrioBarandela et al., 2010; Kashlinsky et al., 2011, 2012), the lack of independent confirmation means that this verylarge scale “Dark Flow” remains controversial. The main challenge is to disentangle the kSZ signal from the primary fluctuations of the CMB and the instrumental noise. The kSZ signal is at the level of a few K, while the CMB is in tens of K and the noise may be in hundreds of K for WMAP (Jarosik et al., 2011). Previous kSZ work has focused on clusters of galaxies (Haehnelt & Tegmark, 1996; Kashlinsky et al., 2008; Osborne et al., 2011; Mody & Hajian, 2012) because clusters hold a significant fraction (but not most) of the ionised plasma in the local universe. Indeed recently Hand et al. (2012) have detected the signal of pairwise infall of clusters using the kSZ effect. Unfortunately there are relatively few clusters of galaxies within 100200 Mpc in the nearby Universe, and so, as shown by the studies cited above, the corresponding errors on the bulk flow are large.
In this paper, we use galaxies themselves to produce a kSZ template which we crosscorrelate with the CMB fluctuations. Galaxies, including spirals, are expected to have a large plasma halo (Fukugita & Peebles, 2006; Rasmussen et al., 2009), indeed this is a standard assumption of galaxy formation models for over 40 years (Rees & Ostriker, 1977; Silk, 1977; White & Rees, 1978). While much of the hot gas is expected to lie outside haloes, in the socalled warmhot intergalactic medium (WHIM, Davé et al., 2001; Shull et al., 2011), nevertheless, this WHIM gas should be correlated with the haloes of larger galaxies and thus may be detected statistically through crosscorrelations (see Sołtan, 2006, for a claimed detection). In the context of this paper, therefore, “plasma halo” refers to both the hot plasma in the dark matter halo as well as plasma in the WHIM that is correlated with the galaxy.
The use of galaxies as kSZ tracers has also been advocated by Ho et al. (2009) and Shao et al. (2011). In particular, using 2MASS galaxies as a probe of the KAKE bulk flow has been proposed by Zhang (2010). On the one hand, in a given volume there are far more galaxies than there are clusters of galaxies, e.g., within 200 Mpc, there are 60 000 galaxies in the 2M++ galaxy catalogue (Lavaux & Hudson, 2011), whereas in the ReflexeBCSCIZA catalogue (RBC, Kocevski & Ebeling, 2006), there are only 273 clusters of galaxies in the same volume. On the other hand, the density profile of the plasma halo is far more uncertain for galaxies than for the clusters of galaxies. Moreover, the plasma halo occupies a much smaller area on the sky, which requires an accurate modelling of beaming effects. Nevertheless there are expected to be more ionised electrons in or near galaxies than in clusters of galaxies, so smaller errors are expected from a galaxy sample.
The basic method used in this paper is to model the observed WMAP maps in all frequencies simultaneously using a combination of foreground templates to represent the kSZ effect as well as the thermal SunyaevZeldovich (tSZ) effect. We also model the contamination from radio point sources that may be present in the galaxies, as well as Galactic foregrounds. We then analytically marginalise all results with respect to primary CMB fluctuations, and additionally marginalise over independent monopoles and dipoles in each channel.
In Section 2, we describe the statistical method used to fit the templates. In Section 3, we describe the data and models used to build the templates for the thermal and kinetic SunyaevZel’dovich effects and point source contamination. In Section 4, we present and discuss our measurement of the kSZ effect. We compare the kSZ bulk flow with results using other methods and measure the baryon fraction in free electrons in Section 5. In Section 6, we discuss future improvements and forecast the errors on the components of the kSZ bulk flow from the upcoming Planck temperature maps. Finally, Section 7 concludes the paper.
2 Statistical method
Here, we adopt a fullsky, multichannel statistical model for WMAP7 data. We can model the observed sky temperature as a combination of multiple physical effects using a template based method, as in e.g. (Gorski et al., 1996; Jaffe et al., 2006). The templates that we consider in this work will model the two SunyaevZel’dovich effects, the Point Source contaminations, the Galactic foreground emission, the residual temperature monopoles and dipoles. Generically, the expected observed signal , in the frequency channel , may be written as:
(1) 
where is the beam of the instrument corresponding to channel , is an instrumental noise component, is the Cosmic Microwave Background (CMB) signal, are real scalars and is the template of signal in the channel (without beam). We note that is a matrix operation, which can include convolution by any kind of smoothing kernel modelling the instrument response, provided the response is linear. expresses our prior on the frequency dependence for the physical signal . In the case where the frequency dependence is not known, the operator is the identity, . This implicitly states that for the template , we have a number of free parameters corresponding to the number of frequency channels. In the case where the frequency dependence of the phenomenon is known, does not depend on , e.g. . This states that only one free parameter is required for all channels, e.g. . There may be other intermediate cases if some channels are at the same frequency. We may pick two concrete example that concern us in this work: the tSZ signal and the unresolved point source contamination map. In the case of tSZ, we know the frequency dependence, which is encoded in . We thus only need one parametre . In the case of the map of point sources, the frequency dependence is unclear so we are leaving it free. We will have three parametres: , , .
We assume that the primary CMB signal is a realisation of a Gaussian random field, and that the instrumental noise is also Gaussian. After marginalisation over the primary CMB fluctuations, we may write the total loglikelihood as (see Appendix A):
(2) 
with
(3) 
and
(4) 
In the above, and run over the available channels of WMAP, is the instrumental noise covariance matrix in the channel . The inverse covariance matrix does not use the noise covariance matrix (only its inverse) and it uses the angular CMB spectrum (and not its inverse). Thus, we do not have to worry about how to regulate the inverse of these two operators. In the above, we have used pseudoinversion and we have set
(5) 
with the pixel masking operator. In the case of WMAP observations, we assume that the noise covariance matrix is diagonal in pixel space, but we do not assume homogeneity of the noise. It takes the form , with the noise normalisation and the number of observations for each frequency channels as provided by the WMAP collaboration. The matrix is an additional masking operation acting in pixel space. All pixels which are masked are set to zero by this operator. In our case, the mask will correspond to a Healpix map, with value either one, for accepting the pixel, or zero, for rejecting it. This form has already been used in Wandelt et al. (2004) in the context of Gibbs sampling of the CMB fluctuations.
If we assume that the angular power spectrum of the primary CMB is kept fixed, the equation (2) leads to an analytic solution of the :
(6) 
with
(7) 
Computing requires applying times the operator . In this paper, we consider ten templates, and for one map per channel, three input maps. This results in applying thirty times. We do not require more because, for a fixed primary CMB spectrum, we may precompute a weighted template , which is sufficient to compute all the terms of the covariance matrix and the maximum likelihood estimate . The procedure is detailed in Appendix B.
For the practical purpose of the numerical implementation, we have separated the fitting of the Galactic foreground templates from the fluctuations due to kSZ, tSZ and point sources contamination. Doing so, the matrix , with elements, stays the same for all of our experiments on the SZ components and point sources. We have separated the fitting using Gibbs sampling on a set of parameters assuming the other set is fixed. Gibbs sampling provides samples of the posterior distributions. On the positive side, by selecting samples, we can impose a nonGaussian prior on the . On the negative side, we are required to run a sampling chain. If the templates are correlated, this chain may take time to converge. In practice though, for the cases of Section 4, the convergence is achieved in a few steps.
3 Template generation
In this section, we describe the physical models used to generate the templates of the three signals considered in this work: kinetic SunyaevZel’dovich (kSZ), the thermal SunyaevZel’dovich (tSZ) and radio point source contamination (PSC).
The signal from the kinetic SunyaevZel’dovich effect is our primary goal. This is derived from the galaxies belonging to 2M++ galaxy compilation (Lavaux & Hudson, 2011), which in turn is based on the 2MASS eXtended Sources Catalog (Skrutskie et al., 2006) and publiclyavailable redshift surveys.
The thermal SunyaevZel’dovich (tSZ) signal is produced mostly by clusters of galaxies. As clusters and galaxies are correlated, it is necessary to also model the tSZ signal and marginalise over it. We model the signal using the position and luminosity of clusters in the RBC (Kocevski & Ebeling, 2006) catalogue.
Finally, it may be expected that a subset of local galaxies host radio or submillimetre sources that contaminate WMAP data as unresolved point sources. The population of galaxies and of unresolved point sources may thus be correlated and must also be marginalised over. The contamination by unresolved point sources is obtained using the information contained in the 2MASS eXtended Sources Catalog (Skrutskie et al., 2006, hereafter 2MASSXSC).
For fitting the contamination of the temperature fluctuations due to MilkyWay emission, we have used the templates provided by the WMAP7 collaboration on Lambda.
We now describe the templates in detail.
3.1 kSZ model
We assume there is a massive halo of ionised plasma around galaxies. Fukugita & Peebles (2006) proposed an isothermal model for this plasma halo, which we further modify by introducing a hole at the centre of the galaxy. There are at least two reasons to do this. First, we reduce the possibility of correlating the kSZ signal with radio sources located at the centres of galaxies. Second, we expect that the plasma close to the centre of the haloes would cool and collapse to form the cold gas and stars in galaxies. Thus we adopt the following model:
(8) 
with the maximum extent of our gas profile, and kpc. In doing so, we do not attempt to fit the part of the galaxy dominated by stars and may avoid some point source contamination. The density of electrons is determined from the gas density, and the Helium abundance
(9) 
with the mass of the proton, and the total mass density of the gas, and (e.g. Komatsu et al., 2011).
The kSZ effect depends on the electron density and the velocity of the electrons with respect to the primary CMB, i.e. it is proportional to the momentum. The velocity of a galaxy in the 2M++ sample can be modelled using linear perturbation theory applied to the density field. Here instead, for simplicity, we model the flow field to lowest order, i.e. by a dipole or “bulk flow” with constant Cartesian components . The average kSZ temperature change due to a bulk flow is given by:
(10) 
with is given by Equation (12). After integration, we get:
(11) 
with the Heaviside step function, being the fraction of free electrons, , and km s the typical velocity dispersion of galactic haloes. is the projected distance from the centre of the galaxy:
(12) 
where is the angular distance of the galaxy at redshift , and is the unit vector in the direction of the galaxy. We set the kSZ template to zero at small radii . Finally, we scale according to the luminosity as:
(13) 
with as given for the best Schechter function fit in Lavaux & Hudson (2011). We take the normalisation from Fukugita & Peebles (2006). The parameter is introduced to increase the effective radius of the plasma halo. As noted above the “plasma halo” refers not only to the actual halo of a given galaxy but also to all correlated plasma. The choice of is selected so that all free electrons, including those in the WHIM or associated with low luminosity galaxies not in the 2M++ catalogue, are included in the kSZ template (see Appendix C). We adopt as a default, but consider other possibilities in Section 4.2.
As shown below (Section 5), for WMAP, we are mostly sensitive to nearby galaxies at a typical distance of 20 Mpc. The typical apparent scale of galaxies on the sky is thus Mpc degrees. For ( respectively), the scale is thus degrees ( degrees). In both cases, the modelled plasma halo is bigger than the beam size of WMAP, which is degrees.
We generate three kSZ templates, one for each bulk flow Cartesian axis, and then fit simultaneously all components. In Figure 1, we show the kSZ template obtained before multiplying by a dipolar bulkflow and normalised to 100 km s, in the inward direction. Equivalently, it would be the kSZ signal that would be produced if all selected galaxies were moving at the same 100 km s, speed towards us. Using a different colour scale, this same map corresponds to the optical depth. Averaging over prebeamed nonnull pixels, the average optical depth is for . For , this same average yields , a 44% increase.
The kSZ template exhibits smaller structures than the tSZ signal. Additionally, the kSZ signal does not depend on the frequency. These two features make the kSZ signal far more difficult to detect in the CMB data than the tSZ signal. Nevertheless, we note that the kSZ signal should dominate the tSZ signal on the scale of the plasma halo of a galaxy. The ratio between the tSZ signal and the kSZ is given by
(14) 
with the global velocity of the galaxy, the mean temperature of electrons in the halo, the mass of electrons. With K, keV and km s, we get . Additionally, the bulk flow gives a dipolar modulation of the galaxy distribution on the sky, whereas the tSZ signal would simply trace cluster (or galaxy) distribution. Consequently, we do not expect a systematic contamination by tSZ at the galaxy level.
3.2 tSZ model
The tSZ signal is generated by hot and dense ionised gas. These conditions are found in the intracluster medium, and clusters of galaxies have already been detected using this effect (for a review see e.g. Birkinshaw, 1999). We use the model given in the appendix of Afshordi et al. (2005) for the temperature and gas density profiles in clusters. Given the Xray luminosity, the RBC catalogue provides the virial mass from which we derive the virial radius , with , the gravitational constant and the Hubble constant. Using this mass estimate, the model predicts the two profiles assuming hydrostatic equilibrium. The temperature change due to the presence of a cluster in direction is given by:
(15) 
with and , respectively denoting the density and temperature of the free intracluster electrons at a distance from the centre of the cluster, the frequency at which the temperature is observed. The dependence on the frequency is
(16) 
is the projected distance from the centre of the cluster. It has the same expression as for galaxies, given in Equation (12). Note that we stop the integration at twice the virial radius of the cluster.
An example of the tSZ template in the W channel of WMAP is given in Figure 2. We have selected all 781 clusters with redshift , from RBC. With our normalisation the resulting signal is typically at a level of a fraction of a milliKelvin.
3.3 Point source model
We follow Afshordi et al. (2004) and Jarosik et al. (2011) in modelling the effective temperature of point sources in the CMB sky. We remind the reader of their main result:
(17) 
with is a conversion factor between physical flux and antenna temperature that depends on the channel , is the direction of the th pixel, and is the direction of observation of the galaxy . We also use , which is one if and corresponds to the same pixel of the CMB map, and zero otherwise. accounts for the difference between antenna temperature and thermodynamic temperature (Bennett et al., 2003):
(18) 
is the average point source luminosity in the channel . is a normalisation factor to account both for pixelisation and the normalisation of the beam. is the luminosity density of galaxies in the considered catalogue. We use the typical value derived by Lavaux & Hudson (2011) for normalising: . is the observed flux of the galaxy :
(19) 
with the intrinsic luminosity of the sun in the K band of 2MASS: . Page et al. (2003) gives the value for in the case of freefree emission (scale free spectrum with a slope ). The point source template includes only galaxies with apparent magnitude (with the extinction derived from the reddening (Schlegel et al., 1998)). The relation between the reddening and the extinction is
(20) 
where the constant of proportionality is obtained from the relation between extinction in band and in band (Cardelli et al., 1989).
3.4 Effective mask
Since WMAP data are contaminated by strong radio sources and foreground emission, it is necessary to mask parts of the sky that may contaminate the fitting procedure. We have used the “Extended Temperature Analysis” mask (dubbed KQ75 by the WMAP collaboration) to remove Galactic contamination.
In addition, we need to allow for the fact that, due to Galactic extinction, galaxies are not uniformly detected on the sky. To ensure the completeness of the 2MASSXSC sample, and thus of our template, we also require that the Galactic extinction .
The 2M++ sample has less uniform sampling at low redshift for Galactic latitudes because at lower latitudes only the galaxies are properly sampled. We thus add two further conditions: and to produce the effective mask. We use a conservative latitude cut to remove also as much galactic foreground as possible from the fitting. The final mask is shown in Figure 3. The operator in Equation (5) corresponds to multiplying a pixelated temperature data by this mask.
4 Results
In this Section, we present our measurement of the kSZ signal using WMAP7 data. We present the significance and the possible shortcomings of the measurement in Section 4.1.
4.1 Results from 2M++
Our “default” solution is based on the templates and mask described in Section 3, with the amplitude of the templates left free. We use the best fit angular power spectrum provided by the WMAP collaboration (Komatsu et al., 2011) for a CDM model obtained from WMAP7 data alone (Jarosik et al., 2011). Additionally, we independently fit the residual full sky dipole in WMAP7 maps, which could otherwise contaminate our fit of the kSZ signal, which is also dipolar, albeit localised at the positions of galaxies. We produce a set of kSZ templates with the 2M++ galaxy sample limited at different distances from the observer. All templates discard galaxies within 5 Mpc which is the minimum distance at which redshift information can be used as an average indicator of distance. We produce subsamples limited to 50 Mpc, 100 Mpc, 200 Mpc and fit each galaxy sample independently.
Line  Sky cut  kSZ depth  kSZ  

()  ( Mpc)  (km s)  (%)  
1  50  6.1  11  
2  100  2.5  47  
3  200  5.0  17 
Note: For all the above measurements, we have used the basic mask of Section 3.4. The reported has three degrees of freedom. There is a weak correlation between components that is accounted for in the evaluation of the . The correlation is 14% for , 2% for and 2.6% for . is the probability of the null hypothesis, namely that there is no bulk flow. An empty field means that the value is kept the same compared to the previous line. For all measurements we find consistent tSZ measurement with an amplitude of , though the bulk of the tSZ pixels are masked out. Without masking the amplitude of the tSZ is . The point source contamination signals are, in unit of Jansky, (Q channel), (V channel) and (W channel).
Line  Experiment  kSZ  

(km s)  (%)  
1  tSZ masked  6.1  11  
2  tSZ not masked  5.5  14  
3  NFW, tSZ not masked  5.4  15  
4  tSZ not masked,  6.4  10  
5  tSZ not masked, scaling,  3.8  29  
6  tSZ masked, scaling, limit to ,  7.3  6  
7  tSZ extended masking, scaling, limit to ,  6.2  10  
8  tSZ masked,  6.6  9  
9  tSZ not masked, kSZ monopole  5.7  13 
Note: In all the above, we have used a mask based on the KQ75 sky cut, as indicated in Section 3.4. The notation “tSZ masked” indicates that we have masked out pixels which were expected to have unbeamed tSZ signal from the cluster sample. “tSZ extended masking” indicates that all pixels with a contamination bigger than 0.1K, after beaming in the Q channel, have been masked out. All kSZ templates have been generated from the 2M++ limited to a depth of 50 Mpc.
4.2 Tests of potential systematics
In deriving our solution, we imposed the constraint that that the Galactic foreground corrections are positive. Technically, this corresponds to enforcing the constraint , for the templates corresponding to foregrounds. As a check on the foreground template fitting, we give in Table 3 our best fit parameters for the Galactic foregrounds (without forcing the parameter to be positive). We also show in Table 3 the value of the parameters found by the WMAP collaboration, which we have averaged by frequency channel. There is a good agreement between the two sets of parameters when we include all pixels at . However, there is some tension on the parameters of the when only pixels with are included, but we note that the WMAP collaboration fitted their models after degrading their map at , using all the pixels of the maps. The cut at is thus more consistent with their methodology. We conclude that our foreground fitting at full resolution is consistent with the findings of the WMAP collaboration.
Origin  Dust map  map  map  

(K R)  
Q  V  W  Q  V  W  Q  V  W  
WMAP7  0.20  0.47  1.28  0.23  0.047  0.00  1.22  0.78  0.43 
Note: We show the best fit amplitude for each of the templates provided by the WMAP collaboration (Gold et al., 2011), at a resolution of . The first two lines has been obtained in this work, using the indicated sky cuts. The last line has been derived from table 2 of Gold et al. (2011).
After subtracting the Galactic foreground components, we are left with a signal for pointsources which is consistent with zero for the V and W channels and marginally above zero for the Q channel. The tSZ template has an amplitude of , significant at , and consistent with unity. A value of unity means that the physical model adopted for the tSZ effect is exactly right.
In Table 2, we investigate how different cuts or models affect the results of line 1 of Table 1. The first line of this table is a copy of the first line of Table 1. We check the impact of pixel masking in line 2. For this experiment, we use all the pixels of the map and we assume that our model for the tSZ signal is correct in average. This operation introduces a marginal change in the kSZ signal.
Our default model for the plasma halo discussed in Section 3.1 is simple and it is possible that the plasma traces halos of different mass differently, or has a different radial dependence. In lines 3 to 6, we investigate the impact of the choice of the model for the density profile of the plasma halo. Using a NavarroFrenkWhite profile (Navarro et al., 1996) instead of an isothermal profile barely changes the result (line 3 vs. line 1). We have attempted to scale up the plasma halo radius by changing the multiplicative factor to the extreme value . Increasing this factor may be useful for including the presence of smaller haloes and/or filamentary WHIM that are clustered with the galaxy. This operation does not affect mean recovered velocity and slightly reduces the error bars by introducing more pixels in the correlation. This suggests for WMAP that it is difficult to measure the radial extent of clustered electrons via the strength of the kSZ effect
It is also possible that the mapping between stellar mass traced by band light and plasma halo mass differs from our fiducial choice for which the mass is proportional to . In lines 5 to 7, we have introduced the scaling of the velocity dispersions with the luminosity of the galaxy according to
(21) 
where . This choice effectively forces mass to be proportional to the luminosity. The first impact of such a choice is just to increase error bars (line 5), which is expected as the kSZ signal from small galaxies is strongly reduced. Limiting the sample to the 4450 galaxies within 50 Mpc which have an absolute magnitude brighter than and adjusting accordingly (Appendix C) leads to a strong change in the flow amplitude along the direction (line 6). The cause of this change is not understood. In line 7, we have tried to remove as much tSZ signal as possible by masking out pixels which were contaminated at a level larger than 0.1 K after beaming according to our model and the RBC cluster catalogue. The kSZ signal does not change compared to line 6.
In line 8, we have reduced the masked regions to include more temperature measurements at lower galactic latitudes. Compared to the fiducial experiment in line 1, the error bars are slightly smaller in all directions. The mean values per directions are in perfect agreement with the values in line 1.
For the case 9, we consider whether we may miss a kSZ monopole component and/or a tSZ component generated by the plasma halo around galaxies. Not accounting for this effect in our model may bias the results. To assess this effect, we add a new template that corresponds to a kSZ monopole. We find a monopole for the kSZ template equal to a mean (inward) peculiar velocity of km s. The value takes into account errors correlation between the component, which are introduced by the mask.
In Figure 6, we show the likelihood contours of the direction of the kSZ signal for two different mask cuts and . The test is run for a kSZ depth of 200 Mpc. There is no sign of strong variation of the peak of the likelihood by cutting 40% of the available pixels. The only visible impact is a widening the distribution.
Finally, in the left panel of Figure 7 we show the result of a jackknife test obtained by removing the part of the sky that corresponds to the indicated root pixel of the Healpix mesh. These root pixels are given in the right panel of Figure 7. This test shows that, at a given depth, the kSZ signal is stable. We note that for deeper measurements, the jackknife causes the recovered flow to scatter along the Xdirection and thus could be interpreted as a sign of contamination by the WMAP “haze” (Finkbeiner, 2004; Pietrobon et al., 2011). Correlations between galaxies and MilkyWay extinction properties are likely in the bulge and that could correlate the kSZ signal with the haze. We note that the effect of this haze would go in the opposite direction as what we obtain for the kSZ in all slices: it creates a positive temperature fluctuation which would push the kSZ signal towards negative X. In our case, the signal is pushed towards positive X. Additionally, the push would be affected by removal of the the galactic bulge part in the Jackknife test, which is not the case.
4.3 Results from 2MASSXSC
Experiment  Typical depth  kSZ  
( Mpc)  (km s)  (%)  (%)  
2MASSXSC  470  4.3  23  
Kashlinsky et al. (2010)  175260  7.1  7  9.1  
260380  14.7  0.2  16.3  
270530  15.2  0.16  16.1 
Note: , and its associated probability , represent the probability that the flow is zero. , and its associated probability , is the probability that the results of Kashlinsky et al. (2010) are in agreement with our result based on 2MASSXSC.
Finally, we have estimated the amplitude of the bulk flow using the entire 2MASSXSC limited to as the kSZ template. After removing nearby large galaxies and parts of the sky that could suffer strong reddening, we have 959 554 galaxies with . As the 2MASSXSC does not have redshifts, we randomly generated distances for each galaxy based on their apparent magnitude and an assumed Schechter function for the distribution of absolute magnitudes of galaxies. Specifically, we used the Schechter parameters given in Lavaux & Hudson (2011) for the 2M++: , . The result is not strongly dependent on these parameters. We estimated the kSZ bulk flow from an ensemble of ten realisations of those distances. The scatter in bulk flow from one realisation to the other is km s by component. The median depth of such a survey is 470 Mpc, with a mean distance of 550 Mpc because of the long tail of the distribution. The results are given in Table 4 and are consistent with no bulk flow on these very large scales, with a 95% upper limit of 520 km s, after correction for random error biasing. The standard deviation per component of the kSZ signal is 100 km s. We have not found any evidence for a bulk flow that would be inconsistent with CDM. However, our results disagree with the bulk flow found by Kashlinsky et al. (2010) at the 4.2 level
5 Bulk flows and the fraction of free electrons
While the bulk flow on very large scales is controversial, the existence of a bulk flow at depths of Mpc, due to local superclusters such as Virgo, the Great Attractor region and PerseusPisces, is not controversial: there is good agreement between peculiar velocity surveys as well as with the predictions of the density field. The kSZ signal is the product of the electron density times the velocity, i.e. the momentum. Therefore, we can take the existence of the bulk flow as a given, and instead determine the density of free electrons.
In order to do so, we must first consider the volume that is actually being sampled by combination of the kSZ template and the WMAP data. Although our template extends to 200 Mpc, nearby galaxy haloes have more weight, and so the bulk flow measured is not simply volume limited. We discuss the effective volume of the kSZ measurement below before discussing the fraction of free electrons.
5.1 Kinetic SunyaevZel’dovich effect window function
The amplitude and direction of the kSZ effect may be interpreted as a bulk flow obtained from the weighted average of the peculiar velocities of galaxies. The weighting depends both on the selection cut and the limitations of the instrument used to observe the CMB. The weighted average procedure may be modelled by the convolution of the peculiar velocity field by a kernel, called window function. We argue that the bulk flow estimate must have a similar window function to that used in a maximum likelihood analysis of galaxy distance surveys. We may assume, e.g., that galaxies have all the same physical size . Their sizes on the sky, in terms of number of pixels, are thus , with their redshifts and their angular distances. The number of pixels affected by each galaxy in the kSZ template is thus scaling as . If for simplicity we assume that the pixel noise is homogeneous, the weight for the peculiar velocity of each galaxy becomes proportional to the inverse squared distance of this galaxy. This is exactly the same case for bulk flow derived from maximum likelihood estimate from galaxy distances. In that case, the peculiar velocity errors scale as the distance , which leads to the same weight per galaxy as the kSZ effect.
We give a quantitative derivation of this window function in Appendix D where we also discuss the limitations of this calculation. Figure 8 shows the square of the window function of the bulk flow in the Galactic bulge direction ( axis). We also show the window function corresponding to the convolution with a Gaussian kernel of size defined as
(22) 
The size is determined such that the second derivatives of the actual window function and of the Gaussian kernel are equal. The measurement, based on galaxies within 50 Mpc, seems to correspond to a Mpc Gaussian window on large scales. Similarly, the measurement using galaxies within 200 Mpc corresponds to a Gaussian window of 31.4 Mpc. We have evaluated the window function for the Galactic and components of the bulk flow and we have obtained similar results for the probed scales. Because the masked galaxy distribution is anisotropic, the window functions are not strictly equal for the three components of the bulk flow. The important point is that the kSZ signal is dominated by galaxies on scales significantly nearer than the limiting depth.
5.2 Comparison with published bulk flows and the free electron fraction
When only galaxies within 50 Mpc are used in the kSZ template, we find km s (after correction for “error biasing”
Reference  Scale  Filter  

( Mpc)  (km s)  (km s)  (km s)  (km s)  ()  ()  
Watkins et al. (2009)  20  Gaussian  
SHALLOW  MLE  
SFI++  MLE  
Lavaux et al. (2010)  50  Tophat 
Note: In the case of “SHALLOW” and “SFI++”, we have used the Maximum Likelihood Estimator (MLE) values quoted in Watkins et al. (2009). The value given reported as 20 Mpc Gaussian is obtained using the minimum variance estimator applied on the “COMPOSITE” sample.
Given the agreement in direction, we may assume that the amplitude itself is significant. The amplitude of the kSZ signal, expressed as bulk flow components , may be modelled as the following:
(23) 
with , being the fraction of free electrons and modelling the unknown normalisation of the density profile of the galactic plasma halo. is taken as a free parameter for this subsection, the bulk flow components as determined by other methods listed in Table 5. We may derive the posterior probability of using Bayes identity, which, up to a normalisation, is:
(24) 
with
(25) 
and
(26) 
In the above expressions, we have used the notation for a three dimensional Gaussian probability, with the mean and the standard deviation for the same component. ( respectively) is given by the errors on the components quoted in Table 1 (Table 5 respectively). Effectively the posterior probability (24) may be written as a likelihood function
(27) 
We show the posterior distribution of the parameter in Figure 10 for different choices of the published bulk flows against one single of our kSZ measurement. We are taking the line 1 of Table 1 as the fiducial kSZ measurement. Assuming the bulk flow for the SHALLOW sample in Watkins et al. (2009), the posterior peaks at (68% confidence limit). For the bulk flow of the SFI sample, this value becomes . Assuming Lavaux et al. (2010) cut at 50 Mpc tophat, the posterior peaks at . For information, we have shown two extreme choices of choosing the amount of baryons in galaxies. The choice corresponds to considering that galaxies have a dark matter halo and all baryons are ionised in the plasma halo. For indicative purposes, we have shown in Figure 10, . We would measure such a value if the density of baryons were equal to the mean density of dark matter. We note that all posteriors are in agreement with a choice of . The ratio of the posterior at its maximum to its value for in the best case (green curve, SHALLOW sample) is . For the blue and red curve, this ratio is and respectively. According to Jeffreys (1961), a ratio of corresponds to substantial evidence against the null hypothesis there are no free electrons in galaxy halos .
6 Discussion and Future Work
There are several ways in which this measurement can be improved in the near future. First, improved CMB data is anticipated from SPT (Schaffer et al., 2011) and from the Planck mission. We discuss below the improvement in signaltonoise expected from Planck. Moreover, we anticipate a more detailed treatment of the peculiar velocity field, an improved understanding of the halo model and of the distribution of free electrons around galaxies.
6.1 Forecast for Planck
In this section, we estimate the noise covariance matrix of the three components of the bulk flows for the maps obtained from the Planck probe. The inverse covariance matrix is given by Equation (7). Neglecting the impact of the other foreground components, assuming the noise is homogeneous on the sky and no masking, the covariance matrix of the kSZ bulk flow becomes diagonal in spherical harmonic space and simplifies to:
(28) 
with the angular cross power spectrum of the kSZ template between the component and of the bulk flow. For a single channel, this reduces to:
(29) 
We give the forecast estimate for the channel W of WMAP7 in Table 6. In this same table, we also report the actual errors that we obtained using the full method. To make a forecast for the sensitivity that we expect using Planck maps, we consider the map at a frequency of 100 GHz. According to The Planck Collaboration (2006), the noise at resolution of is K for the temperature. The beam is modelled using a Gaussian with a FullWidthHalfMaximum equal to arcmins. The last column of the Table 6 gives the standard deviation expected for the velocity components of the bulk flow with these parameters. We have also tried our forecast with a more complex model based on the published Planck noise properties (Zacchei et al., 2011), which increases the errors reported in Table 6 by 20%. Also note that these are the errors for the 50 Mpc cut. Planck is able to reach much more distant galaxies.
There is a good agreement between the estimated value of the noise for WMAP data and the actual noise amplitude that we measure using the complete analysis of Section 2. This gives us some confidence that the projection for Planck is realistic. We note that, mainly due to the reduction of the beam width, the noise should go down by a factor of two to three by using Planck data compared to WMAP data. This estimate does not take into account possible additional foreground contamination. If the bulk flows stay at the same amplitude as we have estimated from WMAP data, that could give us a detection of the kSZ effect. If the bulk flow is at the level of that found for the SFI++ sample (Watkins et al., 2009), the significance could decrease to . It must be nonetheless noted that we have neglected all contributions from the multichannel analysis in this prediction. The highfrequency channels would clearly help to remove the contribution of the tSZ signal. Additionally, the temperature data from the Planck mission have sufficient angular resolution to probe deeper flows on scales up to 200 Mpc scale with better sensitivity. As a final note, we assume that the kSZ signal has the same amplitude as what is expected from the bulk flow given by the SHALLOW sample in Table 5. In this case, we forecast that the constraints on the baryon density in plasma haloes of galaxies from the Planck mission should be , instead of as obtained in Section 5.2.
Component  WMAP7  WMAP7  PLANCK 

obtained  forecast  forecast  
(km s)  (km s)  (km s)  
x  290  305  120 
y  285  277  111 
z  200  224  93 
6.2 Model improvements
There are several ways in which the model for the free electron momentum field will be improved.

The model for the flow field will be updated. In this paper, for simplicity, we have modelled the peculiar velocity field as a simple bulk flow, a low order moment of the peculiar velocity field. However, as suggested by Ho et al. (2009) and Shao et al. (2011), by using linear perturbation theory and an assumption about the relationship between mass and light in haloes (linear biasing or the halo model), we can predict the peculiar velocities of individual galaxies (Yahil et al., 1980; Strauss et al., 1992; Hudson, 1994) to an accuracy of km s (Willick & Strauss, 1998) or better using a nonlinear reconstruction (Lavaux et al., 2008, 2010). These predictions scale with the parameter where for CDM and is a bias factor, and so the kSZ effect will allow us to get an independent measurement of this parameter.

We are using galaxies as a tracer of the distribution of free electrons, but we can improve this model in several ways. First, the scaling of halo mass and radius with galaxy luminosity or stellar mass can be updated using results from abundance matching (Marinoni & Hudson, 2002; Yang et al., 2003; Guo et al., 2010) or weak lensing studies of low redshift galaxies (Mandelbaum et al., 2006; van Uitert et al., 2012). Second, highresolution hydrodynamical cosmological simulations can be used to get a better handle on the link between galaxy halo profiles and their crosscorrelations with the distribution of free electrons (as in e.g. Shao et al., 2011; Peirani et al., 2012).
7 Conclusion
We have developed and applied a method for estimating bulk flows from kSZ signals in temperature maps of the Cosmic Microwave Background. This method is based on a likelihood analysis of the amplitude of a set of template maps derived from catalogues of galaxies. Assuming an isothermal model for the galaxy plasma halo, we have derived template maps for the kinetic SunyaevZel’dovich from the 2M++ galaxy catalogue. The thermal SunyaevZel’dovich signal is derived in a similar fashion from the RBC galaxy cluster catalogue, assuming the model described in Afshordi et al. (2005) for the density and temperature of the gas. To estimate the extent of the contamination by point sources residing in galaxies, we have also generated a set of templates from the 2MASSXSC. We have fitted, independently in each channel, the standard WMAP7 foreground templates, the monopole and the dipole.
The amplitude of the tSZ signal is measured with a high accuracy at confidence. We have found evidence at the 90% confidence level for the presence of a kSZ signal in WMAP7 data. We have not detected any significant contamination by point sources. All the results are presented after having marginalised over the Galactic foreground emissions.
Interpreting the kSZ signal as the signature of a bulk flow, we derive an amplitude, when only galaxies within 50 Mpc are included in the template, of km s, in the direction , , in excellent agreement with the ”Seven Samurai” bulk flow (Dressler et al., 1987) and other recent determinations on similar scales. If we extend selection up to 200 Mpc, we find km s, in the direction , . The kSZ bulk flow at 50 Mpc is statistically in agreement with other measurements obtained from either galaxy peculiar velocities, observed or modelled. This comparison can be translated into a measurement of the fraction of baryon in galaxies. We found substantial evidence of baryons in the galaxy plasma halo, compatible with the mean density of baryons derived from WMAP7 (Komatsu et al., 2011). Planck data should be able to reduce the statistical error bars on the kSZ measurement by a factor two to three. If the signal stays at the present value, the evidence may be increased to , excluding improvements due to the high frequency channels for removing tSZ contamination. Realistically, if the bulk flow is as given by Watkins et al. (2009), then the evidence would be at the level.
Acknowledgements
NA and MJH acknowledge the preliminary work of Matthias Müller on a similar project.
This publication makes use of data products from the Two Micron All Sky Survey, which is a joint project of the University of Massachusetts and the Infrared Processing and Analysis Center/California Institute of Technology, funded by the National Aeronautics and Space Administration and the National Science Foundation.
GL acknowledges support from CITA National Fellowship and financial support from the Government of Canada PostDoctoral Research Fellowship and NSF Grant AST 0708849, AST 0908693. MH acknowledges support from NSERC.
Research at Perimeter Institute is supported by the Government of Canada through Industry Canada and by the Province of Ontario through the Ministry of Research and Innovation.
This work was made possible by the facilities of the Shared Hierarchical Academic Research Computing Network (SHARCNET:http://www.sharcnet.ca) and Compute/Calcul Canada.
We acknowledge the use of the Legacy Archive for Microwave Background Data Analysis (LAMBDA). Support for LAMBDA is provided by the NASA Office of Space Science.
Some of the results in this paper have been derived using the HEALPix
Appendix A Derivation of the likelihood from the model
In this Appendix, we derive the Equation (2) directly from the model assumptions. We start by expressing that the primary CMB fluctuations and the instrumental noise by channel are Gaussian. They have known covariance matrices , for the primary fluctuations, and , for the channel of the instrument. may include masking effect. , as defined in Equation (5) or may both be used in the following. The likelihood of the templates, assuming the amplitudes and the modes of the primary fluctuations and given the input data, is thus:
(30) 
with
(31) 
the residual. The above terms can be reordered such that appears only in one quadratic term:
(32) 
with
(33) 
and as defined in Equation (4). The likelihood is now split into two parts:
(34) 
The marginalisation according to primary CMB fluctuations consists in dropping the first part, which is the only one depending on . Equation (2) immediately follows.
Appendix B Accelerating the precomputation of the likelihood
For each template of the signal in the frequency channel , we define a weighted template
(35) 
The Equations (6) and (7) may be rewritten in terms of uniquely these weighted templates. We expand the Equation (3) in the Equations (6) and (7). After some algebra, we find the expression, numerically simpler, of the covariance matrix
(36) 
and for the maximum likelihood estimate