# The redshifted HI 21 cm signal from the post-reionization epoch: Cross-correlations with other cosmological probes

## Abstract

Tomographic intensity mapping of the H i using the redshifted 21 cm observations opens up a new window towards our understanding of cosmological background evolution and structure formation. This is a key science goal of several upcoming radio telescopes including the Square Kilometer Array (SKA). In this article we focus on the post-reionization signal and investigate the of cross correlating the 21 cm signal with other tracers of the large scale structure. We consider the cross-correlation of the post-reionization 21 cm signal with the Lyman- forest, Lyman-break galaxies and late time anisotropies in the CMBR maps like weak lensing and the Integrated Sachs Wolfe effect. We study the feasibility of detecting the signal and explore the possibility of obtaining constraints on cosmological models using it.

Cross-correlations of the 21 cm signal with other cosmological probes] The redshifted HI 21 cm signal from the post-reionization epoch: Cross-correlations with other cosmological probes

Guha Sarkar et al.] T. Guha Sarkar ^{1}^{2}

Department of
Physics, Birla Institute of Technology and Science, Pilani 333031,
India.

Department of Physics, Presidency University,
Kolkata, 700073, India.

National Centre for Radio Astrophysics,
Pune, 411007, India.

Department of Physics, Indian Institute of
Technology Kharagpur, 721302, India.

March 6, 2018

osmology: theory – large-scale structure of Universe - cosmology: diffuse radiation – cosmology: Dark energy

## 1 Introduction

The tomographic intensity mapping of the neutral hydrogen (H i ) distribution through
redshifted HI 21-cm signal observation is an important
probe of cosmological evolution and structure formation in
the post reionization epoch (Bharadwaj &
Sethi, 2001; Wyithe &
Loeb, 2009; Loeb &
Wyithe, 2008; Chang et al., 2008). The astrophysical processes dominating the epoch of
reionization is now believed to have completed by redshift
(Fan et al., 2006). In the post-reionization era most of the neutral HI gas
are housed in the Damped Ly- (DLA) systems.
These DLA clouds are the predominant source of the HI 21-cm
signal. Intensity mapping involves a low resolution imaging of the diffuse HI 21-cm radiation
background without attempting to resolve the individual DLAs. Such a tomographic imaging shall naturally provide
astrophysical and cosmological data regarding the large scale
matter distribution, structure formation and background cosmic history in the post-reionization epoch (Chang et al., 2008; Wyithe, 2008; Bharadwaj
et al., 2009; Camera et al., 2013; Bull et al., 2015). Several functioning and upcoming radio interferometric arrays
like Giant Metrewave Radio Telescope
(GMRT) ^{3}^{4}^{5}^{6}

Cross-correlating the 21 cm signal with other probes may prove to be useful towards mitigating the severe effect of foreground contaminants and other systematic effects which plague the signal. The main advantage of cross-correlation is that the cosmological origin of the signal can only be ascertained only if it is detected with high a statistical significance in the cross- correlation. Cosmological parameter estimation often involves a joint analysis of two or more data sets and this would require not only the auto-correlation but also cross-correlation information. Further, the two different probes may focus on specific modes with high signal to noise ratio and in such cases the cross-correlation signal takes advantage of the different cosmological probes simultaneously. This has been studied extensively in the case of the BAO (Guha Sarkar & Bharadwaj, 2013) signal. It is to be noted that if the observations of the distinct probes are perfect, there shall be no new advantage of using the cross correlation. However, we expect the first generation observations of the redshifted HI 21 cm signal to have large systematic errors and foreground residuals (even after subtraction). For a detection of the 21 cm signal and subsequent cosmological investigations these measurements can be cross-correlated with other large scale structure tracers to yield information from the 21 cm signal which may not be possible to obtain using the low SNR auto correlation signal. In this article we consider the cross-correlation of the 21 cm signal with the Ly- flux distribution. On large scales both the Ly- forest absorbed flux and the redshifted 21-cm signal are, believed to be biased tracers of the underlying dark matter (DM) distribution (McDonald, 2003; Bagla et al., 2010; Guha Sarkar et al., 2012; Villaescusa-Navarro et al., 2014). The clustering of these signals, is then, directly related to the underlying dark matter power spectrum. We investigate the possibility of using the cross-correlation of the 21-cm signal and the Ly- forest for cosmological parameter estimation, neutrino mass measurement, studying BAO features and primordial bispectrum. We also investigate the possibility of correlating the post-reionization 21-cm signal with CMBR maps like the weak lensing and ISW anisotropies.

## 2 Cross-correlation between cosmological signals (General Formalism)

Consider two cosmological fields and . These could, for example represent two tracers of large scale structure. We define the cross correlation estimator as follows

(1) |

We note that and can be complex fields. We are interested in the variance

(2) |

Noting that , we have

(3) |

Further, the term can be dropped since

(4) |

This gives

(5) |

The variance is suppressed by a factor of for that many number of independent estimates. Thus, finally we have

(6) |

## 3 Cross-correlation of Post-reionization 21 cm signal with Lyman- forest

Neutral gas in the post reionization epoch produces distinct
absorption features, in the spectra of background quasars
(Rauch, 1998). The Ly- forest, traces the HI density
fluctuations along one dimensional quasar lines of sight. The
Ly- forest observations finds several cosmological
applications (Croft et al., 1999b; Mandelbaum et al., 2003; Lesgourgues et al., 2007; Croft
et al., 1999a; McDonald &
Eisenstein, 2007; Gallerani et al., 2006). On large cosmological scales the Ly- forest and
the redshifted 21-cm signal are, both expected to be biased tracers of
the underlying dark matter (DM) distribution (McDonald, 2003; Bagla
et al., 2010; Guha Sarkar et al., 2012; Villaescusa-Navarro et al., 2014). This allows to study their cross clustering
properties in n-point functions. Also the Baryon Oscillation
Spectroscopic Survey
(BOSS) ^{7}

Detection these signals are observationally challenging. For the HI 21-cm a detection of the signal requires careful modeling of the foregrounds (Ghosh et al., 2011; Alonso et al., 2015). Some of the difficulties faced by Ly- observations include proper modelling of the continuum, fluctuations of the ionizing sources, poor modeling of the temperature-density relation (McDonald et al., 2001) and metal lines contamination in the spectra (Kim et al., 2007). The two signals are tracers of the underlying dark matter distribution. Thus they are correlated on large scales. However foregrounds and other systematics are uncorrelated between the two independent observations. Hence, the cosmological nature of a detected signal can be only ascertained in a cross-correlation. The 2D and 3D cross correlation of the redshifted HI 21-cm signal with other tracers such as the Ly- forest, and the Lyman break galaxies have been proposed as a way to avoid some of the observational issues (Guha Sarkar et al., 2011; Villaescusa-Navarro et al., 2015a). The foregrounds in HI 21-cm observations appear as noise in the cross correlation and hence, a significant degree foreground cleaning is still required for a detection.

We use to denote the redshifted 21-cm brightness temperature fluctuations and as the fluctuation in the transmitted flux through the Ly- forest. We write and in Fourier space as

(7) |

where and refer to the Ly- forest transmitted flux and 21-cm brightness temperature respectively. On large scales we may write

(8) |

where is the dark matter density contrast in Fourier space and denotes the cosine of the angle between the line of sight direction and the wave vector (). is similar to the linear redshift distortion parameter. The corresponding power spectra are

(9) |

where is the dark matter power spectrum.

For the 21-cm brightness temperature fluctuations we have

(10) |

The neutral hydrogen fraction is assumed to be a constant with a value (Lanzetta et al., 1995; P’eroux et al., 2003; Noterdaeme et al., 2009). For the HI 21-cm signal the parameter , is the ratio of the growth rate of linear perturbations and the HI bias . The 21 cm bias is assumed to be a consnt. This assumption of linear bias is supported by several independent numerical simulations (Bagla et al., 2010; Guha Sarkar et al., 2012) which shows that over a wide range of k modes, a constant bias model is adequately describes the 21 cm signal for . We have adopted a constant bias from simulations (Bagla et al., 2010; Guha Sarkar et al., 2012; Villaescusa-Navarro et al., 2014). For the Ly- forest, , can not be interpreted in the usual manner as . This is because Ly- transmitted flux and the underlying dark matter distribution (Slosar et al., 2011a) do not have a simple linear relationship. The parameters are independent of each other.

We adopt approximately from the numerical simulations of Ly- forest (McDonald, 2003). We note that for cross-correlation studies the Ly- forest has to be smoothed to the observed frequency resolution of the HI 21 cm frequency channels.

We now consider the 3D cross-correlation power spectrum of the HI 21-cm signal and Ly- forest flux. We consider an observational survey volume V which on the sky plane consists of a patch and of line of sight thickness along the radial direction. We consider the flat sky approximation. The Ly- flux fluctuations are now written as a 3-D field

(11) |

The observed quantity is , where the sampling function is defined as

(12) |

and is normalized to unity ( ). The summation as before extends up to . The weights shall in general be related to the pixel noise. However, for measurements of transmitted hight SNR flux, the effect of the weight functions can be ignored. With this simplification we have used , so that . In Fourier space we have

(13) |

One may relate to as . We have, in Fourier space

(14) |

where is the Fourier transform of and denotes a convolution defined as

(15) |

denotes a possible noise term. Similarly the 21-cm signal in Fourier space is written as

(16) |

where is the corresponding noise.

The cross-correlation 3-D power spectrum for the two fields is defined as

(17) |

Similarly, we define the two auto-correlation multi frequency angular power spectra, for 21-cm radiation and for Lyman- forest flux fluctuations as

(18) |

(19) |

We define the cross-correlation estimator as

(20) |

We are interested in the various statistical properties of this estimator. Using the definitions of and we have the expectation value of as

(21) |

We assume that the quasars are distributed in a random fashion, are not clustered and the different noises are uncorrelated. Further, we note that the quasars are assumed to be at a redshift different from rest of the quantities and hence is uncorrelated with both and . Therefore we have

(22) |

Noting that

(23) |

we have

(24) |

Thus, the expectation value of the estimator faithfully returns the quantity we are probing, namely the 3-D cross-correlation power spectrum .

We next consider the variance of the estimator defined as

(25) |

(26) |

We saw that

(27) |

and we note that

(28) |

where is the 21-cm noise power spectrum. We also have for the Ly- forest

(29) |

where is the Noise power spectrum corresponding to the Ly- flux fluctuations. Using the relation

(30) |

we have

(31) |

or

(32) |

This gives

(33) |

Writing the summation as an integral we get

where is the angular density of quasars and . We assume that the variance of the pixel noise contribution to is a constant and is same across all the quasar spectra whereby we have for its noise power spectrum. An uniform weighing scheme for all quasars is a good approximation when most of the spectra are measured with a sufficiently high SNR (McQuinn & White, 2011). We have not incorporated quasar clustering which is supposed to be sub-dominant as compared to Poisson noise. In reality, the clustering would enhance the term by a factor , where is the angular power spectrum of the quasars(Myers et al., 2007).

For a radio-interferometric measurement of the 21-cm signal we have (McQuinn et al., 2006; Wyithe et al., 2008)

(34) |

Here denotes the system temperature. is the observation bandwidth, is the total observation time, is the comoving distance to the redshift , is the density of baseline , and is the effective collecting area of each antenna.

### 3.1 The cross correlation signal and constraints with SKA

We investigate the possibility of detecting the signal using the
upcoming SKA-mid phase1 telescope and future Ly- forest
surveys with very high quasar number densities. Two separate
telescopes named SKA-low and SKA-mid operating at two different
frequency bands and will be constructed in Australia and South Africa
respectively in two phases. For this work we consider the instruments
SKA1-mid which will be built in phase 1. The instrument specifications
such as the total number of antennae, antenna distribution, frequency
coverage, total collecting area etc., have not been fixed yet and
might change in future. We use the specifications considered in the ‘Baseline Design Document’ and ’SKA Level
1 Requirements (revision 6)’ which are available on the SKA
website^{8}

The fiducial redshift of is justified since the quasar distribution peaks in the range . Only a smaller part of the quasar spectra corresponding to an approximate band is used to avoid contamination from metal lines and quasar proximity effect. The cross-correlation can however only be computed in the region of overlap between the 21-cm signal and the Ly- forest field.

The left panel of the figure (1) shows the dimensionless redshift space 21-cm power spectrum () at . We can see that the power spectrum is not circularly symmetric in the , plane. The asymmetry is related to the redshift space distortion parameter. The right panel of figure (1) shows the 21-cm and Ly- cross-power spectrum.

We first consider that a perfect foreground subtraction is achieved. The left panel of the figure (2) shows the contours of SNR for the 21-cm auto correlation power spectrum for a observation and total MHz bandwidth at a frequency . We have taken a bin . The SNR reaches at the peak ()at intermediate value of . We find that detection is possible in the range and . The range for the detection is and . At lower values of the noise is expected to be dominated by cosmic variance whereas, the noise is predominantly of instrumental origin at large .

The right panel of the figure (2) shows the SNR contours for the Ly- 21-cm cross-correlation power spectrum. For the 21 cm signal, a observation is considered. We have taken , and the Ly- spectra are assumed to be measured at a sensitivity level. We use to be and overall normalization factor consistent with recent measurements (Slosar et al., 2011b). Although the overall SNR for the cross power spectrum is lower compared to the 21-cm auto power spectrum, detection is ideally possible for the and . The SNR peaks ()at . The error in the cross-correlation can be reduced either by increasing the QSO number density or by increasing the observing time for HI 21-cm survey. The QSO number density is already in the higher side for the BOSS survey that we consider. The only way to reduce the variance is to consider more observation time for HI 21-cm survey and enhance the volume of the survey.

### 3.2 Parameter estimation using the cross-correlation

We now consider the precision at which we can constrain various model parameters using the Fisher matrix analysis. Figure (4) shows the and confidence contours obtained using the Fisher matrix analysis for the parameters . The table 1 summarises the error these parameters. The parameters are constrained much better that and at . The error projections presented here are for a single field of view radio observation. The noise scales as where is the number of pointings.

Parameters | Fiducial Value | Error | Error |
---|---|---|---|

(marginalized) | (conditional) | ||

0.48 | 1.06 | 0.04 | |

1.11 | 1.55 | 0.05 | |

0.73 | 0.025 | 0.013 | |

0.114 | 0.01 | 0.002 |

We also consider conditional error on each of the parameters assuming that the other three are known. The projected error in and are and respectively for single pointing. For independent radio observations the conditional errors improve to , , and for , , and respectively. These constraints on the redshift space distortion parameters from our cross-correlation analysis are found to be quite competitive with other cosmological probes (Font-Ribera et al., 2012; Slosar et al., 2011a). Further, we note that higher density of QSOs and improved SNR for the individual QSO spectra shall also provide stronger constraints.

### 3.3 BAO imprint on the cross-correlation signal

The characteristic scale of the BAO is set by the acoustic horizon at the epoch of recombination The comoving length-scale defines a angular scale in the transverse direction and a radial redshift interval , where and are the angular diameter distance and Hubble parameter respectively. The comoving acousic horizon scale correspond to an angle and reshift interval at redshift . Measurement of and separately, allows the determination of and separately and thereby constrain background cosmological evolution. Here we consider the possibility of measurement of these two parameters from the imprint of BAO features on the cross-correlation power spectrum.

The Fisher matrix is given by (Guha Sarkar & Bharadwaj, 2013)

(35) |

where refer to the cosmological parameters to be constrained. This BAO signal is mainly present at small (large scales) with the first peak at roughly . The subsequent oscillations are highly suppressed by which is within the limits of the and integrals. We use to isolate the purely baryonic features, and we use this in . Here, is the CDM power spectrum without the baryonic features. This gives

(36) |

where and denotes the scale of ‘Silk-damping’ and ‘non-linearity’ respectively. We have used and from (Seo & Eisenstein, 2007). The quantity where and corresponds to and in units of distance. is an overall normalization constant. The value of is well constrained from CMBR data. Changes in and manifest as the corresponding changes in the values of and respectively, and thus the fractional errors in and correspond to fractional errors in and respectively. We choose and as the cosmological parameters to be constrained, and determine the precision at which it will be possible to measure these using the BAO imprint on the in the cross-correlation power spectrum. We use the formalism outlined in (Seo & Eisenstein, 2007), whereby we construct the Fisher matrix

(37) |

(38) |

where and . The Cramer-Rao bound is used to calculate the maximum theoretical error in the parameter . A combined distance measure , also referred to as the “dilation factor” (Eisenstein et al., 2005)

(39) |

is often used as a single parameter to quantify BAO observations . We use to obtain the relative error in . The dilation factor is known to be particularly useful when the individual measurements of and have low signal to noise ratio.

The Fisher matrix formalism is used to determine the accuracy with which it will be possible to measure cosmological distances using this cross-correlation signal.

The limits and , which correspond to and , set the cosmic variance limit. In this limit, where the SNR depends only on the survey volume corresponding to the total field of view we have , and which are independent of any of the other observational details. The fractional errors decrease slowly beyond or . We find that parameter values and , attainable with BOSS and SKA1 mid are adequate for a accuracy, whereas and are adequate for a accuracy in measurement of . With a BOSS like survey is possible to achieve the fiducial value from the cross-correlation at . The error varies slower than in the range to . We have and at and at respectively. The errors do not significantly go down much further for , and we have at .

### 3.4 Constraints on Neutrino mass

Free streaming of neutrinos causes a power suppression on large scales. This suppression of dark matter power spectrum shall imprint itself on the cross-correlation of Ly- forest and 21 cm signal (Pal & Guha Sarkar, 2016). We have suggested this as a possible way to constrain neutrino mass. We have considered a BOSS like Ly- survey with a quasar density of deg with an average sensitivity for the measured spectra. We have also assumed a 21 cm intensity mapping experiment at a fiducial redshift corresponding to a frequency MHz using a SKA1-mid like instrument with 250 dishes each of diameter m. We have assumed a = (Planck Collaboration et al., 2014) for this analysis. The Fisher matrix analysis using a two parameter shows that For a 10.000 hrs radio observation distributed over 25 pointings of 400 hrs each the parameters and are measurable at and . respectively [see figure (4)]. We find it significant that instead of a deep long duration observation in one small field of view, it is much better if one divides the total observation time over several pointings and thereby increasing survey volume. For 100 pointings each of one can get a measurement of . This is close to the cosmic variance limit at the fiducial redshift and the given observations. In the ideal limit one may measure at a level which corresponds to a measurement of at the precision of eV.

## 4 Cross-correlation with Lyman break galaxies

The cross-correlation between the HI 21-cm signal and the Lyman break galaxies is another important tool to probe the large scale structure of the Universe at post reionization epoch. This has been studied recently (Villaescusa-Navarro et al., 2015b) using a high resolution N-body simulation. Prospects for detecting such a signal using the SKA1-mid and SKA1-low telescopes together with a Lyman break galaxy spectroscopic survey with the same volume have also been investigated. It is seen that the cross power spectrum can be detected with a SNR up to times higher than the HI 21-cm auto power spectrum. Like in all other cross power spectrum the Lyman break galaxy and HI 21-cm cross power spectrum is expected to be extracted more reliably from the much stronger by spectrally smoothed foreground contamination compared to the HI 21-cm auto power spectrum.

## 5 Cross-correlation of HI 21 cm signal with CMBR

### 5.1 Weak Lensing

Gravitational lensing has the effect of deflecting the CMBR photons. This forms a secondary anisotropy in the CMBR temperature anisotropy maps (Lewis & Challinor (2006)). The weak lensing of CMBR is a powerful probe the universe at distances () far greater than any galaxy surveys. Measurement of the secondary CMBR anisotropies, often uses the cross correlation of some relevant observable (related to the CMB fluctuations) with some tracer of the large scale structure (Hirata et al. (2004a); Smith et al. (2007); Hirata et al. (2004b)). For weak lensing statistics the ‘convergence’ and the ‘shear’ fields quantify the distortion of the maps due to gravitational lensing. Convergence () measures the lensing effect through its direct dependence on the gravitational potential along the line of sight and is thereby a direct probe of cosmology. The difficulty in precise measurement of lensing is the need for very high resolutions in the CMBR maps, since typical deflections over cosmological scales is only a few arcminutes. The non-Gaussianity imprinted by lensing on smaller scales allows a statistical detection for surveys with low angular resolution. Cross-correlation with traces, limits the effect of systematics and thereby increases the signal to noise. The weak lensing observables like convergence are constructed using various estimators involving the the CMBR maps(T, E, B) (Seljak & Zaldarriaga (1999); Hu (2001); Hu & Okamoto (2002)). The reconstructed convergence field can then be used for cross correlation.

We have probed the possibility of using the post-reionization HI as a tracer of large scale structure to detect the weak lensing (Guha Sarkar, 2010) effects. We have studied the cross correlation between the fluctuations in the 21-cm brightness temperature maps and the weak lensing convergence field. We can probe the one dimensional integral effect of lensing at any intermediate redshift by tuning the observational frequency band for 21-cm observation. The cross-correlation power spectrum can hence independently quantify the cosmic evolution and structure formation at redshifts . The cross-correlation power spectrum may also be used to independently compare the various de-lensing estimators.

The distortions caused by the deflection is the quantity of study in weak lensing. At the lowest order, magnification of the signal is contained in the convergence. The convergence field is a line of sight integral of the matter over density given by (Van Waerbeke & Mellier (2003))

(40) |

and is given by

(41) |

Here, denotes the growing mode of density contrast , and denotes the conformal time to the epoch of recombination. The comoving angular diameter distance for flat universe, for and for Universe. The convergence power spectrum is defined as . where are the expansion coefficients in spherical harmonic basis. The Convergence auto-correlation power spectrum for large can be approximated as

(42) |

The cross correlation angular power spectrum between the post-reionization H i 21-cm brightness temperature signal and the convergence field, is given by

(43) |

where is dark matter power spectrum at , and

(44) |

We note that the convergence field , is not directly measurable in CMBR experiments. It is reconstructed from the CMBR maps through the use of various statistical estimators (Hanson et al. (2009); Kesden et al. (2003); Cooray & Kesden (2003)). The cross-correlation angular power spectrum, , does not de-lens the CMB maps directly. It uses the reconstructed cosmic shear fields , and is thereby very sensitive to the underlying tools of de-lensing, and the cosmological model. The cross-correlation angular power spectrum may provide a way to independently compare various de-lensing estimators.

The cross-correlation power spectrum follows the same shape as the matter power spectrum. The signal peaks at a particular which scales as when the redshift is changed. The angular distribution of power clearly follows the underlying clustering properties of matter. The amplitude depends on several factors which are related to cosmological model and the H i distribution at . The angular diameter distances directly also depends directly on the cosmological parameters. The cross-correlation signal may hence be used independently for joint estimation of cosmological parameters.

We shall now discuss the prospect of detecting the cross-correlation signal assuming a perfect foreground removal. The error in the cross-correlation signal has the contribution due to instrumental noise and sample variance. Sample variance however puts a limiting bound on the detectability. The cosmic variance for is given by

(45) |

where is fraction of overlap portion of sky common to both observations. denotes the number of independent estimates of the signal.

In the ideal hypothetical possibility of a full sky 21 cm survey we have , and used . The predicted is found to be and is not significantly high for detection which requires . Choosing a for and for shall produce a . This establishes that, with full sky coverage and negligible instrumental noise, the binned cross-correlation power spectrum is not cosmic variance limited and it detectable. The estimate is based on H i observation at only one frequency. The - cm observations allow us to probe a continuous range of redshifts. This allows us to further increase the by collapsing the signal from various redshifts. In principle, a broad band 21-cm experiment may further increase the .

The maybe improved by collapsing the signal from different scales and thereby test the feasibility of a statistically significant detection. The cumulated SNR upto a multipole is given by

(46) |

and denotes the noise power spectrum for and H i observations respectively. Ignoring the instrument noises we note that there is a significant increase in the by cumulating over multipoles . This implies that a statistically significant detection of is possible and the signal is not limited cosmic variance. It is important to push instrumental noise to the limit set by cosmic variance for a detection of the signal. At the relevant redshifts of interest, it is possible to reach such low noise levels with SKA. It is however important to scan large parts of the sky and thereby increase the survey volume.

Instrumental noise plays an important role at large multipoles (small scale). For a typical CMB experiment, the noise power spectrum (Marian & Bernstein (2007); Smith et al. (2006)) is given by , where different pixels have uncorrelated noise with variance . Here and are the pixel sensitivity and ‘time spent on the pixel’ respectively. is the solid angle subtended per pixel and we use a Gaussian beam .

For H i observations, the quantity of interest is the complex Visibility which is used to estimate the power spectrum (Ali et al. (2008)). For a radio telescope with N antennae, system temperature , operating at a frequency , and band width the noise correlation is given by