LyC leakage versus quenching with the JWST

Lyman continuum leakage versus quenching with the James Webb Space Telescope: The spectral signatures of quenched star formation activity in reionization-epoch galaxies

Christian Binggeli, Erik Zackrisson, Kristiaan Pelckmans, Rubén Cubo, Hannes Jensen, Ikko Shimizu
Department of Physics and Astronomy, Uppsala University, Box 515, SE-751 20 Uppsala, Sweden
Department of Information Technology, Division of Systems and Control, Uppsala University, Box 337, SE-751 05 Uppsala, Sweden
Theoretical Astrophysics, Department of Earth and Space Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka 560-0043, Japan
Accepted 2018 April 20. Received 2018 April 04; in original form 2018 January 24

In this paper, we study the effects of a recent drop in star formation rate (SFR) on the spectra of epoch of reionization (EoR) galaxies, and the resulting degeneracy with the spectral features produced by extreme Lyman continuum leakage. In order to study these effects in the wavelength range relevant for the upcoming James Webb Space Telescope (JWST), we utilize synthetic spectra of simulated EoR galaxies from cosmological simulations together with synthetic spectra of partially quenched mock galaxies. We find that rapid declines in the SFR of EoR galaxies could seriously affect the applicability of methods that utilize the equivalent width of Balmer lines and the ultraviolet spectral slope to assess the escape fraction of EoR galaxies. In order to determine if the aforementioned degeneracy can be avoided by using the overall shape of the spectrum, we generate mock NIRCam observations and utilize a classification algorithm to identify galaxies that have undergone quenching. We find that while there are problematic cases, JWST/NIRCam or NIRSpec should be able to reliably identify galaxies with redshifts that have experienced a significant decrease in the SFR (by a factor 10-100) in the past 50-100 Myr with a success rate . We also find that uncertainties in the dust-reddening effects on EoR galaxies significantly affect the performance of the results of the classification algorithm. We argue that studies that aim to characterize the dust extinction law most representative in the EoR would be extremely useful.

galaxies: high-redshift – dark ages, reionization first stars.
pubyear: 2018pagerange: Lyman continuum leakage versus quenching with the James Webb Space Telescope: The spectral signatures of quenched star formation activity in reionization-epoch galaxiesLyman continuum leakage versus quenching with the James Webb Space Telescope: The spectral signatures of quenched star formation activity in reionization-epoch galaxies

1 Introduction

The epoch of reionization (EoR) represents an extremely important yet poorly understood stage in the evolution of the Universe. During the EoR, the Universe was flooded by energetic hydrogen-ionizing Lyman continuum radiation (LyC) which ionized the neutral intergalactic medium (IGM), but the source of this radiation remains to be determined. Star-forming galaxies have emerged as the main candidate for producing the bulk of the ionizing photons required to sustain reionization, but it is not clear whether the early star-forming galaxies produced and leaked enough such radiation.

In the low- and intermediate-redshift Universe, studying the leakage of LyC photons can be done directly by detecting the escaping LyC photons. There are now several studies that detect leaking LyC with (e.g. Bergvall et al., 2006; Leitet et al., 2013; Borthakur et al., 2014; Izotov et al., 2016b, a) at redshifts . There are also studies that claim extreme () leakage (Vanzella et al., 2016; Shapley et al., 2016; Bian et al., 2017; Izotov et al., 2018). Matthee et al. (2017) also present very high escape fractions () for a number of individual sources in a large sample of Lyman- and H emitters at . When stacking the whole sample they find escape fractions for both median and mean stacking. However, detecting the LyC photons directly becomes impossible at due to absorption in the increasingly neutral IGM as we approach the EoR. A possible workaround for this is to indirectly determine the LyC escape fraction () using parts of the spectrum that are unaffected by IGM absorption. A couple of methods for indirectly determining the escape fraction have been suggested. A method using the strengths of absorption lines to probe the line-of-sight covering fraction of neutral hydrogen was proposed by Jones et al. (2013) and Leethochawalit et al. (2016). The method has recently been successfully applied to known LyC leakers by Gazagnes et al. (2018) and Chisholm et al. (2018). This method is, however, limited by signal-to-noise requirements at sufficient spectral resolution. It has been suggested that high line ratios could be indicative of Lyman continuum leakage through density-bounded regions in local compact star-forming galaxies (Jaskot & Oey, 2013). Low-redshift star-forming galaxies selected for high ratios and compactness have been found to show significant escape fractions (Izotov et al., 2016b, a, 2018), see also Chisholm et al. (2018). Meanwhile, Stasińska et al. (2015) have shown that high ratios are not necessarily an indicator of leaking LyC.

Verhamme et al. (2015) propose that the shape of the Lyman- spectral line may hold information about the escape fraction, which observations of Lyman- profiles of local LyC emitters seem to support (Verhamme et al., 2015; Verhamme et al., 2017). However, as the neutral fraction of the IGM increases with redshift, the shape of the Lyman- line may change due to Lyman- transmission effects. Thus, it is uncertain how applicable this method will be to reionization-epoch galaxies (Verhamme et al., 2017).

Another method was suggested by Zackrisson et al. (2013), and later tested on simulated galaxies from cosmological simulations in Zackrisson et al. (2017). This method exploits the fact that ionizing radiation produced by a young hot population of stars will become reprocessed to longer wavelengths in the neutral hydrogen gas surrounding newly formed stars. Thus, the LyC photons will leave an imprint in the rest-frame non-ionizing UV/optical parts of the spectrum. The authors argue that this could in the future be observed using the James Webb Space Telescope (JWST). By measuring the equivalent width of the Balmer beta line () while simultaneously measuring the ultraviolet spectral slope, one should be able to identify cases of extreme LyC leakage () from reionization-epoch galaxies (Zackrisson et al., 2017). In Jensen et al. (2016), this method was further expanded upon using machine learning algorithms in order to increase the number of spectral features used to predict the escape fraction using mock JWST/NIRSpec observations. For a discussion on the caveats of this method, see Zackrisson et al. (2017)

Since there are indications that star formation rates (SFR) in galaxies in the EoR are generally increasing over time (e.g. Finlator et al., 2011), optical recombination lines of these galaxies are expected to be strong. The method by Zackrisson et al. (2013) utilizes this, and therefore, any weak emission lines in galaxies with blue UV slopes will be interpreted as a telltale sign of Lyman continuum leakage. This could, however, make the method sensitive to rapid changes in star formation. Since the emission in these lines depends on the reprocessing of LyC, and LyC production is dominated by the short-lived O and B stars, a drop in the SFR can lead to weaker lines while not necessarily leading to a significantly redder UV slope. While many cosmological simulations predict star formation histories with fairly modest fluctuations in SFR in the early Universe (Finlator et al., 2013; Shimizu et al., 2014; Gnedin, 2014; Gnedin & Kaurov, 2014), there are simulations that do feature such variations even for galaxies with stellar masses above (Kimm et al., 2015; Ma et al., 2015, 2018). Furthermore, the recent discovery of a galaxy that may have undergone a significant drop in the SFR (Hashimoto et al., 2018) raises the question of how common these types of galaxies may be in the early Universe.

If one would catch a galaxy soon after it has undergone a rapid drop in SFR, this object could be incorrectly identified as a galaxy with high LyC leakage by the method described in Zackrisson et al. (2013); Zackrisson et al. (2017); Jensen et al. (2016).

We utilize simulated galaxies from the Shimizu et al. (2014) cosmological simulation in combination with mock-galaxies that have experienced a recent drop in star formation activity in order to study the effect of quenching on the applicability of using emission lines and the UV slope as a probe of the LyC escape fraction. We assess the possibility of identifying quenched galaxies during the EoR using JWST/NIRCam and MIRI photometry. Finally, we utilize linear discriminant analysis (LDA)(Hastie et al., 2009, pp.106-112) as a classification algorithm in order to assess how well we will be able to distinguish between galaxies that have experienced a recent decline in SFR and star-forming galaxies with high escape fractions. We also asses in which cases the information in the spectral energy distribution is insufficient to realistically distinguish the two types. Note that we do not strictly distinguish between ‘attenuation’ and ‘extinction’, but use the terms interchangeably when talking about dust-reddening effects.

2 Model

Here, we present the models used in this paper and some important properties of these models. More in-depth properties of the model are discussed in Zackrisson et al. (2017).

Figure 1: Top: Star formation histories binned in 10 Myr bins over cosmic time ( to the age of the Universe at ) for two typical (black and yellow) and one extremely stochastically star-forming galaxy (red) from the Shimizu et al. (2014) simulations. Bottom: Stellar masses for the same three galaxies over cosmic time.

2.1 Simulated galaxies and SED modeling

We use the cosmological hydrodynamic simulation by Shimizu et al. (2014), which constitutes a subset of the galaxies used in Zackrisson et al. (2017). The simulation is based on the smoothed particle hydrodynamic code GADGET-3 which is an improved version of the public GADGET-2 code (Springel, 2005). In the simulation, dark matter and gas particles are simulated in a comoving volume of Mpc cube while considering star formation, supernova feedback and chemical enrichment following Okamoto et al. (2008). The simulation does not resolve single stars, but rather collections of stars with masses with a Salpeter IMF (Salpeter, 1955), which leads to a minimum stellar mass for the simulated galaxies of around . We extract the star formation histories, internal metallicity distribution, and the dust content of 406 galaxies with stellar masses at redshift and dust-free magnitudes in the range to (corresponding to apparent magnitudes at ) from the simulation.

The information about the star formation history and metallicity stored in star particles representing a collection of stars with mass of , while the dust content is calculated for each galaxy as a whole in the simulation and given as a prediction of the extinction at 1500 Å ().

The average stellar metallicity of the galaxies in the simulation ranges from to , while single SPH particles can have metallicities . In Figure 1, we show the star formation histories and total stellar mass as a function of cosmic time for three galaxies from the simulations. On average, the galaxies show increasing SFRs with small variations (factor 2-3) on 10 Myr time-scales once the galaxies reach stellar masses around , with of galaxies displaying larger variations (factor 5-10). This is shown in Figure 1, where the black and yellow lines show two typical galaxies, while the red line shows the most extreme galaxy extracted at . This galaxy has a rapid change in SFR (about a factor 10 drop and shortly thereafter a factor 20 increase) at around 0.74 Gyr after Big Bang. This type of galaxy represents an extremely rare type of galaxy in the Shimizu et al. (2014) simulation. However, variations of this size and even more extreme cases, with fluctuations in SFR of the order of over Myr, are seen more routinely in other simulations (Kimm & Cen, 2014; Kimm et al., 2015; Ma et al., 2015, 2018; Trebitsch et al., 2017). Variations of this kind would also be a consequence of a star formation duty cycle of % as advocated by Wyithe et al. (2014).

In this paper, the term ‘quenching’ is used to describe sudden a drop in the SFR, in contrast to the more regular usage for the term describing a total shut-down in star formation in a galaxy (Harker et al., 2006; Faber et al., 2007) . The drop in star formation can be explained for example by the removing of gas due to feedback effects after a recent starburst event (Ma et al., 2018)

In order to generate quenched galaxies, we extract simulated galaxies from the Shimizu et al. (2014) set and lower the star formation with a fixed factor 5,10 and 100 and simulate continuous star formation for 10-100 Myr by aging the already existing population while adding a less massive population of stars with ages distributed evenly over the time after the quenching takes place and metallicities equal to the average of the simulated stellar population. Thus, the quenched galaxies will have a range of internal metallicity distributions and varying star formation histories up until a significant drop in the SFR occurs. The values for the drop in the SFR and the duration of the quenching were selected to be roughly of the same size as those observed in simulations by Kimm et al. (2015) and Ma et al. (2015, 2018), and should be comparable to to the kind of SFR fluctuations seen in simulations by Kimm & Cen (2014); Trebitsch et al. (2017).

The resulting galaxies are fainter than the simulated ones with dust-free magnitudes in the range to . In the simulation by Shimizu et al. (2014), this kind of galaxy that is caught in the low SFR stage after quenching is rarer than one in a hundred. In principle, we expect that the effect of this type of quenching on the SED will not be dissimilar to a less massive young starburst on top of a massive underlying aged population, as discussed in Zackrisson et al. (2013). In that study, the authors argue that, for dust-free galaxies, continuum measurements with NIRSpec and MIRI could be used to identify such cases. This could however be complicated further when the difference in age between the massive and less massive population is smaller or when taking more complex star formation histories and dust and into consideration.

Figure 2: Top: Star formation histories binned in 10 Myr size bins over cosmic time ( to age of Universe at ) for a mock quenched galaxy shown with a factor of 5 (yellow), 10 (red) and a factor of 100 (black) drop in star formation over 50 Myr. Bottom: Stellar masses for the same galaxies over cosmic time. Note that the way the mock galaxies are created they have a mismatch in age with the galaxies from the simulations, which grows with time after the quenching occurs.

In Figure 2, we show the star formation history and total stellar mass assembly history for three quenched mock galaxies. We assume a constant SFR once the quenching has occurred.

To get spectral energy distributions (SEDs) from the collection of star particles and the dust content, we use a grid of synthetic spectra for metallicity ranging from to generated with the Yggdrasil spectral evolutionary code (Zackrisson et al., 2011). This way, we can assign a spectrum to each star particle and sum over the star particles in the galaxies. To find the closest relevant spectrum for each particle, we perform an interpolation in and . The grid of spectra is generated using the BPASS v2.0 binary stellar evolutionary models (Eldridge & Stanway, 2009; Stanway et al., 2016) in combination with spectra from Raiter et al. (2010) for extremely metal poor stars (). We calculate the Lyman continuum photon production efficiency (the number of ionizing photons produced per unit UV luminosity, , in ) for the simulated galaxies. We find values of for the individual galaxies, with a mean value for the simulated galaxy set of . This result is consistent with several observational studies (e.g. Stark et al., 2015; Bouwens et al., 2015, 2016), motivating the use of the BPASS binary stellar evolutionary models 111Note that the newer version of the BPASS binary evolution code does predict lower values of (Eldridge et al., 2017). The nebular emission associated with each star particle is calculated using the Cloudy photo-ionization code (Ferland et al., 2013) while taking the escape fraction into account and under the assumption that the nebular metallicity is equal to the stellar metallicity. See Zackrisson et al. (2013); Zackrisson et al. (2017) for more in-depth discussion of the procedure. To account for dust, we use the simulated dust amount from the Shimizu et al. (2014) simulations and apply Calzetti et al. (2000) reddening with the same attenuation for the nebular and stellar component () as the fiducial model. With the amount of dust given in the simulations, this gives an average UV slope for the whole sample, which is consistent with observations of galaxies (Bouwens et al., 2014). The mean optical extinction in the V-band for these galaxies is with a few galaxies reaching as high as . In Section 3.2 we discuss the effects of other attenuation and extinction laws, and we there use the standard Calzetti et al. (2000) attenuation law (where ) and the Pei (1992) SMC extinction law. For the Pei (1992) SMC law, the mean optical extinction in the V-band is . While the Calzetti attenuation law gives UV slopes consistent with observations of galaxies, and there are observations of Lyman break galaxies at that point towards a Calzetti-like attenuation law (Koprowski et al., 2018), we note that there are also recent observations that favor a steeper extinction curve for galaxies at (Smit et al., 2018).

In this paper, we assume that all leakage of LyC radiation is occurring through holes in the neutral gas envelope surrounding star-forming regions. This is what is described in Zackrisson et al. (2013) as the ionization-bounded nebula with holes (a.k.a. the picket-fence model). In this type of geometry, the Lyman continuum escape fraction is simply where is covering fraction of neutral gas. This basically means that the nebular emission will be scaled according to the escape fraction. We calculate spectra for . This gives us a total of quenched mock galaxies and a total of simulated galaxies with varying escape fractions and different star formation histories.

2.2 Mock observations

In order to test whether we can realistically distinguish between quenched and normally star-forming galaxies with variable escape fractions we create mock JWST/NIRCam and MIRI photometry observations of the galaxies and add observational noise to these. For the noise we assume Gaussian noise with a fixed signal-to-noise ratio of in all filters independent of their sensitivity.

In reality, the will of course depend on the sensitivity in a given filter. For the NIRCam observations, we expect this level of in all filters for around 30 minutes exposure time for a galaxy with 222Calculated using the JWST exposure time calculator ( For MIRI, we expect a signal-to-noise ratio of at least 10 for galaxies with with an exposure of hours in the F560W filter. Observations of this type will be performed within the JWST guaranteed time observations 333For example, see GTO programs 1180 and 1283, In total, we use mock observations in six NIRCam filters (F115W, F150W, F200W, F277W, F356W and F444W) as well as the F560W MIRI filter.

2.3 Linear discriminant analysis (LDA)

The classification algorithm linear discriminant analysis (LDA) is used in order to classify the galaxies as quenched or as normally star-forming (Hastie et al., 2009, pp.106-112).

A classification algorithm as LDA aims to recover the probability of an object belonging to a class given input features . LDA is based on Bayes theorem, and makes the parametric assumption that the involved probabilities can be modeled as multivariate Gaussian distributions with common covariances but class specific means. Given those quantities, a sample with input features is assigned to class or according to the log-likelihood ratio:


where and are the probabilities of an observation or object belonging to class and respectively, given input features . are the prior probabilities of the classes, are the means and denotes the common covariance matrix. Equation 1 implies that the decision boundary between the classes (where the probabilities are equal) is linear in x. Classification to a class and is done according to the maximum conditional probability of belonging to a class.

The training problem amounts to estimating the quantities and that best describe the classes given some training set of input features. This is performed using maximum likelihood. In this work, the features are NIRCam and MIRI magnitudes and the classes are galaxies that have undergone quenching and those that have not. To avoid skewed data, we under-sample the quenched set of galaxies so that we end up with balanced sets of simulated star-forming galaxies and mock quenched galaxies.

3 Quenched galaxies and observables

Figure 3 shows the general effect that quenching has on the SED. As the number of Lyman continuum photons produced drops, the number of such photons that are reprocessed into optical and non-ionizing UV goes down. The emission lines become weaker and the continuum drops in the UV while remaining more or less unchanged at longer wavelengths. The net effect is a reddening of the continuum. 10 Myrs after quenching, has dropped a factor while the UV slope remains more or less the same as in the non-quenched case. 50 Myrs after quenching the UV slope has changed significantly while has dropped further. However, the UV slope does not experience extreme reddening due to the quenching. The average reddening of the UV slope over time is only and for a factor of 10 and 100 drop in SFR over 100 Myr, respectively. While this slight change in the UV slope could be used to identify cases of quenching if dust effects are ignored, the reddening of the UV slope by dust if one assumes a Calzetti attenuation law for the galaxies is of the same order.The quenched galaxies would thus be indistinguishable from extreme leakers using only and the UV slope as diagnostics for escape fraction.

Figure 3: Synthetic spectrum of a single star-forming galaxy () from the simulations (black) and the same galaxy after quenching with a factor 100 has taken place for 10 (red) and 50 (yellow) Myr. The spectra have been normalized at 12000 Å

Examples of this degeneracy between extreme leakers and quenched galaxies are shown in Figure 4. This figure displays three pairs of simulated and quenched galaxies, where the galaxies in each pair have basically identical and UV slopes. The pairs in a) and b) in this figure show excess flux in the continuum in JWST NIRSpec and NIRCam wavelengths. This is seen especially around 4000 Å. In c), the galaxies look virtually identical in this wavelength range while the dustier star-forming galaxy is brighter in the rest-frame near-infrared relative to the quenched galaxy. In Section 3.1 we test whether or not these differences in the continuum are sufficient to identify quenched galaxies when observational noise is taken into account.

Figure 4: Synthetic spectra showing three examples of the degeneracy between escape fraction and SFR in dusty star-forming galaxies (black) with extreme escape fractions and quenched galaxies (red) with zero escape fractions and lower dust content. a): Simulated star-forming galaxy with , and mock galaxy quenched with a factor of 5 with . b): Simulated star-forming galaxy with , and mock galaxy quenched with a factor 10 and . c): Simulated star-forming galaxy with , and mock galaxy quenched with a factor 100 and . All quenched galaxies are observed 40 Myr after the quenching takes place and the Calzetti attenuation law, where was used for all galaxies. For clarity, the spectra have been normalized at 1250  Å, a) and b) have been shifted upwards.

3.1 NIRCam+MIRI to identify quenched galaxies

In order to determine whether we can identify if a galaxy has undergone recent quenching or not, we use the calculated magnitudes in NIRCam and MIRI. The distribution of quenched and simulated galaxies in three filters is shown as a color-color diagram in Figure 5. In the noise-free case (panel 1, Figure 5) the quenched galaxies are largely separated from the normal galaxies. However, when adding noise in this sample, the overlap between the two classes becomes large (panel 2, Figure 5). In this case we assume that the extinction law is well characterized. We perform 10-fold cross validation with LDA on the whole set with all the available NIRCam filters and the MIRI F560W filter. This yields a mean accuracy or total accuracy of ( error), where the accuracy for one validation iteration is defined as

Figure 5: Color-color diagrams showing the noise-free (left) distribution and the noisy () distribution (right) of the quenched galaxies (red) and simulated (star-forming) galaxies (black).

We define the recall, or true positive rate (TPR), the precision, or positive predictive value (PPV), where we use quenched as positive


And the corresponding quantities, specificity, or true negative rate (TNR), and negative predictive value (NPV) for the simulated star-forming, or ‘normal’ galaxies


where and are true quenched and normal, respectively, (those galaxies that are correctly classified as either quenched or normal), and and are false quenched and normal, respectively (i.e. those that are falsely identified as quenched and normal). Table 1 shows the confusion matrix when the model is applied to a test set. This is constructed by randomly taking out 20% of the objects from the training data, meaning that the algorithm has never “seen” these objects before. In this case, 262 ‘normal’ star-forming galaxies from the simulations are misclassified, while 1439 are correctly classified. The corresponding numbers for the quenched galaxies are 377 and 1333, which indicates that the algorithm performs slightly better on the normal galaxies with a specificity of approximately 85% and a NPV of 79%, while the recall for the quenched galaxies is 78% and the precision is 84%. This means that about 15% of the normal galaxies will be incorrectly identified as quenched and about 22% of the quenched galaxies will be incorrectly classified as normal. The receiver operating characteristic curve (ROC curve) in Figure 6 displays how the TPR (recall) and the false positive rate (FPR) changes when changing the threshold used for the classification, together with the area under the curve (AUC). Ideally, one would want zero FPR and maximal TPR, which would mean you have a perfect classifier (i.e. you have no false positive and thus all your positives are true). The dashed line in the ROC curve (Figure 6) is the line of no discrimination, getting a ROC curve of a classifier along this line would mean our classifier is classifying at random. Having an AUC of unity corresponds to perfect classification, while 0.5 would correspond to a classifier which makes random guesses.

Figure 6: ROC curve for the fiducial set of galaxies. The red line gives the TPR versus the FPR (1-specificity) for the fiducial set as the threshold used for classification is changed. The dashed line displays the line of no-discrimination. The area under the ROC curve (AUC) is given in the lower right side of the figure.

If testing is done only on galaxies that have Å (which corresponds to galaxies with a 50% escape fraction according to the method of Zackrisson et al. (2013); Zackrisson et al. (2017)) we get the confusion matrix shown in Table 2. We see that the specificity (and NPV) in this case is approximately 87% (82%), while the recall (and precision) are 80% (86%).

The general trend of increasing the number of photometric filters used to train and test the algorithm is to increase the total accuracy. However, excluding the MIRI F560W filter from the classification does not significantly affect the result, indicating that it holds very little additional information about the quenching.

In order to identify the most problematic cases of quenching, and for identifying when the algorithm would break down, we create testing sets for each quenching factor and quenching duration. These sets are not used in the respective training phase, hence the algorithm has not ‘seen’ the objects before. The algorithm is once again trained on a set of mixed quenched and simulated star-forming objects but now tested on a set consisting only of quenched galaxies with a certain quenching factor and quenching duration. The results of this test are shown in Figure 7. Not surprisingly, the algorithm performs better when quenching is stronger and the quenching time scale is longer, i.e. when the galaxies exhibit clearer signs of aging. If the galaxies are observed only 10 Myr after quenching takes place, the model only achieves a recall of 44-59% depending on the level of quenching, meaning that these galaxies will generally not be distinguishable from simulated star-forming galaxies. Furthermore, it appears that galaxies that have undergone a weak quenching (factor 5 drop in SFR) are problematic. However, for the most extreme cases of quenching (factor 100), a recall of 94% is achieved even after a relatively short quenching duration ( 30 Myr). The recall rate for galaxies that have undergone strong quenching (factor 10-100) after 50 Myr is .

Normal Quenched
True Normal 1439 262
Quenched 377 1333
Table 1: Confusion matrix giving the true class (true) versus the class that the LDA algorithm predicts (predicted) for a testing set consisting of 20% of the total set.
Normal Quenched
True Normal 429 63
Quenched 97 395
Table 2: Confusion matrix showing the true class (true) versus the class that the LDA algorithm predicts (predicted) when the test set is limited to galaxies that have Å
Figure 7: The recall (TPR) of the LDA model as a function of quenching duration for the different factors of quenching. Results shown here are for the fiducial set of objects where no limits have been put on the .

3.2 Effects of dust handling

If we have access to JWST/NIRSpec observations, a correction for dust reddening could in principle be done for galaxies using and for . However, in cases with high escape fractions (), is relatively weak ( Å) for a majority of the simulated galaxies. Furthermore, as mentioned in Section 2.1, the new version of BPASS binary evolution model predicts lower values of , which would lead to weaker emission lines. This and the fact that will only be detectable out to makes it unclear whether or not we can actually reliably correct for dust in these galaxies. We therefore assume that we cannot apply any dust correction.

In Table 3 we show results for the recall, precision, specificity and NPV when the algorithm is applied to sets with different attenuation/extinction laws. We see no significant impact on the performance when switching from testing and training on the Calzetti law where to the standard Calzetti law where . Furthermore, there is no significant difference in performance if the model is trained on either Calzetti law, and tested on the other. Meanwhile, the difference in performance between training on any of the Calzetti laws and testing on the steeper SMC law is larger, giving a slight increase in performance. We see an overall increase in performance if the model is trained and tested on the SMC law, implying that the steepness of the extinction law leads to less confusion between quenched and ‘normal’ galaxies. However, we also see a significant decrease in the performance if one assumes a extinction law that is too steep, ie. if we train on the SMC law and apply the model to a population which is actually experiencing dust-reddening effects by a flatter Calzetti law. In this case, the algorithm will achieve a TNR of 66-67%, which means that 33-34% of ‘normal’ star-forming galaxies will be wrongly identified as quenched.

(%) (%) (%) (%)
T:Cal P: Cal 78 84 85 79
T:Cal P: Cal 80 85 86 81
T:Cal P: SMC 84 87 88 85
T:Cal P: Cal 80 82 83 80
T:Cal P: Cal 80 86 87 81
T:Cal P: SMC 87 83 83 76
T:SMC P: Cal 89 72 66 86
T:SMC P: Cal 90 73 67 87
T:SMC P: SMC 89 89 88 89
Table 3: Recall (TPR), precision (PPV), specificity (TNR) and negative predictive value (NPV) for the LDA model when it is trained (T) and tested (P) on different dust laws. ‘Cal’ is the Calzetti law where , ‘Cal’ is the standard Calzetti law and ‘SMC’ is the Pei (1992) SMC extinction law

4 Discussion

The previous sections provide evidence for the statement that rapid variations in the SFR in reionziation epoch galaxies will have an impact on the applicability of methods that utilize the emission lines and UV slope to estimate the escape fraction of LyC photons. Quenching of the type discussed in this paper will leave imprints on the SED that are in some degree degenerate with those produced by an extreme escape fraction of LyC photons. However, the effect that quenching has on the overall shape of the SED should in cases of large decreases in SFR (by a factor of 10-100) in the past 50-100 Myr, allow us to statistically identify quenched galaxies using JWST NIRCam and/or NIRSpec observations. For example, consider a situation where our algorithm correctly classifies galaxies in approximately 85% of cases. If we then observe 100 galaxies and find that 15 of those are identified as quenched, it is hard to say anything about how common quenching is. However, if we observe 100 objects and find that 25-35 of those are identified as quenched, it is very likely that there exists a quenched sub-population. Furthermore, by considering a subsample of those galaxies that have H equivalent width ( Å) we should be able to statistically distinguish between extreme LyC leakage and a recent decrease in SFR. It is still unclear how common this type of quenching is in the early Universe, but if there exists a population of EoR galaxies with star formation histories similar to the the one recently claimed by Hashimoto et al. (2018) for a object, JWST should be able to identify this.

While a sub-population with strong variations in the SFH could be detectable using the JWST after roughly 50-100 Myr, distinguishing between cases where weak quenching has taken place (or cases where we catch a galaxy shortly after quenching) and a high escape fraction will be challenging. If we are able to identify a number of strongly quenched galaxies, it is however, not unlikely that there are galaxies that experience weaker quenching but that are not correctly identified. In principle, this should tell us how applicable the method presented in Zackrisson et al. (2013), Jensen et al. (2016) and Zackrisson et al. (2017) is. If quenching is relatively common, it is likely that there will be cases in which the UV slope and emission lines will lead us to wrongly assume large leakage of LyC photons, and these cases may actually not be distinguishable using only JWST observations.

We have shown that the Zackrisson et al. (2013) method is subject to a degeneracy between the LyC escape fraction and star formation activity. Other methods to determine the escape fraction such as high line ratios Jaskot & Oey (2013), using UV absorption lines to probe the line-of-sight covering fraction (Jones et al., 2013; Leethochawalit et al., 2016) or constraining the escape fraction using Lyman- profiles (Verhamme et al., 2015) are not subject to the same degeneracy. These methods do, however, have other limitations that are important to consider when applying the methods to EoR galaxies, as discussed in Section 1.

Meanwhile, identifying a quenched population of galaxies using JWST in itself also depends on our understanding of what dust extinction/attenuation law is most representative for the galaxy population at EoR. Even with the JWST/NIRSpec it not clear if we can reliably perform dust corrections for galaxies at . A possible solution that has not been discussed here yet is the combination of MIRI and NIRSpec spectroscopic data. While NIRSpec will not be able to get spectra with H at redshifts , MIRI may be able to provide us with these data. Another possibility could be to characterize dust-reddening effects at lower redshifts, where NIRSpec will have coverage to simultaneously observe H, H and possibly H and assume that this dust law is representative even at higher redshifts. In short, any method that can characterize the dust-reddening effects and the variations in the reddening in galaxies during EoR will be extremely useful. If it is the case that there is no extinction law that is representative for the population of galaxies in the early universe, and in principle every galaxy has a different type of reddening at work, this will strongly limit the applicability of the method that we discuss here. Furthermore, determining which galaxies have undergone quenching also requires that we have a good understanding of how stars are formed and evolve from stellar evolutionary models.

We find that removing the MIRI F560W filter does not significantly affect the performance of the algorithm. The reason for this is most likely that the F560W filter does not probe a significantly different region than the NIRCam F444W filter, and thus provides little additional information about the overall shape of the SED. It is possible that the redder MIRI filters hold information that may improve the identification. However, the limited sensitivity in the F770W and redder filters makes it unclear if these will actually provide information that help the identification of quenched galaxies at these redshifts.

The classification algorithm can also produce results in terms of continuous class memberships (the likelihood for each single galaxy of belonging to a certain class). Computing this for an observed sample of galaxies, one could test how likely we are to find the observed distribution given only a distribution of non-quenched galaxies. This could be used to understand if a quenched sub-population exists in the observed sample of galaxies. However, performing formal significance tests (in the form of P-values) to these statements is challenging as it would involve estimating the underlying distribution of non-quenched galaxies.

We have also tried using a nonlinear support vector classifier (Hastie et al., 2009, pp. 417-422) with a radial basis function kernel and tuned parameters, and find no significant difference in performance compared to LDA.

5 Summary & Conclusions

  • We demonstrate that reionization-epoch galaxies that have experienced a recent decrease in star formation activity give rise to JWST spectra that are to some extent degenerate to those produced by extreme LyC leakage. This could seriously affect the applicability of the Zackrisson et al. (2013), Jensen et al. (2016) and Zackrisson et al. (2017) method for identifying cases of high-LyC leakage based on low emission line equivalent widths with JWST/NIRSpec.

  • We show that if we have a good understanding of the general properties of the galaxy population during EoR, we should be able to statistically identify luminous galaxies that have undergone a significant decrease in SFR (by a factor of 10–100) in the past 50-100 Myr using JWST/NIRCam and/or NIRSpec observations. Cases with more moderate SFR fluctuations are more difficult to single out, but if there exists a significant population of galaxies with star formation histories similar to that recently claimed by Hashimoto et al. (2018) for a object, then JWST should easily be able to pick up on this. Conversely, it should be possible to – in a statistical sense – distinguish galaxies with extreme LyC leakage from objects that have experienced a significant drop in star formation activity.

  • While the UV/optical dust extinction is generally assumed to be quite low for galaxies, the slope of the dust extinction law represents one of the main uncertainties in attempts to break the degeneracy between extreme LyC leakage and quenched star formation. Hence, observations attempting to quantify the details of the dust-reddening effects at should be considered a high-priority task for JWST.


EZ acknowledges research grants from Swedish National Space Board and stiftelsen Olle Engkvist Byggmästare. IS is supported in part by the JSPS KAKENHI Grant Number JP26247022 and JP17H01111. Numerical simulations of IS were performed on Cray XC30 at at the Center for Computational Astrophysics, National Astronomical Observatory of Japan.


  • Bergvall et al. (2006) Bergvall N., Zackrisson E., Andersson B.-G., Arnberg D., Masegosa J., Östlin G., 2006, A&A, 448, 513
  • Bian et al. (2017) Bian F., Fan X., McGreer I., Cai Z., Jiang L., 2017, ApJ, 837, L12
  • Borthakur et al. (2014) Borthakur S., Heckman T. M., Leitherer C., Overzier R. A., 2014, Science, 346, 216
  • Bouwens et al. (2014) Bouwens R. J., et al., 2014, ApJ, 793, 115
  • Bouwens et al. (2015) Bouwens R. J., Illingworth G. D., Oesch P. A., Caruana J., Holwerda B., Smit R., Wilkins S., 2015, ApJ, 811, 140
  • Bouwens et al. (2016) Bouwens R. J., Smit R., Labbé I., Franx M., Caruana J., Oesch P., Stefanon M., Rasappu N., 2016, ApJ, 831, 176
  • Calzetti et al. (2000) Calzetti D., Armus L., Bohlin R. C., Kinney A. L., Koornneef J., Storchi-Bergmann T., 2000, ApJ, 533, 682
  • Chisholm et al. (2018) Chisholm J., et al., 2018, preprint, (arXiv:1803.03655)
  • Eldridge & Stanway (2009) Eldridge J. J., Stanway E. R., 2009, MNRAS, 400, 1019
  • Eldridge et al. (2017) Eldridge J. J., Stanway E. R., Xiao L., McClelland L. A. S., Taylor G., Ng M., Greis S. M. L., Bray J. C., 2017, Publ. Astron. Soc. Australia, 34, e058
  • Faber et al. (2007) Faber S. M., et al., 2007, ApJ, 665, 265
  • Ferland et al. (2013) Ferland G. J., et al., 2013, Rev. Mex. Astron. Astrofis., 49, 137
  • Finlator et al. (2011) Finlator K., Oppenheimer B. D., Davé R., 2011, MNRAS, 410, 1703
  • Finlator et al. (2013) Finlator K., Muñoz J. A., Oppenheimer B. D., Oh S. P., Özel F., Davé R., 2013, MNRAS, 436, 1818
  • Gazagnes et al. (2018) Gazagnes S., Chisholm J., Schaerer D., Verhamme A., Rigby J. R., Bayliss M., 2018, preprint, (arXiv:1802.06378)
  • Gnedin (2014) Gnedin N. Y., 2014, ApJ, 793, 29
  • Gnedin & Kaurov (2014) Gnedin N. Y., Kaurov A. A., 2014, ApJ, 793, 30
  • Harker et al. (2006) Harker J. J., Schiavon R. P., Weiner B. J., Faber S. M., 2006, ApJ, 647, L103
  • Hashimoto et al. (2018) Hashimoto T., et al., 2018, Nature, 557, 392
  • Hastie et al. (2009) Hastie T., Tibshirani R., Friedman J. H., 2009, The elements of statistical learning: data mining, inference, and prediction, 2nd edn. No. Book, Whole, Springer, New York, NY
  • Izotov et al. (2016a) Izotov Y. I., Schaerer D., Thuan T. X., Worseck G., Guseva N. G., Orlitová I., Verhamme A., 2016a, MNRAS, 461, 3683
  • Izotov et al. (2016b) Izotov Y. I., Orlitová I., Schaerer D., Thuan T. X., Verhamme A., Guseva N. G., Worseck G., 2016b, Nature, 529, 178
  • Izotov et al. (2018) Izotov Y. I., Schaerer D., Worseck G., Guseva N. G., Thuan T. X., Verhamme A., Orlitová I., Fricke K. J., 2018, MNRAS, 474, 4514
  • Jaskot & Oey (2013) Jaskot A. E., Oey M. S., 2013, ApJ, 766, 91
  • Jensen et al. (2016) Jensen H., Zackrisson E., Pelckmans K., Binggeli C., Ausmees K., Lundholm U., 2016, ApJ, 827, 5
  • Jones et al. (2013) Jones T. A., Ellis R. S., Schenker M. A., Stark D. P., 2013, ApJ, 779, 52
  • Kimm & Cen (2014) Kimm T., Cen R., 2014, ApJ, 788, 121
  • Kimm et al. (2015) Kimm T., Cen R., Devriendt J., Dubois Y., Slyz A., 2015, MNRAS, 451, 2900
  • Koprowski et al. (2018) Koprowski M. P., et al., 2018, MNRAS,
  • Leethochawalit et al. (2016) Leethochawalit N., Jones T. A., Ellis R. S., Stark D. P., Zitrin A., 2016, ApJ, 831, 152
  • Leitet et al. (2013) Leitet E., Bergvall N., Hayes M., Linné S., Zackrisson E., 2013, A&A, 553, A106
  • Ma et al. (2015) Ma X., Kasen D., Hopkins P. F., Faucher-Giguère C.-A., Quataert E., Kereš D., Murray N., 2015, MNRAS, 453, 960
  • Ma et al. (2018) Ma X., et al., 2018, MNRAS,
  • Matthee et al. (2017) Matthee J., Sobral D., Best P., Khostovan A. A., Oteo I., Bouwens R., Röttgering H., 2017, MNRAS, 465, 3637
  • Okamoto et al. (2008) Okamoto T., Nemmen R. S., Bower R. G., 2008, MNRAS, 385, 161
  • Pei (1992) Pei Y. C., 1992, ApJ, 395, 130
  • Raiter et al. (2010) Raiter A., Schaerer D., Fosbury R. A. E., 2010, A&A, 523, A64
  • Salpeter (1955) Salpeter E. E., 1955, ApJ, 121, 161
  • Shapley et al. (2016) Shapley A. E., Steidel C. C., Strom A. L., Bogosavljević M., Reddy N. A., Siana B., Mostardi R. E., Rudie G. C., 2016, ApJ, 826, L24
  • Shimizu et al. (2014) Shimizu I., Inoue A. K., Okamoto T., Yoshida N., 2014, MNRAS, 440, 731
  • Smit et al. (2018) Smit R., et al., 2018, Nature, 553, 178
  • Springel (2005) Springel V., 2005, MNRAS, 364, 1105
  • Stanway et al. (2016) Stanway E. R., Eldridge J. J., Becker G. D., 2016, MNRAS, 456, 485
  • Stark et al. (2015) Stark D. P., et al., 2015, MNRAS, 454, 1393
  • Stasińska et al. (2015) Stasińska G., Izotov Y., Morisset C., Guseva N., 2015, A&A, 576, A83
  • Trebitsch et al. (2017) Trebitsch M., Blaizot J., Rosdahl J., Devriendt J., Slyz A., 2017, MNRAS, 470, 224
  • Vanzella et al. (2016) Vanzella E., et al., 2016, ApJ, 825, 41
  • Verhamme et al. (2015) Verhamme A., Orlitová I., Schaerer D., Hayes M., 2015, A&A, 578, A7
  • Verhamme et al. (2017) Verhamme A., Orlitová I., Schaerer D., Izotov Y., Worseck G., Thuan T. X., Guseva N., 2017, A&A, 597, A13
  • Wyithe et al. (2014) Wyithe J. S. B., Loeb A., Oesch P. A., 2014, MNRAS, 439, 1326
  • Zackrisson et al. (2011) Zackrisson E., Rydberg C.-E., Schaerer D., Östlin G., Tuli M., 2011, ApJ, 740, 13
  • Zackrisson et al. (2013) Zackrisson E., Inoue A. K., Jensen H., 2013, ApJ, 777, 39
  • Zackrisson et al. (2017) Zackrisson E., et al., 2017, ApJ, 836, 78
Comments 0
Request Comment
You are adding the first comment!
How to quickly get a good reply:
  • Give credit where it’s due by listing out the positive aspects of a paper before getting into which changes should be made.
  • Be specific in your critique, and provide supporting evidence with appropriate references to substantiate general statements.
  • Your comment should inspire ideas to flow and help the author improves the paper.

The better we are at sharing our knowledge with each other, the faster we move forward.
The feedback must be of minimum 40 characters and the title a minimum of 5 characters
Add comment
Loading ...
This is a comment super asjknd jkasnjk adsnkj
The feedback must be of minumum 40 characters
The feedback must be of minumum 40 characters

You are asking your first question!
How to quickly get a good answer:
  • Keep your question short and to the point
  • Check for grammar or spelling errors.
  • Phrase it like a question
Test description