Perturbative QCD effects and the search for a signal at the Tevatron
Abstract:
The Tevatron experiments have recently excluded a Standard Model Higgs boson in the mass range 160 GeV 170 GeV at the confidence level. This result is based on sophisticated analyses designed to maximize the ratio of accepted signal to background. In this paper we study the production of a Higgs boson of mass GeV in the channel. We choose a set of cuts like those adopted in the experimental analysis and compare kinematical distributions of the final state leptons computed in NNLO QCD to lowerorder calculations and to those obtained with the event generators PYTHIA, HERWIG and MC@NLO. We also show that the distribution of the output from an Artificial Neural Network obtained with the different tools does not show significant differences. However, the final acceptance computed with PYTHIA is smaller than those obtained at NNLO and with HERWIG and MC@NLO. We also investigate the impact of the underlying event and hadronization on our results.
1 Introduction
Clarifying the role of Higgs bosons in the breaking of electroweak symmetry is of paramount importance in improving our understanding of elementary particle interactions. The discovery of the Standard Model Higgs boson, or its equivalent in theories beyond the Standard Model, is a principal objective of the highenergy collider experimental program.
Now is a very exciting time in the Higgs search since the Tevatron experiments have become sensitive to potential signals from a Higgs boson with production crosssections of roughly the magnitude predicted by the Standard Model (SM). Both CDF [1, 2] and DØ [3] have presented studies where a crosssection about times the SM prediction for a Higgs boson with a mass about twice the Wmass can be excluded by the two experiments independently with a confidence level of . Recent preliminary combinations [4, 5] of the two experiments exclude a SM Higgs boson in the mass range with a confidence level. In this mass range, a signal is predominantly produced in the SM via the gluon fusion process .
Higher order calculations for the background processes and especially the signal crosssection are indispensable for the study or exclusion of high mass Higgs bosons at the Tevatron. The magnitude of QCD corrections for the dominant signal process is extraordinarily large; the inclusive crosssection at nexttonexttoleading order (NNLO) for gluon fusion is about three times larger than the leading order (LO) crosssection. This is an even larger factor than that obtained at LHC energies. The Tevatron experiments are sensitive to a Standard Model Higgs boson signal in the mass region where gluon fusion is dominant, due to this particularly large factor and the small theoretical uncertainty which is attained with calculations at higher orders in perturbation theory.
Nexttoleading order (NLO) corrections have been computed in [8, 9, 10], where the anomalously large higher order effects in gluon fusion were first shown. The NNLO corrections for the inclusive crosssection have been computed in [11, 12, 13]. The NNLO computation has been consistently improved by resumming the softgluon contributions up to nexttonexttoleading logarithmic (NNLL) accuracy [14]. The result is an additional increase of the cross section that amounts to about at the Tevatron. The NNLL result is nicely confirmed by the computation of additional soft terms to NLO [15, 16, 17] (see also [18, 19, 20]). A calculation based on a somewhat different approach has been presented in Ref. [21].
The calculations of threshold effects have provided an invaluable argument that by now the bulk of the higher order corrections is accounted for. Smaller theoretical effects can also be important for setting a precise exclusion limit on the Higgs boson crosssection at the Tevatron. Twoloop electroweak corrections (of about ) for gluon fusion Higgs production have been computed in [22, 23, 24] , and the full twoloop amplitude has been presented in [25, 26]. Mixed QCD and Electroweak corrections have been computed in [27]. Recent predictions for the inclusive crosssection taking into account these effects have been presented in [27, 28]; these theoretical results for the inclusive gluon fusion crosssection have been used in setting the exclusion limits of Ref. [5].
Electroweak corrections of for the decay have been computed in [29]. These reach for the partial decay width of the decay of a Higgs boson to four leptons with mass of . In the region just above , which is relevant for the recent Tevatron exclusion limits, Ref. [29] reported important differences with the prediction of the program HDECAY [30]. These corrections have not yet been included into estimates of the Higgs branching ratio to leptons, and they are not considered in the experimental analysis of Refs. [4, 5].
The extraction of limits on the Higgs boson crosssection from the study of the process requires a sophisticated analysis. Background processes, such as direct production as well as and (multi)jet production, are dominant. Experimental cuts can suppress background and enhance the signal by vetoing hadronic radiation and by exploiting characteristic differences in lepton angular distributions as well as the large missing transverse momentum in signal events [31].
At the Tevatron, a cutbased analysis alone is not sufficient. Additional methods that exploit efficiently the kinematic features of signal and background processes in their finest detail are required. CDF [2] and DØ [3] apply cuts on missing energy and jet activity and impose lepton isolation criteria only for purposes of a first rough selection which biases the data samples towards signal events. After this first cutbased selection, background processes remain dominant and processing of real data and MonteCarlo simulations with Artificial Neural Network (ANN) methods follows. It is easy to appreciate the importance of ANN techniques for setting exclusion limits at the Tevatron. For example, in a data sample analyzed passing first cut selection in Ref. [3], the data model employed there predicts only signal versus background events.
Given the sensitivity of the gluon fusion crosssection to higher order effects, it is important to establish that the sophisticated methods used in the Tevatron analysis [5] account for these effects within the estimated uncertainties. Already the first cut selection may change the relative importance of higher order corrections for signal and background crosssections with respect to the perturbative patterns observed in the inclusive crosssections. A complete simulation of the experimental analysis with NLO and NNLO perturbative corrections is not practically feasible. In this paper, we aim to provide precise predictions for the production cross section in conditions close to those of the actual experiments by combining knowledge of higherorder effects from fixedorder perturbation theory and parton shower event generators.
First, we provide fixedorder predictions which are sufficiently detailed to permit assessment of the sensitivity of the acceptance due to selection cuts. For such a study, fully differential crosssections of the signal at NNLO are required. Unlike NLO computations, NNLO differential calculations are a rarity due to their substantial technical complications. The first differential distribution at NNLO was computed in 2003 [32, 33], and fully differential NNLO crosssections appeared in 2004 [34, 35]. At an electronpositron collider NNLO differential crosssections are known only for two [34, 36] and three jet production [37, 38, 39, 40, 41, 42, 43, 44] crosssections. At hadron colliders fully differential crosssections have been computed only for Higgs production in gluon fusion [35, 45, 46, 47, 48], and the DrellYan process [51, 52, 53].
In this paper we compute accepted crosssections and kinematic distributions at NNLO using the programs FEHIP [45, 47] and HNNLO [48]. We find an excellent agreement between the predictions of the two programs. We remark that the methods used [54, 46] for constructing these fully differential NNLO programs are independent and very different in their conception. Both programs produce kinematic distributions in the form of bin histograms. All NNLO results for crosssections and kinematic distributions calculated with both programs and presented here are in agreement within the expectations of statistical integration errors. The NNLO acceptance of the selection cuts can be directly compared with the predictions from the modeling of data as it has been performed by the CDF [2] and DØ [3] collaborations. The kinematic distributions we present here are input for their ANN analyses.
Two selection cuts require special care in estimating their acceptance for the Tevatron studies: (i) a jet veto on two or more central jets and (ii) isolation of leptons from hadronic activity. These cuts are used in DØ [3] in order to define the entirety of the data sample and at CDF [2] that part of the sample with a potential Higgs boson signal. At CDF [2], a further division of the data sample into zero and onejet multiplicities is made. The relative magnitude of the perturbative corrections at NLO and NNLO with respect to LO (factor) is smaller after applying selection cuts.
The same observation was made in earlier NNLO studies [47, 48] of the process at LHC centerofmass energy of 14 TeV, with similar cuts, such as a jetveto. In a separate paper [55], these NNLO predictions at the LHC were compared to results obtained from (i) a resummation of logarithms in the transverse momentum of the Higgs boson at NNLL accuracy [56] and (ii) the MonteCarlo event generator MC@NLO [57]. They were found to be in good agreement with each other over the phasespace regions singled out by the event selection cuts. On the contrary, a fixedorder calculation at nexttoleading order (NLO) accuracy provided a rather poor approximation for the required distributions and cut efficiencies.
In this paper we compare the selection cut acceptance and the shape of kinematic distributions for leptons at NNLO with the event generators MC@NLO [57], HERWIG [58], and PYTHIA8 [59] for Tevatron collisions.
The simulation of the gluon fusion process in Refs. [2, 3], is performed with PYTHIA6 [60]. In these analyses, the uncertainty in the acceptance after selection cuts is estimated with other means rather than a direct NNLO calculation. In the CDF analysis [2], PYTHIA events are reweighted [61, 62] to match either the (N)NLO Higgs or the NNLO Higgs rapidity spectrum [45, 48]. The systematic uncertainty in the acceptance is computed from the differences between the original PYTHIA and the reweighted versions. In DØ, acceptance uncertainties are estimated by comparing the spectrum of PYTHIA with that of other generators, such as MC@NLO [57] and SHERPA [63]. We believe that neither of the two methods can substitute for a direct comparison with the acceptance at NNLO. We will discuss this point in Section 4.
Multivariate techniques and distributions of ANN variables have so far been “terra incognita” for theoretical calculations at higher orders in perturbation theory. To the best of our knowledge, there has been no calculation of NLO and NNLO corrections for such observables. So far, the systematic uncertainty due to higher order effects on the shape of ANN distributions has been estimated indirectly. ANNs construct composite output variables that maximize the differences between signal and background crosssections. An error estimate on the shape of the ANN output distribution is obtained by varying the input kinematic distributions within their uncertainty range. However, if the input variables have a proper definition at the parton level, there is no obstacle to computing the corrections directly at fixed order in perturbation theory in the same fashion as for any other simple partonic variable. This should provide a more reliable theoretical estimate of the uncertainty on the ANN distribution.
We demonstrate such a calculation of an ANN output distribution through NNLO in this paper. We train an ANN with a PYTHIA simulated data sample that satisfies the selection in Ref. [1], which is a similar but somewhat simpler selection than that in Refs. [3, 2]. As input we use kinematic distributions of the leptons in the final state. We have deliberately refrained from using variables such as the transverse momentum of the Higgs boson or jets, since these distributions may differ substantially in event generators, and they are not defined in their full physical range in fixed order perturbation theory. We compare the predictions of PYTHIA, MC@NLO and HERWIG for this ANN output, and find reasonable agreement if the observed discrepancies at the cut selection levels are already accounted for.
The organization of the paper is as follows: We first present in Section 2 our predictions for the inclusive cross section at various orders of perturbation theory, then in Section 3 we define the experimental observables to which selection cuts are applied. Section 4 is devoted to a discussion of the Higgs spectrum and of the jet multiplicities. In Section 5 we give the results for the accepted cross section, i.e. after applying cuts on the various observables, and discuss the impact of the higher order corrections and scale variations on the selection efficiency. Next we present detailed comparisons of kinematical distributions, calculated at different orders of perturbative QCD (Section 6) and by using parton shower Monte Carlo models (Section 7). In Section 8 we compare for the first time fixedorder and parton shower Monte Carlo predictions for the output of an artificial neural network (ANN) similar to that used by the experimental groups. We comment on the stability and accuracy of these perturbative predictions, depending on the type of input variables to the ANN. Our conclusions are summarized in Section 9.
2 Inclusive cross section
Recent updates on the inclusive cross section for Higgs boson production at hadron colliders have been presented in Refs. [27, 28].
In Ref. [27], the fixed order NNLO crosssection [11, 12, 13] is recomputed with MSTW2008 parton densities [64]. Twoloop electroweak corrections from [25, 26] and exact finite top and bottom mass effects [10, 65] are included. Mixed QCD electroweak effects are taken into account by computing the relative magnitude of the correction with respect to the leading twoloop electroweak contribution by means of an effective theory. The central value of the crosssection is determined at a factorization and renormalization scale . The scale variation error for Higgs mass values in this fixed order calculation is . The corresponding parton density error is .
In Ref. [28], the NNLL calculation [14] with the appropriate matching to the fixed order NNLO result [11, 12, 13] is repeated using the MSTW2008 parton densities [64]. Twoloop electroweak corrections from [25, 26] and exact finite top and bottom mass effects [10] are included. Mixed QCD electroweak effects are taken into account by multiplying the inclusive twoloop electroweak contribution [25, 26] with the QCD factor. The central value of the crosssection is determined at a factorization and renormalization scale . The scale variation error for Higgs mass values in this resummed calculation is . The corresponding parton density error is .
The central values of the crosssections in Refs [27, 28] agree within a percent, and they have been used in setting the exclusion limits in [5]. CDF assigns a global theoretical uncertainty on the inclusive crosssection of [2], adding in quadrature a scale variation error and a parton density error. In our opinion, the combination of the two errors in quadrature requires further justification. DØ assigns a somewhat smaller theoretical uncertainty of to the inclusive Higgs boson crosssections. Even justifying a combination in quadrature of the uncertainties from parton densities and scale variations, the uncertainties on the total rate appear underestimated, compared to those of Refs. [27, 28].
The main goal of the present paper is to assess the theoretical uncertainties on the shapes of kinematic distributions and the acceptance after selection cuts. We have used MRST2001 PDFs at LO and MRST2004 PDFs at NLO and NNLO. All the fixedorder results in this paper have been obtained independently with the FEHiP [35, 45, 47, 55] and HNNLO [46, 48] programs, by first calculating factors in the limit of a very heavy topquark and then by multiplying these factors with the exact leading order gluon fusion cross section for Higgs production via a topquark (ignoring bottom contributions). The total width is computed using the program HDECAY [30]. electroweak corrections for the Higgs partial decay to leptons [29] have also been ignored. This is justified for studies of shapes and acceptances.
LO  NLO  NNLO  

The crosssection for , with no cuts applied and using the setup described in the last paragraph, is presented in Table 1 for illustration. We have chosen to study the signal crosssection for a Higgs mass value , and we vary the renormalization () and factorization () scales simultaneously in the interval . In the same Table, we also present the factors for the inclusive cross section,
(1) 
The perturbative corrections are very large, and there is still a substantial increase, depending on the scale choice, of the crosssection at NLO upon including the NNLO corrections. The smallest factors occur for smaller values of the renormalization and factorization scale.
3 Observables and selection cuts
The very sophisticated and complex experimental analyses of the two Tevatron collaborations cannot be reproduced fully at a perturbative level. In addition, there are important differences between the CDF and DØ analyses, both at the event selection level and in the usage of ANNs, and they evolve with time and the acquisition of more data. Nevertheless, we believe the analysis presented here captures the essential features that allow us to study the likely sensitivity of the results to higher order effects.
Our particle and event selection follows along the lines of the CDF analysis of Ref. [1] and proceeds through the following steps:

Lepton selection: in the CDF experiment, the experimental acceptances for electrons and muons are different. In this publication, we only consider the final state with two muons, simulating only the muon acceptance. The differences in the CDF cuts for muons and electrons are rather geometric, and should not alter the convergence pattern of the perturbative corrections. In our analysis, one of the finalstate leptons (the ‘trigger lepton’) must have a transverse momentum and pseudorapidity . In order to pass a further lepton selection, a second lepton must be found with and .

Two oppositesign leptons have to be found, fulfilling the requirements discussed above.

Both leptons have to be isolated, i.e. the additional transverse energy in a cone with radius around the lepton has to be smaller than 10 % of the lepton transverse momentum.

In order to reduce the background from b resonances, the invariant mass of the lepton pair has to be .


We define the missing transverse energy (MET) as the vectorial sum of the transverse momenta of the two neutrinos. We define the variable as
(2) where is the angle in the transverse plane between MET and the nearest charged lepton or jet. We require GeV, which suppresses the background from DrellYan lepton pairs and removes contributions from mismeasured leptons or jets.
The jet veto that we apply here is different from that used in our corresponding LHC studies [47, 55, 48], where all events with any number of central jets with a higher than a certain minimum value are vetoed. The cuts in the present study allow for events with a single high jet. This type of jet veto is used in the DØ analysis [3] in order to define the data sample with a potential Higgs signal. A stricter jet veto is applied in the CDF analysis [2], where three data samples are defined according to whether events have zero, one, or more central jets.
4 Higgs spectrum and jet multiplicities
One of the most important distributions for a Higgs boson produced at hadron colliders is its transverse momentum spectrum. A good description of the spectrum implies a good understanding of the QCD radiation recoiling against the Higgs.
It is well known that the Higgs spectrum is not physical when computed at fixed order, since it diverges to or at any fixed order in . When large logarithmic contributions of the form appear that must be resummed to all orders. In Ref. [56] the resummation of these logarithmically enhanced terms has been performed analytically up to NNLL accuracy, and the result has then been matched to the fixed order calculation up to . The integral of the ensuing spectrum coincides with the total NNLO cross section. In Fig. 1 (left) we compare the normalized spectrum of the Higgs computed at fixed order, to the one obtained with the numerical program of Ref. [56]. We see that the fixed order calculation diverges to as . The resummed calculation is instead well behaved as . We also note that the two calculations are in good agreement for larger than about 30 GeV.
Standard Monte Carlo event generators effectively perform the transversemomentum resummation, and thus obtain a well behaved spectrum. A comparison of the spectra from PYTHIA and MC@NLO to the resummed calculation is presented in Fig. 1 (right). We notice that the PYTHIA spectrum is softer than those of MC@NLO and the NNLL calculation. This difference is more marked at LHC energies [66].
We now comment on the procedure used by CDF and DØ to estimate the systematic uncertainties on the signal acceptance. The simulation of the gluon fusion process in Refs [2, 3], is performed with PYTHIA [60]. In these analyses, the uncertainty in the acceptance after selection cuts is estimated with different methods. In the CDF analysis [2], PYTHIA events are reweighted [61, 62] to match either the (N)NLO Higgs or the NNLO Higgs rapidity spectrum [45, 48]. The systematic uncertainty in the acceptance is computed from the differences between the original PYTHIA and the reweighted versions. In DØ, acceptance uncertainties are estimated by comparing the spectrum of PYTHIA with that of other generators, such as MC@NLO [57] and SHERPA [63]. We believe that neither of the two methods can substitute for a direct comparison with the acceptance at NNLO.
The reweighting technique used by CDF is based on the NNLO spectrum of the Higgs boson. As shown in Fig. 1, the fixedorder Higgs spectrum is divergent as . A reweighting procedure based on the NNLO Higgs spectrum makes sense only if the signal rate is integrated over a reasonably large region (as is done, for example, in Ref. [62]). In this respect, it would be better to reweight using a resummed calculation [56], or to use different kinematic variables.
Note also that a reweighting based on the Higgs and rapidity is insensitive to the jet multiplicity of the event, which is used in order to divide the data sample. Another important aspect is that the relative weight of the one and twojet sample is enhanced by the CDF cuts, with respect to the zerojet sample. If a jet is required in all events, the calculation includes matrix elements through NLO only. If two jets are required, only LO matrixelements are taken into account. An NLO prediction for 1jet samples and a LO prediction for 2jet samples ^{1}^{1}1 An NLO calculation for jets via gluon fusion can be found in [67]. can be obtained with FEHIP and HNNLO using the corresponding NLO and LO and parton densities. More importantly, we find it inconsistent to use the theoretical uncertainty from the inclusive NNLO gluon fusion crosssection as the uncertainty of the samples with defined jet multiplicities other than zero.
To demonstrate this point further, we follow an analogous procedure as in [2] and divide the signal crosssection into three bins according to the number of central jets. Jets are defined using the algorithm, with minimum GeV and maximum rapidity . We compute the inclusive crosssections for GeV and vary the renormalization and factorization scale simultaneously in the interval . In Table 2 we compute the three bin crosssections, using either NNLO or NLO or LO parton densities and evolution from the recent MSTW2008 fit [64].
LO (pdfs, )  NLO (pdfs, )  NNLO (pdfs, )  

0jets  
1jet  
2jets 
The total crosssection with NNLO pdfs varies around the default scale value by . From Table 2 we see that about of the events contain zero jets, one jet only, and contain more than one jets. Notice, however, that the scale variation in the three jet bins is significantly different and deteriorates with increasing jet multiplicity. This is a consequence of the fact that in the 1jet and 2jet bins the fixed order calculation is only accurate through NLO and LO, respectively. The resulting scale dependence of the inclusive cross section is made up as follows:
(3) 
The application of different selection cuts in the three jet bins leads to a theoretical error estimate of the number of signal events which is different from the theoretical error of the inclusive NNLO crosssection. Specifically, from Tables 13 of Ref. [2] we observe that, after preselection, of gluon fusion events belong to the 0jets bin, to the 1jet bin, and to the 2jet bin.
We now examine how this modification of the jet multiplicities with the experimental cuts affects the scale variation for the total number of events. With the exception of the jetveto, all other cuts used in the CDF preselection [2] do not affect the scale variation of the total crosssection significantly. We can then estimate the scale variation of the total number of signal events using the scalevariations for each jetmultiplicity in Table 2 and the expected composition of jetmultiplicities for the signal [2]. Using NNLO pdf’s and NNLO evolution for all jet bins, we find that:
(4) 
The resulting scale variation is therefore larger than the corresponding scale variation of for the inclusive crosssection.
Notice that in Eq. 4 we used a scale variation for the onejet and twojet bins corresponding to NNLO pdfs and evolution. A more consistent approach would be to estimate the number of events in the 1jet and 2jet bins using NLO and LO pdfs and evolution correspondingly. In this way we obtain:
(5) 
The relative population of the jet bins is very important for the determination of the theoretical error on the total number of events. The contribution of the different jet multiplicities to the total error can be altered also after the preselection cuts, since, in general, the independent multivariate methods for discriminating signal from background events in various jet bins should have different discriminating efficiency. In conclusion, the theoretical error for the number of events at various jet multiplicities should not be estimated collectively from the scale variation of the total crosssection.
5 Signal cross section and preselection efficiency at fixed order
We present in Table 3 the LO, NLO, and NNLO cross sections, as obtained with FEHIP and HNNLO after applying the selection cuts in Section 3, for a default Higgs boson mass value .
LO  NLO  NNLO  

Comparison of the results of Table 3 with those of Table 1 shows that the impact of QCD radiative corrections is significantly reduced when selection cuts are applied. Indeed, for the NLO and NNLO factors are reduced by and , respectively. As a consequence, the acceptance is also reduced, since it is defined as the ratio of the crosssection after cuts to the inclusive cross section. At LO about of the events are accepted. At NLO, the efficiency drops to and at NNLO to , depending on the scale choice.
An important observation is that the scale dependence of the efficiency becomes stronger when we increase the perturbative order. The lepton isolation and jetveto cuts do not change the LO cross section, since the isolation requirement gives a nonvanishing contribution only at NLO and the veto on the number of central jets is effective only beyond NLO. We will comment further on the jetveto later when comparing to the MonteCarlo event generators, PYTHIA, HERWIG and MC@NLO.
The increased scale variation of the acceptance at NNLO is a reflection of the reduction of the scale variation for the cross section after cuts are applied. In Fig. 2, we present the cross section as a function of the renormalization and factorization scale , before and after cuts, at LO, NLO, and NNLO. Before cuts, the NNLO cross section varies by over the interval . This drops to a scale variation after cuts, and consequently the ratio varies by . We note that the scale variation in a different but also commonly used range is for the inclusive crosssection and for the accepted crosssection after cuts. In this second scale variation range, the accepted crosssection is not monotonic and develops a maximum.
6 Kinematic distributions at fixed order perturbation theory
We now turn to a more detailed study of the kinematical properties of the accepted events. The applied cuts provide a rough discrimination of the Higgs signal over the background. The experimental analysis proceeds further by exploiting the differences in the kinematical distributions between signal and background and is typically based on Artificial Neural Network (ANN) techniques. It is thus important to check that these distributions are stable against radiative corrections.
In Fig. 3 we present a set of distributions that are commonly used in the experimental analysis, computed at LO, NLO and NNLO. The uncertainty bands are obtained by varying between and . Figures 3(a) and (b) show the transverse momentum spectrum of the leading and trailing lepton, and . Figure 3(c) shows the invariant mass distribution of the lepton pair, , Fig. 3(d) shows the MET distribution and, finally, Fig. 3(e) shows the azimuthal separation of the two charged leptons in the transverse plane, . Overall, these plots show that the distributions are quite stable when going from NLO to NNLO. The NNLO band generally lies on top of the NLO band and nicely overlaps with the latter. The scale uncertainty of the distributions is consistent with that quoted in Table 3 and is about in the peak region of the distributions.
We also observe that the perturbative corrections from NLO to NNLO are smaller when the scale is used.
7 Comparison with parton shower event generators
In the next step we compare the accepted crosssection and kinematic distributions after selection cuts, obtained in fixed order perturbation theory, with the predictions of the parton shower event generators HERWIG, PYTHIA and MC@NLO. We consider this as a very important validation for the current studies at the Tevatron, which rely on event generator predictions.
[]  

LO  
(HERWIG)  
(PYTHIA)  
NLO  
MC@NLO  
(HERWIG)  
(PYTHIA)  
NNLO  
(MC@NLO)  
(HERWIG)  
(PYTHIA) 
In Table 4 we present the predictions of the fixed order calculations through NNLO, as well as the predictions of PYTHIA, HERWIG and MC@NLO. At this stage we do not include a simulation of hadronization and the underlying event, in order to make a more direct comparison with the fixed order predictions, which cannot take into account such effects. The total inclusive crosssection of PYTHIA and HERWIG corresponds to a pure LO computation, and that of MC@NLO is correct at NLO accuracy. Since we are interested in a comparison of the efficiencies and not of absolute crosssections, we multiply the results of the event generators with appropriate scaling factors, such that they match the result of the total inclusive crosssection given by our fixed order computation. Because of the different parton distribution functions and approximation for the top loop used in the various calculations, these scaling factors to match the leading fixed order calculations [45, 48] are different from unity at the level of 5%.
After applying the selection cuts, we find a relatively good agreement among the NNLO, MC@NLO and HERWIG results for the accepted crosssections, with the MC@NLO and HERWIG predictions rescaled to reproduce the fixed order NNLO inclusive cross section before cuts, as explained above. The MC@NLO result is smaller than the NNLO prediction by , depending on the scale choice. HERWIG results differs from the NNLO prediction by to . On the contrary, the accepted cross section and consequently the selection efficiency obtained with PYTHIA appears to be somewhat smaller. Depending on the scale choice the difference ranges between 12 and 21%.
For a better understanding of the results of Table 4, we now analyze the efficiency of individual cuts, applied in turn. The results are given in Table 5, where the efficiencies due to a specific cut only (after all previous cuts have been applied) are presented between parentheses.
Trigger  JetVeto  Isolation  All Cuts  

NNLO ()  ()  ()  
NNLO ()  ()  ()  ()  
MC@NLO ()  ()  ()  ()  
MC@NLO ()  ()  ()  ()  
HERWIG  ()  ()  ()  
PYTHIA  ()  ()  () 
We observe the following:

When only the cuts for lepton selection are applied (“Trigger”), we generally find very good agreement of all calculations for the corresponding efficiency. In detail, MC@NLO and NNLO yield almost identical efficiencies, while HERWIG and PYTHIA give a slightly higher efficiency.

The veto on two or more central jets is a rather critical cut for the achievable accuracy of the efficiency estimation. This “preselection” cut has been used in the recent Tevatron studies. The efficiency of the jetveto alone, after trigger cuts, varies significantly at NNLO by about . As discussed in Section 5, this is due to the larger scale variation of the inclusive NNLO crosssection, while at the same time the accepted crosssection after the jetveto application is more stable. For , where the fixed order expansion demonstrates a faster convergence, we find that the jetveto efficiency at NNLO is in very good agreement with MC@NLO and HERWIG. PYTHIA, which is the main tool used in the Tevatron analysis, predicts an efficiency which is smaller by for .

PYTHIA also predicts a smaller efficiency by about for the isolation cut. HERWIG, MC@NLO and the NNLO calculation are consistent, taking into account scale variations.

The efficiency of the remaining cuts is very similar in all computations. After preselection, PYTHIA, rescaled with an inclusive NNLO factor, predicts between 12 and 21% less signal events than the NNLO computation.
The sensitivity of event generator predictions to cuts that restrict hadronic activity requires careful investigation. In particular, it is important to study the effect of hadronization and the underlying event on the efficiency. We have performed such an analysis in the case of MC@NLO, where hadronization effects are modeled by HERWIG and the underlying event is simulated by interfacing to the JIMMY package [68], and also for PYTHIA, where the hadronization and underlying event modelling is inbuilt. The results are given in Table 6. We find only minor differences at the 1% level, which leads us to the conclusion that the differences observed with PYTHIA are rather related to its matrix element and parton shower implementation.
Trigger  JetVeto  Isolation  All Cuts  

MC@NLO (Parton)  
MC@NLO (Hadron)  
MC@NLO (Had + UE)  
PYTHIA (Parton)  
PYTHIA (Had + UE) 
8 Artificial Neural Network
The current experimental analysis at the Tevatron attempts to distinguish a very small number of events from a considerably larger background. For such a task, the use of advanced statistical methods is necessary. An integral part of the experimental studies are distributions of discrimination variables, defined via artificial neural networks (ANN). It is clear that such techniques will become an indispensable tool in many future studies at the Tevatron and at the LHC, optimizing the sensitivity of the experiments.
To the best of our knowledge, so far there has been no study of how the distributions of ANN outputs are modified at higher orders in perturbation theory. Here we present for the first time an ANN output distribution, computed at fixed order in perturbation theory, beyond the leading order.
In order to study these higherorder effects on the outcome variable, we have built an ANN with the tool TMVA v. 3.9.2. [6] based on the Data Analysis Framework ROOT v. 5.21.02 [7]. In the construction (the socalled training) of the ANN the user has to provide a set of signal and background events, as well as a list of input variables. In our study we use the variables defined in Section 6 as input variables. Based on the techniques of Multilayer Perceptrons, the ANN then builds an output variable, which is basically a nonlinear function of the input variables. Since our study is based on Monte Carlo truth information, we restrict the set of background to processes that have the same final state signature as the signal, i.e. to continuum WW background and toppair production. For both of these processes, as well as for the signal process, we generate a large enough event sample ( events after preselection) with the LO partonshower Monte Carlo PYTHIA8 [60], and use half of the samples to train the ANN. The other half is used in the socalled testing step.
Fig. 6 shows the ANN output for the signal and background samples, for the training (left) and the testing (right) samples. The comparison of the training and testing distributions serves as a verification that the ANN has not been overtrained, i.e. tuned to statistical effects in the training sample. The discrimination power of the ANN variable can be seen clearly. While the background distribution peaks at low values,^{2}^{2}2The background peaks around and arise mainly from and respectively. the signal events populate the high value range. Distributions like this can serve to distinguish background from possible signal events, either by cutting on the ANN output, or by using the shape of the distribution to decide whether the observed event set consists of background only or background and signal events.
In Fig. 7 we present the distribution of the ANN output for the signal in fixedorder perturbation theory, computed at LO, NLO and NNLO. We find significant radiative corrections, which however are consistent in magnitude with those for the accepted cross section after preselection cuts.
In Fig. 8 we compare the ANN distribution obtained at NNLO and with MC@NLO. Again we find a very good agreement between the fixed order calculation and the MC@NLO prediction, when the latter is rescaled with a factor in order to reproduce the total inclusive crosssection.
Finally, in Fig. 9 we compare the ANN distribution obtained at NNLO QCD and with PYTHIA. We see that PYTHIA, even after rescaling with an inclusive factor, yields predictions which are smaller by 1220%, depending on the chosen bin. This difference can be traced back to the difference in efficiency already observed at the level of the selection cuts placed on the kinematic input distributions.
Note that we have not included any hadronic variable as an input to the ANN. It is clear that stable perturbative patterns are obtained as long as we apply cuts on “leptonic” variables only. However, adding a hadronic variable to the list of ANN inputs could produce results that are very sensitive to the details of the modeling of the hadronic activity in the event generators used for the training of the network.
9 Conclusions
In this paper we have studied higherorder QCD effects in the search for a Higgs boson of mass GeV at the Tevatron. We have considered a definite set of preselection cuts that we believe capture the essential features of the CDF and DØ analyses. We have studied the impact of higher order corrections on a set of kinematical distributions of the final state leptons. We have then compared these distributions, computed up to NNLO in QCD perturbation theory, to those obtained with the PYTHIA, HERWIG and MC@NLO event generators. The comparison of distributions does not show significant differences, and this is confirmed by a more sophisticated analysis we have performed based on the training of our own ANN.
For the ANN analysis we used only leptonic input variables, in order to reduce sensitivity to the modelling of hadronic activity and to allow perturbative evaluation of the output distribution. These features are also necessary for the reliable estimation of theoretical uncertainties in experimental ANN analyses.
We have also compared the efficiency of the experimental cuts obtained in NNLO QCD to those obtained with PYTHIA, HERWIG and MC@NLO. The efficiencies obtained with HERWIG and MC@NLO are consistent with that obtained at NNLO. The MC@NLO acceptance is slightly smaller than the NNLO acceptance, by , while the acceptance of HERWIG differs from the NNLO prediction by to . In contrast, we find that the acceptance computed with PYTHIA is between and smaller than the NNLO acceptance, depending on the choice of the factorization and renormalization scale. This result is not significantly altered by hadronization and underlying event and appears instead to be related to the matrix element and parton shower implementation in PYTHIA itself. Since the Tevatron analyses are based on PYTHIA, we believe that this effect could be important and requires a more detailed investigation within the framework of the full experimental analysis.
Relevant to the experimental analysis, we have remarked that the combination in quadrature of the theoretical errors due to the parton distributions and scale variations in Refs. [2, 3] implies that the theoretical uncertainty on the total cross section used there is likely to be underestimated. We also pointed out that a reweighting of parton shower MonteCarlos to match the fixedorder Higgs distribution is not appropriate for events with a low Higgs value. Finally, we have demonstrated that a reliable estimation of the theoretical uncertainty for Higgs signal crosssections with defined jet multiplicities requires dedicated fixed order computations for each multiplicity. The theoretical uncertainty in each jetbin is different from the theoretical uncertainty of the total crosssection.
Acknowledgments
We thank Stefan Bucherer, Frank Petriello, Uli Haisch, Zoltan Kunszt and Giulia Zanderighi for usefull discussions.
References
 [1] T. Aaltonen et al. [CDF Collaboration], Phys. Rev. Lett. 102, 021802 (2009) [arXiv:0809.3930 [hepex]].
 [2] “Search for production at CDF using 3.0 fb of data”, CDF conference note 9500.
 [3] “Search for Higgs boson production in dilepton plus missing transverse energyy final states with 3.0–4.2 fb of collisions at ”, D0 conference note 5871.
 [4] G. Bernardi et al. arXiv:0808.0534 [hepex].
 [5] Tevatron New Phenomena Higgs Working Group and CDF Collaboration and D0 collaboration, arXiv:0903.4001 [hepex].
 [6] TMVA: Toolkit for Multivariate Data Analysis with ROOT, http://tmva.sourceforge.net/
 [7] ROOT, A Data Analysis Framework, http://root.cern.ch
 [8] S. Dawson, Nucl. Phys. B 359, 283 (1991).
 [9] A. Djouadi, M. Spira and P. M. Zerwas, Phys. Lett. B 264, 440 (1991).
 [10] M. Spira, A. Djouadi, D. Graudenz and P. M. Zerwas, Nucl. Phys. B 453, 17 (1995) [arXiv:hepph/9504378].
 [11] R. V. Harlander and W. B. Kilgore, Phys. Rev. Lett. 88, 201801 (2002) [arXiv:hepph/0201206].
 [12] C. Anastasiou and K. Melnikov, Nucl. Phys. B 646, 220 (2002) [arXiv:hepph/0207004].
 [13] V. Ravindran, J. Smith and W. L. van Neerven, Nucl. Phys. B 665, 325 (2003) [arXiv:hepph/0302135].
 [14] S. Catani, D. de Florian, M. Grazzini and P. Nason, JHEP 0307, 028 (2003) [arXiv:hepph/0306211].
 [15] S. Moch and A. Vogt, Phys. Lett. B 631, 48 (2005) [arXiv:hepph/0508265].
 [16] E. Laenen and L. Magnea, Phys. Lett. B 632 (2006) 270 [arXiv:hepph/0508284].
 [17] A. Idilbi, X. d. Ji, J. P. Ma and F. Yuan, Phys. Rev. D 73 (2006) 077501 [arXiv:hepph/0509294].
 [18] V. Ravindran, Nucl. Phys. B 746 (2006) 58 [arXiv:hepph/0512249].
 [19] V. Ravindran, Nucl. Phys. B 752 (2006) 173 [arXiv:hepph/0603041].
 [20] V. Ravindran, J. Smith and W. L. van Neerven, Nucl. Phys. B 767 (2007) 100 [arXiv:hepph/0608308].
 [21] V. Ahrens, T. Becher, M. Neubert and L. L. Yang, arXiv:0809.4283 [hepph].
 [22] G. Degrassi and F. Maltoni, Phys. Lett. B 600, 255 (2004) [arXiv:hepph/0407249].
 [23] U. Aglietti, R. Bonciani, G. Degrassi and A. Vicini, Phys. Lett. B 595, 432 (2004) [arXiv:hepph/0404071].
 [24] U. Aglietti, R. Bonciani, G. Degrassi and A. Vicini, “Twoloop electroweak corrections to Higgs production in proton proton arXiv:hepph/0610033.
 [25] S. Actis, G. Passarino, C. Sturm and S. Uccirati, “NLO Electroweak Corrections to Higgs Boson Production at Hadron Phys. Lett. B 670, 12 (2008) [arXiv:0809.1301 [hepph]].
 [26] S. Actis, G. Passarino, C. Sturm and S. Uccirati, Nucl. Phys. B 811, 182 (2009) [arXiv:0809.3667 [hepph]].
 [27] C. Anastasiou, R. Boughezal and F. Petriello, arXiv:0811.3458 [hepph].
 [28] D. de Florian and M. Grazzini, arXiv:0901.2427 [hepph].
 [29] A. Bredenstein, A. Denner, S. Dittmaier and M. M. Weber, Phys. Rev. D 74, 013004 (2006) [arXiv:hepph/0604011].
 [30] A. Djouadi, J. Kalinowski and M. Spira, Comput. Phys. Commun. 108, 56 (1998) [arXiv:hepph/9704448].
 [31] M. Dittmar and H. K. Dreiner, Phys. Rev. D 55, 167 (1997) [arXiv:hepph/9608317].
 [32] C. Anastasiou, L. J. Dixon, K. Melnikov and F. Petriello, Phys. Rev. Lett. 91, 182002 (2003) [arXiv:hepph/0306192].
 [33] C. Anastasiou, L. J. Dixon, K. Melnikov and F. Petriello, Phys. Rev. D 69, 094008 (2004) [arXiv:hepph/0312266].
 [34] C. Anastasiou, K. Melnikov and F. Petriello, Phys. Rev. Lett. 93, 032002 (2004) [arXiv:hepph/0402280].
 [35] C. Anastasiou, K. Melnikov and F. Petriello, Phys. Rev. Lett. 93, 262002 (2004) [arXiv:hepph/0409088].
 [36] S. Weinzierl, Phys. Lett. B 644, 331 (2007) [arXiv:hepph/0609021].
 [37] A. D. Ridder, T. Gehrmann, E. W. N. Glover and G. Heinrich, arXiv:0903.4658 [hepph].
 [38] A. GehrmannDe Ridder, T. Gehrmann, E. W. N. Glover and G. Heinrich, Phys. Rev. Lett. 100, 172001 (2008) [arXiv:0802.0813 [hepph]].
 [39] G. Dissertori, A. GehrmannDe Ridder, T. Gehrmann, E. W. N. Glover, G. Heinrich and H. Stenzel, JHEP 0802, 040 (2008) [arXiv:0712.0327 [hepph]].
 [40] A. GehrmannDe Ridder, T. Gehrmann, E. W. N. Glover and G. Heinrich, JHEP 0712, 094 (2007) [arXiv:0711.4711 [hepph]].
 [41] A. GehrmannDe Ridder, T. Gehrmann, E. W. N. Glover and G. Heinrich, Phys. Rev. Lett. 99, 132002 (2007) [arXiv:0707.1285 [hepph]].
 [42] S. Weinzierl, arXiv:0904.1145 [hepph].
 [43] S. Weinzierl, arXiv:0904.1077 [hepph].
 [44] S. Weinzierl, Phys. Rev. Lett. 101, 162001 (2008) [arXiv:0807.3241 [hepph]].
 [45] C. Anastasiou, K. Melnikov and F. Petriello, Nucl. Phys. B 724, 197 (2005) [arXiv:hepph/0501130], http://www.phys.hawaii.edu/kirill/FEHiP.htm.
 [46] S. Catani and M. Grazzini, Phys. Rev. Lett. 98 (2007) 222002 [arXiv:hepph/0703012].
 [47] C. Anastasiou, G. Dissertori and F. Stockli, JHEP 0709, 018 (2007) [arXiv:0707.2373 [hepph]].
 [48] M. Grazzini, JHEP 0802, 043 (2008) [arXiv:0801.3232 [hepph]].
 [49] S. Catani, Y. L. Dokshitzer, M. H. Seymour and B. R. Webber, Nucl. Phys. B 406, 187 (1993).
 [50] S. D. Ellis and D. E. Soper, Phys. Rev. D 48, 3160 (1993) [arXiv:hepph/9305266].
 [51] K. Melnikov and F. Petriello, Phys. Rev. D 74, 114017 (2006) [arXiv:hepph/0609070].
 [52] K. Melnikov and F. Petriello, Phys. Rev. Lett. 96, 231803 (2006) [arXiv:hepph/0603182].
 [53] S. Catani, L. Cieri, G. Ferrera, D. de Florian and M. Grazzini, arXiv:0903.2120 [hepph].
 [54] C. Anastasiou, K. Melnikov and F. Petriello, Phys. Rev. D 69, 076010 (2004) [arXiv:hepph/0311311].
 [55] C. Anastasiou, G. Dissertori, F. Stockli and B. R. Webber, JHEP 0803 (2008) 017 [arXiv:0801.2682 [hepph]].
 [56] G. Bozzi, S. Catani, D. de Florian and M. Grazzini, Nucl. Phys. B 737, 73 (2006) [arXiv:hepph/0508068].
 [57] S. Frixione and B. R. Webber, JHEP 0206, 029 (2002) [arXiv:hepph/0204244]; arXiv:0812.0770 [hepph].
 [58] G. Corcella et al., JHEP 0101, 010 (2001) [arXiv:hepph/0011363]; arXiv:hepph/0210213.
 [59] T. Sjostrand, S. Mrenna and P. Skands, Comput. Phys. Commun. 178 (2008) 852 [arXiv:0710.3820 [hepph]].
 [60] T. Sjostrand, S. Mrenna and P. Skands, JHEP 0605, 026 (2006) [arXiv:hepph/0603175].
 [61] G. Davatz, G. Dissertori, M. Dittmar, M. Grazzini and F. Pauss, JHEP 0405, 009 (2004) [arXiv:hepph/0402218].
 [62] G. Davatz, F. Stöckli, C. Anastasiou, G. Dissertori, M. Dittmar, K. Melnikov and F. Petriello, JHEP 0607, 037 (2006) [arXiv:hepph/0604077].
 [63] T. Gleisberg, S. Hoche, F. Krauss, A. Schalicke, S. Schumann and J. C. Winter, JHEP 0402, 056 (2004) [arXiv:hepph/0311263].
 [64] A. D. Martin, W. J. Stirling, R. S. Thorne and G. Watt, arXiv:0901.0002 [hepph].
 [65] C. Anastasiou, S. Beerli, S. Bucherer, A. Daleo and Z. Kunszt, JHEP 0701, 082 (2007) [arXiv:hepph/0611236].
 [66] C. Balazs, M. Grazzini, J. Huston, A. Kulesza and I. Puljak, arXiv:hepph/0403052.
 [67] J. M. Campbell, R. K. Ellis and G. Zanderighi, JHEP 0610, 028 (2006) [arXiv:hepph/0608194].
 [68] J. M. Butterworth, J. R. Forshaw and M. H. Seymour, Z. Phys. C 72, 637 (1996) [arXiv:hepph/9601371].