Surrogate model of hybridized numerical relativity binary black hole waveforms

Surrogate model of hybridized numerical relativity binary black hole waveforms

Vijay Varma Theoretical Astrophysics, California Institute of Technology, Pasadena, CA 91125, USA    Scott E. Field Department of Mathematics, Center for Scientific Computing and Visualization Research, University of Massachusetts, Dartmouth, MA 02747, USA    Mark A. Scheel Theoretical Astrophysics, California Institute of Technology, Pasadena, CA 91125, USA    Jonathan Blackman Theoretical Astrophysics, California Institute of Technology, Pasadena, CA 91125, USA    Lawrence E. Kidder Center for Radiophysics and Space Research, Cornell University, Ithaca, New York 14853, USA    Harald P. Pfeiffer Max Planck Institute for Gravitational Physics (Albert Einstein Institute), Am Mühlenberg 1, Potsdam 14476, Germany Canadian Institute for Theoretical Astrophysics, 60 St. George Street, University of Toronto, Toronto, ON M5S 3H8, Canada
July 20, 2019

Numerical relativity (NR) simulations provide the most accurate binary black hole gravitational waveforms, but are prohibitively expensive for applications such as parameter estimation. Surrogate models of NR waveforms have been shown to be both fast and accurate. However, NR-based surrogate models are limited by the training waveforms’ length, which is typically about 20 orbits before merger. We remedy this by hybridizing the NR waveforms using both post-Newtonian and effective one body waveforms for the early inspiral. We present NRHybSur3dq8, a surrogate model for hybridized nonprecessing numerical relativity waveforms, that is valid for the entire LIGO band (starting at ) for stellar mass binaries with total masses as low as . We include the and spin-weighted spherical harmonic modes but not the or modes. This model has been trained against hybridized waveforms based on 104 NR waveforms with mass ratios , and , where () is the spin of the heavier (lighter) BH in the direction of orbital angular momentum. The surrogate reproduces the hybrid waveforms accurately, with mismatches over the mass range . At high masses (), where the merger-ringdown is more prominent, we show roughly two orders of magnitude improvement over existing waveform models. We also show that the surrogate works well even when extrapolated outside its training parameter space range, including at spins as large as 0.998. Finally, we show that this model accurately reproduces the spheroidal-spherical mode mixing present in the NR ringdown signal.


I Introduction

The era of gravitational wave (GW) astronomy has been emphatically unveiled with the recent detections Abbott et al. (2016a, 2017a, b, 2017b, 2017c, 2017d, 2018a) by LIGO Aasi et al. (2015) and Virgo Acernese et al. (2015). The detection of gravitational wave signals from compact binary sources is expected to become a routine occurrence as the advanced detectors reach their design sensitivity Abbott et al. (2016, 2018b). The possible science output from these events crucially depends on the availability of an accurate waveform model to compare against observed signals.

Numerical relativity (NR) is the only ab initio approach that accurately produces waveforms from the merger of a binary black hole (BBH) system. However, because NR simulations are computationally expensive, it is impractical to use them directly for applications such as parameter estimation, which can require upwards of waveform evaluations. Therefore, the GW community has developed several approximate waveform models Khan et al. (2018); Pan et al. (2014); London et al. (2018); Cotesta et al. (2018); Khan et al. (2016); Bohé et al. (2017); Hannam et al. (2014); Taracchini et al. (2014); Pan et al. (2011); Mehta et al. (2017), some of which are fast to evaluate. These models make certain physically-motivated assumptions about the underlying phenomenology of the waveforms, and they fit for any remaining free parameters using NR simulations.

Surrogate modeling Field et al. (2014); Pürrer (2014) is an alternative approach that doesn’t assume an underlying phenomenology and has been applied to a diverse range of problems Field et al. (2014); Pürrer (2014); Lackey et al. (2017); Doctor et al. (2017); Blackman et al. (2015, 2017a, 2017b); Huerta et al. (2018); Canizares et al. (2015); Galley and Schmidt (2016). NR Surrogate models follow a data-driven approach, directly using the NR waveforms to implicitly reconstruct the underlying phenomenology. Three NR surrogate models have been built so far Blackman et al. (2015, 2017a, 2017b), including a 7-dimensional (mass ratio and two spin vectors) model for generically precessing systems in quasi-circular orbit Blackman et al. (2017b). Through cross-validation studies, these models were shown to be nearly as accurate as the NR waveforms they were trained against.

Despite the success of the surrogate modeling approach, existing surrogate models have two important limitations: (1) Because they are based solely on NR simulations, which typically are only able to cover the last orbits of a BBH inspiral, they are not long enough to span the full LIGO band for stellar mass binaries. (2) Apart from the first non-spinning model Blackman et al. (2015), these models have been restricted to mass ratios  111We use the convention , where and are the masses of the component black holes, with .. There are two reasons for this: (i) The 7d parameter space is vast, requiring at least a few thousand simulations to sufficiently cover it. (ii) Because of the smaller length scale introduced by the lighter black hole, NR simulations become increasingly more expensive with mass ratio.

In this work we address these limitations in the context of nonprecessing BBH systems. First, to include the early inspiral we “hybridize” the NR waveforms : each full waveform consists of a post-Newtonian (PN) and effective one body (EOB) waveform at early times that is smoothly attached to an NR waveform at late times. Second, since we restrict ourselves to the 3-dimensional space of nonprecessing BBHs, fewer simulations are necessary compared to the 7-dimensional case, and therefore we can direct computational resources to simulations with higher mass ratios. The resulting model, NRHybSur3dq8, is the first NR-based surrogate model to span the entire LIGO frequency band for stellar mass binaries; assuming a detector low-frequency cut-off of , this model is valid for total masses as low as . This model is based on 104 NR waveforms in the parameter range , and , where () is the dimensionless spin of the heavier (lighter) black hole (BH).

The plus () and cross () polarizations of GWs can be conveniently represented by a single complex time-series, . The complex waveform on a sphere can be decomposed into a sum of spin-weighted spherical harmonic modes  Newman and Penrose (1966); Goldberg et al. (1967), so that the waveform along any direction (,) in the binary’s source frame is given by


where are the spin weighted spherical harmonics, is the inclination angle between the orbital angular momentum of the binary and line-of-sight to the detector, and is the initial binary phase. can also be thought of as the azimuthal angle between the axis of the source frame and the line-of-sight to the detector. We define the source frame as follows: The axis is along the orbital angular momentum direction, which is constant for nonprecessing BBH. The axis is along the line of separation from the lighter BH to the heavier BH at some reference time/frequency. The axis completes the triad.

The terms typically dominate the sum in Eq. (1), and are referred to as the quadrupole modes. Studies Varma and Ajith (2017); Capano et al. (2014); Littenberg et al. (2013); Calderón Bustillo et al. (2017); Brown et al. (2013); Varma et al. (2014); Graff et al. (2015); Harry et al. (2018) have shown that the nonqudrupole modes, while being subdominant, can play a nonnegligible role in detection and parameter estimation of GW sources, particularly for large signal to noise ratio (SNR), large total mass, large mass ratio, or large inclination angle . For the first event, GW150914 Abbott et al. (2016a), the systematic errors due to the quadrupole-mode-only approximation are generally smaller than the statistical errors Abbott et al. (2017e, 2016), although higher modes may lead to modest changes in some of the extrinsic parameter values Kumar et al. (2018). However, as the detectors approach their design sensitivity Abbott et al. (2016), one should prepare for high-SNR sources (particularly at larger mass ratios than those seen so far), where the quadrupole-mode-only approximation breaks down. In addition, nonqudrupole modes can help break the degeneracy between the binary inclination and distance, which is present for quadrupole-mode-only models (see e.g. London et al. (2018); O’Shaughnessy et al. (2014); Usman et al. (2018)).

In this work, we model the following spin-weighted spherical harmonic modes: and (5,5), but not the (4,1) or (4,0) modes 222Because of the symmetries of nonprecessing BBHs (see Eq. (23)), the modes contain the same information as the modes, and do not need to be modeled separately.. Several inspiral-merger-ringdown waveform models Cotesta et al. (2018); London et al. (2018); Pan et al. (2011); Mehta et al. (2017) that include nonquadrupole modes have been developed in recent years; however, compared to those models we show an improved accuracy and we include more modes.

The rest of the paper is organized as follows. In Sec. II we choose the parameters at which to perform NR simulations, which will be used for training the surrogate model. Sec. III describes the NR simulations. Sec. IV describes our procedure to compute the waveform for the early inspiral using PN and EOB waveforms. Sec. V describes our hybridization procedure to attach the early inspiral waveform to the NR waveforms. Sec. VI describes the construction of the surrogate model. In Sec. VII, we test the surrogate model by comparing against NR and hybrid waveforms. We end with some concluding remarks in Sec. VIII. We make our model available publicly through the easy-to-use Python package gwsurrogate Blackman et al. ().

Ii Training set generation

ii.1 Greedy parameters from PN surrogate model

We do not know a priori the distribution or number of NR simulations required to build an accurate surrogate model. Furthermore, we hope to select a representative distribution that will allow for an accurate surrogate to be built with as few NR simulations as possible. Therefore, we estimate this distribution by first building a surrogate model for PN waveforms; we find that parameters suitable for building an accurate PN surrogate are also suitable for building an NR or a hybrid NR-PN surrogate.

We use the same methods to build the PN surrogate as we use for the hybrid surrogate (cf. Sec. VI). We use the PN waveforms described in Sec. IV.1; however, for simplicity we only model the (2,2) mode. In addition, we restrict the length of the PN waveforms to be , terminating at the innermost-stable-circular-orbit’s orbital frequency, , where is the total mass of the binary.

We determine the desired training data set of parameters as follows. We begin with just the corner cases of the parameter space; for the 3d case considered here, that consists of 8 points at . We build up the desired set of parameters iteratively, in a greedy manner: At each iteration we build a PN surrogate using the current training data set and test the model against a much larger ( times) validation data set. The validation data set is generated by randomly resampling the parameter space at each iteration. Since the boundary cases are expected to be more important, for 30% of the points in the validation set we sample only from the boundary of the parameter space, which corresponds to the faces of a cube in the 3d case. We select the parameter in the validation set that has the largest error (cf. Eq. (2)), and add this to our training set (hence the name greedy parameters). We repeat until the validation error reaches a certain threshold.

In order to estimate the difference between two complexified waveforms, and , we use the time-domain mismatch,


where indicates a complex conjugation, and indicates the absolute value. Note that in this section, we do not perform an optimization over time and phase shifts. In addition, we assume a flat noise curve.

Figure 1: Largest mismatch of the surrogate (over the entire validation set) as a function of number of greedy parameters used to train the PN surrogate. The PN surrogate is seen to converge to the validation waveforms as the size of the training data set increases.

Figure 1 shows how the maximum validation error decreases as we add greedy parameters to our training data set. For our case, we stop at 100 greedy parameters (at which point the mismatch is ) and use those parameters to perform the NR simulations. Note that we don’t expect 100 NR simulations to produce an NR surrogate with comparable accuracy, , for two reasons. First, unlike the PN waveforms used here, the NR simulations also include the merger-ringdown part, which we expect to be more difficult to model. Second, the NR numerical truncation error is typically higher than in mismatch, therefore the numerical noise will limit the accuracy.

Iii NR simulations

The NR simulations for this model are performed using the Spectral Einstein Code (SpEC) SpE (); Pfeiffer et al. (2003); Lovelace et al. (2008); Lindblom et al. (2006); Szilágyi et al. (2009); Scheel et al. (2009) developed by the SXS SXS () collaboration. Of the 100 cases determined in Sec. II, only 91 simulations were successfully completed333The main reason for failure is large constraint violation as the binary approaches merger. We believe a better gauge condition may be needed for some of these simulations.. These simulations have been assigned the identifiers SXS:BBH:1419 - SXS:BBH:1509, and will be made publicly available in the upcoming update of the SXS public catalog Boyle et al. (2019). For cases with equal mass, but unequal spins, we can exchange the two BHs to get an extra data point. There are 13 such cases, leading to a total of 104 NR waveforms. These are shown as circular markers in Fig. 2.

Figure 2: The parameter space covered by the 104 NR waveforms (circle markers) used in the construction of the surrogate model in Sec. VI. We also show the 9 long NR waveforms (square markers) used to test hybridization in Sec. VII.2, and the 8 NR waveforms (triangle markers) used to test extrapolation in Sec. VII.3. The axes show the mass ratio and the spin on the heavier BH, while the colors indicate the spin on the lighter BH. The black rectangle indicates the bounds of the training region: , .

The start time of these simulations varies between and before the peak of the waveform amplitude (defined in Eq. (38)), where is the total Christodoulou mass measured after the initial burst of junk radiation. The algorithm for choosing a fiducial time at which junk radiation ends is discussed in Ref. Boyle et al. (2019). The initial orbital parameters are chosen through an iterative procedure Buonanno et al. (2011) such that the orbits are quasicircular; the largest eccentricity for these simulations is , while the median value is . The waveforms are extracted at several extraction surfaces at varying finite radii form the origin and then extrapolated to future null infinity Boyle and Mroué (2009). Finally, the extrapolated waveforms are corrected to account for the initial drift of the center of mass Boyle (2016, a). The time steps during the simulations are chosen nonuniformly using an adaptive time-stepper Boyle et al. (2019). We interpolate these data to a uniform time step of ; this is dense enough to capture all frequencies of interest, including near merger.

Iv Inspiral waveforms

While NR provides accurate waveforms, computational constraints limit NR to only the late inspiral, merger, and ringdown phases. Fortunately, PN/EOB waveforms are expected to be accurate in the early inspiral. Hence we can “stitch” together an early inspiral waveform and an NR waveform, to get a hybrid waveform Santamaría et al. (2010); Ohme (2012); Ohme et al. (2011); MacDonald et al. (2011, 2013); Ajith et al. (2012); Varma et al. (2014); Boyle (2011); Bustillo et al. (2015) that spans the entire frequency range relevant for ground-based detectors. In this section, we describe the waveforms we use for the early inspiral, leaving the hybridization procedure for the next section.

iv.1 PN waveforms

We first generate PN waveforms as implemented in the GWFrames package Boyle (b). For the orbital phase we include nonspinning terms up to 4 PN order Blanchet et al. (2004); Blanchet (2014); Jaranowski and Schäfer (2013); Bini and Damour (2013, 2014) and spin terms up to 2.5 PN order Kidder (1995); Will and Wiseman (1996); Bohe et al. (2013). We use the TaylorT4 Boyle et al. (2007) approximant to generate the PN phase; however, as described below, we replace this phase with an EOB-derived phase. For the amplitudes, we include terms up to 3.5 PN order Blanchet et al. (2008); Faye et al. (2012, 2015).

The spherical harmonic modes of the PN waveform can be written (after rescaling to unit total mass and unit distance) as Blanchet et al. (2008); Blanchet (2014),


where is the symmetric mass ratio, is the characteristic speed that sets the perturbation scale in PN, is the (real) orbital phase, and are the complex amplitudes of different modes. Note that we ignore the tail distortions Blanchet and Schäfer (1993); Arun et al. (2004) to the orbital phase as these are 4 PN corrections (see e.g. Barkett et al. (2018)).

The complex strain is obtained as a time series from GWFrames. We can absorb the complex part of the amplitudes into the phases and rewrite the strain as


where and are the real amplitude and phase of a given mode, and is an offset that captures the complex part of . Note that Eqs. (6) and (7) together imply ; contains complex terms starting at 2.5PN, but these appear as 5PN corrections in the phase (see e.g. Barkett et al. (2018)), which we can safely ignore.

At this stage, , , and are functions of time. But they can be recast as functions of the characteristic speed by first computing


where the derivative is performed numerically, and then inverting Eq. (8) to obtain . Then we define


Note that the PN waveform is generated in the source frame defined such that the reference time is the initial time. This also ensures that the heavier BH is on the positive axis at the initial time, and the initial orbital phase is zero.

To summarize: From the GWFrames package, we obtain the complex time series (Eq. (5)). We compute the orbital phase (Eq. (7)), the real amplitudes (Eq. (9)), and the phase offsets (Eq. (10)). These three quantities are obtained as a time series but can be represented as functions of the characteristic speed using Eq. (8).

Figure 3: NR, PN (Sec. IV.1), and EOB-corrected PN (Sec. IV.2) waveforms for an example case. We show the and modes. The binary parameters are shown at the top of the plot. The EOB-corrected PN waveform stays faithful to the NR waveform until much later times, compared to the pure PN waveform.

iv.2 EOB correction

As was done in previous works Varma et al. (2014); Varma and Ajith (2017), we find that the accuracy of the inspiral waveform can be improved by replacing the PN phase with the phase derived from an NR-calibrated EOB model. For this work we use SEOBNRv4 Bohé et al. (2017).

SEOBNRv4 is a time domain model that includes only the mode, which we can decompose as follows:


where and are the real amplitude and phase of the mode. These are functions of time, but following the same procedure as earlier, they can be recast in terms of the characteristic speed:


where the derivative is performed numerically, and we invert Eq. (13) to obtain . We replace in Eqs. (9) and (10) to get, respectively, the EOB-corrected amplitudes and phase offsets:


Note that in practice, computing and is accomplished via an interpolation in : and as computed in Eqs. (9) and (10) are known only at particular values of , which are where are the times in the PN time series; we interpolate and to the points where are the times in the EOB time series. We use a cubic-spline interpolation scheme as implemented in Scipy Jones et al. (2001–).

Following Eq. (6), the EOB-corrected phases are given by:


where we use the EOB orbital phase from Eq. (12). Finally, our EOB-corrected inspiral waveform modes are given by:


Fig. 3 shows an example of PN and EOB-corrected waveforms along with the corresponding NR waveform. All three waveforms have the same starting orbital frequency and their initial orbital phase is set to zero. We see that the PN waveform becomes inaccurate at late times, as expected. The EOB-corrected waveform, on the other hand, remains faithful to the NR waveform until much later times.

V Hybridization

In this section we describe our procedure to “stitch” together an inspiral waveform (described in Sec. IV) to an NR waveform (described in Sec. III).

We start by generating inspiral and NR waveforms with the same component masses and spins. We note that the spins measured in SpEC simulations agree well with PN theory Ossokine et al. (2015). However, the PN and NR waveforms are typically represented in different coordinate systems that need to be aligned with each other as follows. The two coordinate systems are related to each other by a possible time translation and a possible rotation by three Euler angles: inclination angle , initial binary phase , and polarization angle . For nonprecessing BBH the first angle , is trivially specified by requiring that the z-axis is along the direction of orbital angular momentum. This leaves us with the freedom to vary and . We choose the hybridization frame and time shifts by minimizing a cost function in a suitable matching region; this is described in more detail below.

v.1 Choice of cost function

We use the following cost function when comparing two waveforms, and , in the matching region:


where and denote the start and end of the matching region, to be defined in Sec. V.3, and the sum does not include modes for reasons described in Sec. V.2. This cost function was introduced in Ref. Blackman et al. (2017a) and is shown to be related to the weighted average of the mismatch over the sky.

We minimize the cost function by varying the time and frame shifts between the NR and inspiral waveforms


We perform the time shifts on the inspiral waveform so that the matching region always corresponds to the same segment of the NR waveform. The frame shifts are performed on the NR waveform so as to preserve the initial frame alignment of the inspiral waveform (cf. Sec. IV.1). This alignment gets inherited by the hybrid waveform, and is important in the surrogate construction.

v.2 modes

We find that the modes of the inspiral waveforms do not agree very well with the NR waveforms. There are several possible reasons for this Favata (2010): (1) The NR waveform does not have the correct “memory” contribution since this depends on the entire history of the system starting at , while the NR simulation covers only the last few orbits. (2) The extrapolation to future null infinity does not work as well for these modes Boyle et al. (2019). This could be improved in the future with Cauchy Characteristic Extraction Handmer et al. (2015, 2016); Winicour (2009); Bishop et al. (1996). (3) The amplitude of these modes is very small except very close to merger; therefore the early part of the NR waveform where we compare with the inspiral waveforms is contaminated by numerical noise.

Therefore, when constructing the hybrid waveforms, we set the entire inspiral waveform to zero for these modes,


When computing the cost function (Eq. (18)), we ignore the modes.

This means that our hybrid waveforms for these modes are equivalent to the NR waveforms. In addition, the main contribution for these modes comes from the region close to merger, which does not correspond to a memory signal, but instead is due to axisymmetric excitations near merger (cf. bottom panel of Fig. 5).

v.3 Choice of matching region

There are several considerations to take into account when choosing a matching region for the cost function (Eq. (18)): (1) The NR and inspiral waveforms should agree with each other reasonably in this region; at early times the NR waveform is contaminated by junk radiation while at late times the inspiral waveform deviates from NR (cf. Figs. 3 and 5). (2) The matching region should be wide enough that the cost function is meaningful.

Our matching region starts at after the start of the NR waveform; we find that this is necessary to avoid noise due to junk radiation in some of the higher order modes. The length of the NR waveforms from the start of the matching region to the peak of the waveform amplitude varies between and . The width of the matching region is then chosen to be equal to the time taken for 3 orbits of the binary. We use the phase of the (2,2) mode of the NR waveform to determine this. This choice ensures the width of the matching region scales appropriately with the NR starting frequency, so that we get wider matching regions when the NR waveform starts early in the inspiral.

v.4 Allowed ranges for frame and time shifts

The allowed range for is [0,]. For nonprecessing binaries the allowed values for can be restricted by taking into account the symmetries of the system. We will show that this restriction is a consequence of the well-known relationship


between the modes and the modes for nonprecessing binaries. We compute the shifted waveform


Eq. (24) implies that the only allowed values for are and 444 is also allowed, but it is degenerate with .. If the inspiral waveform and the NR waveform have the same sign convention, then . Unfortunately, not all NR catalogs and PN-waveform codes use the same sign convention, so we allow the possibility of to account for this.

To set the allowed range for , we begin by computing the orbital frequency of the inspiral waveform, , as half the frequency of the (2,2) mode. Similarly, we compute the orbital frequency of the NR waveform, . We first time-align the NR and inspiral waveforms such that their frequencies match at the start of the matching region. This gives us a good starting point to vary the time shift.

We also define,


where is the NR frequency at the start of the matching region. The allowed range for time shifts is restricted to lie in the interval [, ], where , and are the times at which is equal to , and , respectively. In other words, the allowed range for is a region near . is the case when the frequencies of the inspiral and the NR waveforms match at , the start of the matching region. The lower (upper) limit for is chosen such that the inspiral waveform has a frequency equal to () times the NR frequency at .

The factors in Eqs. (26) and (27) are chosen such that the time shift that minimizes the cost function is always well within the range of allowed time shifts. Hence, choosing a wider range (i.e. values of these factors farther from unity) does not improve the hybridization procedure. Note also that, like the width of the matching region in Sec. V.3, setting the range of time shifts based on the orbital frequency ensures that it scales appropriately with the start frequency of the NR waveform.

The minimization in Eq. (19) is performed as follows. We vary the time shift over 500 uniformly spaced values in the above mentioned time range 555We find that increasing the number of time samples results in no noticeable improvement; the typical values of the cost function after minimization with 500 samples are , and using 1000 samples results in changes only of order .. For each of these time shifts , we try both allowed values of . For each and , we minimize the cost function over using the Nelder-Mead down-hill simplex minimization algorithm as implemented in Scipy Jones et al. (2001–). To avoid local minima in the minimization, we perform 10 searches with different initial guesses, which are sampled from a uniform random distribution in the range .

Figure 4: Left: The real part (top) and frequency (bottom) of the mode using the inertial frame stitching described in Sec. V.5.1. The binary parameters are shown on the top of the plot. The vertical red dashed lines indicate the matching region. Note that this plot shows the inspiral and NR waveforms after the time and frame shifts are performed. Right: Same, but using the amplitude-frequency stitching described in Sec. V.5.2. Now we see that the frequency of the hybrid waveform agrees much better with the NR and inspiral data.
Figure 5: An example hybrid waveform used in this work. We show the modes of the inspiral, NR and hybrid waveforms. The binary parameters are shown on the top of the plot. The vertical red dashed lines indicate the matching region. Note that this plot shows the inspiral and NR waveforms after the time and frame shifts are done.

v.5 Stitching NR and inspiral waveforms.

Having obtained the right frame and time shifts between the NR and inspiral waveforms, the final step is to smoothly stitch the inspiral waveform to the shifted NR waveform. The stitching is done using a smooth blending function:


where and take on the same values as those appearing in Eq. (18). Different blending functions have been proposed in the literature Ajith et al. (2008); MacDonald et al. (2011); Santamaría et al. (2010); Ajith et al. (2012). Our choice is equivalent to the blending function defined in Ref. MacDonald et al. (2011). We find that our results are not sensitive to the choice of blending function.

In what follows, for brevity, we drop the hybridization parameters , , with the understanding that the models are stitched together after transforming into hybridization frame,


Given the shifted waveforms and the blending function, there are still several ways in which one can stitch the waveforms together.

v.5.1 Inertial frame stitching

One could work with the complex waveform strain and define:


With this choice, by construction, the complex strain transitions smoothly from the inspiral part to the NR part over the matching region. However, the transition is more complicated for the frequency, since it involves time derivatives of the complex argument of the strain; the time derivatives of the blending function do not behave like a smooth blending function. This is demonstrated in the left panel of Fig. 4: the inspiral and NR frequencies agree well in the matching region but the frequency of the hybrid waveform deviates from this.

v.5.2 Amplitude-Frequency stitching

To avoid the undesirable artifacts described above, we choose to perform the inspiral-NR stitching using the amplitude and frequency rather than the inertial frame strain.

We begin by decomposing the NR and inspiral waveforms into their respective amplitude and phase:


The frequency of each mode


is then numerically computed from 4th-order finite difference approximations to the time derivative. Finally, we stitch the amplitude and frequency of each mode to get their hybrid versions:


To get the inertial frame strain we first need to integrate the frequency to get the phase. However, we already know the phase in the region before (only inspiral) and after (only NR) the matching region. So, we integrate the hybrid frequency


in the matching region using a 4th-order accurate Runge-Kutta scheme.

Finally, we set the phase of the hybrid waveform to,


where and are chosen such that is continuous at and .

Since, by construction, the frequency transitions smoothly from the inspiral-waveform to NR data, we eliminate the artifact seen in the bottom left panel of Fig 4 (dashed line), as demonstrated in the right panel of Fig. 4.

We note that since the modes are purely real/imaginary and nonoscillatory for nonprecessing systems, they do not have a frequency associated with them, therefore we use the inertial frame stitching of Sec. V.5.1 for these modes. For these modes the waveform goes from zero to the NR value over the matching region.

In the hybridized waveform we include the and modes, but not the or modes. For the and modes we find that the inspiral and NR waveforms do not agree very well. An example of the final NR, inspiral and hybrid waveforms is shown in Fig. 5.

Vi Building the surrogate model

Starting from the 104 NR waveforms described in Sec. II and Sec. III, we construct hybrid waveforms as described in Sec. V. In this section we describe our method to construct a surrogate model for these hybrid waveforms.

vi.1 Processing the training data

Before building a surrogate model, we process the hybrid waveforms as follows.

vi.1.1 Time shift

We shift the time arrays of the hybrid waveforms such that the peak of the total amplitude


occurs at for each waveform.

vi.1.2 Frequency and mass ranges of validity

The length of a hybrid waveform is set by choosing a starting orbital frequency for the inspiral waveform; we use for all waveforms. However, for the same starting frequency, the length in time of the waveform is different for different mass ratios and spins. Since we want to construct a time-domain surrogate model, we require a common time array for all hybrid waveforms. The initial time for the surrogate is determined by the shortest hybrid waveform in the training data set; this waveform begins at a time before the peak. We truncate all hybrid waveforms to this initial time value.

The largest starting orbital frequency among the truncated hybrid waveforms is , which sets the low frequency limit of validity of the surrogate. For LIGO, assuming a starting GW frequency of , the mode of the surrogate is valid for total masses . The highest spin-weighted spherical harmonic mode we include in the surrogate model is , for which the frequency is times that of the mode. Therefore, all modes of the surrogate are valid for . This coverage of total mass is sufficient to model all stellar mass binaries of interest for ground based detectors; for an equal mass binary neutron star system, the total mass is .

vi.1.3 Downsampling and common time samples

Because the hybrid waveforms are so long, it is not practical to sample the entire waveform with the same step size we use for the NR waveforms (). Fortunately, the early low-frequency portion of each waveform requires sparser sampling than the later high-frequency portion. We therefore down-sample the time arrays of the truncated hybrid waveforms to a common set of time samples. We choose these samples so that there are 5 points per orbit for the above-mentioned shortest hybrid waveform in the training data set, except for we choose uniform time samples separated by . This ensures that we have a denser sampling rate at late times when the frequency is higher. We retain times up to , which is sufficient to capture the entire ringdown.

Before downsampling, we first transform the waveform into the co-orbital frame, defined as:


where is the inertial frame hybrid waveform, is the orbital phase, and is the phase of the mode. The co-orbital frame can be thought of as roughly co-rotating with the binary, since we perform a time-dependent rotation given by the instantaneous orbital phase. Therefore the waveform is a slowly varying function of time in this frame, increasing the accuracy of interpolation to the chosen common time samples. For the mode we save the downsampled amplitude and phase , while for all other modes we save . We find that this down-sampling results in interpolation errors (defined in Eq. (18)) for all hybrid waveforms.

vi.1.4 Phase alignment

After down-sampling to the common temporal grid of the surrogate, we rotate the waveforms about the z-axis such that the orbital phase is zero at . Note that this by itself would fix the physical rotation up to a shift of . When generating the inspiral waveforms for hybridization, we align the system such that the heavier BH is on the positive axis at the initial frequency; this fixes the ambiguity. Therefore, after this phase rotation, the heavier BH is on the positive axis at for all waveforms666Here the BH positions at are defined from the waveform at future null infinity, using a phase rotation relative to the early inspiral where the BH positions are well-defined in PN theory; these positions do not necessarily correspond to the (gauge-dependent) coordinate BH positions in the NR simulation..

vi.2 Decomposing the data

It is much easier to build a model for slowly varying functions of time. Therefore, rather than work with the inertial frame strain , which is oscillatory, we work with simpler “waveform data pieces” , as explained below. We build a separate surrogate for each waveform data piece. When evaluating the full surrogate model, we first evaluate the surrogate of each data piece and then recombine the data pieces to get the inertial frame strain.

A common choice in literature when working with nonprecessing waveforms has been to decompose the complex strain into an amplitude and phase, each of which is a slowly varying function of time:


However, when and , the amplitude of odd- modes becomes zero due to symmetry. This means that the phase becomes meaningless, so one has to treat such cases separately. For example, Ref Blackman et al. (2015) used specialized basis functions for the odd- modes that captured the divergent behavior of the phase in the equal-mass limit.

To avoid this issue, instead of using the amplitude and phase we use the real and imaginary parts of the co-orbital frame strain , defined in Eq. (39), for all nonquadrupole modes. The co-orbital frame strain is always meaningful: in the special, symmetric case mentioned above, the co-orbital frame strain for the odd- modes just goes to zero, rather than diverge. For the mode we use the amplitude777Note that for the (2,2) mode . and phase .

As mentioned above, our hybrid waveforms are very long, typically containing orbits. This presents new challenges that are not present for pure-NR surrogates. For instance, sweeps over radians for a typical hybrid waveform. We find that the accuracy of the surrogate model at early times improves if we first subtract a PN-derived approximation to the phase, model the phase difference rather than , and then add back the PN contribution when evaluating the surrogate model. In particular, we use the leading order TaylorT3 approximant Damour et al. (2001). For this approximant, the phase is given as an analytic, closed-form, function of time. Therefore, even though TaylorT3 is known to be less accurate than some other approximants Buonanno et al. (2009), its speed makes it ideal for our purpose as we only need it to capture the general trend. At leading order, the TaylorT3 phase is given by:


where is an arbitrary integration constant, , is an arbitrary time offset, and is the symmetric mass ratio. Note that diverges at . We choose , long after the end of the waveform (recall that the peak is at ), to ensure that we are always far away from this divergence. We choose such that at ; this is the same time at which we align the hybrid phase in Sec. VI.1.4.

Instead of modeling , we model the residual


after removing the leading-order contribution . By construction, goes to zero at . We find that after removing the leading order TaylorT3 phase, the scale of for a typical hybrid is radians, as compared to radians for . In essence, this captures almost all of the phase evolution in the early inspiral, simplifying the problem of modeling the phase to the same as modeling the phase of late-inspiral NR waveforms. We stress that the exact form of (or its physical meaning) is not important, as long as it captures the general trend, since we add the exact same to our model of when evaluating the surrogate. In fact, we find that adding higher order PN terms in Eq. (43) does not improve the accuracy of the surrogate.

To summarize, we decompose the hybrid waveforms into the following waveform data pieces, each of which is a smooth, slowly varying function of time: (, ) for the (2,2) mode, and the real and imaginary parts of for all other modes888For modes of nonprecessing systems, is purely real (imaginary) for even (odd) , so we ignore the imaginary (real) part for modes..

vi.3 Building the surrogate

Once we have the waveform data pieces, we build a surrogate model for each data piece using the procedure outlined in Refs. Blackman et al. (2017a); Field et al. (2014), which we only briefly describe here. Note that the steps below are applied independently for each waveform data piece.

vi.3.1 Greedy basis

We first construct a greedy reduced-basis Field et al. (2011) such that the projection errors (cf. Eq. (5) of Ref. Blackman et al. (2017a)) for the entire data set onto this basis are below a given tolerance. For the basis tolerances we use for the data piece, for , and for all other data pieces. These are chosen through visual inspection of the basis functions to ensure they are not noisy, and based on the expected truncation error of the NR waveforms. For instance, we expect the error in phase to be about radians.

The greedy procedure is initialized with a single basis function as described in Ref. Blackman et al. (2017a). Then at each step in the greedy procedure, the waveform with the highest projection error onto the current basis is added to the basis. Previous work has shown that the resulting greedy reduced-basis is robust to different choices of initialization Herrmann et al. (2012). When computing the basis projection errors, we only include data up to after the peak. We find that this helps avoid noisy basis functions. This is particularly important for the phase data piece as this becomes meaningless at late times, when the waveform amplitude becomes very small.

vi.3.2 Empirical interpolation

Next, using a different greedy procedure, we construct an empirical interpolant Barrault et al. (2004); Maday et al. (2009); Hesthaven, Jan S. et al. (2014) in time. This picks out the most representative time nodes, where the number of time nodes is the same as the number of greedy basis functions.. We require that the start of the waveform always be included as a time node for all data pieces. This is a useful modeling choice because the magnitude of the waveform data pieces in the very early inspiral can be smaller than the basis tolerances mentioned above. By requiring the first index to be an empirical time node, we enforce an anchor point that ensures the waveform data piece has the right magnitude at the start of the waveform. Furthermore, we do not allow any empirical time nodes at times , since we expect this part to be dominated by noise (especially for the phase data piece).

Figure 6: Errors in NRHybSur3dq8 and SEOBNRv4HM when compared against hybrid waveforms. For NRHybSur3dq8, we show out-of-sample errors. Mismatches are computed at several points in the sky of the source frame using all available modes in each waveform: For the hybrid waveforms and NRHybSur3dq8, that is and , but not or . For SEOBNRv4HM that is , and . Left: Mismatches computed using a flat noise curve, but including only the late inspiral part of the waveforms, starting at before the peak. Therefore, we are essentially comparing only to the NR part of the hybrid waveforms. For comparison, we also show the NR resolution error, obtained by comparing the two highest available resolutions. The histograms are normalized such that the area under each curve is when integrated over (Mismatch). Right: Mismatches as a function of total mass, computed using the Advanced LIGO design sensitivity noise curve. Here we compare against the full hybrid waveforms. The solid (dashed) lines show the percentile (median) mismatch values over points on the sky as well as different hybrid waveforms.

vi.3.3 Parametric fits

Finally, for each time node, we construct a fit across the parameter space. The fits are done using the Gaussian process regression fitting method described in the supplemental material of Ref. Varma et al. (2018). Following Ref. Varma et al. (2018), we parameterize our fits using , , and . Here is the spin parameter entering the GW phase at leading order Khan et al. (2016); Ajith (2011); Cutler and Flanagan (1994); Poisson and Will (1995) in the PN expansion,


and is the “anti-symmetric spin”,


The fit accuracy, and as a result the accuracy of the surrogate model, improves noticeably when using , compared to or .

Figure 7: The plus polarization of the waveforms for the cases that result in the largest mismatch for NRHybSur3dq8 (top) and SEOBNRv4HM (bottom) in the left panel of Fig. 6. We also show the corresponding hybrid waveforms (labeled as NR as only the late part is shown). Each waveform is projected using all available modes for that model, along the direction which results in the largest mismatch for NRHybSur3dq8 (SEOBNRv4HM) in the top (bottom) panel. Note that NRHybSur3dq8 is evaluated using trial surrogates that are not trained using these cases. The binary parameters and the direction in the source frame are indicated in the inset text. All waveforms are time shifted such that the peak of the total waveform amplitude occurs at (using all available modes, according to Eq. (38)). Then the waveform modes are rotated about the axis such that the orbital phase is zero at .

vi.4 Evaluating the surrogate

When evaluating the surrogate waveform, we first evaluate each surrogate waveform data piece. Next, we compute the phase of the mode,


where is the surrogate model for and is given in Eq. (43). If the waveform is required at a uniform sampling rate, we interpolate each waveform data piece from the sparse time samples used to construct the model to the required time samples, using a cubic-spline interpolation scheme. Finally, we use Eqs. (39), (40), and (41) to reconstruct the surrogate prediction for the inertial frame strain.

Vii Results

In order to estimate the difference between two waveforms, and , we use the mismatch, defined in Eq. (2), but in this section instead of Eq. (3) we use the frequency-domain inner-product


where indicates the Fourier transform of the complex strain , indicates a complex conjugation, indicates the real part, and is the one-sided power spectral density of a GW detector. We taper the time domain waveform using a Planck window McKechan et al. (2010), and then zero-pad to the nearest power of two. We further zero-pad the waveform to increase the length by a factor of eight before performing the Fourier transform. The tapering at the start of the waveform is done over cycles of the mode. The tapering at the end is done over the last . Note that our model contains times up to after the peak of the waveform amplitude, and the signal has essentially died down by the last .

We compute mismatches following the procedure described in Appendix D of Ref. Blackman et al. (2017a): the mismatches are optimized over shifts in time, polarization angle, and initial orbital phase. Both plus and cross polarizations are treated on an equal footing by using a two-detector setup where one detector sees only the plus and the other only the cross polarization. We compute the mismatches at 37 points uniformly distributed on the sky in the source frame, and we use all available modes of a given waveform model.

When computing flat noise mismatches (), we take to be the frequency of the mode at the end of the initial tapering window, and , where is the frequency of the mode at its peak. This choice of ensures that we capture the peak frequencies of all modes considered in this work, including the mode, whose frequency has the highest multiple of the mode frequency of all the modes we model. We also compute mismatches with the Advanced-LIGO design sensitivity Zero-Detuned-HighP noise curve LIGO Scientific Collaboration (2011). For this we use and .

vii.1 Surrogate errors

We evaluate the accuracy of our new surrogate model, NRHybSur3dq8, by computing mismatches against hybrid waveforms. For this, we compute “out-of-sample” errors as follows. We first randomly divide the 104 training waveforms into groups of waveforms each. For each group, we build a trial surrogate using the remaining training waveforms and test against these five validation ones. We also compute the mismatch between an existing higher-mode waveform model, SEOBNRv4HM Cotesta et al. (2018), and the hybrid waveforms.

Figure 6 summarizes mismatches of both NRHybSur3dq8 and SEOBNRv4HM versus the hybrid waveforms. We use all available modes for each waveform model. In the left panel we show mismatches computed using a flat noise curve over the NR part of the hybrid waveforms (to do this, we truncate the waveforms and begin tapering at ). We see that the mismatches for NRHybSur3dq8 are about two orders of magnitude lower than that of SEOBNRv4HM. We compare this with the truncation error in the NR waveforms themselves, by computing the mismatch between the two highest available resolutions of each NR waveform. The errors in the surrogate model are well within the truncation error of the NR ssimulations. Note that NR error estimated in this manner is a conservative estimate; if we treat the high resolution simulation as the fiducial case, the NR curve in Fig. 6 can be thought of as the error in the lower resolution simulation. This explains why the errors in the surrogate are lower than the NR errors. We suspect that the error of the high resolution simulations is close to the surrogate model’s error.

The right panel of Fig. 6 shows mismatches computed using the Advanced LIGO design sensitivity noise curve. The mismatches are now dependent on the total mass of the system, so we show mismatches for masses starting at the lower limit of the range of validity of the surrogate: . percentile mismatches for NRHybSur3dq8, are always below in the mass range . At high masses (), where the merger-ringdown is more prominent, our model is more accurate than SEOBNRv4HM by roughly two orders of magnitude, in agreement with the left panel of Fig. 6.

For high masses only the last few orbits of the hybrid waveforms are in the LIGO band, and the hybrid waveforms are effectively the same as the NR waveforms. For low masses, the errors in the right panel of Fig. 6 quantify how well different models reproduce the hybrid waveforms. However, this comparison cannot account for the errors in the hybridization procedure itself. We provide some evidence for the fidelity of the hybrid waveforms in Sec. VII.2, by comparing against some long NR waveforms.

Fig. 7 shows NRHybSur3dq8 and SEOBNRv4HM waveforms for the cases leading to the largest errors in the left panel of Fig. 6. The surrogate shows very good agreement with the NR waveform, even for its worst case. SEOBNRv4HM shows a noticeably larger deviation that cannot all be accounted for with a time and/or phase shift. Note that we align the time and orbital phase of the waveforms in Fig. 7.

We note the main improvement over SEOBNRv4HM is not due to the inclusion of more modes. We find that the agreement between SEOBNRv4HM and the NR/hybrid waveforms in Figs. 6 and 7 improves only marginally when restricting the NR/hybrid waveforms to the same set of modes as SEOBNRv4HM.

vii.2 Hybridization errors

The errors described in Sec. VII.1 are computed by comparing the surrogate against hybrid waveforms, hence they do not include the errors in the hybridization procedure or the errors from EOB-corrected-PN waveforms (cf. Sec. IV.2) we use for the early inspiral. To estimate these errors, we compare the surrogate against a few very long NR simulations 999Note that for these long NR simulations, the outer boundary location is chosen based on the length of the simulations Boyle et al. (2019) so as to avoid unphysical center-of-mass accelerations seen in earlier long-duration runs Szilágyi et al. (2015).. We perform five new simulations that are long and two that are long. These have been assigned the identifiers SXS:BBH:1412 - SXS:BBH:1418, and will be made publicly available in the upcoming update of the SXS public catalog Boyle et al. (2019). In addition, we use two simulations of length from Ref. Kumar et al. (2015). These nine simulations are represented as square markers in Fig. 2, and have not been used in training the surrogate. The surrogate was trained against hybrid waveforms whose NR duration varied between and . Therefore, comparing against long NR waveforms, which include the early inspiral, is a good way to estimate the hybridization error.

Figure 8: Comparisons between the NRHybSur3dq8 surrogate model and a few NR waveforms of in duration. We also show the NR resolution error. percentile mismatches (over points in the sky) are shown as a function of total mass. The inset text indicates the mass ratio and component spins. Mismatches are computed using the Advanced LIGO design sensitivity noise curve. To best assess the error introduced by the hybridization procedure we use the same set of modes for the NR waveforms as the surrogate. At low masses, the hybridization errors (red circles) become less reliable measures of accuracy due to the large NR resolution error (black circles) itself. Fig. 9 describes a refined comparison to improve the assessment at low masses.
Figure 9: Errors in the NRHybSur3dq8 surrogate model against long NR waveforms, but only looking at segments of length individually. Each point represents one segment that ends at a specified number of orbits before the waveform peak, as plotted on the horizontal axis; Therefore, going from left to right in the figure, we plot segments that start earlier in the inspiral. We also show the NR resolution error in the same segments. The inset text indicates the mass ratio and component spins. We show percentile mismatches (over points in the sky), computed using a flat noise curve. We use the same set of modes for the NR waveforms as the surrogate. We find that, in general, the surrogate error is lower than or comparable to the NR resolution error throughout the inspiral.

We begin by repeating the mismatch computation from the right panel of Fig. 6, using the long NR waveforms. This is shown in Fig. 8. We also show the errors in the NR simulations, estimated by comparing the two highest available NR resolutions. We find that the mismatches between the surrogate and the long NR waveforms for are below , in agreement with Fig. 6. For lower masses, the mismatches quickly increase and can be as high as . However, this increase in mismatch is accompanied by an increase in the error of the NR waveforms. This is expected, since for very long NR waveforms the accumulated phase error is a dominant source of numerical error, which becomes increasingly relevant for low mass systems as more of the waveform moves in-band. Therefore, in Fig. 8, at low masses, the comparison between the surrogate and NR waveforms is largely dominated by the numerical resolution error of the long NR waveforms themselves.

We find that a better test of the hybridization procedure, one that is less sensitive to NR phase accumulation errors, is to compare against different segments of the NR waveform. Since the phase errors accumulate over a large number of cycles, by looking at smaller segments we ensure that this contribution is not the dominant error. To be precise, we compare the surrogate and the NR data, using segments of length ending at a particular number of orbits before the peak of the waveform. For each segment we compute mismatches at several points in the sky using a flat noise curve. By varying the number of orbits to the peak, we can cover the entire NR waveform including the early inspiral region where the surrogate depends on the hybridization procedure. These errors are shown in Fig. 9. We find that in each segment, the mismatch between the surrogate and the NR data is, in general, lower or comparable to the NR resolution error. Therefore, the surrogate reproduces the NR data accurately in the early inspiral and the hybridization errors are smaller than or comparable to the NR resolution error for these cases. We note that the surrogate errors in Fig. 9 depend on the length of the segment considered and are only meaningful when compared to the NR errors in the same segment.

Unfortunately, long NR simulations such as these are not available at regions of the parameter space where both mass ratio and spin magnitudes are large. These are the cases where PN is expected to perform poorly, so we expect larger hybridization errors for these cases.

vii.3 Extrapolation outside the training range

Figure 10: Errors in NRHybSur3dq8 when evaluated outside its training range. percentile mismatches (over points in the sky) are shown as a function of total mass for different extrapolated cases. These are computed using the Advanced LIGO design sensitivity noise curve. To best assess the error introduced by the extrapolation, we use the same set of modes for the NR waveforms as the surrogate. The labels indicate the mass ratio and component spins (). For comparison we reproduce the percentile mismatches for NRHybSur3dq8 within its training range from the right panel of Fig. 6.

We now investigate the efficacy of NRHybSur3dq8 to extrapolate beyond its training parameter range by comparing against SpEC NR simulations SXS Collaboration (); Kumar et al. (2015); Chu et al. (2016); Scheel et al. (2015); Chatziioannou et al. (2018) at larger mass ratios () and/or larger spin magnitudes ( or ). These NR simulations are represented as triangle markers in Fig. 2.

Fig. 10 shows mismatches for NRHybSur3dq8 when compared against these simulations. We find that surrogate extrapolates remarkably well, with the mismatch always below for all cases, which include mass ratios up to and spin magnitudes up to . However, the extrapolation errors can be about half an order of magnitude larger than errors within the training range. Note that NR simulations with both high mass ratios and high spin magnitudes are not currently available, and the ones used here represent the most extreme cases found in the SXS Catalog. We do not hybridize these simulations before comparing to NRHybSur3dq8 because several of them are too short. In Fig. 10, the minimum mass for each case is chosen to be the lowest mass at which all used modes of the NR simulation lie fully in the LIGO band with a low frequency cut-off of .

At much higher mass ratios than those tested here, such as , we find that the waveforms generated by the surrogate can have “glitches” in the time series. Therefore, we recommend the surrogate be used for and . However, we advise caution with any extrapolation in general.

vii.4 Mode mixing

Figure 11: Mode mixing between spherical harmonic modes is clearly seen in the ringdown signal of the NR waveform and is accurately reproduced by the surrogate. The absolute values of the Fourier transform of different spherical harmonic modes are shown as solid (dashed) curves for the surrogate (NR). The dotted vertical lines indicate the frequencies of the fundamental QNM overtone of these modes. The component parameters as well as the remnant mass and spin are shown in the text above the figure.

Numerical relativity waveforms are extracted as spin-weighted spherical harmonic modes Newman and Penrose (1966); Goldberg et al. (1967). However, in the ringdown regime, the natural basis to use is the spin-weighted spheroidal harmonic basis Teukolsky (1973, 1972). A spherical harmonic mode can be written as a linear combination of all spheroidal harmonic modes with the same index Berti and Klein (2014). Therefore, during the ringdown, we expect leakage of power between different spherical harmonic modes with the same . This is referred to as mode mixing.

Since the surrogate accurately reproduces the spherical harmonic modes from the NR simulations, it also captures this mode mixing. We demonstrate this for an example case in Fig. 11. Here we compute the Fourier transform of different spherical harmonic modes in the ringdown stage of the waveform. Before computing the Fourier transform, we first drop all data before , where corresponds to the peak of the waveform amplitude (cf. Eq. (38)). Then, we taper the data between and , as well as the last of the time series, using a Planck window McKechan et al. (2010). The tapering width at the start is chosen such that the remaining signal is dominated by the fundamental quasi-normal mode (QNM) overtone. Fig. 11 shows the absolute value of these Fourier transforms for different modes, for both the surrogate and the NR waveform. In addition, we show the frequency of the fundamental QNM overtone for each mode Berti and Klein ().

Note that the mode and the mode have the same index, the condition required for mode mixing. We see that the peak of the mode agrees with the QNM frequency as expected. For the mode however, while there are features of a peak at the expected QNM frequency, there is a much larger peak at the frequency of the mode. This is because some of the energy of the stronger mode has leaked into the mode due to mode mixing. Mode mixing can also be seen for the and modes, which also have the same index. Fig. 11 shows that not only does the surrogate agree with NR in the ringdown, it also reproduces the mode mixing present in the NR data.

vii.5 Evaluation cost

Figure 12: Evaluation cost for NRHybSur3dq8 including the cost of a Fast Fourier Transform. We show the cost for evaluating all 11 modes modeled by NRHybSur3dq8, as well as the cost of a single mode. These are compared to the evaluation cost of SEOBNRv4_ROM which includes only the mode. The evaluation cost is computed by averaging over 64 points uniformly distributed in the parameter space, and .

Figure 12 shows the evaluation cost for NRHybSur3dq8, at different total masses, starting at , and using a sampling rate of . This suggests that NRHybSur3dq8 is fast enough for direct use in parameter estimation. We also show the evaluation cost per mode. This is compared to the evaluation cost of SEOBNRv4_ROM Bohé et al. (2017), a Fourier domain Reduced Order Model (ROM) version of SEOBNRv4. Note that SEOBNRv4_ROM models only the mode. Comparing the cost for SEOBNRv4_ROM to the single-mode cost of NRHybSur3dq8 suggests that the evaluation cost of NRHybSur3dq8 can be reduced by a factor of by building a Fourier domain ROM. These tests were performed on a single core on a 3.1 GHz Intel Core i5 processor. SEOBNRv4_ROM was evaluated using the C implementation in the LIGO Algorithm Library LIGO Scientific Collaboration (2018). NRHybSur3dq8 was evaluated using the Python implementation in gwsurrogate Blackman et al. (), therefore further speed-up could be possible with a C implementation.

Viii Conclusion

We present NRHybSur3dq8, the first NR-based surrogate waveform model that spans the entire LIGO bandwidth, valid for stellar mass binaries with total masses . This model is trained on 104 NR-PN/EOB hybrid waveforms of nonprecessing quasicircular BBH systems with mass ratios , and spin magnitudes . The parametric fits for this model are performed using Gaussian process regression. This model includes the following spin-weighted spherical harmonic modes: and , but not or . We make our model available publicly through the easy-to-use Python package gwsurrogate Blackman et al. ().

Through a cross-validation study, we show that the surrogate accurately reproduces the hybrid waveforms. The mismatch between them is always less than for total masses . For high masses (), where the merger-ringdown is more prominent, we show roughly a 2 orders of magnitude improvement over the current state-of-the-art model with nonqudrupole modes, SEOBNRv4HM Cotesta et al. (2018).

By comparing against several long NR simulations, we show that the errors in our hybridization procedure are comparable or lower than the resolution error in current NR simulations. In addition, by comparing against available NR simulations at higher mass ratios and spins, we show that our model extrapolates reasonably well outside its training range. Based on these tests, we are cautiously optimistic that the surrogate can be used for and , and we leave a more detailed investigation for future work.

viii.1 Future work

While our tests of the hybridization procedure are encouraging, long NR simulations are available only for low mass ratios and low spin magnitudes. Therefore, we have no means to test hybridization at high mass rations and/or high spins, where PN is expected to perform poorly. An improved surrogate model and refined study of the hybridization errors will require longer inspiral waveforms with greater coverage of the parameter space.

Another extension of interest is towards larger mass ratios and spin magnitudes. While the surrogate extrapolates very well when compared to available simulations at larger mass ratios and spins, no NR simulations are available with both large mass ratios () and large spins (). Therefore, our model is untested in that parameter space and it might be necessary to add training points there. The model could also be extended to include precession and/or eccentricity, however this is more challenging because of the enlarged parameter space as well as more complicated hybridization.

Finally, as mentioned in Sec. VII.5, the evaluation time of NRHybSur3dq8 can likely be reduced by constructing a Fourier domain ROM Pürrer (2014) of the time-domain model.

We leave these explorations to future work.

We thank Matt Giesler for helping carry out the new SpEC simulations used in this work. We thank Michael Boyle, Kevin Barkett, Matt Giesler, Yanbei Chen, and Saul Teukolsky for useful discussions. We thank Patricia Schmidt for careful and detailed feedback on an earlier draft of this manuscrtipt. V.V. and M.S. are supported by the Sherman Fairchild Foundation, and NSF grants PHY–170212 and PHY–1708213 at Caltech. S.E.F is partially supported by NSF grant PHY-1806665. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562. This research is part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. Simulations were performed on NSF/NCSA Blue Waters under allocation NSF PRAC–1713694; on the Wheeler cluster at Caltech, which is supported by the Sherman Fairchild Foundation and by Caltech; and on XSEDE resources Bridges at the Pittsburgh Supercomputing Center, Comet at the San Diego Supercomputer Center, and Stampede and Stampede2 at the Texas Advanced Computing Center, through allocation TG-PHY990007N. Computations for building the model were performed on Wheeler and Stampede2.



Comments 0
Request Comment
You are adding the first comment!
How to quickly get a good reply:
  • Give credit where it’s due by listing out the positive aspects of a paper before getting into which changes should be made.
  • Be specific in your critique, and provide supporting evidence with appropriate references to substantiate general statements.
  • Your comment should inspire ideas to flow and help the author improves the paper.

The better we are at sharing our knowledge with each other, the faster we move forward.
The feedback must be of minimum 40 characters and the title a minimum of 5 characters
Add comment
Loading ...
This is a comment super asjknd jkasnjk adsnkj
The feedback must be of minumum 40 characters
The feedback must be of minumum 40 characters

You are asking your first question!
How to quickly get a good answer:
  • Keep your question short and to the point
  • Check for grammar or spelling errors.
  • Phrase it like a question
Test description