Enabling high confidence detections of gravitationalwave bursts
Abstract
With the advanced LIGO and Virgo detectors taking observations the detection of gravitational waves is expected within the next few years. Extracting astrophysical information from gravitational wave detections is a wellposed problem and thoroughly studied when detailed models for the waveforms are available. However, one motivation for the field of gravitational wave astronomy is the potential for new discoveries. Recognizing and characterizing unanticipated signals requires data analysis techniques which do not depend on theoretical predictions for the gravitational waveform. Past searches for shortduration unmodeled gravitational wave signals have been hampered by transient noise artifacts, or “glitches,” in the detectors. In some cases, even high signaltonoise simulated astrophysical signals have proven difficult to distinguish from glitches, so that essentially any plausible signal could be detected with at most 23 level confidence. We have put forth the BayesWave algorithm to differentiate between generic gravitational wave transients and glitches, and to provide robust waveform reconstruction and characterization of the astrophysical signals. Here we study BayesWave’s capabilities for rejecting glitches while assigning high confidence to detection candidates through analytic approximations to the Bayesian evidence. Analytic results are tested with numerical experiments by adding simulated gravitational wave transient signals to LIGO data collected between 2009 and 2010 and found to be in good agreement.
I Introduction
When the LIGO Abbott et al. (2009a) and Virgo Accadia et al. (2012) observatories make their first detection of gravitational waves (GW) it will represent a major achievement. Making a claim of a significant discovery requires exceptional evidence. In the field of particle physics, a common practice for declaring detection of a new particle is a “5sigma” level of confidence, meaning that there is probability of less than of the observation arising from sources other than the claimed discovery.
Detailed theoretical predictions for the gravitational wave signal helps reduce the false alarm (or false positive) rate due to glitches (Abbott et al., 2009b; Blackburn et al., 2008; Abadie et al., 2010; Aasi et al., 2012) but searches for generic transient signals, known as GW bursts, have been hampered by an inability to distinguish nonGaussian noise artifacts, or “glitches,” and astrophysical signals at high confidence (e.g. Ref. (Abbott et al., 2009c)). Background distributions for burst searches, determined by timeshifting the data from multiple detectors so that no coherent gravitational signals are in the data, show a long tail to high signal to noise ratio (SNR), meaning that even a very strong gravitational wave signal would be consistent with having arisen from a glitch. In both the first and second joint LIGOVirgo observation runs, simulated signals intentionally added to the data and included in the final analysis were recovered by Burst algorithms with false alarm probabilities of order 10% and could not be identified as statistically significant events without using physically motivated waveform models (Abbott et al., 2009d; Abadie et al., 2012).
In preparation for the advanced detector era several new approaches to the burst detection problem have been developed. Thrane and Coughlin Thrane and Coughlin (2015) have demonstrated the capability to make highconfidence detections of longduration () burst signals in nonstationary, nonGaussian noise by searching for excess power found along parameterized curves through a timefrequency representation of the data. In an independent effort, the Bayesian parameter estimation analysis library LALInference Veitch et al. (2015), originally designed for the characterization of compact binary signals, has been adapted for burst analyses by using a sineGaussian waveform as the gravitational wave template Essick et al. (2014); Lynch et al. (). LALInference differentiates between signals and glitches using a “coherence test” where the “coherent” signal hypothesis uses a templatebased analysis assuming the data streams from multiple detectors contain a coherent gravitational wave signal while the “incoherent” glitch hypothesis treats each data stream independently. The incoherent model uses the same template waveform as the signal model but optimizes its parameters independently for each detectors’ data Veitch and Vecchio (2010).
Recently we proposed BayesWave–a Bayesian algorithm to followup short duration ( s) candidate gravitational wave transient events, separate signals from glitches, and provide robust signal characterization for arbitrary burst waveforms Cornish and Littenberg (2015). BayesWave uses a variabledimension model for signals and/or glitches enabling the analyses to adapt the complexity of the waveform model to match what is present in the data instead of imposing a template waveform and searching for best fit parameters. For a detection candidate BayesWave computes the relative evidence of the event being produced by a GW signal, an instrument artifact, or statistical fluctuations of the detector’s Gaussian noise. In the event that the candidate is of astrophysical origin, BayesWave also produces posterior distributions for the source skylocation and orientation, accurate waveform reconstruction, and metrics to characterize the signal such as duration, bandwidth, signal energy, etc. In all instances, BayesWave characterizes the instrument behavior including spectral estimation for the background Gaussian noise and glitch reconstructions which can then be used to feedback into the neverending effort to improve the interferometers’ performance. Analysis of the Gaussian component of the instrument noise is handled by BayesWave’s sibling algorithm, BayesLine Littenberg and Cornish (2015). During the first Advanced LIGO observing run BayesWave is being utilized as a followup analysis to candidate and background events found by the coherent WaveBurst algorithm Klimenko et al. (2008).
In this paper we will demonstrate BayesWave’s potential by analyzing data from the sixth LIGO science run (S6) which took place from 20092010. Our results are achieved by analyzing data known to contain glitches which contributed to the longtailed background distribution for the burst search, and by adding simulated gravitational wave signals to detector noise. In addition to this study using archived data, we present an analytical framework for understanding the performance of the pipeline. A companion paper uses BayesWave and the flagship burst search algorithm, coherent WaveBurst Klimenko et al. (2008), in an endtoend demonstration of how burst detection efficiency is improved by the joint analysis Kanner et al. (2015).
In section II we briefly describe the BayesWave algorithm, Bayesian model selection, and our model for the data. In section III we go through a simple analytic calculation to give insight into how BayesWave is able to distinguish signals and glitches, and use BayesWave’s performance on simulated signals added to real data to support the analytic approximations. Section IV uses the intuition built from the analytics to estimate background rates for glitches to be considered signals by BayesWave, and connects the Bayes factor to false alarm rates for detections. We summarize the work in section V. The appendix contains a more detailed derivation of the analytic approximation to the evidence.
Ii Method
Searches for Burst signals have been based on frameworks that employ detection statistics to measure the likelihood that Gaussian noise could produce the data (Chatterji et al., 2004; Chatterji, 2005; Chatterji et al., 2006; Klimenko et al., 2008; Sutton et al., 2010). While stationary Gaussian noise is often a good description for LIGO/Virgo data, the approximation breaks down with much higher regularity than the arrival of detectable gravitational waves. Any data analysis method must account for the possibility of nonstationary nonGaussian noise. Most existing analysis strategies apply various selection cuts to separate glitches from astrophysical signals which are tuned by adjusting thresholds to minimize the estimated background rate of transient noise glitches Abbott et al. (2009d, c); Abadie et al. (2012).
Bayesian hypothesis testing has been used in searches for GWs from a timing glitch in the Vela pulsar Abadie et al. (2011) using a damped sinusoid that abruptly starts at times associated with the pulsar timing glitch as the signal model. A recently developed search pipeline Lynch et al. () uses excess power to identify interesting data segments and a matchedfiltering followup with a sineGaussian template for signal characterization Essick et al. (2014). The “coherent vs. incoherent” Bayes factor is used to distinguish between noise and signal Veitch and Vecchio (2010).
BayesWave employs a different approach by using a parameterized model for the LIGO/Virgo data, noise and signal included, and forward modeling, i.e. predicting, the detector output. The data model has three distinct components: A gravitational wave signal that is elliptically polarized and is coherent across the network of detectors; glitches that are independent in each interferometer; and stationary Gaussian noise which is fully characterized by its power spectral density as modeled by BayesLine Littenberg and Cornish (2015). At its core, BayesWave is a Markov chain Monte Carlo (MCMC) algorithm Gamerman (1997). BayesWave uses parallel tempering Swendsen and Wang (1986) and thermodynamic integration Goggans and Chi (2004) to compute the evidence for each model. The MCMC implementation and evidence calculation is described in detail in Refs. (Cornish and Littenberg, 2015; Littenberg and Cornish, 2015). For results in this work we utilize an adaptive temperature scheme as suggested in Vousden et al. (2015).
Because we do not know a priori the functional form of glitch or GW burst waveforms, our model for both must be flexible. We use a linear combination of MorletGabor wavelets as our waveform model where the number of wavelets included in the linear combination, , is itself a model parameter. Each basis function (wavelet) is described by parameter vector with components for central frequency ; central time ; amplitude ; quality factor ; and phase offset . A wavelet is expressed in the time domain as:
(1) 
where and . BayesWave uses a reversible jump Markov chain Monte Carlo Green (2003) to marginalize over the number of wavelets needed for the model to be consistent with the data.
ii.1 Bayesian hypothesis testing or model selection
The likelihood that hypothesis , parameterized by , would have produced the data is calculated by
(2) 
Directly integrating Eq 2 is seldom practical and a wide variety of alternative means for arriving at have been devised. Our method of choice for computing the integral in Eq. 2 is thermodynamic integration Goggans and Chi (2004).
Once the evidence has been computed it provides a relative measure of how well one hypothesis is supported by the data over another through the “odds ratio”
(3) 
where is the prior probability for the hypothesis and is the likelihood ratio, or “Bayes factor,” for the two hypotheses.
ii.2 Modeling signals versus glitches
Consider a GW network consisting of the two LIGO detectors in Hanford, WA (H) and Livingston, LA (L). For a candidate event BayesWave calculates the Bayesian evidence for each of three models: signal, glitch, or Gaussian noise. We can then use the Bayes factor between any two models to quantify the degree of supporting evidence for one model over the other. Within each model the likelihood is computed by
(4) 
where is the residual of the data minus the signal or glitch model, is the noise weighted inner product, is the noise power spectral density estimated from the data by BayesLine, and is the duration of the data. This work will focus only on examples where we need to distinguish between the signal and glitch models. We assume either will be preferred over the Gaussian noise model.
ii.2.1 : The glitch model :
The data contain Gaussian noise and glitches independent in each detector . The parameters are comprised of independent sets of intrinsic parameters
which determine the shape of each wavelet. The glitch model is computed for each detector as an independent linear combination of wavelets
where is the Fourier transform of , can take on any value between with the caveat that at least one wavelet must be used in the model for the whole network. is typically 20. The glitchmodel likelihood is computed using Eq. 4 with the residual
ii.2.2 : The signal model :
The data contain Gaussian noise and an elliptically polarized gravitational wave signal coherent across the network of detectors. The parameters are a common set of intrinsic parameters
referenced at the center of the Earth and four “extrinsic parameters”
which define the skylocation , the polarization angle and an ellipticity parameter relating the two gravitational wave polarizations and . The signalmodel likelihood is computed using Eq. 4 with the residual
The geocenter signal wavelets are projected onto the network using each detector’s unique time delay operators , and antenna beam pattern response functions , Anderson et al. (2001):
(5) 
Iii Distinguishing signals from glitches
While BayesWave uses a computationally expensive numerical integration to compute the evidence for each model, we will build intuition for how BayesWave successfully distinguishes signals from glitches using the Laplace approximation to the evidence and several simplifying assumptions about the model and the data. As our results will show, the simple analytic treatment derived here leads to useful approximations for when signals and glitches are distinguishable and in forecasting the most significant background event. A more detailed derivation and discussion of the Laplace approximation to BayesWave’s signal and glitch model evidence can be found in the appendix.
iii.1 LaplaceFisher approximation to the evidence
If an event has enough SNR to be a strong candidate for detection () the integrand of Equation (2) will be sharply peaked around the maximum a posteriori (MAP) parameter values of the model . The evidence can be estimated as
(6) 
which is the product of the MAP likelihood , the prior evaluated at the MAP parameters, and the determinant of the parameter covariance matrix which is a measure of the posterior volume. is the dimension of the model. The covariance matrix can be approximated by the inverse of the Fisher information matrix , and we replace with .
The term is the “Occam factor” that penalizes the likelihood by the model’s size. If two models achieve the same likelihood the Occam factor, and therefore the evidence, will be smaller for the model that requires more (constrained) parameters to achieve that fit. Consider a simple model with a single parameter and uniform prior over an interval . The covariance matrix is simply the variance of the likelihood . In this case the Occam factor is proportional to which leads to a simple, intuitive, interpretation: The Occam factor is the fraction of the prior taken up by the posterior. We will return to this interpretation when predicting the most significant background event for BayesWave.
For the glitch or signal model, the expectation value for the intrinsic parameter log likelihood is proportional to Röver (2007)
(7) 
For uniform priors where is the volume of the intrinsic parameter space. BayesWave uses uniform priors for all but the amplitude parameter, with a function of the wavelet’s SNR Cornish and Littenberg (2015). For simplicity we will neglect the parameterdependence of the amplitude prior in favor of the simpler scaling. A similar but more detailed derivation including the true amplitude used by BayesWave can be found in the appendix.
The determinant of the intrinsic parameter Fisher matrix for a single wavelet is
(8) 
If we assume little overlap between wavelets in the parameter space the correlations between wavelet parameters are negligible and the Fisher matrix is block diagonal. The determinant for the full covariance matrix with wavelets is then Cornish and Littenberg (2015)
(9) 
Neglecting the extrinsic parameters for the signal model, and the BayesLine parameters which are common to all models, the dimension where is the number of wavelets used in the fit. To simplify the expression we define to absorb the and additional factors of 2 and . Now the log evidence becomes
(10) 
From this expression we see that the Bayes factor for either the glitch or signal model versus the Gaussian noise model goes as .
For the glitch model, the prior and posterior volume terms are summed over the number of detectors () in the network. The signal model, on the other hand, picks up an additional and Occam factor term for the extrinsic parameters which govern the projection of the signal onto the network. is the extrinsic parameter dimension, is the signal parameter covariance matrix, and is the extrinsic parameter prior volume. Including these details into Eq. 10 we find the log evidence for the glitch and signal models is
(11) 
respectively, where and the extrinsic parameter dimension while the prior volume for extrinsic parameters is .
iii.2 Two detector network
Consider a fairly loud gravitational wave signal in the two detector LIGO network. The optimal extrinsic parameters for detection will result in similar signal strength in each of the interferometers such that where the index is for each wavelet and the index is for each detector. For such events the glitch model will use similar wavelets as the signal model , but because it treats each detector independently, will need two copies–one for each interferometer . One final simplifying assumption is that the signal to noise ratio of each wavelet is the same: .
Substituting these simplifications into Eq. III.1 we arrive at a simple expression for the log Bayes factor :
(12) 
and immediately see that . As a consequence, at fixed SNR, waveform morphologies that require more wavelets to reconstruct have a higher likelihood of being classified as signals. This is an important departure from traditional SNRbased ranking statistics. The Bayes factor computed by BayesWave is more sensitive to signal complexity than signal strength. Heuristically, the naturally encodes how increasingly unlikely it is for the detectors to simultaneously and coherently produce glitches with nontrivial timefrequency structure. This is a significant difference from existing Burst pipelines which put greater emphasis on signal strength in forming their detection statistic, and are thus hamstrung by the detectors’ tendency to produce loud noise transients at a higher rate than the universe supplies gravitational wave signals. We find this fundamental difference allows BayesWave to assign detection candidates high confidence in data prone to loud glitches while existing pipelines have not.
iii.3 Single wavelet examples in simulated noise
To verify the predictions from the Laplace approximation we used BayesWave to recover simulated sine Gaussian gravitational wave signals in Gaussian noise, drawing waveform parameters from our prior distributions: Hz, s, , rad, , rad, rad, , and amplitude drawn from the distribution described in the appendix and Ref. Cornish and Littenberg (2015). For this study we analyze segments of LIGO data collected during the sixth science run which took place from 20092010 in which we have purposefully added GW signals. The priors used for this analysis reflect what is being used for lowfrequency triggers in the first advanced LIGO observing run (O1) during which BayesWave relies on the coherent WaveBurst pipeline to provide the segments of data which warrant followup analysis (for details see Refs. Klimenko et al. (2008); Kanner et al. (2015)).
BayesWave calculates Bayes factors for each combination of models along with an estimate of the error in that calculation, using thermodynamic integration. We do not anticipate the agreement between numerical simulations and the analytic approximations to be perfect. Many of the approximations we have made along the way to arrive at Eqs. 10 and 12 are known to be inadequate for the gravitational wave detection problem Cornish and Littenberg (2007), particularly our use of the covariance matrix to estimate the posterior volume and, even more egregiously, our use of the Fisher matrix as the inverse covariance matrix Vallisneri (2008).
Fisher matrix approximations are particularly bad for . The extrinsic parameter space is rife with degeneracies between parameters and nonGaussian, multimodal likelihood distributions which often span the full extent of the prior range. Fisher matrix arguments would predict a scaling for the determinant of which is much too strong for any burst source in a two detector network. Using numerical experiments to get a rough understanding of the extrinsic parameter posterior volume, we find an with ranging from at low SNRs to at (See Fig. 6 in the appendix).
In Figure 1 the left panel shows the glitch to noise (red) and signal to glitch (blue) log Bayes factors as a function of the simulated signals’ SNR along with the Laplace approximation predictions. The predicted scaling laws for signals and are generally obeyed by the numerical results. The observed agreement reinforces the intuition developed from considering the analytic expressions, and we can be confident that the numerical integration is performing well. The right panel demonstrates BayesWave’s glitch rejection capabilities by comparing for simulated sineGaussian glitches (gray crosses) and signals (blue circles). The glitches were simulated by adding sineGaussians to each detector with parameters drawn independently from the prior. Negative corresponds to data with higher likelihood for the glitch model.
iii.4 Multiple wavelet examples in real noise
Equation 12 predicts that the Bayes factor grows with SNR more rapidly for waveforms that have more timefrequency structure, thus requiring more wavelets to account for all of the excess power in the data. For astrophysical signals the number of wavelets necessary will not be known a priori, and furthermore will not be constant, depending on the SNR. As the signal strength increases, more detailed structure in the waveforms will be detectable, and more wavelets will be favored by the model selection. Through numerical experiments we find simple relationships for the number of basis functions and the average SNR per wavelet in terms of the true SNR:
(13) 
where the coefficients and index are different for different kinds of signals with corresponding to sine Gaussian waveforms and and increasing while decreases with increasing signal complexity (see Fig. 5 in the appendix).
To demonstrate this important feature of BayesWave we add simulated gravitational wave signals from different waveform families into real detector data. Figure 2 shows as a function of SNR for the different simulations. Red points are sine Gaussian waveforms, blue points correspond to signals from the merger of two 50 M black holes modeled using nonspinning Effective One Body (EOB) waveforms Buonanno et al. (2007), and the black points are results from “white noise bursts”–unpolarized, bandlimited, white noise signals used to test LIGO/Virgo burst detection pipelines. We can empirically determine that is larger for more complicated signal morphologies. Results agree well enough with the analytic predictions that the insight gained in the analytic study is applicable, but the Laplace approximation is clearly no substitute for the numerical integration. The large scatter in Bayes factors is due to failings in the Laplace approximation, signals that violate our assumption about roughly equal SNR in each detector, and segments of data that contain both signals and glitches.
It is important to note that the high degree of scatter in the white noise burst results is also to be expected because these signals are unpolarized, while BayesWave assumes and are related by Eq. II.2.2. In a two detector network we generally cannot reliably measure the GW polarizations independently. Introducing the additional degrees of freedom to independently solve for and will hinder our ability to reject glitches because the number of signal model and glitch model parameters will be comparable for a wider variety of waveform morphologies. While there is no reason to expect a priori that GW bursts will be elliptically polarized, selection effects by the detection pipelines which identify segments of data for BayesWave to followup in a real analysis, and the similar orientation of the LIGO detectors, favor signals which are well approximated by a single polarization (causing many of the degeneracies between extrinsic parameters discussed in the previous section). This assumption will need to be relaxed when more detectors are added to the network, and in future studies we will investigate strategies for optimizing BayesWave’s performance on unpolarized detection candidates even in the two detector case.
Iv Background estimation
We have shown that BayesWave predictably favors the signal model over the glitch model for simulated GW events, i.e. BayesWave is robust against false dismissal of gravitational wave signals. This is only half of the battle: Any useful data analysis procedure must also be robust against false alarms, i.e. misidentifying noise events as being astrophysical signals, and be able to assign significance to a detection. While the right panel of Figure 1 demonstrates how BayesWave can reject glitches in the trivial case of random sineGaussian waveforms, how it will fair against real glitches, and how to assign significance to candidate events, requires more careful attention.
To understand BayesWave’s glitch rejection capabilities, imagine that a glitch waveform in LIGO Hanford () is well represented by a linear combination of wavelets with parameters and a coincident (i.e. within the light travel time between detectors) collection of wavelets is found in LIGO Livingston (). If the signal is astrophysical in nature, the waveform in must have parameters that are consistent with , within the measurement uncertainties up to the appropriate time, phase, and amplitude shifts due to the geometry of the detector locations and orientations. On the other hand, if the data represent coincident glitches, then a priori there is no reason for the glitch in Livingston to match the parameters in Hanford. Instead, the wavelet in Livingston is chosen at random. One can consider glitches to be random draws from space and false alarms (glitches that appear as signals) are draws that overlap within the size . If the posteriors do not overlap the data is not consistent with the signal model, i.e. the signal model likelihood will be lower than the glitch model likelihood, and the Bayes Factor will favor the glitch model (c.f. Figure 1).
We can use the same logic to estimate the background rate of glitches that are consistent with the signal model. Assume, in a given two detector data set, there are coincident glitches. If we assume our signal/glitch model can achieve a perfect match to glitches in the data, the recovered SNR of the signal and the glitch model will be equal when the glitches overlap in the space and the Bayes factor will be again well approximated by Eq. 12.
Recall the Occam factor is interpreted as the fraction of the prior covered by the posterior , i.e. the Occam factor is the size of the “target” the second glitch must hit to be misidentified as a signal. Put another way, a glitch has probability to be consistent with the signal model. Therefore finding a background event with a Bayes factor consistent with Eq 12 will require analysis of something like coincident glitches. In our application the Occam factor thus takes on an additional interpretation as the expected number of trials (coincident noise transients) needed for two random glitches to have sufficient overlap in parameter space to look like a signal.
We can loosely turn this into an argument for the maximum Bayes factor–the one that occurs only once in a span of LIGO data–as having an Occam factor of , i.e. the maximum Bayes factor for a background noise event is
(14) 
This limit is not robust. The loudest noise event is obviously in the extreme tail of the background distribution and will therefore fluctuate wildly for different realizations of the data. Nor is this a statement about the population of glitches beyond the assumption that the parameters are chosen at random for glitches in each detector. It is also important to point out this is may be a conservative estimate. Most glitches are at low SNR in any realistic glitch population, and so low values of the Occam Factor will likely be much more common than high values.
We use our estimate of the most significant background event to approximate the false alarm rate. To do so we need to know the rate of coincident glitches, , which is a carefully studied quantity within LIGO. The single detector glitch rate was known during S6 to typically have values between 1 and 0.1 Hz Aasi et al. (2015). The light travel time between LIGO detectors is 10 ms, leading to a coincident glitch rate of .
False alarm rates are estimated by analyzing timeshifted data, or “time slides.” If the data from one detector is shifted by more than the light travel time to another detector, there will be no coincident gravitational wave signals. Because the rate of glitches completely dominates the rate of GW signals, analyzing timeshifted data all but guarantees that any coincidences are due to noise artifacts.
Consider the last quarter of LIGO’s sixth science run (S6D) which lasted for days. A socalled “three sigma” detection requires an event more significant than any background coincidences found in time slides. The background estimate from 300 time slides corresponds to 40 years of data, and . Equation 14 predicts that events with would be detected with better than threesigma confidence.
To test this prediction we compute the Bayes factors for the coincident events in time slides of the S6D data found by the coherent WaveBurst algorithm Klimenko et al. (2008). Figure 3 shows the cumulative glitch rate as a function of i.e. the yaxis is the rate at which coincident glitches were found with Bayes factors greater than the corresponding value on the xaxis. The distribution steeply decreases with increasing Bayes factor, and does not show evidence of levelingoff with a broad “tail” in the background that has limited previous searches. See Ref. Kanner et al. (2015) for a detailed study of how BayesWave can improve detection confidence of existing burst searches. Furthermore, the distribution ends at which is consistent with our analytic prediction for the background. Ultimately we should be able to turn arguments about the expected background rate into a prior odds ratio between the glitch and signal model. For the immediate future we elect to take a more conservative approach and continue using background studies to estimate the false alarm rate and therefore the detection significance. There is no guarantee that the nonGaussian noise in future GW data will bear any resemblance to what was found during S6.
Comparing our inferred background rate to Figure 2 we find that sineGaussian waveforms in a two detector network will be detected at false alarm rates that suggest marginal significance at any reasonable SNR, similar to performance seen in past burst searches. However, unlike previous burst searches, we find that IMBH and white noise bursts are detectable with very high significance. Figure 5 in the Appendix shows the number of wavelets used to recover each waveform morphology as a function of injected SNR and provides supporting evidence that waveforms that require more wavelets typically provide higher Bayes factors.
What is required for a high confidence, or “fivesigma,” detection? For this case, we seek a pvalue of less than , and so demand our event be louder than the loudest event in time slides. For S6D this leads to , and an expected loudest event . We have already seen that single wavelet events can not reach this level at any reasonable SNR but applying the scaling law in Equation 12, we find that such a “goldplated” detection could be achieved at reasonable SNR with as few as two or three wavelets. For example, the IMBH and white noise burst signals in Figure 2 added to the same data we used to estimate the background by far exceed the Bayes factor which corresponds to a false alarm probability of . This is an important feature of the BayesWave pipeline: Goldplated detections of shortduration signals are possible even in the presence of a significant glitch population.
V Discussion
In this paper we have demonstrated BayesWave’s utility as a followup analysis for GW burst searches. By analyzing data from the sixth LIGO science run (S6) which took place from 20092010 we have shown that high confidence detections are achievable using BayesWave as a followup analysis despite the high rate of noise transients in the data. When used to followup shortduration gravitational wave triggers, BayesWave has been shown to significantly reduce the rate of falsealarms while remaining sensitive to a wide range of signals Kanner et al. (2015). For insight into how BayesWave takes advantage of Bayesian model selection to separate signals and glitches we presented an analytical framework and found simple expressions which provide approximations to our full numerical analysis on real data. The results show that BayesWave has several novel features, when compared with other Burst pipelines:

The detection statistic directly compares the evidence for an astrophysical signal with a glitch model, as opposed to calculating a likelihood derived from Gaussian noise.

BayesWave places emphasis on the timefrequency complexity and network coherence of an event, rather than just its strength, to distinguish signals from glitches

The background distribution shows no evidence of “tails” at high values.
In order to emphasize the importance of including the glitch model in a statistical framework, best fit waveforms for the signal and glitch models for the most “signallike” background events in S6D are shown in Figure 4. For these examples of real glitches, the signal and glitch model are shown to very nearly agree. Because glitches can be so successful in imitating real gravitational wave signals, pipelines which attempt to reject these events with tunings and cuts face a major challenge. Instead, BayesWave attempts to accurately assess the probability of such coincident glitches arising from chance. This approach places a lower weight to events with simple timefrequency structure that could plausibly arise simultaneously in two or more instruments, regardless of their SNR.
The detection statistic described in this work, , represents the likelihood ratio for two competing models: the data contains a glitch, or the data contains an astrophysical signal. The purist may object to this application of the Bayes factor, instead favoring the Bayesian odds ratio between the signal and glitch model. The prior odds ratio between these models is the ratio of the expected coincident and coherent glitch rate to the expected rate of GW signals. While the rate of GW signals is unknown, we have shown that the measured background distribution is consistent with our analytic predictions using the LIGO glitch rate. This consistency suggests that the BayesWave model is a good fit to actual LIGO data and Bayes factors calculated by BayesWave will serve as a robust means for correctly identifying signals and glitches. In principle, glitches with nonflat distributions in f and Q, especially if similarly distributed in multiple detectors, could invalidate this agreement. Should that be the case, the posterior distribution of background events can easily be folded in to our analysis as a prior on the glitch model. Because the glitch population in earlier LIGO data will likely differ from that of the advanced detectors, we will continue to rely on the bruteforce approach of using time slides to estimate the significance of a candidate event and use what is learned to further improve our priors for subsequently collected data.
As the capabilities of ground based detectors continues to improve so too must our analysis. The work presented here represents a snap shot of BayesWave’s capabilities as the algorithm continues to advance. Further development is underway to relax the requirement of elliptical polarization for the signal model (improving the detection efficiency for unpolarized signals) and to account for glitches and signals appearing in the same segment of data (reducing false dismissals due to nearcoincidence with glitches). Nonetheless, based on the thorough performance studies in real LIGO data reported in this work we conclude that BayesWave is prepared to decisively aid in the detection and characterization of GW bursts in the advanced detector era.
Vi Acknowledgments
We acknowledge numerous discussions with Reed Essick, Erik Katsavounidis, and Salvatore Vitale which helped motivate the detailed analytic derivation of the evidence in the Appendix; James Clark for thorough comments and suggestions on an earlier draft of this paper; Sergey Klimenko and Francesco Salemi for help finding and understanding the cWB output files; and Kent Blackburn, Vicky Kalogera, Tjonnie Li, Patricia Schmidt, and Michele Vallisneri for helpful conversations about this work. TBL acknowledges the support of NSF LIGO grant, award PHY1307020. LIGO was constructed by the California Institute of Technology and Massachusetts Institute of Technology with funding from the National Science Foundation and operates under cooperative agreement PHY0757058. This paper carries LIGO Document Number LIGOP1500083.
*
Appendix A Bayes Factors
The Laplace approximation for the evidence is given by
(15) 
where are the maximum a posteriori parameters, is the model dimensions, and is the parameter covariance matrix, which we can estimate from the inverse of the Fisher information matrix . If the prior is uniform for all parameters, the prior density is equal to the inverse of the prior volume: . We recognize the collection of terms as the posterior volume. Thus, for uniform priors, the evidence is given by the product of the MAP likelihood times the ratio of the posterior to prior volume, which is referred to as the Occam penalty. In the case of BayesWave the priors on most parameters are flat, with the important exception of the amplitude or the signaltonoise ratio (which depends of the amplitude, quality factor, central frequency and noise spectral density).
Dropping terms down by factors of relative to leading order, the Fisher matrix for a single wavelet using the parameters is given by
(16) 
The determinant of is
(17) 
The expectation value of the MAP log likelihood is given by (see page 31 of Ref. Röver (2007) and references therein)
(18) 
The constant is independent of the signal model. The term comes from more complicated models being able to better fit features in the Gaussian noise.
Each wavelet is described by 5 parameters, and has
(19) 
where is the signaltonoise ratio for wavelet . Assuming that the wavelets used in the reconstruction have little overlap with each other, the total posterior volume for the wavelet model is
(20) 
BayesWave has a nontrivial amplitude prior which needs to be taken into account. One choice would be a uniform in volume prior on the source distribution, which is equivalent to a prior on the distance that scales as . Since amplitude and distance are inversely related, we have . Here we have made the change of variables to since this is the parameter used to compute the Fisher matrix. This prior is improper, and to make it proper a minimum amplitude cutoff (maximum distance) has to be introduced. The properly normalized uniforminvolume prior is
(21) 
An alternative approach, used by BayesWave in this work, is to adopt different physically motivated priors on the signals and glitches that are given as functions of the SNR. For glitches the SNR is given by
(22) 
while for signals the SNR is given by the same expression, but with the individual detector noise spectral density replaced by the network average
(23) 
Thus the signalmodel SNR depends on . The priors on the signal model and glitch model are given in terms of a prior on the SNR, . Making the change of variables from SNR to yields
(24) 
for the glitch model and
(25) 
for the signal model.
If we further assume little correlation between the wavelet model and the Gaussian noise model, then the expected value for the log Bayes Factor between the glitch plus noise model and the noise model in a single detector is
(26) 
Here is the signaltonoise ratio of the signal or glitch in that detector. Later when considering a network of detectors the will refer to the network signaltonoise ratio of the signal.
If the wavelet model in one detector uses wavelets, and assuming little overlap between wavelets, then
(27) 
Based on simulations, we find that the average number of wavelets used by BayesWave increases linearly with the total , and that the average per wavelet increases with the total as a waveformdependent power law. Writing and , we find that the values of , , and depend on the waveform morphology, with and increasing, and decreasing, as the timefrequency structure of the waveform becomes more complicated. In the case of a constrained model using a fixed number of wavelets the average SNR per wavelet always increases linearly with the total SNR, though with a proportionality less than one for anything other than sinegaussians.
Fig. 5 shows the average number of wavelets (left panel) and SNR per wavelet (right panel) as a function of SNR for the three different waveform morphologies studied in this paper–sine Gaussians, binary black hole mergers, and white noise bursts–added to simulated Gaussian noise from a single detector at Advanced LIGO sensitivity. Each simulation was repeated for several Gaussian noise realizations. Plotted are the average and one standard deviations of the mean, plus lines that show the scaling relations using the best fit values for .
Starting with a simplified model of to aligned collocated detectors, the signal model does not need any extrinsic parameters and the log Bayes factors are
(28) 
If we assume that all the wavelets have the same quality factor and individual signaltonoise ratios , then the individual SNRs in each detector are . Thus for the glitch model , while for the signal model . Then the Bayes Factor between the signal and glitch models for two collocated detectors is
(29) 
Here is the time frequency volume, and is the prior range for . Note that the contributions from the amplitude prior introduce important SNR scalings into the Bayes factor. For a single sineGaussian model the posterior volume terms introduce a scaling to the log Bayes factor. For BayesWave the scaling with SNR is far more complicated. On complex waveforms both and scale linearly with the SNR, so the posterior volume introduces terms that scale as . The amplitude prior for the signal model introduces a similar scaling, along with a more complicated scaling of the form . The amplitude prior for the glitch model introduces terms, in addition to a term that scales as , though this term does not start to dominate until very high SNRs ( for typical choices of ). In the fixed dimension case the BayesWave scaling is dominantly of the form for moderate SNRs. At very high SNRs the quadratic dependence of the full BayesWave scaling is replaced by a linear scaling in SNR.
There are several assumptions that went into the derivation of the signaltoglitch Bayes Factor for BayesWave that are rather crude. The worst approximations are that the wavelets used in the reconstruction all have roughly the same and signaltonoise ratio. While on average the scaling is quite robust, the for individual wavelets never go much below the value set by the peak of the SNR prior, so that . This means that the linear scaling typically only holds for network SNRs greater than around 10 or 12. Rather than assuming the same quality factor for each wavelet we could use the average value. For distributed uniformly in the range we have
(30) 
and
(31) 
a.1 Extrinsic Parameters
Implicit in the preceding derivation was the assumption that the overlap of any two wavelets and, as a consequence, the parameter correlation matrix for the wavelets was blockdiagonal. This assumption is reasonable since each wavelet collects the power in a certain timefrequency volume disfavoring highly overlapping wavelets. For misaligned detectors we can write the parameter correlation matrix for the signal model in blockform as
(32) 
where is the correlation matrix for the intrinsic wavelet parameters, is the correlation matrix for the extrinsic parameters and is the crosscorrelation matrix that mixes the extrinsic and intrinsic parameters. The Fisher matrix can similarly be decomposed:
(33) 
Now, for partitioned symmetric matrices we have (see page 46 of the Matrix Cookbook Petersen and Pedersen (2012))
(34) 
which implies that the volume of the posterior factors into extrinsic and intrinsic pieces, where the intrinsic part has exactly the same form as for the glitch model:
(35)  
Putting all the pieces together we have
(36)  
From here we can insert the SNR scalings for , and the and include the explicit expression for the intrinsic parameter volumes in an effort to make quantitative predictions. While the expressions are more complicated than the aligned collocated case the scalings with SNR are the same.
We are unable to derive an analytic expression for . Additionally, there is the problem that some of the extrinsic parameters, most notably the ellipticity and polarization angle, are poorly constrained and Fisher matrix estimates are unreliable. As a result the posterior volume does not scale as as we naively expect from the Fisher matrix, but as some lower power such as or . One way to incorporate the restriction that the posterior not exceed the prior is to elevate the extrinsic parameters from their fundamental domain (with periodic boundary conditions) to the universal cover, and introduce a Gaussian prior on the parameters that restricts the posterior volume to be no larger than the prior volume. The negative Hessian of second derivatives of the log of this prior is added to the Fisher matrix (so that the Fisher matrix describes the curvature of the posterior, not just the likelihood). Numerically computing the posterior volume as a function of SNR using this approach shows that the posterior volume scales as , where the exponent is weakly dependent on the SNR, varying between roughly 2 and 3 across the range of SNRs we expects to encounter, as shown in Fig. 5.
The end result is that including the intrinsic parameters increases the dimension of the signal model by between 2 and 3 degrees of freedom, not 4 as we would naively expect. Thus, the overall scaling for the single sineGaussian Bayes factor should scales as . The scaling for more elaborate waveforms is far more complicated. Ultimately it is this added complexity that enables BayesWave to assign highconfidence to detection candidates of nontrivial GW signals.
References
 Abbott et al. (2009a) B. Abbott et al. (LIGO Scientific Collaboration), Rept. Prog. Phys. 72, 076901 (2009a), arXiv:0711.3041 [grqc] .
 Accadia et al. (2012) T. Accadia et al., Journal of Instrumentation 7, P03012 (2012).
 Abbott et al. (2009b) B. Abbott et al. (LIGO Scientific Collaboration), Rept.Prog.Phys. 72, 076901 (2009b), arXiv:0711.3041 [grqc] .
 Blackburn et al. (2008) L. Blackburn, L. Cadonati, S. Caride, S. Caudill, S. Chatterji, et al., Class.Quant.Grav. 25, 184004 (2008), arXiv:0804.0800 [grqc] .
 Abadie et al. (2010) J. Abadie et al. (LIGO Scientific Collaboration, Virgo Collaboration), Phys. Rev. D 82, 102001 (2010), arXiv:1005.4655 [grqc] .
 Aasi et al. (2012) J. Aasi et al. (VIRGO Collaboration), Class.Quant.Grav. 29, 155002 (2012), arXiv:1203.5613 [grqc] .
 Abbott et al. (2009c) B. Abbott et al. (LIGO Scientific Collaboration), Phys. Rev. D 80, 102002 (2009c), arXiv:0904.4910 [grqc] .
 Abbott et al. (2009d) B. Abbott et al. (LIGO Scientific Collaboration), Phys.Rev. D80, 102001 (2009d), arXiv:0905.0020 [grqc] .
 Abadie et al. (2012) J. Abadie et al. (LIGO Scientific Collaboration, Virgo Collaboration), Phys. Rev. D 85, 122007 (2012), arXiv:1202.2788 [grqc] .
 Thrane and Coughlin (2015) E. Thrane and M. Coughlin, Phys. Rev. Lett. 115, 181102 (2015), arXiv:1507.00537 [astroph.IM] .
 Veitch et al. (2015) J. Veitch, V. Raymond, B. Farr, W. Farr, P. Graff, et al., Phys.Rev. D91, 042003 (2015), arXiv:1409.7215 [grqc] .
 Essick et al. (2014) R. Essick, S. Vitale, E. Katsavounidis, G. Vedovato, and S. Klimenko, (2014), arXiv:1409.2435 [astroph.HE] .
 (13) R. Lynch, S. Vitale, R. Essick, E. Katsavounidis, and F. Robinet, “2015,” In preperation.
 Veitch and Vecchio (2010) J. Veitch and A. Vecchio, Phys. Rev. D 81, 062003 (2010), arXiv:0911.3820 [astroph.CO] .
 Cornish and Littenberg (2015) N. J. Cornish and T. B. Littenberg, Class. Quant. Grav. 32, 135012 (2015), arXiv:1410.3835 [grqc] .
 Littenberg and Cornish (2015) T. B. Littenberg and N. J. Cornish, Phys.Rev. D91, 084034 (2015), arXiv:1410.3852 [grqc] .
 Klimenko et al. (2008) S. Klimenko, I. Yakushin, A. Mercer, and G. Mitselmakher, Class. Quantum Grav. 25, 114029 (2008), arXiv:arXiv:0802.3232 [grqc] .
 Kanner et al. (2015) J. B. Kanner, T. B. Littenberg, N. Cornish, M. Millhouse, E. Xhakaj, F. Salemi, M. Drago, G. Vedovato, and S. Klimenko, ArXiv eprints (2015), arXiv:1509.06423 [astroph.IM] .
 Chatterji et al. (2004) S. Chatterji, L. Blackburn, G. Martin, and E. Katsavounidis, 8th Gravitational Wave Data Analysis Workshop (GWDAW 2003) Milwaukee, Wisconsin, December 1720, 2003, Class. Quant. Grav. 21, S1809 (2004), arXiv:grqc/0412119 [grqc] .
 Chatterji (2005) S. Chatterji, The search for gravitational wave bursts in data from the second LIGO science run, Ph.D. thesis, Massachusettes Institute of Technology, http://hdl.handle.net/1721.1/34388 (2005).
 Chatterji et al. (2006) S. Chatterji, A. Lazzarini, L. Stein, P. J. Sutton, A. Searle, et al., Phys.Rev. D74, 082005 (2006), arXiv:grqc/0605002 [grqc] .
 Sutton et al. (2010) P. J. Sutton et al., New J. Phys. 12, 053034 (2010), arXiv:0908.3665 [grqc] .
 Abadie et al. (2011) J. Abadie et al. (LIGO), Phys. Rev. D83, 042001 (2011), arXiv:1011.1357 [grqc] .
 Gamerman (1997) D. Gamerman, Markov Chain Monte Carlo: Stocastic Simulation of Bayesian Inference (Chapman and Hall, London, 1997).
 Swendsen and Wang (1986) R. Swendsen and J. Wang, Phys. Rev. Lett. 57, 2607 (1986).
 Goggans and Chi (2004) P. Goggans and Y. Chi, Bayesian Inference and Methods in Science and Engineering, edited by J. Rychert, G. Erickson, and C. Smith (American Institute of Physics, USA, 2004).
 Vousden et al. (2015) W. Vousden, W. M. Farr, and I. Mandel, ArXiv eprints (2015), arXiv:1501.05823 [astroph.IM] .
 Green (2003) P. Green, Highly Structured Stochastic Systems, edited by P. Green, N. Hjort, and S. Richardson (Oxford University Press, 2003).
 Anderson et al. (2001) W. Anderson, P. Brady, D. Chin, J. Creighton, K. Riles, and J. Whelan, “Beam pattern response functions and times of arrival for earthbound interferometer,” LIGOT01011000Z (2001).
 Röver (2007) C. Röver, Bayesian inference on astrophysical binary inspirals based on gravitationalwave measurements., Ph.D. thesis, The University of Auckland, URL:http://hdl.handle.net/2292/2356 (2007).
 Cornish and Littenberg (2007) N. J. Cornish and T. B. Littenberg, Phys. Rev. D 76, 083006 (2007), arXiv:arXiv:0704.1808 [grqc] .
 Vallisneri (2008) M. Vallisneri, Phys. Rev. D 77, 042001 (2008), arXiv:grqc/0703086 .
 Buonanno et al. (2007) A. Buonanno et al., Phys. Rev. D 76, 104049 (2007), arXiv:0706.3732 [grqc] .
 Aasi et al. (2015) J. Aasi et al. (LIGO Scientific, VIRGO), Class.Quant.Grav. 32, 115012 (2015), arXiv:1410.7764 [grqc] .
 Petersen and Pedersen (2012) K. B. Petersen and M. S. Pedersen, The Matrix Cookbook (Technical University of Denmark, URL:http://www2.imm.dtu.dk/pubdb/p.php?3274, 2012).