Sky marginalization in black hole spectroscopy and tests of the area theorem

Alex Correia University of Massachusetts Dartmouth, 285 Old Westport Rd, North Dartmouth, MA 02747 Collin D. Capano [email protected] Department of Physics, Syracuse University, Syracuse, NY 13244 University of Massachusetts Dartmouth, 285 Old Westport Rd, North Dartmouth, MA 02747

Abstract

Direct observation of gravitational waves from binary black hole (BBH) mergers has made it possible to test the laws of black-hole thermodynamics using real astrophysical sources. These tests rely on accurate and unbiased parameter estimates from the pre- and post-merger portions of a signal. Due to numerical complications, previous analyses have fixed the sky location and coalescence time when independently estimating the parameters of the pre- and post-merger signal. Here we overcome the numerical complications and present a novel method of marginalizing over sky location and coalescence time. Doing so, we find that it is not possible to model only the pre- or post-merger portions of the signal while marginalizing over timing uncertainty. We surmount this problem by simultaneously yet independently modelling the pre- and post-merger signal, with only the sky location and coalescence time being shared between the models. This allows us to marginalize over all parameters. We use our method to measure the change in area $\Delta A_{\rm measured}=A_{f}-A_{i}$ between the final and initial black holes in the BBH merger GW150914. To measure the final black hole’s area $A_{f}$ we do an analysis using quasi-normal modes (QNMs) to model the post-merger signal, and another analysis using the post-merger portion of an inspiral-merger-ringdown (IMR) template. We find excellent agreement with expectations from General Relativity. The Hawking area theorem (which states that $A_{f}\geq A_{i}$ ) is confirmed to $95.4\%$ and $99.5\%$ confidence using the QNM and IMR post-merger models, respectively. Both models yield $\Delta A_{\rm measured}/\Delta A_{\rm expected}\sim 1$ , where $\Delta A_{\rm expected}$ is the expected change in area derived from fits to numerical relativity simulations.

I Introduction

The direct detection of gravitational waves (GWs) throughout the past decade has been one of the most significant advancements in observational relativity. These observations not only confirmed the existence of GWs, but also opened the door to direct tests of General Relativity (GR) in the strong-field regime. One such prediction that can be verified is Hawking’s area theorem, which states that the remnant from a binary-black hole (BBH) merger must have an event horizon surface area greater than the sum of the progenitor horizon areas [1].

Tests of the area theorem using GWs have been carried out previously [2, 3, 4]. Generally, they involve measuring the initial two black holes’ mass and spin (from which the total initial area $A_{i}$ is derived) from the pre-merger part of the signal, during which the two black holes inspiral into each other. The area of the final black hole is independently measured using the GW that is emitted during the post-merger, or “ringdown” phase. In both cases Bayesian inference is used to produce a “posterior” probability density on the black holes’ parameters. Ideally, all parameters that describe the BBH should be allowed to vary in the analysis, in order to fully account for all statistical uncertainties. However, previous tests of the area theorem have fixed the sky position and coalescence time $t_{c}$ of the events to nominal values when doing their analysis [2, 4]. Fixing the values in this way may lead to biases in the resultant parameter estimates, obfuscating the true nature of the system [3, 5]. At the very least, it may cause an underestimate of the statistical uncertainty of measured parameters, yielding constraints on deviations from GR that are misleadingly strong.

The sky location and $t_{c}$ have been fixed in earlier studies due to technical hurdles in calculating the likelihood function. In order to independently analyze the pre- and post-merger portion of the signal it is necessary to excise the post- and pre-merger data, respectively, from the analysis. Excising the pre-merger data is also necessary when just analyzing the post-merger signal for tests of the no-hair theorem using quasi-normal modes (known as black hole spectroscopy). In either case, the excision (or “gate”) results in a modified likelihood function that cannot be solved using conventional, frequency-domain means. Existing pipelines for doing such analyses in the time domain, such as pyring [6, 7] and ringdown [8], instead calculate the likelihood by numerically inverting the noise covariance matrix for the data. This calculation can be numerically costly [9, 5]. If the sky location and $t_{c}$ are not fixed, the gate will vary in time, and the full likelihood will need to be recalculated for every unique gate time, significantly increasing computational costs. However, if the sky location and $t_{c}$ are fixed, the gate is static, and the most computationally demanding part of the calculation need only be done once.

Finch & Moore [10] devised a method to overcome this issue and vary the sky location and $t_{c}$ . In their method the inspiral and merger are modeled with a wavelet series stitched to the beginning of the ringdown. This allows them to use the traditional frequency-domain likelihood, and vary the start time of the ringdown. However, this does not allow for an area theorem test, since the parameters of the initial black holes cannot be estimated from the wavelet signal model. Furthermore, the traditional frequency-domain likelihood causes the start of the ringdown to be coupled to information from just before the merger due to convolution with the whitening filter. This means that recovered pre- and post-merger parameters will not be independent measurements.

This paper presents a novel method of calculating the likelihood function using the parameter estimation code PyCBC Inference [11]. This code already utilizes “gating and in-painting” [12] to excise data from the analysis, which allows for cheaper likelihood evaluation under certain conditions as compared to the methods used by pyring and ringdown [9, 5]. However, an additional normalization factor has traditionally been omitted from this calculation, as this also required computationally expensive numerical methods to evaluate. This paper presents a method of linear interpolation that calculates this normalization factor with good approximation. Together, these methods allow for fast likelihood calculations regardless of gate position, allowing for marginalization over $t_{c}$ and sky location and a full accounting of parameter uncertainties.

This paper is structured as follows. Section II describes the modifications to the likelihood calculation in PyCBC in detail. Full waveform analyses can be conducted with these modifications to marginalize over sky location and $t_{c}$ . However, the method cannot be used in partial waveform analyses; Section III documents the issues that arise in these models. Using this method, a test of the Hawking area theorem is conducted using data from GW150914 [13]. Section IV describes the configuration of the analyses used in this test, and Section V describes and analyzes the results. Section VI summarizes the findings and the implications of the marginalization method for future GW analyses.

II Determinant approximation

PyCBC Inference [11] utilizes Bayesian inference to conduct parameter estimation on GW events. Bayes’ theorem is used to extract information about the parameter space $\bm{\vartheta}$ from a data set $\bm{s}$ [14]. Assuming the data set is composed of a signal $h$ and noise $n$ such that $\bm{s}=\bm{n}+\bm{h}$ , the probability of observing a specific $\bm{\vartheta}$ is given by

p(\bm{\vartheta}|\bm{s},h)=\frac{p(\bm{s}|\bm{\vartheta},h)p(\bm{\vartheta}|h)% }{p(\bm{s}|h)}.

(1)

The term $p(\bm{s}|\bm{\vartheta},h)$ is the likelihood, which describes the probability of observing a signal $\bm{s}$ assuming the event has a parameter space $\bm{\vartheta}$ . The term $p(\bm{\vartheta}|h)$ is the prior, a distribution that describes the a priori probability of observing $\bm{\vartheta}$ given a signal model $h$ . The denominator $p(\bm{s}|h)$ is the evidence, a normalization factor used to compare analyses using different models for $h$ . The resultant probability distribution on the left-hand side of the equation is known as the posterior.

The prior is chosen at the discretion of the analyst based on assumed plausible values for $\bm{\vartheta}$ , whereas the likelihood is calculated directly from the data. For a system of $K$ detectors each taking $N$ time series samples of an event with a stochastic Gaussian noise background, the likelihood can be written as [15]

p(\bm{s}_{net}|n)=\frac{\exp{[-\frac{1}{2}\sum_{d=1}^{K}\bm{s}^{T}_{d}\bm{% \Sigma}^{-1}_{d}\bm{s}_{d}]}}{[(2\pi)^{NK}\prod_{d=1}^{K}\det\bm{\Sigma}_{d}]^% {1/2}},

(2)

where $\bm{\Sigma}_{d}$ is the covariance matrix associated with $n$ in detector $d$ .

To evaluate the likelihood, further assumptions must be made to calculate $\bm{\Sigma}_{d}^{-1}$ . The result of these assumptions is:

p(\bm{s}_{net}|n)\propto\exp\bigg{[}-\frac{1}{2}\sum^{K}_{d=1}\langle\bm{s}_{d% },\bm{s}_{d}\rangle\bigg{]},

(3)

where the inner product is defined by Eq. (22). (The full derivation of this likelihood function is given in Appendix A). This gives the likelihood function for a signal that is assumed to be entirely noise. To get the likelihood function with respect to the signal model $h$ evaluated for a parameter space $\bm{\vartheta}$ , Eq. (3) can be rewritten by substituting $\bm{s}_{d}\rightarrow\bm{s}_{d}-\bm{h}_{d}(\bm{\vartheta})$ on the right-hand side, yielding

	$\displaystyle\log p(\bm{s}_{net}\|\bm{\vartheta},h)=$	$\displaystyle-\frac{1}{2}\sum^{K}_{d=1}\langle\bm{s}_{d}-\bm{h}_{d}(\bm{% \vartheta}),\bm{s}_{d}-\bm{h}_{d}(\bm{\vartheta})\rangle$
		$\displaystyle-\frac{1}{2}\bigg{[}NK\log 2\pi+\sum^{K}_{d=1}\log\det\bm{\Sigma}% _{d}\bigg{]}.$		(4)

(Here and throughout, we use $\log$ to refer to the natural logarithm.) For a GW analysis that examines the entirety of $\bm{s}$ , this expression is easily evaluated using the approximated eigenvalues of $\bm{\Sigma}_{d}$ . However, many analyses only examine a portion of the time series (generally either the pre- or post-merger signal). Therefore, a region of $\bm{s}$ is omitted, or “gated”, to reduce numerical biases due to the merger. Doing this also requires excising rows and columns from $\bm{\Sigma}_{d}$ , thereby breaking the Toeplitz form of the matrix and the corresponding approximations. The inverse of the truncated covariance matrix $\bm{\Sigma}_{d,tr}$ is evaluated in PyCBC using “gating and in-painting” (explained in detail in [12] and Appendix B), allowing for easy calculation of the first term of Eq. (4).

The second term, however, requires the calculation of $\log\det\bm{\Sigma}_{d,tr}$ , which is a notoriously expensive numerical problem. The most widely-used matrix decomposition methods are $\sim\!O(N^{3})$ [16], which become impractical for GW time series such as GW150914, where $N\!\sim\!10^{4}$ . This problem is amplified when varying sky position and $t_{c}$ in a gated analysis. The start and end times of the gate must be converted between the geocentric and individual detector frames. The time conversions directly depend on the time of merger $t_{c}$ and the sky location of the event relative to the detectors. Therefore, when varying sky location and $t_{c}$ , the gate times in the detector frames will also vary. Subsequently, the elements of $\bm{\Sigma}_{d,tr}$ will vary with each unique sky location and $t_{c}$ value. Analyses that marginalize over these parameters require recalculating $\log\det\bm{\Sigma}_{d,tr}$ on multiple steps of the sampler, which would be impractical using numerical decomposition methods. This is not an issue if the sky location and $t_{c}$ are fixed, since in that case $\det\bm{\Sigma}_{d,tr}$ will not change throughout the analysis (and in fact can be ignored, as it amounts to a constant normalization term).

In order to marginalize over sky location and $t_{c}$ it is necessary to have a fast method for calculating $\det\bm{\Sigma}_{d,tr}$ . We find that $\log\det\bm{\Sigma}_{d,tr}$ is strongly linearly correlated to the length of the power spectral density (PSD) of the data, which is equivalent the number of rows and columns in $\bm{\Sigma}_{d,tr}$ . Fig. 1 shows this relationship using $\log\det\bm{\Sigma}_{d,tr}$ values calculated for various gate sizes applied to GW150914 data. Using a least squares fit to the points, the correlation coefficient was calculated as $0.999<R^{2}<1$ , which for a sample of 8 points implies a highly significant correlation [17].

Refer to caption — Figure 1: Plot of $\log\det\bm{\Sigma}_{d,tr}$ versus PSD length for GW150914 in the Hanford detector. Blue points represent determinants calculated exactly using SciPy functions [18], while the black dashed line is a least squares linear fit. The correlation coefficient of $0.999<R^{2}<1$ indicates a highly significant linear correlation between $\log\det\bm{\Sigma}_{d,tr}$ and the size of $\bm{\Sigma}_{d,tr}$ .

We also find that the position of the gate in the time series has no significant effect on the value of $\log\det\bm{\Sigma}_{d,tr}$ . The maximum range over which the determinant values varied was $\sim\!10^{-6}$ . However, since these values were generally of order $\sim\!10^{6}$ , this represents a maximum fractional change of $\sim\!10^{-12}$ over the entire time domain. While gate position does have a minor effect on determinant value, the relative effect is so small that the value is well-approximated as a constant for a static gate length.

Using these two facts, we calculate the normalization term in Eq. (4) using a linear interpolation based solely on the size of $\bm{\Sigma}_{d,tr}$ . Specifically, a least squares linear fit is generated using the determinant values for the full matrix and three differently-sized truncated matrices. The full matrix determinant is calculated using its approximated eigenvalues, while the truncated determinants are calculated numerically using SciPy [18]. The determinant can then be calculated on each step through linear interpolation based on the size of the truncated covariance matrix.

III Need for Joint Analyses

The strategy outlined in Sec. II fixes the technical hurdle of calculating the likelihood when the gate length and position are varied. Even with that, however, we find that it is not possible to only model a portion of a signal — whether it be the post-merger or the pre-merger — if $t_{c}$ or sky location are uncertain. This is because a larger likelihood can generally be obtained if the gate is shifted such that it removes as much of a signal as possible.

From Eq. (4) it is evident that the likelihood is maximized as $\bm{s}_{d}-\bm{h}_{d}(\bm{\vartheta})\rightarrow 0$ . When analyzing the entire time series this can only occur if the template $\bm{h}_{d}(\bm{\vartheta})$ is similar to the signal that exists in the data (as desired). However, if a variable gate is involved, then a large likelihood can also be obtained if the gate is shifted such that it excises most (or all) of the signal in the data. The template then only needs to match the remaining noise. Since the noise manifold is larger than the signal manifold, a larger volume of prior space will generally be able to match the noise than the volume that matches the signal. For example, when using a QNM model, if the gate is shifted such that it excises the entire signal then one only needs to reduce the amplitude of the QNM template below noise level to get a relatively large likelihood. The same likelihood would then be obtained regardless of what the other parameters describing the template are set to. The end result is that the posterior probability will favor excising the entire signal even if the template that is being used can match the signal.

Specifically, in pre-merger-only analyses, the gate will be shifted to as early a time that is possible while in post-merger-only analyses, the gate will be shifted to as late as possible. These phenomena are illustrated in the middle row of Fig. 2. Shown are the results from separate pre-merger and post-merger analyses of a simulated signal in which the gate time is varied due to time and sky-location marginalization. The maximum likelihood template in the pre-merger-only analysis has a $t_{c}$ roughly $25\text{ms}$ earlier than what was injected. This significantly shortens the pre-merger signal, leading to inaccurate measurements in parameters such as total spin. Meanwhile, the maximum likelihood template for the post-merger-only analysis had a ringdown start time $\sim\!10M_{\odot}$ later than the injected value, well into the regime where the signal is expected to be noise-dominated. This leads to a “prior-in prior-out” posterior – a roughly uniform distribution across parameter space – as the signal model can fit any arbitrary model to late-time noise.

Fundamentally, this problem is due to the gated signal model not being the appropriate model for the observed data. Excising data from the analysis is mathematically equivalent to assuming that the excised data is Gaussian noise and marginalizing over all possible realizations of it. A property of multivariate Gaussian distributions is that marginalizing over a subset of dimensions yields another multivariate Gaussian distribution with the marginalized dimensions excised from the covariance matrix. This is exactly the same form as Eq. (3); in our case, each time sample is a dimension in the multivariate Gaussian. The problem is the excised times are not Gaussian noise. They contain a signal, albeit a portion of the signal that we want to ignore. Consequently, a gated signal model is not representative of the data.

This issue could be mitigated by modifying the prior to exclude the portion of parameter space that matches the noise while keeping the portion that matches the signal. With a QNM template this would mean setting a lower bound on the amplitude such that it is above noise level. However, modifying the prior in this manner is challenging to do in an unbiased way, as it would require a priori knowledge of both the noise and signal properties. It also does not solve the fundamental problem that the gated signal model is not representative of the data.

We resolve this issue by simultaneously yet independently modeling both the pre- and post-merger signals. Each domain is treated as a separate gated analysis. The signal parameters used in each domain are independent of each other, except for a set of common parameters $\bm{\vartheta}_{\rm com}$ . In our case the common parameters are the right ascension $\alpha$ , declination $\delta$ , and geocentric coalescence time $t_{c}$ . These parameters determine the coalescence time $t^{d}_{c}$ in each detector $d$ , which the gate time depends on. The likelihood is calculated in each domain, then combined to get the overall likelihood.

Explicitly, our algorithm to calculate the likelihood for a given set of parameter values $\bm{\vartheta}=\{\bm{\vartheta}_{\rm insp},\bm{\vartheta}_{\rm rd},\bm{% \vartheta}_{\rm com}\}$ is:

1.

Generate the pre-merger (“inspiral”) template $h^{\rm insp}$ using parameters $\bm{\vartheta}_{\rm insp}$ .
2.

Project the template into each detector’s frame using $\bm{\vartheta}_{\rm com}=\{\alpha,\delta,t_{c}\}$ .
3.

Excise times after $t^{\rm det}_{c}$ by gating and in-painting the residual $s_{d}-h_{d}^{\rm insp}(\bm{\vartheta}_{\rm insp},\bm{\vartheta}_{\rm com})$ using a $1\,$ s gate that spans $[t^{d}_{c},t^{d}_{c}+1)$ .
4.

Calculate the pre-merger likelihood $p(\bm{s}_{net}|\bm{\vartheta}_{\rm insp},\bm{\vartheta}_{\rm com},h^{\rm insp})$ via Eq. (4).
5.

Repeat steps 1-4 for the post-merger (“ringdown”) template $h^{\rm rd}$ to get the post-merger likelihood $p(\bm{s}_{net}|\bm{\vartheta}_{\rm rd},\bm{\vartheta}_{\rm com},h^{\rm rd})$ . However, for the ringdown the gate spans $[t^{d}_{c}-1,t^{d}_{c})$ ; i.e., it ends at $t^{d}_{c}$ whereas it starts at $t^{d}_{c}$ for the pre-merger. The data used in each domain is thereby (nearly) mutually exclusive (see Appendix B for more details).

The total likelihood is then

	$\displaystyle p(\bm{s}_{net}\|\bm{\vartheta},h)$	$\displaystyle=$
	$\displaystyle p(\bm{s}_{net}$	$\displaystyle\|\bm{\vartheta}_{\rm insp},\bm{\vartheta}_{\rm com},h^{\rm insp})% p(\bm{s}_{net}\|\bm{\vartheta}_{\rm rd},\bm{\vartheta}_{\rm com},h^{\rm rd})$		(5)

We fix the issue of the gate trying to excise the signal by using this hierarchical likelihood. The two domains offset each other: in order for the post-merger gate to shift to later times the pre-merger template must match more of the signal, and vice versa. This also addresses the fundamental issue with the gated signal model highlighted above: our global model for the entire data set now contains a non-Gaussian element (the other domain’s signal model) in each domain’s excised region. Note also that no coupling occurs across the domain boundaries due to the whitening filter.

The bottom plots in Fig. 2 show the maximum likelihood waveform templates from an analysis with this configuration. Besides sky location and $t_{c}$ , both models generated independent parameter measurements. The resultant waveform templates closely match those obtained by fixing sky location and $t_{c}$ to the injected values (top row of Fig. 2); the erroneous gate motion observed in the pre- and post-merger-only analyses is no longer present.

IV Methods

We performed eight analyses in total. Each analysis was differentiated by the waveform used, post-merger approximant, and whether or not sky location and $t_{c}$ were marginalized over. All other aspects of the analyses were kept constant. Both the pre- and post-merger signals in all models utilized the PyCBC model GatedGaussianMargPol, which inherits the normalization protocols described in Section II. The dynesty sampler [20] was used to generate posterior distributions. The samplers in these analyses used 4000 live points to ensure the convergence of each model (see Appendix C).

Half of the analyses used the original data of GW150914 with a sample rate of 2048 Hz obtained from the Gravitational Wave Open Science Center [21]. To validate these results, we repeat each run on a simulated signal in zero noise. The simulated signal was generated using the IMRPhenomXPHM waveform approximant [22] with the maximum likelihood parameters for GW150914 from Ref. [19].

All template models utilized IMRPhenomXPHM (abbreviated here on as IMR) to model the pre-merger signal of the waveform. Half of the models used this IMR approximant to model the post-merger signal. The other half utilized a quasinormal mode (QNM) approximant to model the post-merger section of the waveform. Table 1 lists the sampled parameters and priors used in the pre-merger models, and Table 2 lists the same for the post-merger models. The QNM approximant was configured such that the ringdown was composed of a dominant $(2,2,0)$ mode and a subdominant $(2,2,1)$ mode as proposed by [23]. All models apply the ringdown model starting at merger time $t_{c}$ . The priors of the QNM post-merger models were restricted such that the $(2,2,0)$ mode contribution to the post-merger signal-to-noise ratio (SNR) was at least 2. This condition was imposed to ensure that the dominant mode was present in the model, preventing possible “label-switching” that may occur due to the $(2,2,1)$ mode erroneously matching to the $(2,2,0)$ mode in the signal.

Only half of the models allowed for sky location and $t_{c}$ to vary. The other half fixed these parameters to nominal values for GW150914 to replicate previous works. Specifically, the fixed parameter analyses set $\alpha=1.95$ , $\delta=-1.27$ , and $t_{c}=1126259462.408$ , in accordance with the maximum likelihood values in [23].

Parameter	Description	Prior Dist.	Prior Range
$t_{c}$	Coalescence time	uniform	$1126259462.43+[-0.05,0.05]$ s
$\alpha$	Right ascension	uniform	$[0,2\pi]$
$\delta$	Declination	sine angle	$[-\pi/2,\pi/2]$
$\iota$	Inclination	sine angle	$[0,\pi]$
$M_{chirp}$	Source frame chirp mass	$M_{1}$ , $M_{2}$	$[23,42]M_{\odot}$
$q$	Mass ratio $M_{1}/M_{2}$	$M_{1}$ , $M_{2}$	$[1,4]$
$\chi_{a,(1/2)}$	Spin magnitude	uniform	$[0,0.99]$
$\chi_{\theta,(1/2)}$	Spin polar angle	solid angle	$[0,2\pi]$
$\chi_{\phi,(1/2)}$	Spin azimuthal angle	solid angle	$[0,\pi]$
$\phi_{c}$	Reference phase	uniform	$[0,2\pi]$
$V_{C}$	Comoving volume	uniform	$[5000,92918664351]$ Mpc³

Table 1: Varied parameters in IMR models and their associated prior distributions. The subscript

(1/2)

indicates that the same prior was used for the primary and secondary masses. The third column indicates the sampling method for each prior. Parameters listed in this column indicate uniform sampling over those parameters rather than what is listed in column 1. (For example,

M_{chirp}

is sampled using uniform priors for

M_{1}

and

M_{2}

Parameter	Description	Prior Dist.	Prior Range
$M_{f}$	Source frame final mass	uniform	$[10,200]M_{\odot}$
$\chi_{f}$	Final spin	uniform	$[-0.99,0.99]$
$A_{220}$	Initial $(2,2,0)$ mode amplitude	$\log_{10}$	$[10^{-25},8\times 10^{-17}]$
$\phi_{220}$	$(2,2,0)$ mode phase	uniform	$[0,2\pi]$
$A_{221}/A_{220}$	Initial $(2,2,1)$ mode amplitude (as ratio of $A_{220}$ )	uniform	$[0,5]$
$\phi_{221}$	$(2,2,1)$ mode phase	uniform	$[0,2\pi]$

Table 2: Varied parameters in ringdown models and their associated priors. The

t_{c}

\iota

, and sky location priors used in ringdown analyses were identical to those shown in Table 1. The amplitude priors indicate the amplitudes of the corresponding quasinormal modes at the start of the ringdown model (i.e. at the merger). The third column indicates the sampling method for each prior. Here,

\log_{10}

indicates a uniform distribution over the base 10 logarithm of the given parameter.

As a preliminary test, the sky location posteriors of the analyses with variable sky/ $t_{c}$ parameters on real data were plotted. As seen in Fig. 3, the analyses were able to recover most of the posterior for the full IMR analysis conducted in [19].

V Area theorem

The simplest area theorem test is to compare the progenitor horizon areas $A_{1}$ and $A_{2}$ to the remnant horizon area $A_{f}$ to check that

A_{1}+A_{2}\equiv A_{i}\leq A_{f}.

(6)

In natural units ( $G=c=1$ ) the area of each black hole is given by [1]:

A=8\pi M^{2}(1+\sqrt{1-\chi^{2}}),

(7)

where $M$ the black hole’s mass and $\chi=J/M^{2}$ is its dimensionless spin.

A more robust test can be performed by comparing the measured change in area to the expected change in area [2]

H=\frac{A_{f,\rm measured}-A_{i}}{A_{f,\rm expected}-A_{i}}.

(8)

Here, $A_{f,\rm measured}$ is the area of the final black hole inferred from the post-merger analysis and $A_{i}$ is the sum of the initial areas inferred from the pre-merger analysis. The expected area of the final black hole $A_{f,\rm expected}$ is derived from the initial masses and spins measured from the pre-merger analysis. These are converted into an estimated final mass and spin using fits from numerical relativity, which is then converted into an area via Eq. (7).

Since $A_{f,expected}$ is evaluated with numerical relativity using $A_{1}$ and $A_{2}$ , the denominator of Eq. (8) is positive definite. Therefore, if $H<0$ , then $A_{f,measured}<A_{i}$ and the area theorem is violated. The confidence interval in favor of the area theorem is the fraction of the posterior for which $H>0$ . More robustly, $H=1$ indicates that signal is consistent with GR specifically, not just the more generic class of theories that satisfy the area increase law.

Figure 4 shows the posteriors of the final mass $M_{f}$ and final spin $\chi_{f}$ of the QNM post-merger models on real data. Specifically, the $M_{f}$ and $\chi_{f}$ posteriors from the analysis with variable sky/ $t_{c}$ parameters are compared with the corresponding posteriors from fixed gate analysis. All posteriors contain within them the distribution from a full IMR analysis from 4-OGC [19]. Furthermore, both posteriors from the variable sky/ $t_{c}$ run overlap almost completely with their corresponding fixed parameter posteriors.

A similar plot is shown in Fig. 5, except with the analyses that used an IMR post-merger model on real data. Again, the pre-merger and post-merger posteriors from the analysis with variable sky/ $t_{c}$ parameters are compared to their fixed counterparts as well as the full IMR posterior from [19]. Notably, while still in very close agreement, the variable sky/ $t_{c}$ posteriors have a greater discrepancy from their fixed counterparts than seen in Fig. 4. Specifically, the post-merger posterior tends towards slightly higher masses and spins, while the pre-merger posterior includes lower spin values.

Directly comparing the pre-merger posteriors in Figs.4 and 5 shows a slight discrepancy in final mass and spin estimates. Namely, the analysis with a QNM post-merger model tends towards higher final mass and spin estimates than the IMR post-merger model. This indicates minor coupling between the pre- and post-merger models caused by their shared sky and time parameters. However, as made evident by these two plots, this is not a major effect, and both pre-merger models maintain strong consistency with the full IMR posterior.

Figure 6 compares the $H$ posteriors between the four analyses of the real GW150914 data. All posteriors agree with the expected value $H=1$ , with each distribution peaking near this value. While the QNM post-merger posteriors are almost in exact agreement with each other, there is a slight discrepancy between the IMR post-merger posteriors. Specifically, the fixed sky/ $t_{c}$ posterior has a much sharper peak at a lower value than the variable gate counterpart. This may be explained by Fig. 5, where the fixed IMR post-merger posterior was observed to contain lower masses and spins than the variable gate model. This could lead to the slight bias to lower $H$ values seen here.

Table 3 summarizes the $H$ credible intervals and area theorem confidence intervals for all eight analyses. All credible intervals are in agreement with the expected value $H=1$ . Additionally, all analyses of the real GW150915 signal have $H>0$ at greater than 95% confidence, indicating very high agreement with the area theorem. Results from the analysis of the simulated signal are largely consistent with these results (see Appendix C for equivalent figures).

The area theorem confidence interval of any given analysis appears to be uncorrelated to whether or not the sky location and coalescence time were allowed to vary. However, IMR post-merger models tended to have slightly higher agreement with the area theorem than corresponding QNM post-merger models. This pattern may be explained by trends in the pre- and post-merger mass and spin results. The IMR post-merger models in Fig. 5 had post-merger measurements that were either in agreement with or greater than the corresponding pre-merger estimates. This corresponds to a very high concentration of $H$ measurements approximating or exceeding 1, with very few points skewed to negative $H$ values. Conversely, the post-merger posteriors shown in Fig. 4 have significant amounts of points with higher and lower mass and spin values than their pre-merger counterparts. This leads to a wider distribution of $H$ values to both higher and lower values, allowing for a higher concentration of negative measurements.

GW150914 analysis	Credible interval	$P(H>0)$
Variable sky/ $t_{c}$ , QNM post-merger, real waveform	$1.0^{+1.4}_{-1.0}$	95.4%
Variable sky/ $t_{c}$ , IMR post-merger, real waveform	$1.1^{+0.6}_{-0.6}$	99.5%
Fixed sky/ $t_{c}$ , QNM post-merger, real waveform	$1.2^{+1.2}_{-1.0}$	97.4%
Fixed sky/ $t_{c}$ , IMR post-merger, real waveform	$1.0^{+0.6}_{-0.6}$	98.4%
Variable sky/ $t_{c}$ , QNM post-merger, zero-noise injection	$0.9^{+1.5}_{-1.3}$	89.1%
Variable sky/ $t_{c}$ , IMR post-merger, zero-noise injection	$0.9^{+0.6}_{-0.6}$	98.4%
Fixed sky/ $t_{c}$ , QNM post-merger, zero-noise injection	$1.1^{+0.9}_{-0.9}$	97.3%
Fixed sky/ $t_{c}$ , IMR post-merger, zero-noise injection	$1.0^{+0.5}_{-0.5}$	98.8%

Table 3: Summary of posteriors for

H=(A_{f,measured}-A_{i})/(A_{f,expected}-A_{i})

from various GW150914 analyses. The first column lists the properties of each model, namely whether or not sky location and

t_{c}

were allowed to vary, the post-merger model used, and the waveform data used. The approximant IMRPhenomXPHM [22] was used to model the post-merger signals of analyses labeled “IMR post-merger,” as were all pre-merger signals. Analyses with injected waveforms used a zero-noise injection of the GW150914 waveform as input data. The second column lists the median and the 90% credible interval for each

H

posterior. The third column lists the percentage of posterior points for which

H>0

, thereby agreeing with the area theorem.

VI Conclusions

This paper presented a novel method of marginalizing over sky location and coalescence time when performing a pre- or post-merger analysis, which allows for a full accounting of uncertainties in parameter estimates. Tests of the area theorem were conducted using data from GW150914 using this method. It was found that the data for this event agrees very well with the area theorem regardless of whether sky position and $t_{c}$ were allowed to vary. The only noticeable changes in agreement in the area theorem were caused by the post-merger approximant used, though in general this was a very minor change.

While we focus on tests of the area theorem here, our method for marginalizing over sky location and $t_{c}$ applies to any analysis in which only a portion of the signal is modelled. In particular, black hole spectroscopy involves analyzing the post-merger signal using a QNM template in order to perform a test of the no-hair theorem [24, 25, 26]. As with previous tests of the area theorem, previous black hole spectroscopy studies have fixed the sky location and $t_{c}$ for the reasons discussed in Sec. II [23, 27, 28]. Our finding that a signal model is needed for the entire observable signal applies equally well to black hole spectroscopy, even though the pre-merger signal is purposely excluded in such analyses. The hierarchical method we develop in Sec. III is therefore equally relevant for marginalizing over sky location and $t_{c}$ when doing black hole spectroscopy. In Ref. [29] we use our QNM analysis of GW150914 to investigate the evidence for the presence of the $(2,2,1)$ mode, which has been hotly contested in the literature [23, 27, 30, 5].

In all our analyses here we end the pre-merger analysis when the post-merger analysis beings. This is necessitated by our finding that some model must exist for the entire observable signal when the gate time is allowed to vary. Since there was no gap between our pre- and post-merger template, we obtained agreement with the area theorem in excess of $95\%$ . A more rigorous test of the area theorem is to excise the merger from the data, since any biases introduced by including the merger as part of the pre-merger model would be omitted. Refs. [4, 2] did this additional test with fixed sky location and $t_{c}$ . However, this is not possible when marginalizing over sky location and $t_{c}$ for the reasons highlighted in Sec. III. Introducing a gap between the pre- and post-merger will once again favor points that excise as much of the signal as possible, even when both the pre- and post-merger were modeled simultaneously.

Introducing a gap between the pre- and post-merger models could be achieved by using three sub-domains rather than two: one each for the inspiral, merger, and ringdown. The gate for one domain would start/end at the end/start of the next. For the merger domain, an arbitrary signal model using wavelets could be used, similar to what Finch and Moore used in Ref. [10]. This would ensure that pre- and post-merger parameters are fully unbiased by the merger while ensuring that the merger itself is not arbitrarily gated out. The initial black hole areas could then be estimated using an inspiral model and the final area using a QNM model. This would also be useful in black hole spectroscopy studies involving fundamental angular QNMs. These modes are not expected to become relevant until $\sim 10M$ after merger, necessitating a gap between the merger and the start of the QNM model. We plan to investigate this in a future study.

This research was conducted using PyCBC [31]. Our data is available at [32].

VII Acknowledgements

A.C. was supported by funds from the Massachusetts Space Grant Consortium. C.C. acknowledges support from NSF award PHY-2309356. All computations were performed on Unity, a collaborative, multi-institutional high-performance computing cluster managed by UMass Amherst Research Computing and Data.

This research has made use of data or software obtained from the Gravitational Wave Open Science Center (gwosc.org), a service of the LIGO Scientific Collaboration, the Virgo Collaboration, and KAGRA. This material is based upon work supported by NSF’s LIGO Laboratory which is a major facility fully funded by the National Science Foundation, as well as the Science and Technology Facilities Council (STFC) of the United Kingdom, the Max-Planck-Society (MPS), and the State of Niedersachsen/Germany for support of the construction of Advanced LIGO and construction and operation of the GEO600 detector. Additional support for Advanced LIGO was provided by the Australian Research Council. Virgo is funded, through the European Gravitational Observatory (EGO), by the French Centre National de Recherche Scientifique (CNRS), the Italian Istituto Nazionale di Fisica Nucleare (INFN) and the Dutch Nikhef, with contributions by institutions from Belgium, Germany, Greece, Hungary, Ireland, Japan, Monaco, Poland, Portugal, Spain. KAGRA is supported by Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan Society for the Promotion of Science (JSPS) in Japan; National Research Foundation (NRF) and Ministry of Science and ICT (MSIT) in Korea; Academia Sinica (AS) and National Science and Technology Council (NSTC) in Taiwan.

Appendix A Likelihood Function Derivation

To calculate the posterior from Bayes’ theorem (Eq. (1)), one requires a model for both the signal $h$ and the noise $n$ . While $h$ can be determined from the linearized EFE wave solution, $n$ requires more statistical considerations.

To start, consider a network of $K$ gravitational wave detectors that each sample at a rate $\Delta t$ over a total time length $T$ . Defining the total number of samples $N=T/\Delta t$ , the full data series $\bm{s}_{net}$ can be expressed as a series of $K$ $N$ -dimensional vectors $\bm{s}^{K}$ such that $\bm{s}^{K}=\{s^{K}_{0},s^{K}_{1},...,s^{K}_{N}\}$ and $\bm{s}_{net}=\{\bm{s}^{1},\bm{s}^{2},...,\bm{s}^{K}\}$ . To simplify calculations, assume that the signal is zero such that $\bm{s}_{net}=\bm{n}$ . Additionally, assume that noise model is a stochastic Gaussian distribution and is uncorrelated between detectors. Under these assumptions, the noise likelihood function is:

p(\bm{s}_{net}|n)=\frac{\exp{[-\frac{1}{2}\sum_{d=1}^{K}\bm{s}^{T}_{d}\bm{% \Sigma}^{-1}_{d}\bm{s}_{d}]}}{[(2\pi)^{NK}\prod_{d=1}^{K}\det\bm{\Sigma}_{d}]^% {1/2}},

(9)

where $\bm{\Sigma}_{d}$ is the covariance matrix of the noise model for detector $d$ , defined using the ensemble average:

\bm{\Sigma}_{d}[j,k]=\langle\bm{s}_{d}[j]\bm{s}_{d}[k]\rangle.

(10)

(By dropping the $d$ subscripts, one obtains an equivalent expression for the full covariance matrix and data. We do so in the following steps for brevity.) This is the exact expression of the likelihood function for noise. However, this function is infeasible to calculate analytically due to the inverse covariance matrix in the numerator.

To do this, one may expand the covariance matrix definition in Eq. (10) as follows, defining $\Delta_{kj}=k-j$ :

$\displaystyle\bm{\Sigma}[j,k]$	$\displaystyle=\langle\bm{s}[j]\bm{s}[k]\rangle$
	$\displaystyle=\langle\bm{s}[j]\bm{s}[\Delta_{kj}+j]\rangle$
	$\displaystyle=\lim_{n\to\infty}\frac{1}{n}\sum^{n-1}_{l=0}s^{l}[j]s^{l}[\Delta% _{kj}+j],$	(11)

where in the last step the ensemble average is written out fully.

In general, this expression is dependent on time $t_{j}=j\Delta t$ and displacement $\tau_{kj}=\Delta_{kj}\Delta t$ . However, one can make the assumption that the noise is wide sense stationary, where the mean and variance are both constant in time. Under this assumption, any constant can be added to the indices in Eq. (10) to obtain the same result. This makes $\bm{\Sigma}$ symmetric (since the factors in Eq. (11) commute) and Toeplitz (since the elements along the diagonals are equal) [33]. Additionally, since $\bm{\Sigma}[0,\Delta_{kj}]=\bm{\Sigma}[-\Delta_{kj},0]=\bm{\Sigma}[0,-\Delta_{% kj}]$ , the elements of $\bm{\Sigma}$ are even functions of $\Delta_{kj}$ .

Additionally, one can assume that the data is ergodic, meaning that new realizations of $\bm{s}$ are obtained via time. Under this assumption and the properties of the elements of $\bm{\Sigma}$ , the ensemble averages in Eq. (11) can be replaced with time averages:

$\displaystyle\bm{\Sigma}[j,k]$	$\displaystyle=\lim_{n\to\infty}\frac{1}{n}\sum^{n-1}_{l=0}s^{l}[0]s^{l}[\Delta% _{kj}]$
	$\displaystyle=\lim_{n\to\infty}\frac{1}{n}\sum^{n-1}_{l=0}s^{l}[l]s^{l}[\Delta% _{kj}+l]$
	$\displaystyle=\lim_{n\to\infty}\frac{1}{2n}\sum^{n-1}_{l=-n}s^{l}[l]s^{l}[% \Delta_{kj}+l]$
	$\displaystyle=\frac{1}{2}R_{ss}((k-j)\Delta t).$	(12)

The last step defines the autocorrelation function $R_{ss}(\tau)$ , which describes the correlation between points in the time series $\bm{s}$ . If $R_{ss}(\tau)$ goes to zero in some finite time $\tau_{max}$ , then all diagonals with $|\Delta_{kj}|>floor(\tau_{max}/\Delta t)=\Delta_{max}$ will equal zero. This is similar to the form of a circulant matrix $\bm{C}$ , a special case of a Toeplitz matrix where each row is a right-cycle permutation of the same vector (in this case, $\bm{s}$ ). The eigenvectors of circulant matrices are well-known [33]:

\bm{u}_{p}[k]=\frac{1}{\sqrt{N}}\exp{(-2\pi ikp/N)}

(13)

This generally is not true for Toeplitz matrices, but one may take advantage of Eq. (13) by recognizing that the matrix described by Eq. (12) asymptotes to a circulant matrix for large N:

\lim_{N\to\infty}|\bm{\Sigma}-\bm{C}|=\bm{0}

(14)

Therefore, the eigenvalues $\lambda_{p}$ of $\bm{\Sigma}$ can be evaluated using the usual eigenvalue equation as long as $\Delta_{max}<<N/2$ :

\bm{\Sigma}\bm{u}_{p}\approx\lambda_{p}\bm{u}_{p}

(15)

Therefore, using the fact that $\bm{\Sigma}$ is symmetric and $R_{ss}(l)$ is even:

$\displaystyle\lambda_{p}$	$\displaystyle=\frac{1}{2}Re\bigg{\{}\sum^{N/2}_{l=-N/2}R_{ss}(l)\exp(-2\pi ipl% /N)\bigg{\}}$
	$\displaystyle=\frac{1}{2}Re\bigg{\{}\sum^{N-1}_{l=0}R_{ss}(l)\exp(-2\pi ipl/N)% \bigg{\}}$
	$\displaystyle=\frac{1}{2}Re\{\tilde{R}_{ss}(p)/\Delta t\},$	(16)

where $\tilde{R}_{ss}(p)$ is the discrete Fourier transform of the autocorrelation function:

\tilde{R}_{ss}(p)=\Delta t\sum^{N-1}_{k=0}{R}_{ss}(k)\exp(-2\pi ipk/N)

(17)

To simplify Eq. (16), one may impose the Wiener-Khinchin Theorem [34], which defines the power spectral density $S_{n}$ as the Fourier transform of $R_{ss}$ for a wide-sense stationary stochastic process:

\lambda_{p}=\frac{S_{n}[p]}{2\Delta t}

(18)

One can then construct the inverse of $\bm{\Sigma}$ using the eigenvalues from Eq. (18) and the eigenvectors of Eq. (13):

	$\displaystyle\bm{\Sigma}^{-1}[j,k]$	$\displaystyle=\frac{2\Delta t}{N}\sum^{N-1}_{p=0}\frac{\exp[-2\pi ip(j-k)/N]}{% S_{n}(p)}$
		$\displaystyle={2\Delta f(\Delta t)^{2}}\sum^{N/2-1}_{p=0}\frac{e^{-2\pi ip(j-k% )/N}+e^{2\pi ip(j-k)/N}}{S_{n}(p)},$		(19)

using the fact that $S_{n}$ is symmetric about $N/2$ and defining the sample frequency $\Delta f=1/T=1/N\Delta t$ . Since $\bm{s}$ is real and

(\Delta t)^{2}\sum^{N-1}_{j,k=0}\bm{s}[j]\bm{s}[k](e^{-2\pi ip(j-k)/N}+e^{2\pi ip% (j-k)/N})=2|\tilde{\bm{s}}|^{2}[p],

(20)

one can write:

\bm{s}^{T}\bm{\Sigma}^{-1}\bm{s}=4\Delta f\sum^{N/2-1}_{p=0}\frac{2|\tilde{\bm% {s}}|^{2}[p]}{S_{n}(p)}.

(21)

Finally, if the inner product between two arbitrary vectors $\bm{a}$ and $\bm{b}$ is defined as:

\langle\bm{a},\bm{b}\rangle=4Re\bigg{\{}\Delta f\sum^{N/2-1}_{p=0}\frac{\tilde% {\bm{a}}^{\*}[p]\tilde{\bm{b}}[p]}{S_{n}(p)}\bigg{\}},

(22)

Eq. (21) can be written as an inner product:

\bm{s}^{T}\bm{\Sigma}^{-1}\bm{s}=\langle\bm{s},\bm{s}\rangle,

(23)

and the likelihood function can be expressed as:

p(\bm{s}_{net}|n)=\frac{\exp{[-\frac{1}{2}\sum_{d=1}^{K}\langle\bm{s}_{d},\bm{% s}_{d}\rangle]}}{[(2\pi)^{NK}\prod_{d=1}^{K}\det\bm{\Sigma}_{d}]^{1/2}}

(24)

Appendix B Gating and In-Painting

Generally, BBH models do not require the entire waveform to be analyzed at once. For example, the analyses throughout this paper independently examine the pre- and post-merger portions of the GW150914 waveform. To maintain independence between the models, any points not corresponding to the respective model (i.e. after $t_{c}$ for the pre-merger model, or before $t_{c}$ for the post-merger model) were excised, or “gated,” from the data. Here, a gate of length $M$ applied starting at a sample $a$ will excise all samples within the range $[a,a+M]$ , corresponding to a gate of time length $t=M\Delta t$ . The gated time series will therefore take the form $\bm{s}_{d,tr}=\{s_{0},s_{1},...s_{a},s_{a+M},...s_{N-1},s_{N}\}$ .

By doing this, the simplifications made to derive the likelihood function are no longer valid, since $\bm{\Sigma}_{d,tr}$ is no longer Toeplitz (and, subsequently, no longer approximately circulant for large $N$ ). There are numerical methods to calculate the matrix inverse directly, but in general they can be unstable and time-intensive. Therefore, a method known as “gating and in-painting” is employed for gated waveform analyses [12].

First, the method assumes that the noise time series $\bm{n}$ is the sum of $\bm{n}_{g}$ , the noise series with the gated times zeroed out, and $\bm{x}$ , a time series containing only the gated samples in $\bm{n}$ . The goal of in-painting is to solve the following equation in the gated region:

\bm{\Sigma}^{-1}(\bm{n}_{g}+\bm{x})=\bm{0}

(25)

If the nonzero elements of $\bm{x}$ are such that $(\bm{\Sigma}^{-1}\bm{n})[k]=\bm{0}$ for all samples $k$ in the gate $[a,a+M]$ , then the inner product $\bm{n}^{T}\bm{\Sigma}\bm{n}$ will be equal for the truncated and raw data set. Since $\bm{x}$ is zero outside of the gate, $\bm{\Sigma}^{-1}\bm{x}$ will form an $M\times M$ Toeplitz matrix containing the $[a,a+M]$ rows and columns of $\bm{\Sigma}^{-1}$ . Therefore, Eq. (25) can be rewritten within the gated region as

\bm{\Sigma}^{-1}\bm{x}=-\bm{\Sigma}^{-1}\bm{n}_{g},

(26)

and adding $\bm{x}$ to $\bm{n}_{g}$ will give the same result as truncating $\bm{n}$ and $\bm{\Sigma}$ . Unlike trying to solve for the inverse directly, this solution is readily found using a Toeplitz solver [18, 35]. Given gated data $\bm{s}_{g}$ containing some gated signal $\bm{h}_{g}$ , Eq. (26) can be evaluated using $\bm{n}_{g}=\bm{s}_{g}-\bm{h}_{g}$ , and the value $\bm{x}+\bm{s}_{g}-\bm{h}_{g}$ can be used to calculate the likelihood.

The analyses conducted in this paper utilize gating and in-painting to apply a gate starting/ending at $t_{c}$ to the waveform template. In theory, the gates should extend to the edges of the analysis segment (i.e., the pre-merger gate starts at the segment start time, and the post-merger gate ends at segment end). However, the in-painting algorithm, which is the dominant cost in our analysis, is $\sim\!O(M^{2})$ operations for a gate of $M$ samples. Although the observable signal is only $\sim 0.2\,$ s long [13], we use an analysis segment that is $4\,$ s in duration in order to resolve line artifacts in the PSD. We also use a sample rate of $2048\,$ Hz, in order to fully capture all observable signal power. The in-painting would therefore need to span $M\sim 4096$ samples if the gates were to extend to the beginning/end of the analysis segment. This is computationally expensive: some of our analyses would take $\sim 1$ month to complete (utilizing 64 CPU cores).

To reduce the computational cost, we instead use a 1-second gate starting (ending) at $t_{c}$ and ensuring that the remainder of the pre-merger (post-merger) template $\bm{h}$ after (before) the gate is zero. This is equivalent to applying a full gate to the template when a whitening filter is applied, since the early- and late-time template will be identically zero in both cases.

We do not zero the data $\bm{s}$ outside of the gates, however, since doing so would introduce additional ringing artifacts at the beginning/end of the analysis segment not accounted for by the in-painting. This means that the pre- and post-merger models will share data outside of the gates (namely, greater than 1 second before and after $t_{c}$ ). However, the data is expected to be noise-dominated in this region, yielding an insignificant contribution to the likelihood. This can be seen in Appendix C, which portrays the remnant posteriors for analyses using zero-noise waveform injections. Since the data in these analyses contain no noise, the shared data has identically zero contribution to the likelihood. These results are in strong agreement with Figs. 4, 5, and 6, indicating that any effects due to early- and late-time shared noise are indeed negligible.

Appendix C Simulation results

This section contains additional plots showing results from the analysis of the GW150914-like simulated signal in zero noise. Figure 7 shows the $M_{f}$ and $\chi_{f}$ posteriors for the QNM post-merger models, and Fig. 8 shows the same for the IMR post-merger models. Figure 9 shows the $H$ posteriors for the zero-noise injection models.

Appendix D Sampler convergence

To test the convergence of the sampler we repeated the analyses with 2000 and 4000 live points. Figure 10 shows the $M_{f}$ and $\chi_{f}$ posteriors for QNM post-merger models on real data with 2000 and 4000 live points, while Fig. 11 depicts the same for the IMR post-merger models. Both the QNM and IMR models were able to converge to similar distributions regardless of the number of live points used. All sky/ $t_{c}$ results are reported using 4000 live points.

References

Bardeen et al. [1973] J. M. Bardeen, B. Carter, and S. W. Hawking, The four laws of black hole mechanics, Communications in Mathematical Physics 31, 161 (1973).
Kastha et al. [2022] S. Kastha, C. D. Capano, J. Westerweck, M. Cabero, B. Krishnan, and A. B. Nielsen, Model systematics in time domain tests of binary black hole evolution, Physical Review D 105, 10.1103/physrevd.105.064042 (2022).
Cabero et al. [2018] M. Cabero, C. D. Capano, O. Fischer-Birnholtz, B. Krishnan, A. B. Nielsen, A. H. Nitz, and C. M. Biwer, Observational tests of the black hole area increase law, Physical Review D 97, 10.1103/physrevd.97.124069 (2018).
Isi et al. [2021] M. Isi, W. M. Farr, M. Giesler, M. A. Scheel, and S. A. Teukolsky, Testing the black-hole area law with GW150914, Physical Review Letters 127, 10.1103/physrevlett.127.011103 (2021).
Wang et al. [2023] Y.-F. Wang, C. D. Capano, J. Abedi, S. Kastha, B. Krishnan, A. B. Nielsen, A. H. Nitz, and J. Westerweck, A frequency-domain perspective on GW150914 ringdown overtone, (2023), arXiv:2310.19645 [gr-qc] .
Carullo et al. [2019] G. Carullo, W. D. Pozzo, and J. Veitch, Observational black hole spectroscopy: A time-domain multimode analysis of GW150914, Physical Review D 99, 10.1103/physrevd.99.123029 (2019).
Carullo et al. [2023] G. Carullo, W. Del Pozzo, and J. Veitch, pyRing: a time-domain ringdown analysis python package, git.ligo.org/lscsoft/pyring (2023).
Isi and Farr [2021a] M. Isi and W. M. Farr, maxisi/ringdown: Initial ringdown release (2021a).
Isi and Farr [2021b] M. Isi and W. M. Farr, Analyzing black-hole ringdowns, (2021b), arXiv:2107.05609 [gr-qc] .
Finch and Moore [2022] E. Finch and C. J. Moore, Searching for a ringdown overtone in GW150914, Physical Review D 106, 10.1103/physrevd.106.043005 (2022).
Biwer et al. [2019] C. M. Biwer, C. D. Capano, S. De, M. Cabero, D. A. Brown, A. H. Nitz, and V. Raymond, PyCBC Inference: A Python-based parameter estimation toolkit for compact binary coalescence signals, Publications of the Astronomical Society of the Pacific 131, 024503 (2019).
Zackay et al. [2021] B. Zackay, T. Venumadhav, J. Roulet, L. Dai, and M. Zaldarriaga, Detecting gravitational waves in data with non-stationary and non-gaussian noise, Physical Review D 104, 10.1103/physrevd.104.063034 (2021).
Abbott et al. [2016] B. P. Abbott et al. (LIGO Scientific Collaboration and Virgo Collaboration), Observation of gravitational waves from a binary black hole merger, Physical Review Letters 116, 061102 (2016).
Sivia and Skilling [2006] D. Sivia and J. Skilling, Data analysis: a Bayesian tutorial (Oxford University Press, Oxford, 2006).
Finn [1992] L. S. Finn, Detection, measurement, and gravitational radiation, Physical Review D 46, 5236 (1992).
Pan et al. [1997] V. Pan, Y. Yu, and C. Stewart, Algebraic and numerical techniques for the computation of matrix determinants, Computers & Mathematics with Applications 34, 43 (1997).
Taylor [1996] J. R. Taylor, An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements, 2nd ed. (University Science Books, 1996).
et al. [2020a] P. V. et al., SciPy 1.0: fundamental algorithms for scientific computing in Python, Nature Methods 17, 261 (2020a).
Nitz et al. [2022] A. H. Nitz, S. Kumar, Y. Wang, S. Kastha, S. Wu, M. Schäfer, R. Dhurkunde, and C. D. Capano, 4-OGC: Catalog of gravitational waves from compact-binary mergers (2022), arXiv:2112.06878 [astro-ph.HE] .
Speagle [2020] J. S. Speagle, dynesty: a dynamic nested sampling package for estimating Bayesian posteriors and evidences, Monthly Notices of the Royal Astronomical Society 493, 3132 (2020).
et al. [2021] R. A. et al., Open data from the first and second observing runs of Advanced LIGO and Advanced Virgo, SoftwareX 13, 10.1016/j.softx.2021.100658 (2021).
Pratten et al. [2021] G. Pratten, C. García-Quirós, M. Colleoni, A. Ramos-Buades, H. Estellés, M. Mateu-Lucena, R. Jaume, M. Haney, D. Keitel, J. E. Thompson, and S. Husa, Computationally efficient models for the dominant and subdominant harmonic modes of precessing binary black holes, Physical Review D 103, 10.1103/physrevd.103.104056 (2021).
Isi et al. [2019] M. Isi, M. Giesler, W. M. Farr, M. A. Scheel, and S. A. Teukolsky, Testing the no-hair theorem with GW150914, Physical Review Letters 123, 10.1103/physrevlett.123.111102 (2019).
Vishveshwara [1970] C. V. Vishveshwara, Scattering of Gravitational Radiation by a Schwarzschild Black-hole, Nature 227, 936 (1970).
Chandrasekhar and Detweiler [1975] S. Chandrasekhar and S. L. Detweiler, The quasi-normal modes of the Schwarzschild black hole, Proc. Roy. Soc. Lond. A 344, 441 (1975).
Dreyer et al. [2004] O. Dreyer, B. J. Kelly, B. Krishnan, L. S. Finn, D. Garrison, and R. Lopez-Aleman, Black hole spectroscopy: Testing general relativity through gravitational wave observations, Class. Quant. Grav. 21, 787 (2004), arXiv:gr-qc/0309007 .
Cotesta et al. [2022] R. Cotesta, G. Carullo, E. Berti, and V. Cardoso, Analysis of ringdown overtones in GW150914, Physical Review Letters 129, 10.1103/physrevlett.129.111102 (2022).
Abbott et al. [2021] R. Abbott et al. (LIGO Scientific, VIRGO, KAGRA), Tests of General Relativity with GWTC-3, (2021), arXiv:2112.06861 [gr-qc] .
Correia et al. [2023] A. Correia, Y.-F. Wang, and C. D. Capano, Low evidence for ringdown overtone in GW150914 when marginalizing over time and sky location uncertainty, (2023), arXiv:2312.14118 [gr-qc] .
Isi and Farr [2022] M. Isi and W. M. Farr, Revisiting the ringdown of GW150914 (2022), arXiv:2202.02941 [gr-qc] .
Nitz et al. [2024] A. Nitz, I. Harry, D. Brown, C. M. Biwer, J. Willis, T. D. Canton, C. Capano, T. Dent, L. Pekowsky, G. S. C. Davies, S. De, M. Cabero, S. Wu, A. R. Williamson, B. Machenschalk, D. Macleod, F. Pannarale, P. Kumar, S. Reyes, dfinstad, S. Kumar, M. Tápai, L. Singer, P. Kumar, veronica villa, maxtrevor, B. U. V. Gadre, S. Khan, S. Fairhurst, and A. Tolley, gwastro/pycbc: v2.3.3 release of pycbc (2024).
Correia and Capano [2024] A. Correia and C. Capano, Sky marginalization in black hole spectroscopy (2024).
Gray [2001] R. Gray, Toeplitz and circulant matrices: a review, Foundations and Trends in Communications and Information Theory 2 (2001).
Chatfield [1996] C. Chatfield, The analysis of time series: an introduction, 6th ed. (CRC Press, Florida, US, 1996).
et al. [2020b] C. R. H. et al., Array programming with NumPy, Nature 585, 357 (2020b).