PSR J0437-4715: The Argentine Institute of Radioastronomy 2019–2020 Observational Campaign

, , , , , , , , , , , , , and

Published 2021 February 19 © 2021. The American Astronomical Society. All rights reserved.
, , Citation V. Sosa Fiscella et al 2021 ApJ 908 158 DOI 10.3847/1538-4357/abceb3

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/908/2/158

Abstract

We present the first-year data set of high-cadence, long-duration observations of the bright millisecond pulsar J0437−4715 obtained in the Argentine Institute of Radioastronomy (IAR). Using two single-dish 30 m radio antennas, we gather more than 700 hr of good-quality data with timing precision better than 1 μs. We characterize the white and red timing noise in IAR's observations, we quantify the effects of scintillation, and we perform single-pulsar searches of continuous gravitational waves, setting constraints in the nHz–μHz frequency range. We demonstrate IAR's potential for performing pulsar monitoring in the 1.4 GHz radio band for long periods of time with a daily cadence. In particular, we conclude that the ongoing observational campaign of the millisecond pulsar J0437−4715 can contribute to increase the sensitivity of the existing pulsar-timing arrays.

Export citation and abstract BibTeX RIS

1. Introduction

The Argentine Institute of Radioastronomy (IAR) is equipped with two single-dish 30 m antennas—dubbed A1 and A25 —capable of performing daily observations of pulsars in the southern hemisphere at 1.4 GHz. These antennas were recently refurbished to obtain high-quality timing observations as described in Gancio et al. (2020).

Pulsar Monitoring in Argentina6 is a scientific collaboration dedicated to pulsar observations from the southern hemisphere. As part of IAR's observatory developing stage, accurate timing observations of the millisecond pulsar (MSP) J0437−4715 with both antennas have been carried out since 2019 April 22, with a daily follow-up only interrupted during hardware upgrades or bad weather conditions.

The MSP J0437−4715 was discovered in 1993 by Johnston et al. (1993), and it is one of the brightest (mean flux density S1400 = 150.2 mJy) and closest (d = 156.79 ± 0.25 pc) pulsars. It has a short period (P = 5.758 ms) and it is one of the most massive pulsars known to date (m = 1.44 ± 0.07 M;  Reardon 2018). This pulsar is in a binary system and in an almost circular orbit of period 5.74 days. The secondary star is a low-mass (∼0.2 M), helium white dwarf, with strong visible emission (Danziger et al. 1993). In the interstellar region, an optical bow shock was also reported by Bell et al. (1993). In addition, J0437−4715 was the first MSP detected in X-rays (Becker & Trümper 1993) and the only one for which individual pulses have been studied. It is also the first one detected in the ultraviolet, although in this wavelength its spectrum is consistent with that of a blackbody (Lorimer & Kramer 2012) and pulsed emission was not observed (Kargaltsev et al. 2004).

Because of its proximity to Earth, J0437−4715 is one of the two pulsars with a well-determined three-dimensional orientation of the orbit (van Straten et al. 2001). In addition, its radio emission does not present much nulling, short-scale variation of its integrated profile, or mode-changing (Vivekanand et al. 1998), phenomena associated with longer-period pulsars. This suggests that the origin of the radiative processes in this pulsar is different from the mechanisms in regular pulsars. Moreover, J0437−4715 displays intrinsic and quasiperiodic variations in its flux, (not observed in other pulsars; Vivekanand et al. 1998), and extrinsic variations, due to interstellar medium scintillations (Osłowski et al. 2014).

J0437−4715 stands out for having an extremely stable rotation rate which makes it a natural clock with a similar stability to that of an atomic clock (Hartnett & Luiten 2011) and better over timescales longer than a year (Matsakis et al. 1997). Only two other pulsars, PSR B1855+09 and PSR B1937+21, have a comparable stability (Kaspi et al. 1994). These characteristics of J0437−4715 make it an ideal candidate for pulsar-timing studies. Its high declination in the southern hemisphere makes its observation from the northern hemisphere difficult to achieve (see Figure 2 of Ferdman et al. 2010). This MSP is also in the opposite direction to the Galactic center, where few pulsars are observed. For these reasons, performing daily observations of J0437−4715 is a key science project at IAR, improving upon the weekly to monthly cadence of other observatories in the International Pulsar Timing Array consortium (IPTA; Perera et al. 2019; Lam & Hazboun 2020). This is currently of particular interest as the NANOGrav collaboration is on the verge of detecting an isotropic stochastic gravitational wave background (GWB; Arzoumanian et al. 2020).

In this work, we use the properties of J0437−4715—high rotational stability, high luminosity, and short period—to assess the quality of the observations at IAR with both antennas. This builds upon the preliminary analysis presented in Gancio et al. (2020) which suggested they reach a precision of ≲1 μs.

The paper is organized as follows: Section 2 introduces the observations and the reduction methods. In Section 3 we describe the observations in terms of their signal-to-noise ratio (S/N) and its relation to interstellar scintillation. In Section 4 we present the timing results and study the influence of the S/N and bandwidth (BW) on the timing analysis; further details on this analysis are provided in the Appendix. In Section 5 we use the ENTERPRISE software to perform a noise analysis of the observations and estimate the contribution of a GWB. Finally, in Section 6 we present the main conclusions of our study.

2. Observations

As described in more detail in Gancio et al. (2020), the design of the antennas allows us to observe a source continuously over 220 minutes. Their receivers are not currently refrigerated and have a system temperature of Tsys ∼ 100 K. The back-end is based on two software-defined radios which acquire raw samples with a maximum rate of 56 MHz per board. A1 uses these two digital plates in consecutive radio frequencies with a total bandwidth of 112 MHz in a single polarization mode, while A2 uses those digital plates in one per polarization, thus covering a bandwidth of 56 MHz. Those characteristics are summarized in Table 1. In 2019 November, A1's receiver front-end went into commissioning. The electronics and systems were verified and improved, resulting in a slightly higher sensitivity and the recovery of the second polarization. Nonetheless, the observations were retaken with the previous configuration to have a homogeneous data set.

Table 1.  Parameters of the Observations Analyzed in This Work

  A1 A2
Number of observations 170 (145*) 197 (171*)
MJD start–MJD finish 58596.7–58999.6
Total observation time (hr) 391 (372*) 393 (381*)
Central frequency (MHz) 1400, 1415, 1428 1428
Bandwidth 112 MHz 56 MHz
Polarization modes 1 2
Frequency channels (nchan) 64/128 64
Time resolution (μs) 73.14
Phase bins (nbin) 512/1024

Note. Values marked with (*) correspond to the restricted data set used in Section 5 (observations lasting more than 40 minutes that achieve a S/N > 40 and σTOA < 1 μs).

Download table as:  ASCIITypeset image

In this work we present the analysis of a data set of 170 observations with A1 and 197 with A2 over an interval of 13 months, from 2019 April 23 to 2020 May 30. This includes days with multiple observations (89 days with two observations, 24 days with three, and one day with four). The observations add up to over 390 hr of observation with each antenna (Table 1), leading to an observation efficacy of 0.26 for both antennas. This efficacy is aimed to be improved in future considering that (i) A1 underwent maintenance between 2019 October 8 and November 29, (ii) an unusually loud source of local radio-frequency interference (RFI) was particularly active in 2019 June–July during morning time, affecting A1 more notably, (iii) during 2020 February the observations stopped due to tests in the new automated pointing software and scheduler, (iv) A2 had lost observing time due to problems with a hard disk.

The receptor in A1 became more sensitive to local RFI after its upgrade in 2019 December. We found that the program RFIClean7 (Maan et al. 2020) gave better results than the rfifind task in PRESTO to clean RFI. We therefore ran both software programs in all A1 observations carried out from 2019 November onward.

The observations, stored in filterbank format,8 were folded and de-dispersed with PRESTO (Ransom et al. 2003; Ransom 2011) using nbins = 512 or 1024 phase bins9 and nchan = 64 frequency channels for A2 observations and nchan = 64 or 128 for A1 observations. The data were folded using the timing flag of the task prepfold and the parameter (.par) file provided by IPTA,10 "Combination B" with edits adapted to the IAR site. We then calculated the time of arrival (TOA) of the pulses using the pat package in PSRCHIVE (Hotan et al. 2004) with a Fourier phase gradient-matching template fitting (Taylor 1992). The template was obtained by applying a smoothing wavelet algorithm to a best profile; a more detailed discussion of the template selection is provided in Appendix A.2. The TOAs in this data set were fixed of clock systematics on 2019 April 22 (MJD 58595), when we reached an accuracy of  < 1 μs (see Gancio et al. 2020 for details on clock settings).

3. Analysis of the Observations

3.1. S/N of the Observations

In order to characterize the S/N of the observations we use the functions getDuration and getSN of the Python package PyPulse11 (Lam 2017). In Figure 1 we show the S/N of each observation as a function of its duration. The mean S/N of observations with A1 is 151 and with A2 is 105, with mean observing times of 147 minutes and 116 minutes, respectively. However, we note that these numbers are affected by many short and low-quality observations.

Figure 1.

Figure 1. Signal-to-noise ratio of the observations of each antenna as a function of their tobs. We also plot $f({t}_{\mathrm{obs}})=a\sqrt{{t}_{\mathrm{obs}}}$, where a = 13.1 min−1/2 for A1 and a = 11.1 min−1/2 for A2.

Standard image High-resolution image

When we restrict our analysis to observations with S/N > 50, the mean S/N for observations with A1 increases to 166 and with A2 to 122, with mean observing times of 162 minutes and 124 minutes, respectively. We summarize these and other values in Table 2.

Table 2.  Number of Observations N, Mean S/N, and Mean tobs Expressed in Minutes per S/N Subset per Antenna

  S/N > 1 S/N > 50 S/N > 80 S/N > 110 S/N > 140 S/N > 170
  N 〈S/N〉 tobs N 〈S/N〉 tobs N 〈S/N〉 tobs N 〈S/N〉 tobs N 〈S/N〉 tobs N 〈S/N〉 tobs
A1 170 151 147 159 160 155 150 166 160 120 183 166 96 197 180 59 223 187
A2 197 105 116 164 120 121 128 136 146 88 153 178 58 168 192 22 192 194
A1+A2 367 127 130 323 140 140 278 152 154 208 170 171 154 186 182 81 214 191

Download table as:  ASCIITypeset image

We observe a positive correlation between S/N and tobs, fitting to a ${\rm{S}}/{\rm{N}}\propto \sqrt{{t}_{\mathrm{obs}}}$ as expected (Lorimer & Kramer 2012):

Equation (1)

where P is the pulsar period and W its width, Tpeak is its maximum amplitude, Tsys is the noise temperature of the system, tobs is the observing time, and nP is the number of polarizations observed.

We collect the observations per S/N for each antenna and display them as histograms in Figure 2. We observe a distribution for A1 with a mean higher than the corresponding distribution for A2, perhaps due to the broader band sensitivity of A1.

Figure 2.

Figure 2. Histogram of the observations for each antenna, A1 and A2, according to their S/N.

Standard image High-resolution image

We collect the observations into sets of S/N > 1, 50, 80, 110, 140, and 170, corresponding to roughly the position of the larger bins in the A2 histogram. In Table 2 we specify the number of observations, mean duration, and mean S/N for each of these sets.

3.2. Scintillations

In what follows we assume that the expected S/N scales $\propto \sqrt{{t}_{\mathrm{obs}}}$ and that additional variations in the S/N are due to scintillation. We note that the observations described in Section 2 lack absolute flux calibrations and thus possible variations in Tsys are not accounted for. Moreover, RFI is also variable and its mitigation leads to variations in the effective bandwidth of each observation, so additional dispersion in the S/N versus tobs relation is also expected.

To quantify the variations due to scintillation we build a projected pulse S/N as ${\rm{S}}/{{\rm{N}}}_{\mathrm{proj}}={\rm{S}}/{\rm{N}}\sqrt{{t}_{\max }/{t}_{\mathrm{obs}}}$, with ${t}_{\max }=217\,\mathrm{minutes}$. Given that short observations have a large uncertainty in their determined S/N, we only use observations with tobs > 20 minutes (which is roughly half of the scintillation timescale). Figure 3 shows a histogram of the projected pulse S/N for A1 and A2. The line shows the estimated probability density function from scintillation (Cordes & Chernoff 1997)

Equation (2)

where nISS is the number of scintles, S0 is the mean value of the signal S (i.e., S0 = 〈S/N〉), and Θ is the Heaviside step function. We calculate nISS by fitting the normalized12 data for each antenna. We obtain nISS = 2.67 ± 0.31 for A1 and nISS = 2.17 ± 0.25 for A2, with S0 = 127.27 for A1 and S0 = 87.16 for A2. The bin size is determined using the Knuth's rule (Knuth 2006) algorithm provided in astropy (Astropy Collaboration et al. 2013, 2018), though we confirm that the obtained values do not depend on the binning by repeating the analysis for different bin sizes.

Figure 3.

Figure 3. Histograms of projected pulse S/N for  J0437−4715 for A1 (left column) and A2 (right column). The top row is for the full observations and the bottom row for observations split in segments such that ${t}_{\min }=2000\,{\rm{s}}$. The line shows the estimated scintillation distribution from fitting nISS in Equation (2).

Standard image High-resolution image

In addition, we make use of the long duration of the observations, which is significantly larger than the typical scintillation timescale for J0437−4715. We split the observations in segments lasting ${t}_{\min }=2000\,{\rm{s}}$ and ${t}_{\min }=5000\,{\rm{s}}$ and repeat the previous analysis. In this case we obtain larger values of nISS ∼ 5.

For each of these fittings we perform a Kolmogorov–Smirnov (KS) test for goodness of fit. This test quantifies the distance between the empirical distribution of the sample (obtained from the projected S/N) and the cumulative distribution function of the reference distribution (obtained from fitting nISS in Equation (2)) under the null hypothesis that the sample is drawn from the reference distribution. The null hypothesis can be rejected at a given confidence level α if the resulting p-value is lower than 1−α. The p-values obtained are summarized in Table 3. For α = 0.9 (90% confidence level) we find that the goodness of fit cannot be statistically rejected for complete observations with either A1 or A2, or split observations of A1, all of which have a large p-value. However, the fits to the split observations of A2 fail this test, suggesting that, for short observations with A2, Equation (2) may not be entirely valid or that the estimate of the projected S/N becomes unreliable.

Table 3.  Adjusted Values of nISS for Each Set of Observations and the Kolmogorov–Smirnov Test p-value for Each Fitting

  A1 A2
  nISS Error p nISS Error p
No split 2.67 0.31 0.38 2.17 0.25 0.24
Split tmin = 5000 s 6.33 0.54 0.90 5.53 1.04 0.009
Split tmin = 2000 s 5.50 0.36 0.70 4.63 0.43 0.004

Download table as:  ASCIITypeset image

We compare our values of nISS with theoretical estimations following Lam & Hazboun (2020). We scale the scintillation parameters given at the frequency of 1.5 GHz by Keith et al. (2013) to match our observations centered at 1.4 GHz and obtain the scintillation bandwidth Δνd = 740 MHz and scintillation timescale Δtd = 2290 s. We calculate nISS via the usual formula

Equation (3)

where ηt and ην are filling factors ∼0.2. The estimated nISS for T = 220 minutes are 2.22 for A1 (BW = 112 MHz) and 2.18 for A2 (BW = 56 MHz). We confirm that the value obtained with A2 is consistent with the expectations, although for A1 it is larger than expected, perhaps due to additional factors affecting the variability observed.

Gwinn et al. (2006) found two scintillation scales observing J0437−4715 in 327 MHz. Rescaling those scales to our observing frequency, 1400 MHz, we find timescales of Δtd,1 = 5727 s and Δtd,2 = 515 s, leading to nISS,1 = 1.46 for both antennas and nISS,2 = 6.58 for A1 and nISS,2 = 6.35 for A2. The latter values are close to those displayed in Table 3 for the split observations, consistent with the shorter observations being more sensitive to the shorter-scale scintillations. Note also that those scintillations scales have been observed to vary notably between epochs (Smirnova et al. 2006).

4. Timing Analysis

Here we discuss the timing-error dependence on three parameters: (i) the S/N of the observations, (ii) the number of bins used in the reduction of the observations, and (iii) the BW of the observations. In addition we study and quantify other sources of systematic errors.

4.1. Timing Residuals

We compute the timing residuals of the TOAs using Tempo2 (Hobbs et al. 2006) and its Python wrapper, libstempo (Vallisneri 2020), with the timing model given in the file J04374715.par provided by IPTA and adapted to the IAR observatory.13 Tempo2 returns: (i) the MJD, residual, and template-fitting error (σTOA) of each observation, and (ii) the timing model parameters, the weighted errors of the residuals (rms), and the ${\chi }_{\mathrm{red}}^{2}={\chi }^{2}/{n}_{\mathrm{free}}$ of the timing model fit to the residuals. The χ2 test considers a good fit when ${\chi }_{\mathrm{red}}^{2}\sim 1;$ instead, a value of ${\chi }_{\mathrm{red}}^{2}\gg 1$—assuming the timing model is correct—indicates the presence of outliers or an underestimation of the residual errors. In this case we can:

  • 1.  
    assume a certain systematic error in the computation of the TOAs due, for instance, to instrumental errors such as observation timestamp, reduced BW, hidden RFI, etc.;
  • 2.  
    define a criterion to discard the outliers, for instance by vetting residuals above a certain value.

Figure 4 shows the timing residuals of the observations taken with each antenna. The values of the ${\chi }_{\mathrm{red}}^{2}$ from the fits are greater than 1, indicating the presence of outliers or underestimated errors.

Figure 4.

Figure 4. Timing residuals for the complete data set for A1 and A2.

Standard image High-resolution image

To account for possible systematic errors, we adopt a simplified approach14 in which we add quadratically a common σsys to all the σTOA, producing a total error

Equation (4)

We calculate the value of σsys that leads to ${\chi }_{\mathrm{red}}^{2}=1$, obtaining σsys ∼ 0.67 μs for the observations with A1 and σsys ∼ 1.0 μs for the observations with A2. We recompute the rms using the corrected errors in the residuals by adding σsys as in Equation (4). We obtain rms = 0.72 μs for A1 and rms = 1.05 μs for A2.

In order to determine the effect of the outliers measurements we set a 3σ criterion, but since the σtot itself depends on the assumed value of σsys, we apply the following iterative process.

  • 1.  
    Given an initial ${\sigma }_{\mathrm{sys}}^{(i)}$ (as obtained previously), to each TOA we assign an error ${\sigma }_{\mathrm{tot}}^{(i)\ 2}={\sigma }_{\mathrm{TOA}}^{2}+{\sigma }_{\mathrm{sys}}^{(i)\ 2}$.
  • 2.  
    If the residual of an observation is such that $\left|\delta t\right|\gt 3{\sigma }_{\mathrm{tot}}^{(i)}$, then this observation is discarded as an outlier.
  • 3.  
    If the residual is such that $\left|\delta t\right|\leqslant 3{\sigma }_{\mathrm{tot}}^{(i)}$, then we keep this observation and its TOA error is given the new value
    Equation (5)
    where ${\sigma }_{\mathrm{sys}}^{(i+1)}$ is chosen such that when the new residuals are computed we get ${\chi }_{\mathrm{red}}^{2}=1$. In practice, the process converges after one or two iterations.

In this way, we eliminate all the outliers in our data set (five observations for A1 and 24 for A2) and obtain refined values of the systematic errors σsys ∼ 0.50 μs for A1, σsys ∼ 0.66 μs for A2, and σsys ∼ 0.59 μs for A1+A2.

4.2. Timing versus S/N

We study the timing residuals for each S/N subset for each antenna; these are shown in Figure A2. By filtering out the low-S/N observations, those with large residuals are eliminated. Thus we conclude that outliers tend to have low S/N; we note, however, that some low-S/N observations also have small residuals.

We perform a timing analysis for A1, A2, and A1+A2. In all cases—even for large S/N values—we obtain ${\chi }_{\mathrm{red}}^{2}\gg 1$. We interpret this as indicative of unaccounted-for systematic errors and we perform the procedure detailed in Section 4.1 to find the values of σsys that lead to ${\chi }_{\mathrm{red}}^{2}\approx 1$. Taking as a reference the case for S/N > 50, we obtain σsys = 0.5 μs for A1, 0.66 μs for A2, and 0.59 μs for A1+A2. We note that these values change if we do not remove the 3σ outliers, leading to σsys = 0.67 μs for A1, 0.99 μs for A2, and 0.83 μs for A1+A2.

In Figure 5 we display the values of σsys and rms for each subset of observations with their 1σ error bars (∼68% confidence limits). The error bars for σsys are computed as the values ${\sigma }_{\mathrm{sys},\min }$ that yield ${\chi }_{\mathrm{red}}^{2}({n}_{\mathrm{free}},\alpha /2)$ and ${\sigma }_{\mathrm{sys},\max }$ that yield ${\chi }_{\mathrm{red}}^{2}({n}_{\mathrm{free}},1-\alpha /2)$, with α = 0.32.

Figure 5.

Figure 5. Top: σsys for the 〈S/N〉 of each subset of observations for each antenna and their corresponding error bars. Bottom: rms from recomputed time-of-arrival errors augmented by σsys and their corresponding error bars.

Standard image High-resolution image

The timing rms diminishes (i.e., improves) for higher S/N observations. The value of the rms is well-constrained to ≳0.5 μs, though values a little higher (≈0.7 μs) are obtained for A2 when low-S/N ( < 100) observations are included. In particular, for S/N > 140 we get rms ≈ 0.52 μs for A1 and 0.55 μs for A2, which is a slight improvement over those reported in Gancio et al. (2020) (0.55 μs for A1 and 0.81 μs for A2).

We also obtain a consistent value of σsys ≈ 0.5 μ s. There is a systematic trend of lower σsys toward increasing S/N (Figure 5), though with a small significance (close to or below the 1σ level). We conclude that the systematic errors of both IAR's antennas are of the order of 0.4–0.6 μs when accounting for outliers and S/N effects.

Finally, the values of σsys and rms are smaller for A1 than for A2 for each subset of ${\rm{S}}/{{\rm{N}}}_{\min };$ this behavior subsists at the same 〈S/N〉 (Figure 5). Given that the main differences between the two antennas are BW and nP, we explore those dependences in detail to understand the reason(s) behind the improved timing precision of A1.

4.3. Timing versus Bandwidth

In Section 4.2 we found that the A1 observations have a lower timing rms than the corresponding observations from A2. Since both antennas differ in their BW (112 MHz for A1 and 56 MHz for A2), and nP (1 and 2 for A1 and A2, respectively), we reduce the observations to the same BW and nP in order to quantify the effect of those hardware differences on errors. For this analysis we use the six subsets of observations defined by their S/N in Section 3.1. We split the A1 observations into two subintervals of BW = 56 MHz using pat for the scrunching with the options -j "F {n}", where n = 2 is the number of subintervals. For the observations with A2 we would like to split the two polarizations separately; however, this is not possible as these observations only store the sum of both polarization modes. From the radiometer equation (Lorimer & Kramer 2012)

Equation (6)

we see that the errors scale with ${n}_{{\rm{P}}}^{-1/2};$ hence, we multiply the errors in the A2 residuals by a factor $\sqrt{2}$ to simulate a case with nP = 1 (assuming we are not strongly affected by the polarization of the source).

In this way, for each subset of S/Nmin we have five groups of observations: three from A1 (one with BW = 112 MHz and two reduced to 56 MHz), and two from A2 (with errors modeled to two and one polarization modes). We model S/N(σTOA) and then compute 〈S/N〉 for each of these subset from the σTOA of their observations.

The resulting rms values are plotted in Figure 6. For a given value of S/Nmin, the higher-frequency sub-band of the A1 observations has lower rms values than the lower-frequency sub-band, which in turn is similar to the case of the A2 observations in one polarization. The inclusion of all the BWs for A1 or both polarizations of A2 shows consistently lower rms. These results can be interpreted as due to:

  • 1.  
    RFI affecting more the lower-frequency sub-band,
  • 2.  
    effects of differential scintillation,
  • 3.  
    dispersion effects being better modeled at higher frequencies.

Figure 6.

Figure 6. Timing residual rms values with the A1 observations scrunched to BW = 56 MHz and those observed with A2 reduced to one polarization. We also reproduce the full A1 and A2 original residuals.

Standard image High-resolution image

In conclusion, we found that the main difference in the timing errors between the antennas can be attributed to the difference in BW, followed by nP and an increase of S/N in the selection of the observations. An increase in BW seems paramount for improving the timing errors.

4.4. Timing versus Observation Length (with Split of Observations)

The rms values improve both with longer observation times and with a higher number of TOAs (Lorimer & Kramer 2012; Wang 2015). Here we investigate whether it is possible to improve the overall timing by splitting the long (>200 minute) observations into multiple subintegrations, producing various TOAs from each observation. In this way, we obtain additional data points at the expense of lower timing precision in each of them.

We start with a set of 268 unsplit observations with tobs > 75 minutes and σTOA < 1.0 μs. First we calculate their rms (dashed line in Figure 7). Then we systematically split these observations for different values of the minimum duration of the subintervals considered, from 10 to 75 minutes (the shorter the subinterval, the larger the number of TOAs obtained). We plot the rms as a function of tmin in Figure 7, and specify the total number of points obtained from splitting in each case. We see that the rms diminishes monotonously as tmin increases, showing that this method is not suitable for improving the timing of our observations. This is most likely a sign of the S/N being a major factor affecting our current timing precision; for ${t}_{\min }\lt 70\,\mathrm{minutes}$ it is also possible that jitter affects the TOAs.

Figure 7.

Figure 7. Timing obtained when splitting observations. The total number of points after splitting is detailed as N. The horizontal dashed line corresponds to no splitting. The projected crossing of curves occurs at about 90 minutes.

Standard image High-resolution image

5. Noise Analysis

In the following sections, we analyze: (i) the white noise in our data set, which is needed to estimate the systematic timing errors; (ii) the red noise, which is correlated in time and has a larger amplitude at low frequencies; (iii) the GWB at μHz frequencies, which is produced from a variety of sources that we cannot identify individually. For this purpose, we use the software ENTERPRISE (Enhanced Numerical Toolbox Enabling a Robust PulsaR Inference SuitE), a pulsar-timing analysis code which performs noise analysis, GW searches, and timing-model analysis (Ellis et al. 2019). ENTERPRISE uses the timing model, previously fit with Tempo2, as the basis to construct a design matrix centered around the timing parameters. This is then used to find the maximum-likelihood fit for the white- and red-noise parameters.

5.1. White-noise Analysis

As described by Alam et al. (2021), the white noise is modeled using three parameters.

  • 1.  
    EQUAD accounts for sources of uncorrelated and systematic (Gaussian) white noise in addition to the template-fitting error in the TOA calculations.
  • 2.  
    EFAC is a dimensionless constant multiplier to the TOA uncertainty from template-fitting errors. It accounts for possible systematics that lead to underestimated uncertainties in the TOAs.
  • 3.  
    ECORR describes short-timescale noise processes that have no correlation between observing epoch, but are completely correlated between TOAs that were obtained simultaneously at different observing frequencies. This parameter accounts for wide-band noise processes such as pulse jitter (Osłowski et al. 2011; Shannon et al. 2014).

Considering σTOA to be the template-fitting error of a given observation, the resulting white-noise model is modeled by the noise covariance matrix (Lentati et al. 2014)

Equation (7)

where t and ν are the time and frequency of the observation, respectively.

Given that we have multiple TOAs per day (see Section 2), we need to consider an ECORR contribution. We then incorporate all these noise components and timing-model parameters (as specified in Appendix A.1) into a joint likelihood using ENTERPRISE. We sample the posterior distribution using the sampler PTMCMCSampler (Ellis & van Haasteren 2017), setting uniform prior distributions.

First, we investigate the consistency between the analysis with ENTERPRISE and the independent analysis we presented in Section 4.2. To this end, we use the same set of observations as in the aforementioned analysis while we fix the value EFAC = 1 and we exclude the ECORR parameter from our analysis, so that the Gaussian white noise EQUAD becomes equivalent to the parameter σsys in Equation (4). We obtain a notorious agreement between the values of EQUAD and σsys: when removing 3σ outliers (Section 4.2) we obtain EQUAD ≈ 0.57 μs, fully consistent with the systematic error of σsys ≈ 0.59 μs that we found in Section 4.2 for A1+A2 and S/N > 50;  without removing the outliers, the results are EQUAD ≈ 0.80 μs and σsys ≈ 0.83 μs, which again are fully consistent.

Second, we repeat the previous analysis, now taking both EFAC and EQUAD as free parameters. By doing so, we obtain $\mathrm{EFAC}={2.48}_{-0.30}^{+0.29}$ and ${\mathrm{log}}_{10}\mathrm{EQUAD}=-{6.30}_{-0.07}^{+0.10}$ (EQUAD ≈ 0.5 μ s) as the best-fit parameters. The quoted error bars correspond to the 1σ ( ≈ 68%) confidence limits that were obtained using the lower-level function corner.quantile from the corner.py Python module (Foreman-Mackey 2016) and taking the 16th and 84th percentiles. We present a corner plot for these parameters in Figure 8. In this plot we also show confidence intervals considering that the relevant 1σ contour level for a 2D histogram of samples is 1 − e−0.5 ∼ 0.393 (39.3%). Values of EFAC ∼ 1 would suggest that observing and timing procedures result in near-true TOA uncertainty estimates; thus, the adjusted values of EFAC ∼ 2.5 indicate that the TOAs error bars are considerably underestimated.

Figure 8.

Figure 8. White noise ENTERPRISE timing analysis for J0437−4715 using the A1+A2 data set.

Standard image High-resolution image

5.2. Red-noise Analysis

The red noise is assumed to be a stationary Gaussian process, which is parameterized with a power-law model in frequency, such that the spectral power density is given by (see Hazboun et al. 2020)

Equation (8)

where f is a given Fourier frequency in the power spectrum, fref is a reference frequency (in this case, 1 yr−1), Arn is the amplitude of the red noise at the frequency fref, and Γrn is the spectral index. We take a prior on the red-noise amplitude that is uniform on ${\mathrm{log}}_{10}({A}_{\mathrm{rn}}\,[{\mathrm{yr}}^{3/2}])\in [-14.5,-12]$, and a prior on the red-noise index that is uniform on Γrn ∈ [0, 2.6]. The spectrum is evaluated at 30 linearly spaced frequencies f ∈ [1/Tspan, 30/Tspan], where Tspan is the span of the pulsar's data set (in this case, 1.1 yr).

In the following analysis we use a total of 319 observations obtained with A1 and A2 between 2019 April and 2020 June that meet the criteria tobs > 40 minutes, S/N > 40, and σTOA < 1 μs. Details of this data set are summarized in Table 2.

We analyze the data sets of each antenna both independently and altogether. As described in Section 5.1, ECORR accounts for noise that is correlated between observations that were obtained simultaneously at different frequencies. Since such observations are not available for a single antenna, we exclude the ECORR parameter from the analysis of the individual data sets. However, we do include this parameter when analyzing the A1+A2 data set in order to profit from the simultaneous observations at different frequencies. A corner plot for these parameters and their errors is shown in Figure 9.

Figure 9.

Figure 9. ENTERPRISE timing analysis of red noise for 1.1 yr of observations of J0437−4715 (A1+A2).

Standard image High-resolution image

The fitted values to the white- and red-noise parameters for the different data sets are presented in Table 4. To complement this we explore the possibility of splitting long-duration observations into two subintegrations of ${t}_{\min }=75\,\mathrm{minute}$ in order to sample shorter timing frequencies. The adjusted values for this case, also presented in Table 4, are consistent within 1σ to those obtained without the splitting. We therefore conclude that splitting long observations does not improve the timing analysis, in line with the conclusion from Section 4.4.

Table 4.  Adjusted Values for the White- and Red-noise Parameters

  EFAC ${\mathrm{log}}_{10}\mathrm{EQUAD}$ Γrn ${\mathrm{log}}_{10}{A}_{\mathrm{rn}}$
A1 ${2.43}_{-0.23}^{+0.25}$ $-{6.3}_{-0.06}^{+0.06}$ ${1.22}_{-0.48}^{+0.13}$ $-{13.88}_{-0.44}^{+0.40}$
A2 ${2.82}_{-0.30}^{+0.32}$ $-{6.34}_{-0.09}^{+0.08}$ ${1.02}_{-0.42}^{+0.36}$ $-{13.51}_{-0.26}^{+0.20}$
A1+A2 ${2.48}_{-0.24}^{+0.26}$ $-{6.32}_{-0.07}^{+0.09}$ ${0.97}_{-0.38}^{+0.37}$ $-{13.63}_{-0.27}^{+0.21}$
A1+A2* ${2.76}_{-0.19}^{+0.24}$ $-{6.47}_{-0.14}^{+0.17}$ ${0.80}_{-0.37}^{+0.39}$ $-{13.54}_{-0.29}^{+0.43}$

Note. Values marked with (*) were obtained by splitting the observations as described in Section 4.4.

Download table as:  ASCIITypeset image

We obtain EQUAD ≈ 0.5 μs in all cases. The value of Γrn is less constrained and consistent within 0.5–1.5, while the amplitude is Arn ≈ (2–7) × 10−13. The obtained Arn lies within the expected order of magnitude, whereas Γrn falls below the expected value by at least a factor two (Wang 2015), which we interpret as due to the relatively short baseline of our current data set.

5.3. GW Analysis

We now embark on setting the first bounds to the GW amplitude from massive binary black holes using observations from IAR. In doing so, we aim to exploit the high cadence of these observations.

5.3.1. GW Analysis: Stochastic Background

The contribution of the GWB coming from an ensemble of supermassive black hole (SMBH) binaries or primordial fluctuations during the big bang is modeled similarly to that of the red noise (Equation (8)). Any GWB component is modeled as a single stationary Gaussian process with a power-law timing-residual spectral density

Equation (9)

The analysis is nearly identical to the red-noise analysis described in Section 5.2. The prior on the GWB amplitude is taken uniform on ${\mathrm{log}}_{10}({A}_{\mathrm{gwb}}\,[{\mathrm{yr}}^{3/2}])\in [-14.4,-11]$, whereas the prior on the GWB index is uniform on Γgwb ∈ [0, 3.2]. Moreover, we fix EFAC and EQUAD to the values adjusted in Section 5.2 for each data set.

In this analysis we also consider both the original data sets and the data sets obtained by splitting the observations in subintegrations with tobs ≥ 75 minutes (see Section 4.4). The best-fit values to each GWB parameter and for each set of observations are presented in Table 5.

Table 5.  Best-fit Values to the GWB Parameters

Parameter A1 A2 A1+A2
  No split Split No split Split No split Split
Γgwb ${0.50}_{-0.26}^{+0.25}$ ${0.38}_{-0.21}^{+0.20}$ ${0.12}_{-0.04}^{+0.04}$ ${0.10}_{-0.03}^{+0.03}$ ${0.38}_{-0.29}^{+0.28}$ ${0.28}_{-0.18}^{+0.17}$
${\mathrm{log}}_{10}{A}_{\mathrm{gwb}}$ $-{13.48}_{-0.23}^{+0.25}$ $-{13.37}_{-0.20}^{+0.20}$ $-{13.33}_{-0.21}^{+0.23}$ $-{13.22}_{-0.17}^{+0.18}$ $-{13.48}_{-0.23}^{+0.24}$ $-{13.41}_{-0.18}^{+0.18}$

Download table as:  ASCIITypeset image

Using the split observations, we get a higher cadence at the cost of worsening the S/N (and therefore the TOA precision) of each data point. Our results show consistent values of Agw ≈ (3 ± 2) × 10−14 and Γgw ≈ 0.3 ± 0.2 for all data sets, both with and without the splitting. In general, splitting the observations leads to slightly lower values of Γgw and slightly higher values of Agw, though these differences are not significant as they are within 1σ of the values obtained without the splitting.

While the amplitude we find is consistent with expected bounds for the stochastic background, Γgw falls short from the expected 13/3 for a stochastic GW background of SMBH binaries (Siemens et al. 2013), possibly due to our current relatively short observational baseline of ≈1.1 yr.

In order to account for a background of SMBH binaries, we repeat this analysis including a red-noise model with a uniform prior on the spectral index Γrn ∈ [0, 7] and an extra red-noise process with Γgwb set to 4.33. We also fix all of the white-noise parameters to the values obtained in Section 5.1. In Figure 10 we show a corner plot of the fit to the joint A1+A2 data sets. We find values of Arn ≈ (4 ± 3) × 10−14, consistent with our previous results, and Γrn ≈ 3.81 ± 2.1. Such uncertainties may be attributed to the short time span.

Figure 10.

Figure 10. ENTERPRISE gravitational wave analysis for J0437−4715 (A1+A2) including a red-noise process with Γgwb = 4.33.

Standard image High-resolution image

5.3.2. GW Analysis: Continuous Source

A single SMBH binary system produces continuous GWs because the system does not evolve notably over the few years of a pulsar-timing data set. We used the Python package Hasasia (Hazboun et al. 2019a) to calculate the single-pulsar sensitivity curve of our data set of J0437−4715 for detecting a deterministic GW source averaged over its initial phase, inclination, and sky location. The dimensionless characteristic strain is calculated for each sampled frequency as

Equation (10)

where S is the strain-noise power spectral density for the pulsar. This is related to the power spectrum of the induced timing residuals of Equation (9) by (see Jenet et al. 2006)

Equation (11)

The white- and red-noise parameters that were adjusted in Sections 5.1 and 5.2 using ENTERPRISE are loaded into the package in order to account for these effects in the calculations. Since our observations have a time baseline of Tobs = 1.1 yr and a nearly daily cadence, we calculate the curve across a frequency range between 1/(10 Tobs) ∼ 2.8 × 10−9 Hz and 1/(1 day) ∼ 1.2 × 10−5 Hz.

The resulting sensitivity curve is shown in Figure 11. It is readily seen that there is a loss of sensitivity at a frequency of (1 yr)−1, caused by fitting the pulsar's position, and at a frequency of (PB)−1 ∼ 2μHz (with PB the orbital period), caused by fitting the orbital parameters of the binary system. The additional spikes seen at frequencies higher than (PB)−1 correspond to harmonics of the binary orbital frequency.

Figure 11.

Figure 11. Sensitivity curve for J0437−4715 using 1.1 yr observations at IAR, including pulsar noise characteristics, for a single deterministic gravitational wave source averaged over its initial phase, inclination, and sky location (A1+A2; blue curve). The vertical green line corresponds to a frequency of 1/Tobs, the dotted red line to 1/Tyr, and the dotted purple line to 1/PB (orbital period). The black crosses correspond to the mean values of the ${\mathrm{log}}_{10}{h}_{\mathrm{gw}}$ distributions obtained using ENTERPRISE.

Standard image High-resolution image

In addition, the sensitivity at lower frequencies is reduced by (i) the fit of a quadratic polynomial to the TOAs required to model the pulsar spin-down and (ii) the fitting of "jumps" to connect the timing residuals obtained with different backends (Yardley et al. 2010). The frequency dependence ( ∼ f−3/2) at low frequencies is evidence of a fit to a quadratic spin-down model for the pulsar spin frequency. As a result, the minimum of the sensitivity curve should be attained at a frequency of 1/Tobs. However, given that the Tobs of our data set is close to 1 yr, this feature coincides with the loss of sensitivity at (1 yr)−1. We expect to obtain a well-defined minimum at  ≈ 1/Tobs in future by accumulating more observations and achieving a significantly longer time baseline.

For completeness, we tested the significance of the red-noise contribution by calculating a sensitivity curve without this component. The curve was essentially insensitive to those changes in the priors. This is expected, since the injection of red noise should lead to a flat sensitivity curve around the minimum (Hazboun et al. 2019b), though in our case it is coincident with the spike at (1 yr)−1.

For comparison, we used ENTERPRISE to perform a fixed-frequency Markov chain Monte Carlo procedure at four different frequencies. We obtained a posterior distribution for ${\mathrm{log}}_{10}{h}_{\mathrm{gw}}$ at each of these frequencies with a mean value in great agreement with the curve obtained with Hasasia, as shown in Figure 11.

These first results on GW sensitivity are encouraging, though we still need to achieve a sensitivity of at least a factor 10 higher in order to observe even the most favorable SMBH binary merger events. For instance, the six billion solar mass source of 3C 186 at z ≈ 1 produced a GW of h ∼ 10−14 at the time of arrival to our Galaxy, roughly a million years ago (Lousto et al. 2017).

6. Conclusions

We presented the first detailed analysis of the observational campaign toward the bright MSP J0437−4715 using the two antennas at IAR's observatory. This data set comprises over a year of high-cadence (up to daily) observations with both antennas, A1 and A2.

We quantified the timing precision and noise parameters using the current setup for A1 and A2. We also explored the effect of different reduction parameters of the raw data. We conclude that as follows.

  • 1.  
    The number of phase bins used in the reduction does not have an impact on the timing precision as long as nbins ≥ 256.
  • 2.  
    The S/N of the individual observations plays a crucial role in determining the timing precision. In particular, to achieve a timing precision  < 1 μs, observations with S/N > 140 are required, a condition that is currently fulfilled by ∼1/3 of the observations taken with A2 and ∼1/2 of the observations taken with A1.
  • 3.  
    Splitting long observations into shorter intervals does not improve the timing precision, most likely due to current limitations in S/N for short observations.
  • 4.  
    A1 slightly outperforms A2, probably due to its larger bandwidth configuration.
  • 5.  
    The systematic errors of the observations are σsys ≈ 0.5 μs, although this value is likely to be S/N-limited. The rms of the data set is  ≈ 0.5–0.6 μs
  • 6.  
    The white-noise analysis performed with ENTERPRISE indicates that the error bars are typically underestimated by a factor ∼3 when accounting for EQUAD and EFAC.
  • 7.  
    We placed upper limits to the GWB in the tens of nHz to sub-μHz frequency range. Although the current sensitivity is not sufficient for placing physically interesting constraints, the ongoing campaign—together with incoming hardware upgrades—is likely to significantly improve in the next five to ten years (see also Lam & Hazboun 2020). In particular, observations lasting over 3 hr are promising for exploring GW signals with frequencies above 0.1 μHz by splitting them into hour-scale subintegrations.

Ongoing and future hardware upgrade of IAR's antennas, such as installing larger-bandwidth boards, promise to expand IAR's observational capabilities and improve its achievable timing precision. Such upgrades would allow us to reduce the systematical errors of the antennas and to include (sub)daily high-precision timing of other MSPs of interest, such as J2241−5236.

The authors thank various members of the IAR's technical staff and board for their work and support, as well as numerous members of the LIGO-Virgo and NANOGrav collaborations for very valuable discussions, in particular P. Baker, J. Hazboun, M. Lam, and S. Taylor. We thank the anonymous referee for comments that helped to improve the manuscript. Part of this work was supported by the National Science Foundation (NSF) from Grants No. PHY-1912632, No. PHY-1607520, and No. PHY-1726215. V.S.F. acknowledges financial support from a summer fellowship of the Asociación Argentina de Astronomía (AAA). F.G. and J.A.C. acknowledge support by PIP 0102 (CONICET). This work received financial support from PICT-2017-2865 (ANPCyT). J.A.C. was also supported by grant PID2019-105510GB-C32/AEI/10.13039/501100011033 from the Agencia Estatal de Investigación of the Spanish Ministerio de Ciencia, Innovación y Universidades, and by Consejería de Economía, Innovación, Ciencia y Empleo of Junta de Andalucía as research group FQM- 322, as well as FEDER funds.

Facility: IAR. -

Software: ENTERPRISE (Ellis et al. 2019), PRESTO (Ransom et al. 2003; Ransom 2011), PSRCHIVE (Hotan et al. 2004), PyPulse (Lam 2017), Hasasia (Hazboun et al. 2019a).

Appendix: Details of the Analysis

A.1. Reduction of Observations

To de-disperse and fold the observations we use the software PRESTO (Ransom et al. 2003; Ransom 2011). It has a variety of tools for the reduction of observations. The processed data are s stored in a .pfd file that contains the pulse profile for different time and frequency bins. In addition to this profile, PRESTO outputs a .polycos file that contains the coefficients of a polynomial modeling the variation of the pulsar period. These coefficients allow us to determine the period of pulsation in a topocentric reference system and are necessary to compute the timing residuals.

If the observation is in the file obs.fil and the mask in the file m.mask, then the command-line used has the following syntax:

Equation (A1)

where the option -timing indicates prepfold to generate a file .polycos based on the pulsar parameters This process is currently automatized through local Python scripts.

A.2. Templates

Considering that A1 and A2 have different configurations (number of polarizations and BW; see Table 1), it is possible that slight differences arise in the integrated profile seen by each antenna. We therefore study whether the template used has a significant impact on the timing residuals.

To create each template we choose observations with nbins = 1024 phase bins and nchan = 64 frequency channels. We select for each antenna data the highest-S/N observation and extract the noise using the task psrsmooth in the package psrchive. This choice of templates seems adequate since J0437−4715 is a very bright pulsar and selected individual observations produce a high enough S/N to create a template. We highlight that the large span of our observations (over 3 hr in many cases) mitigates the impact of the intrinsic jitter of the pulsar (Liu et al. 2012). The selected templates for each antenna correspond to the profiles with nbins = 1024 phase bins in Figure A1. The relative error between them is below 5% near the peak, with larger relative differences toward the wings, but those do not have a major influence in the determination of the TOAs.

Figure A1.

Figure A1. Top: templates for each antenna for different values of nbins. Bottom: rms found for each subset per nbins, and its corresponding 1σ error bars.

Standard image High-resolution image

In the preliminary timing analysis of J0437−4715 presented in Gancio et al. (2020), we used the same template on both antennas to determine TOAs. We show in our separated analysis of A1 and A2 data that this assumption was valid to the current level accuracy, producing an rms = 0.8 μs residual for A2 observations with the use of either template. Notwithstanding this a posteriori verification, we consistently use different templates for A1 and A2 throughout this work.

A.3. Timing versus Number of Phase Bins

In order to study the effect of the number of phase bins (nbins) used in the folding of observations on the timing residuals, we have taken data folded originally with nbins = 1024, and processed with the routine bscrunch of the psrchive package for Python, to generated copies of the observations and their corresponding templates for each antenna, but with values of nbins = 512, 256, 128, 64, and 32. Through this process of scrunching we obtained six sets of observations for each antenna only differing by their nbins. Figure A1 shows the effect of the nbins on the templates for each antenna. While for nbins = 32 we lose temporal resolution, the differences beyond nbins ≥ 256 are almost negligible to our precision.

Next we compute the timing residuals for each nbins subset. Interestingly, only for nbins ≤ 64 are the timing errors too large; for nbins ≥ 128 the derived TOAs are very consistent, the size of the error bar being the main difference (with smaller error bars obtained for larger nbins). The rms of the residuals for each subset after adjusting σsys as a function of nbins is shown in Figure A1. The rms decreases with increasing nbins significantly for 64 to 256 bins, showing that we cannot attain good timing for nbins ≤ 64 and need at least 256 bins to obtain a precision higher than 1 μs for a pulsar like J0437−4715. This corresponds to a time interval much smaller than the full width at half-maximum of the pulse, that is, 0.3 μs at 1400 MHz.

A.4. Timing versus S/N

Here we present additional figures that support the hypothesis that our timing studies are limited due to the S/N of the observations. This effect has a larger impact for A2, as can be seen in Figure A2.

Figure A2.

Figure A2. Residuals of each subset of observations with A1 (left) and A2 (right) grouped in different data sets according to their minimum S/N (see legends).

Standard image High-resolution image

Footnotes

  • In 2019 the antennas A1 and A2 were renamed "Varsavsky" and "Bajaja," respectively.

  • In Appendix A.3 we show that the number of phase bins does not affect the posterior analysis as long as nbins ≥ 256.

  • 10 
  • 11 
  • 12 

    We normalize the number of observations in each S/N bin by the total number of observations of each antenna.

  • 13 

    In this .par we also included four JUMPs to account for the different central frequencies of the observations, and the corresponding antenna (A1/A2; see Table 1).

  • 14 

    In Section 5 we compare the results of this simplified model with those obtained using a standard and a more refined white noise model as in Arzoumanian et al. (2016).

Please wait… references are loading.
10.3847/1538-4357/abceb3