On the Fast Fourier Transform to compute the autocovariance#

Here we show the effects of computing the autocovariance function (ACVF) using the Fast Fourier Transform. We will show these effects using functions where we know the analytical Fourier Transform. Which are the exponential and squared exponential autocovariance functions. First we compute the ACVF on an arbitrary grid and then we compute the ACVF on an extended grid of frequencies.

	Autocovariance \(R(\tau)\)	Power spectrum \(\mathcal{P}(f)\)
Exponential	\(\dfrac{A}{2\gamma} \exp{(-\|\tau\|\gamma)}\)	\(\dfrac{A}{\gamma^2 +4\pi^2 f^2}\)
Squared exponential	\(A\exp{(-2 \pi^2 \tau^2 \sigma^2)}\)	\(\dfrac{A}{\sqrt{2\pi}\sigma}\) \(\exp\left(-\dfrac{f^2}{2\sigma^2} \right)\)

Arbitrary grid of frequencies#

Squared exponential ACVF#

First, we define the autocovariance function and its Fourier transform, i.e. the power spectrum.

We generate a grid of frequencies from \(0\) to \(f_N\) with a spacing \(\Delta f=f_0\). The power spectrum will be of length \(N\).

f0 = 1e-4 # min frequency 
df = f0 
fN = 5. # max frequency
# grid of frequencies, from 0 to fend-df with step size df
f = np.arange(0,fN,df)
# number of points in the psd including the zeroth freq
N = len(f)
# define the parameters of the psd
A , sigma = 1 , 1.e-1

P = SqExpo_PSD(f,A,sigma)

../_images/e46cd24d52eb7dc04798198f612b2c8cdcb2e89e264aec244336dd2b41f7e793.png

As we know the power spectrum is symmetric for negative frequencies, we can use the real inverse Fourier transform to compute the autocovariance function.

We use the function np.fft.irfft to compute the inverse Fourier transform. The discrete inverse Fourier transform returns a real array of length \(2N-2\). Where x[0] is the autocovariance at \(\tau=0\) and x[N-1] is the autocovariance at \(\tau=\tau_\mathrm{max}\), after which the autocovariance is mirrored for negative lags. The time separation between the autocovariance values is given by \(\Delta \tau = \tau_\mathrm{max}/(N-1)\), where \(\tau_\mathrm{max}=1/2\Delta f\) is the maximum lag for which we compute the autocovariance function.

To obtain the true autocovariance function we divide the result by the sampling frequency \(\Delta f\).

tmax = 0.5 / df 
dt = 1/df/(N-1)/2

t = np.arange(0,tmax+dt,dt)

R_theo = SqExpo_ACV(t,A,sigma) # theoretical inverse FT of the lorentzian

# compute the IFFT, with a normalisation factor 1/M, M=2*(len(y)-1)
R_dift = np.fft.irfft(P) 
assert (R_dift[1:N-1]-np.flip(R_dift[N:])<np.finfo(float).eps).all(), "The IRFFT did not return a mirrored ACV"
R_num = R_dift[:N]/dt

Below, we plot the resulting autocovariance function and the error between the numerical and analytical autocovariance function. For plotting purposes, we only plot the autocovariance function for positive lags and up to \(\tau=10\).

We can see that in the case of the squared exponential autocovariance function, the numerical Fourier transform is exact within the numerical precision.

../_images/abbb2627bef9dead973dc273cfcc8226ad8b28d43de824e019ae521717862be3.png

We can check the Fourier pairs properties of the autocovariance function and the power spectrum by comparing the sums.

P(0)           : 3.98942e+00
Sum of ACV     : 3.98942e+00
Sum of ACVtheo : 3.98942e+00
------Error----: 1.33227e-15

Sum of PSD     : 1.000e+00
ACVtheo[0]     : 1.000e+00
ACV[0]         : 1.000e+00
-----Error-----: 0.000e+00

Exponential ACVF#

f0 = 1e-3
df = f0 
fN = 1. # max frequency
f = np.arange(0,fN,df)
N = len(f)
A , sigma = 1 , 1e-1
P = Expo_PSD(f,A,sigma)

Unlike in the previous case, the power spectrum is less steep, it does not decay as fast to zero. This means that the autocovariance function at the firs time lags will be more affected by the numerical Fourier transform.

../_images/1bc9660c142916632bd2af1feb2916cc1c8664dc714275e8e880b871fc2e94d1.png

We see that this time FFT does not give an exact result within the numerical precision, this is due to the power spectrum does not decay as fast to zero as in the case of the squared exponential autocovariance function. The Discrete Fourier Transform assumes the signal is periodic, the power spectrum is replicated at the end of the grid of frequencies but because the power spectrum does not decay to zero, the aliasing effect is more important. We will see how to mitigate this effect in the next section.

../_images/79f9ffff48df60fc3faecfd2c8f65e494a742d8b558c9f6b0124027ba6e0fba5.png

P(0)           : 1.00000e+02
Sum of ACV     : 1.00000e+02
Sum of ACVtheo : 1.00021e+02
------Error----: 2.84217e-14

Sum of PSD     : 4.949e+00
ACVtheo[0]     : 5.000e+00
ACV[0]         : 4.949e+00
-----Error-----: 0.000e+00

Extending the grid of frequencies#

As we saw in the previous section, the autocovariance obtained via FFT on a grid of frequencies can be affected by the aliasing effect. To mitigate this effect, we can extend the grid of frequencies.

Let us consider the case of a signal of duration \(T=400\) with a sampling period of \(\Delta T=1\). We want to compute the ACVF of the signal given our two PSD models. The minimal grid of frequencies to compute the ACVF is \(f_\mathrm{min}=1/T\) and the maximal grid of frequencies is given by the Nyquist frequency \(f_\mathrm{max}=1/2\Delta T\). As we saw in the previous section, the autocovariance obtained via FFT on a grid of frequencies can be affected by aliasing.

To mitigate this effect, we extend the grid of frequencies with two factors: \(S_\mathrm{low}\) and \(S_\mathrm{high}\), for low and high frequencies, respectively. The grid of frequencies is then given by \(f_0 = f_\mathrm{min}/S_\mathrm{low} = \Delta f\) to \(f_N = f_\mathrm{max}S_\mathrm{high}=N \Delta f\).

Extending the high frequencies#

First, we start by studying the effect of \(S_\mathrm{high}\) on the autocovariance function. We will use the exponential autocovariance function as an example.

To obtain the values of the autocovariance function up to lag \(\tau=T\) (time separation between the first and last value), \(S_\mathrm{low}\) should be larger than \(2\). This is due to \(\tau_\mathrm{max}=1/2\Delta f\) and \(\Delta f=f_0=\frac{1}{S_\mathrm{low}T}\).

<>:21: SyntaxWarning: invalid escape sequence '\m'
<>:21: SyntaxWarning: invalid escape sequence '\m'
/tmp/ipykernel_1722/717671293.py:21: SyntaxWarning: invalid escape sequence '\m'
  ax.set_title("Extended frequency grid with $S_\mathrm{low}=%i$ and $S_\mathrm{high}=%i$"%(S_low,S_high))

../_images/7f4b4226866e6463e8a45efddd93324d5acbe3df98cddaeaffb86278bf4e82a0.png

For several values of \(S_\mathrm{high}\), we compute the autocovariance function and the error between the numerical and analytical autocovariance function. We plot the absolute difference up to \(\tau_\mathrm{max}\) with an inset for lags up to \(\tau=10\).

The error at the first lags and all other lags is decreasing as \(S_\mathrm{high}\) increases, this reduces aliasing, the power spectrum at high frequencies is slowly reaching zero as more frequencies are added. Also the error appears to attain a plateau for high values of \(\tau\), a plateau which is always smaller for higher values of \(S_\mathrm{high}\).

<>:30: SyntaxWarning: invalid escape sequence '\m'
<>:38: SyntaxWarning: invalid escape sequence '\m'
<>:30: SyntaxWarning: invalid escape sequence '\m'
<>:38: SyntaxWarning: invalid escape sequence '\m'
/tmp/ipykernel_1722/3567205907.py:30: SyntaxWarning: invalid escape sequence '\m'
  ax.plot(t,np.abs(R_theo-R_num),label=f'$S_\mathrm{{high}}={{{S_high}}}$',alpha=0.8)#,marker='s')
/tmp/ipykernel_1722/3567205907.py:38: SyntaxWarning: invalid escape sequence '\m'
  axins.plot(t,np.abs(R_theo-R_num),label=f'$S_\mathrm{{high}}={{{S_high}}}$',alpha=0.8)#,marker='s')

../_images/c80cba88eba7f2368643da734c9db37ee5ee22a7d1285b7133b991debf5efd67.png

Extending the low frequencies#

This time, we study the effect of \(S_\mathrm{low}\) on the autocovariance function. We keep \(S_\mathrm{high}=1\) and we vary \(S_\mathrm{low}\).

Below, we plot the power spectrum on the extended grid of frequencies, more frequencies are added at low frequencies, as \(f_0\) decreases with \(S_\mathrm{low}\) the sampling in frequency is very dense.

<>:21: SyntaxWarning: invalid escape sequence '\m'
<>:21: SyntaxWarning: invalid escape sequence '\m'
/tmp/ipykernel_1722/1677276729.py:21: SyntaxWarning: invalid escape sequence '\m'
  ax.set_title("Extended frequency grid with $S_\mathrm{low}=%i$ and $S_\mathrm{high}=%i$"%(S_low,S_high))

../_images/3e2f69ff9b008b22b81a4d047284d1aee43fd4981e1c50b4f4061e0428590bb6.png

As \(S_\mathrm{low}\) increases, the error for the first lags decreases very slowly. The error appears to attain a plateau for high values of \(\tau\), a plateau which is always the same for all values of \(S_\mathrm{low}\) except for \(S_\mathrm{low}=2\). Comparing with the previous figure, we can clearly see that the effect of \(S_\mathrm{high}\) is more important than the effect of \(S_\mathrm{low}\).

<>:30: SyntaxWarning: invalid escape sequence '\m'
<>:39: SyntaxWarning: invalid escape sequence '\m'
<>:30: SyntaxWarning: invalid escape sequence '\m'
<>:39: SyntaxWarning: invalid escape sequence '\m'
/tmp/ipykernel_1722/2695329910.py:30: SyntaxWarning: invalid escape sequence '\m'
  ax.plot(t,np.abs(R_theo-R_num),label=f'$S_\mathrm{{low}}={{{S_low}}}$',alpha=0.8)#,marker='s')
/tmp/ipykernel_1722/2695329910.py:39: SyntaxWarning: invalid escape sequence '\m'
  axins.plot(t,np.abs(R_theo-R_num),label=f'$S_\mathrm{{low}}={{{S_low}}}$',alpha=0.8)#,marker='s')

../_images/6149cfab022b940cb86b88dc2a382ac52256f67602cfd654ddeb8597d5a8266b.png

Conclusion#

In practice, we should use both \(S_\mathrm{low}\) and \(S_\mathrm{high}\) to extend the grid of frequencies. The optimal values of \(S_\mathrm{low}\) and \(S_\mathrm{high}\) depend on the power spectrum but also on the desired time lags, it might be necessary to try several values of \(S_\mathrm{low}\) and \(S_\mathrm{high}\) to check that the computation of the autocovariance function gives a good estimate of the true autocovariance function.