While the Autocorrelation Function (ACF) provides a broad overview of how a time series observation correlates with its past values, it doesn't distinguish between direct and indirect relationships. For instance, the correlation between yt and yt−2 measured by the ACF includes the effect of yt−1, because yt is likely correlated with yt−1, which in turn is correlated with yt−2. Sometimes, we need to isolate the direct relationship between yt and a specific lag yt−k, removing the influence of the intermediate observations yt−1,yt−2,…,yt−k+1. This is precisely what the Partial Autocorrelation Function (PACF) helps us measure.
Think of partial autocorrelation at lag k as the correlation between yt and yt−k that is not explained by their mutual correlations with the time series values at the intervening lags (t−1,t−2,…,t−k+1).
You can imagine calculating it by:
This process effectively removes the linear dependence associated with the intermediate lags, leaving only the direct connection between yt and yt−k.
Difference between ACF and PACF when considering the relationship between yt and yt−2. PACF isolates the direct link after accounting for yt−1.
The primary utility of the PACF lies in identifying the order (p) of an Autoregressive (AR) model. Recall that an AR(p) model expresses yt as a linear combination of its previous p values plus an error term:
yt=c+ϕ1yt−1+ϕ2yt−2+⋯+ϕpyt−p+ϵtBy definition, in a pure AR(p) process, yt has a direct linear dependency on yt−1,…,yt−p. However, once you account for these p lags, there should be no direct linear relationship between yt and further lags like yt−p−1,yt−p−2, etc. Any correlation seen at these further lags in the ACF is indirect, flowing through the first p lags.
Therefore, for a stationary AR(p) process, we expect the PACF plot to show:
This distinct pattern contrasts with the ACF of an AR(p) process, which typically shows a more gradual decay towards zero.
Similar to ACF plots, PACF plots display the partial autocorrelation values for different lags on the y-axis against the lag number on the x-axis. They also typically include confidence intervals (often represented by a shaded area, usually at the 95% level).
By examining the PACF plot alongside the ACF plot (as we'll discuss how to generate in the next section), you gain valuable clues about the underlying structure of your time series and can make informed decisions about which model type (AR, MA, or ARMA) might be appropriate and what order parameters to investigate. For identifying AR model orders, the PACF is particularly informative.
© 2025 ApX Machine Learning